Difference between revisions of "STEM Majors"

From Computer Science
Jump to: navigation, search
(Created page with "This page contains some advice for non-CS majors who are interested in programming or have some need for programming in their coursework or research. Note that this advice al...")
 
 
(4 intermediate revisions by 2 users not shown)
Line 5: Line 5:
  
 
== Python ==  
 
== Python ==  
Mention Sage, QGIS, python.org tutorial, how to think like a computer scientist. Mention numpy/scipy
+
For people new to python (or programming in general) there are many tutorials at [https://wiki.python.org/moin/BeginnersGuide/Programmers python.org's Beginner's Tutorial Page].
 +
Python scripts are grouped into "packages" for easy distribution. The most used packages for data analysis are listed here as well as some links to places to learn them:
 +
* numpy: [https://numpy.org/doc/stable/user/absolute_beginners.html Numpy Absolute Beginner's Guide]
 +
* scipy: [https://docs.scipy.org/doc/scipy/reference/ Scipy Documentation]
 +
* pandas: [https://pandas.pydata.org/pandas-docs/stable/user_guide/10min.html 10 Minutes to Pandas]
 +
* matplotlib: [https://matplotlib.org/stable/tutorials/introductory/pyplot.html#sphx-glr-tutorials-introductory-pyplot-py Pyplot Tutorial]
 +
* pytorch: [https://pytorch.org/tutorials/ Tutorials]
 +
 
 +
When working with large datasets, the bottleneck is usually around efficiently iterating and structuring through the data. It useful to learn and understand basic Algorithms and Data Structures as well. The information at [https://www.tutorialspoint.com/data_structures_algorithms/index.htm Tutorialspoint] is a good starting point on learning basic Data Structures and Search/Sort algorithms.
  
 
== R ==  
 
== R ==  
 
Mention Bioconductor, R information linked from GH 101 page, R for Data Science
 
Mention Bioconductor, R information linked from GH 101 page, R for Data Science
 +
 +
R is a programming language used primarily by mathematicians and data analysts. It's used for statistic computing and graphics, though it can be used for data mining as well. R can be used by [https://cran.r-project.org/ command line] or in third party programs like [https://www.rstudio.com/ RStudio]. A good tutorial of the language can be found at [https://www.tutorialspoint.com/r/index.htm Tutorialspoint].
 +
 +
Other Resources for R:
 +
* [https://www.bioconductor.org/ Bioconductor] (For Computational Biology, Genomes)
 +
* [https://r4ds.had.co.nz/ R for Data Science]
  
 
== Octave ==
 
== Octave ==
 +
Octave is a powerful alternative to MATLAB licensed under the GNU General Public License. It can be downloaded from their [https://www.gnu.org/software/octave/index website]. There is also a [https://wiki.octave.org/Using_Octave tutorial] and [https://octave.org/doc/v6.2.0/index.html#SEC_Contents full documentation] on their site.

Latest revision as of 15:27, 14 May 2021

This page contains some advice for non-CS majors who are interested in programming or have some need for programming in their coursework or research. Note that this advice also applies to CS majors who are working on interdisciplinary projects.

Programming Languages

The most popular (open-source) programming languages for science and mathematics are currently Python and R. Both are also popular currently because they are used a great deal in data science. There are a great many other languages which have been popular in the past or are still useful in particular situations. But if you are starting out fresh it is likely that Python or R would be your best bet for getting started. Another language/tool of note is GNU Octave, which is an open-source free alternative to MATLAB.

Python

For people new to python (or programming in general) there are many tutorials at python.org's Beginner's Tutorial Page. Python scripts are grouped into "packages" for easy distribution. The most used packages for data analysis are listed here as well as some links to places to learn them:

When working with large datasets, the bottleneck is usually around efficiently iterating and structuring through the data. It useful to learn and understand basic Algorithms and Data Structures as well. The information at Tutorialspoint is a good starting point on learning basic Data Structures and Search/Sort algorithms.

R

Mention Bioconductor, R information linked from GH 101 page, R for Data Science

R is a programming language used primarily by mathematicians and data analysts. It's used for statistic computing and graphics, though it can be used for data mining as well. R can be used by command line or in third party programs like RStudio. A good tutorial of the language can be found at Tutorialspoint.

Other Resources for R:

Octave

Octave is a powerful alternative to MATLAB licensed under the GNU General Public License. It can be downloaded from their website. There is also a tutorial and full documentation on their site.