# Difference between revisions of "STEM Majors"

(Added information for Python) |
|||

Line 7: | Line 7: | ||

== Python == | == Python == | ||

− | + | For people new to python (or programming in general) there are many tutorials at [https://wiki.python.org/moin/BeginnersGuide/Programmers python.org's Beginner's Tutorial Page]. | |

+ | Python scripts are grouped into "packages" for easy distribution. The most used packages for data analysis are listed here as well as some links to places to learn them: | ||

+ | * numpy: [https://numpy.org/doc/stable/user/absolute_beginners.html Numpy Absolute Beginner's Guide] | ||

+ | * scipy: [https://docs.scipy.org/doc/scipy/reference/ Scipy Documentation] | ||

+ | * pandas: [https://pandas.pydata.org/pandas-docs/stable/user_guide/10min.html 10 Minutes to Pandas] | ||

+ | * matplotlib: [https://matplotlib.org/stable/tutorials/introductory/pyplot.html#sphx-glr-tutorials-introductory-pyplot-py Pyplot Tutorial] | ||

+ | * pytorch: [https://pytorch.org/tutorials/ Tutorials] | ||

+ | |||

+ | When working with large datasets, the bottleneck is usually around efficiently iterating and structuring through the data. It useful to learn and understand basic Algorithms and Data Structures as well. The information at [https://www.tutorialspoint.com/data_structures_algorithms/index.htm Tutorialspoint] is a good starting point on learning basic Data Structures and Search/Sort algorithms. | ||

== R == | == R == |

## Revision as of 14:32, 14 May 2021

*Some brief notes, needs to be finished...*

This page contains some advice for non-CS majors who are interested in programming or have some need for programming in their coursework or research. Note that this advice also applies to CS majors who are working on interdisciplinary projects.

## Contents

## Programming Languages

The most popular (open-source) programming languages for science and mathematics are currently Python and R. Both are also popular currently because they are used a great deal in data science. There are a great many other languages which have been popular in the past or are still useful in particular situations. But if you are starting out fresh it is likely that Python or R would be your best bet for getting started. Another language/tool of note is GNU Octave, which is an open-source free alternative to MATLAB.

## Python

For people new to python (or programming in general) there are many tutorials at python.org's Beginner's Tutorial Page. Python scripts are grouped into "packages" for easy distribution. The most used packages for data analysis are listed here as well as some links to places to learn them:

- numpy: Numpy Absolute Beginner's Guide
- scipy: Scipy Documentation
- pandas: 10 Minutes to Pandas
- matplotlib: Pyplot Tutorial
- pytorch: Tutorials

When working with large datasets, the bottleneck is usually around efficiently iterating and structuring through the data. It useful to learn and understand basic Algorithms and Data Structures as well. The information at Tutorialspoint is a good starting point on learning basic Data Structures and Search/Sort algorithms.

## R

Mention Bioconductor, R information linked from GH 101 page, R for Data Science