Data Analysis

Python

Python is a really versatile language and useful for data analysis. It's open-source, and there are plenty of free tutorials online.

Getting Started: Anaconda

Anaconda is a user-friendly interface to launch multiple applications in which to write and execute code and manage different environments and packages.

Tutorial to install Anaconda: https://www.datacamp.com/community/tutorials/installing-anaconda-windows

Get Anaconda here: https://www.anaconda.com/


Jupyter Notebook

Jupyter Notebook is an easy-to-use interface to write and execute code. Some tutorials may be presented as Jupyter Notebooks. Jupyter should come with Anaconda, but if not here is how to get Jupyter Notebook:
https://test-jupyter.readthedocs.io/en/latest/install.html


Visual Studio Code

VSCode is another code editor. You can write in other languages using VSCode, not just Python. It has some great features to assist with debugging and runs a bit faster than the Jupyter Notebook, but isn't quite so beginner-friendly.

Getting started with VSCode: https://code.visualstudio.com/learn/get-started/basics

Get VSCode here: https://code.visualstudio.com/download

Getting started with Python in VSCode: https://code.visualstudio.com/docs/python/python-tutorial

How to get Jupyter Notebook in VSCode: https://code.visualstudio.com/docs/datascience/jupyter-notebooks


Python Tutorials

Find beginner Python and statistics workshops written by Vivien Walter on the NAS under:

storage > current > _VivienWorkshops > Python_and_statistics_workshops

These make use of Jupyter Notebooks.


Find Python tutorials for beginners with an emphasis on image analysis here: https://www.youtube.com/playlist?list=PLHae9ggVvqPgyRQQOtENr6hK0m1UquGaG


Datacamp has some really great Python courses for free.


Managing Packages and Setting Up Virtual Environments

Python has a package manager called 'pip' which will let you install the packages that you want to use. Sometimes when a package is updated, the new version may not work with older code you have written, so to avoid dependency issues it's a good idea to set up virtual environments to code in, which contain the correct versions of the packages that you need.

Install and use PIP: https://realpython.com/what-is-pip/

Intro to Python virtual environments: https://realpython.com/python-virtual-environments-a-primer/

Manage environments with anaconda: https://uoa-eresearch.github.io/eresearch-cookbook/recipe/2014/11/20/conda/

Python environments with VSCode: https://code.visualstudio.com/docs/python/environments


Basic Linux Commands

Want to feel like Mr. Robot and execute code from the terminal? Here are some basic Linux commands that are useful to know: https://maker.pro/linux/tutorial/basic-linux-commands-for-beginners