Summary and Setup
Machine learning is a set of techniques that enable computers to use data to improve their performance in a given task. This is similar in concept to how humans learn to make predictions based upon previous experience and knowledge. Machine learning encompasses a wide range of activities, but broadly speaking it can be used to: find trends in a dataset, classify data into groups or categories, make predictions based upon data, and even “learn” how to interact with an environment when provided with goals to achieve.
We will cover the basics of machine learning using the Scikit-Learn Python package, a popular library for machine learning in Python. Scikit-Learn provides a wide range of machine learning algorithms and tools for data preprocessing, model selection, and evaluation.
Participants will be assumed to have a basic understanding of Python. You should have a basic understanding of how to run Python code in a Jupyter Notebook, and be able to write for loops, conditional statements, use functions, and import libraries.
You will need to have python installed on your computer, as well as some additional packages. please refer to the setup instructions below for details.
Setup
Software
You will need a terminal, Python 3.8+, and the ability to create Python virtual environments.
Installing Python
Python is a popular language for scientific computing, and a frequent choice for machine learning as well. To install Python, follow the Beginner’s Guide or head straight to the download page.
Please set up your python environment at least a day in advance of the workshop. If you encounter problems with the installation procedure, ask your workshop organizers via e-mail for assistance so you are ready to go as soon as the workshop begins.
Packages
You will need the Seaborn, MatPlotLib, Pandas, Numpy and OpenCV packages.
Directory Setup
Create a new directory for the workshop, then launch a terminal in it:
Creating a new Virtual Environment
We’ll install the prerequisites in a virtual environment, to prevent them from cluttering up your Python environment and causing conflicts. First, create a new directory and ent
Deactivating/activating environment
To deactivate your virtual environment, simply run
deactivate in your terminal or prompt. If you close the
terminal, Git Bash, or Conda Prompt without deactivating, the
environment will automatically close as the session ends. Later, you can
reactivate the environment using the “Activate environment” instructions
above to continue working. If you want to keep coding in the same
terminal but no longer need this environment, it’s best to explicitly
deactivate it. This ensures that the software installed for this
workshop doesn’t interfere with your default Python setup or other
projects.
Fallback option: cloud environment
If a local installation does not work for you, it is also possible to run this lesson in Google colab. If you open a jupyter notebook there, the required packages are already pre-installed.