A repository containing Jupyter notebooks that introduce students to real-world examples using data science tools to make predictions for key Fusion parameters using real Fusion data from the Mega Amp Spherical Tokamak (MAST).
This project contains a number of Jupyter notebooks designed to introduce students to data science techniques applied to fusion energy research. Using data from the Fair-MAST project, students will learn how to process, analyze, and build predictive models using real experimental fusion data.
The notebooks focus on:
- Data exploration and visualization of fusion parameters
- Predictive modeling of key fusion performance indicators
- Machine learning applications in fusion research
The data used in this project comes from the FAIR-MAST project, which aims to make fusion research data more Findable, Accessible, Interoperable, and Reusable (FAIR). The MAST (Mega Amp Spherical Tokamak) is a fusion energy experiment based at Culham Centre for Fusion Energy in the UK.
- Python 3.11 or higher
- uv - A faster and more reliable Python package installer and resolver
-
Clone the repository
git clone https://github.com/yourusername/data-science-challenges.git
-
Navigate to the project directory
cd data-science-challenges -
Install uv if you don't have it already
pip install uv
-
Create a virtual environment and install dependencies using uv
uv venv uv pip install -e . -
Activate the virtual environment
# On Windows .venv\Scripts\activate # On Unix or MacOS source .venv/bin/activate
If you're using Google Colab to run the notebooks, you can install this package directly:
-
Using UV (Recommended)
!pip install uv !uv pip install git+https://github.com/Simon-McIntosh/data-science-challenges.git
-
Using Pip
!pip install git+https://github.com/Simon-McIntosh/data-science-challenges.git
-
After installation, restart your runtime for all changes to take effect by clicking on the "Runtime" menu and selecting "Restart runtime".
To run the Jupyter notebooks, make sure you've activated your virtual environment, then:
jupyter lab --notebook-dir notebooks/This will open a browser window with the Jupyter interface where you can select and run any of the notebooks.
- MAST Plasma Current - Infer the value plasma current produced by CCFE's Mega Ampere Spherical Tokamak from discrete magnetic diagnostic data.
- MAST Plasma Volume - Infer plasma volume from wide angle camera data.
- MAST Plasma Equilibrium - Infer plasma equilibria from a diverse set of diagnostic data.
Find sample data in the fair_mast_data directory. For more data:
- Visit the Fair-MAST Data Catalog
- Use the provided API to access the complete FAIR-MAST archive
This project is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0).
This means you are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially
Under the following terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made.
- ShareAlike — If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.
See the LICENSE file for more details.
