HMS - Harmful Brain Activity | Top 8%

This is Team Epoch's top 8% solution to the HMS - Harmful Brain Activity Classification competition.

A technical report is included in this repository.

Getting started

This section contains the steps that need to be taken to get started with our project and fully reproduce our best submission on the private leaderboard. The project was developed on Windows 10/11 OS on Python 3.10.13 on Pip version 23.2.1.

0. Prerequisites

Models were trained on machines with the following specifications:

CPU: AMD Ryzen Threadripper Pro 3945WX 12-Core Processor / AMD Ryzen 9 7950X 16-Core Processor
GPU: NVIDIA RTX A5000 / NVIDIA RTX Quadro 6000 / NVIDIA RTX A6000
RAM: 96GB / 128GB
OS: Windows 10/11
Python: 3.10.13
Estimated training time: 3-6 hours per model on these machines.

For running inference, a machine with at least 32GB of RAM is recommended. We have not tried running the inference on a machine with less RAM using all the test data that was provided by DrivenData.

1. Clone the repository

Make sure to clone the repository with your favourite git client or using the following command:

git clone TODO: UPDATE(...)

2. Install Python 3.10.13

You can install the required python version here: Python 3.10.13

3. Install the required packages

Install the required packages (on a virtual environment is recommended) using the following command: A .venv would take around 7GB of disk space.

pip install -r requirements.txt

4. Setup the competition data

TODO: Explanation

5. Main files explanation

train.py: This file is used to train a model. train.py reads a configuration file from conf/train.yaml. This configuration file contains the model configuration to train with additional training parameters such as test_size and a scorer to use. The model selected in the conf/train.yaml can be found in the conf/model folder where a whole model configuration is stored (from preprocessing to postprocessing). When training is finished, the model is saved in the tm directory with a hash that depends on the specific pre-processing, pretraining steps + the model configurations.
- Command line arguments
- CUDA_VISIBLE_DEVICES: The GPU to use for training. If not specified it uses DataParallel to train on multiple GPUs. If you have multiple GPUs, you can specify which one to use.
submit.py: This file does inference on the test data from the competition given trained model or an ensemble of trained models. It reads a configuration file from conf/submit.yaml which contains the model/ensemble configuration to use for inference. Model configs can be found in the conf/model folder and ensemble configs in the conf/ensemble folder. The conf/ensemble folder specifies the models (conf/model) to use for the ensemble and the weights to use for each model. The submit.py

6. Place the fitted models

(For DrivenData) Any additional supplied trained models /scalers (.pt / .gbdt / .scaler) should be placed in the tm directory. When these models were trained, they are saved with a hash that depends on the specific pre-processing, pretraining steps + the model configurations. In this way, we ensure that we load the correct saved model automatically when running submit.py.

7. Run submit.py

For reproducing our best submission, run submit.py. This will load the already configured submit.yaml file and run the inference on the test data from the competition. submit.yaml in configured to what whe think is our best and our most robust solution:

If you get an error of that the path was not found of a model. Please ensure that you have the correct trained model in the tm directory. If you don't have the trained models, you can train them 1 by 1 using train.py and the conf/train.yaml file.

Quality Checks

Quality checks are performed using pre-commit hooks. To install these hooks, run:

pre-commit install

To run the pre-commit hooks locally, do:

pre-commit run --all-files

Documentation

Documentation is generated using Sphinx.

To make the documentation, run make html with docs as the working directory. The documentation can then be found in docs/_build/html/index.html.

Here's a short command to make the documentation and open it in the browser:

cd ./docs/;
./make.bat html; start chrome file:https://$PWD/_build/html/index.html
cd ../

Name		Name	Last commit message	Last commit date
Latest commit History 741 Commits
conf		conf
dashboard		dashboard
data		data
notebooks		notebooks
plots		plots
src		src
submission		submission
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile.wandb		Dockerfile.wandb
LICENSE		LICENSE
README.md		README.md
copy_model.sh		copy_model.sh
copymodels.sh		copymodels.sh
cv.py		cv.py
delete_cache.sh		delete_cache.sh
docker.md		docker.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
submit.py		submit.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HMS - Harmful Brain Activity | Top 8%

Getting started

0. Prerequisites

1. Clone the repository

2. Install Python 3.10.13

3. Install the required packages

4. Setup the competition data

5. Main files explanation

6. Place the fitted models

7. Run submit.py

Quality Checks

Documentation

About

Releases

Packages

Contributors 5

Languages

License

TeamEpochGithub/iv-q3-harmful-brain-activity

Folders and files

Latest commit

History

Repository files navigation

HMS - Harmful Brain Activity | Top 8%

Getting started

0. Prerequisites

1. Clone the repository

2. Install Python 3.10.13

3. Install the required packages

4. Setup the competition data

5. Main files explanation

6. Place the fitted models

7. Run submit.py

Quality Checks

Documentation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages