math-lm

Repository for the Math-LM project, an open-source replication of the Minerva model. This repository hosts data and model training code. Evaluation code is hosted in a fork of the lm-evaluation-harness.

A WIP build of the proof-pile-v2 dataset is currently hosted on Huggingface.

Note that because this project contains submodules, you should clone this project with the --recurse-submodules flag or, alternatively, run git submodule update --init --recursive from within the project directory after cloning the project. After running git pull, you should also run git submodule update.

This project contains the following directories

analysis: scaling law analysis of training runs.
gpt-neox: git submodule containing a modified branch of EleutherAI/gpt-neox
proof-pile-v2: scripts for downloading and preprocessing data.
task-finetunes: scripts for fine-tuning models on task-specific datasets, such as MATH or GSM8k.

Name		Name	Last commit message	Last commit date
Latest commit History 184 Commits
analysis/group_0.4		analysis/group_0.4
gpt-neox @ e59c873		gpt-neox @ e59c873
pretraining		pretraining
proof-pile-v2		proof-pile-v2
task-finetunes		task-finetunes
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

math-lm

About

Releases

Packages

Languages

License

marco-dossantos/math-lm

Folders and files

Latest commit

History

Repository files navigation

math-lm

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages