Skip to content

Latest commit

 

History

History
8 lines (5 loc) · 605 Bytes

README.md

File metadata and controls

8 lines (5 loc) · 605 Bytes

BLOOM Config Files

This folder contains example configuration files to easily and quickly reproduce the processing flow of the ROOTS dataset, created by the BigScience initiative to train the BLOOM models.

Oscar

The raw data files can be downloaded as described in BLOOM/Oscar. Then use bloom-oscar.yaml to perform the whole processing.

An analysis of our reproduction will be published soon.