Sc2Mol/data at main · zhiruiliao/Sc2Mol

History

Name		Name	Last commit message	Last commit date
parent directory ..
new_scaffold_raw.7z		new_scaffold_raw.7z
readme.md		readme.md
test_raw.7z		test_raw.7z
token_utils.py		token_utils.py
training_raw.7z		training_raw.7z
vocab.txt		vocab.txt

readme.md

Firstly please unzip all *.7z to get *.txt raw data file.

Then run
python token_utils.py --input training_raw.txt --max_len 64 --split 10 --save_path .
python token_utils.py --input test_raw.txt --max_len 64 --split 1 --save_path .
to get preprocessed data.

More details of data can be found at: https://github.com/molecularsets/moses

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

readme.md

Files

data

Directory actions

More options

Directory actions

More options

Latest commit

History

data

Folders and files

parent directory

readme.md