The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA

This code was built based on our previous project The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA.

Requirements

To install requirements:

pip install -r requirements_cuda118.txt

📋 The experiments were done under CUDA 11.8

Dataset

./dataset_ft/Abraham***_cleared.csv already preprocessed.

Training

To train the model(s) in the paper, move to AbraLLaMA (the main directory) and run:

python run_auto_llama.py

Evaluation

To check the model's metrics, loss, and etc., move to AbraLLaMA/evaluations (the main directory):

metric_1(RMSE), metric_2/loss(MAE)

Pre-trained Models

We have used one of the pretrained ChemLLaMA-MTR model from our previous project

./model_mtr/ChemLlama_Medium_30m_vloss_val_loss=0.029_ep_epoch=04.ckpt

Demo Run

You can also train AbraLLaMA demo version wiht Jupyter

Open run_demo.ipynb

Contributing

📋 MIT

Authors' Note

Please use this code only for social goods and positive impact.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
dataset_ft		dataset_ft
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
auto_evaluator_sol.py		auto_evaluator_sol.py
chemllama_mtr.py		chemllama_mtr.py
datamodule_finetune_sol.py		datamodule_finetune_sol.py
model_finetune_sol.py		model_finetune_sol.py
requirements_cuda118.txt		requirements_cuda118.txt
run_auto_llama.py		run_auto_llama.py
run_demo.ipynb		run_demo.ipynb
tokenizer.json		tokenizer.json
tokenizer_sol.py		tokenizer_sol.py
utils_sol.py		utils_sol.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA

Requirements

Dataset

Training

Evaluation

Pre-trained Models

Demo Run

Contributing

Authors' Note

About

Releases

Packages

Languages

License

BrightBlueCheese/AbraLLaMA

Folders and files

Latest commit

History

Repository files navigation

The Role of Model Architecture and Scale in Predicting Molecular Properties: Insights from Fine-Tuning RoBERTa, BART, and LLaMA

Requirements

Dataset

Training

Evaluation

Pre-trained Models

Demo Run

Contributing

Authors' Note

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages