Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner By abhishek • about 11 hours ago • 3
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • 2 days ago • 13
Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • 2 days ago • 5
Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework By Yescia • 2 days ago
A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 8 days ago • 12
Token Merging for fast LLM inference : Background and first trials with Mistral By samchain • 9 days ago • 1
Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ By Andyrasika • 13 days ago • 4
Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM By Pclanglais • 13 days ago • 9
Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner By abhishek • about 11 hours ago • 3
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • 2 days ago • 13
Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • 2 days ago • 5
Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework By Yescia • 2 days ago
A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • 8 days ago • 12
Token Merging for fast LLM inference : Background and first trials with Mistral By samchain • 9 days ago • 1
Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+ By Andyrasika • 13 days ago • 4
Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM By Pclanglais • 13 days ago • 9