RLHFlow
RLHFlow
Code for the Workflow of Reinforcement Learning from Human Feedback (RLHF)
United States of America
Andrew
daia99
AI research needs artificial innovation (AI) |
AI Researcher @Aleph-Alpha; prev TCD, @FT-Autonomous
Aleph Alpha
Daniel Han
danielhanchen
Unsloth - 2x faster 70% less VRAM finetuning Llama-3.1, Mistral, Gemma-2, Phi-3
San Francisco
Roger Creus
roger-creus
Research MSc @mila-iqia @montrealrobotics. Deep Reinforcement Learning
Mila Québec Montréal, Québec, Canada.
Jason Cox
jasonacox
Maker, Learner, Engineer, Author, Artist - My Views/Opinions
- ジェイソンのコード
Los Angeles, CA
Chris Bamford
Bam4d
AI Scientist @mistralai. Reinforcement Learning + LLMs + Duct tape Expert
Mistral AI London
Donny Greenberg
dongreenberg
Chief Housekeeper @run-house 🏃♀️🏠
Prev. Product Lead @pytorch
Runhouse New York
James Le
khanhnamle1994
Data Journalist 📝 -> Data Scientist 📊 -> Machine Learning Researcher 🔍 -> Developer Advocate 🤝
Twelve Labs San Francisco, CA
Kye Gomez
kyegomez
$ pip install swarms
https://github.com/kyegomez/swarms
Join the agent and AI research community:
https://discord.gg/z3P8ahWF
Swarms Palo Alto
PreviousNext