Highlights
- Pro
Block or Report
Block or report shyamsn97
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse
RLHFlow
RLHFlow
Code for the Workflow of Reinforcement Learning from Human Feedback (RLHF)
United States of America
Andrew
daia99
AI research needs artificial innovation |
AI Researcher @Aleph-Alpha; prev TCD, @FT-Autonomous
Aleph Alpha
Daniel Han
danielhanchen
Unsloth - 2x faster 70% less VRAM finetuning Llama-3.1, Mistral, Gemma-2, Phi-3
San Francisco
Roger Creus
roger-creus
Research MSc @mila-iqia @montrealrobotics. Deep Reinforcement Learning
Mila Québec Montréal, Québec, Canada.
Jason Cox
jasonacox
Maker, Learner, Engineer, Author, Artist - My Views/Opinions
- ジェイソンのコード
Los Angeles, CA
Ahmed Khalifa
amidos2006
A Lecturer at the University of Malta and A Game Developer/Designer
Gzira, Malta
Chris Bamford
Bam4d
AI Scientist @mistralai. Reinforcement Learning + LLMs + Duct tape Expert
Mistral AI London
Donny Greenberg
dongreenberg
Chief Housekeeper @run-house 🏃♀️🏠
Prev. Product Lead @pytorch
Runhouse New York
James Le
khanhnamle1994
Data Journalist 📝 -> Data Scientist �� -> Machine Learning Researcher 🔍 -> Developer Advocate 🤝
Twelve Labs San Francisco, CA
Kye Gomez
kyegomez
$ pip install swarms
https://github.com/kyegomez/swarms
Join the agent and AI research community:
https://discord.gg/z3P8ahWF
Swarms Palo Alto
Adam
donadigo
I'm a 24 year old who likes to write some code in different languages like Vala, C++ or Python. I also occasionally contribute to @elementary.
Poland
PreviousNext