Mountain View, California, United States
Contact Info
9K followers
500+ connections
About
Activity
-
We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help…
We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help…
Liked by Sourabh Medapati (csgator)
Experience & Education
Projects
-
Grandmaster-Level Chess Without Search
-
This paper investigates the impact of training at scale for chess. Unlike traditional chess engines that rely on complex heuristics, explicit search, or a combination of both, we train a 270M parameter transformer model with supervised learning on a dataset of 10 million chess games. We annotate each board in the dataset with action-values provided by the powerful Stockfish 16 engine, leading to roughly 15 billion data points. Our largest model reaches a Lichess blitz Elo of 2895 against…
This paper investigates the impact of training at scale for chess. Unlike traditional chess engines that rely on complex heuristics, explicit search, or a combination of both, we train a 270M parameter transformer model with supervised learning on a dataset of 10 million chess games. We annotate each board in the dataset with action-values provided by the powerful Stockfish 16 engine, leading to roughly 15 billion data points. Our largest model reaches a Lichess blitz Elo of 2895 against humans, and successfully solves a series of challenging chess puzzles, without any domain-specific tweaks or explicit search algorithms. We also show that our model outperforms AlphaZero's policy and value networks (without MCTS) and GPT-3.5-turbo-instruct.
Test Scores
-
Graduate Record Examination (GRE)
Score: 329 / 340
-
Test of English as a Foreign language ( TOEFL )
Score: 113 / 120
Languages
-
Marathi
Native or bilingual proficiency
-
Hindi
Native or bilingual proficiency
-
Telugu
Professional working proficiency
-
English
Professional working proficiency
More activity by Sourabh
-
Et voilà Gemma 2! Small models are getting soooo good! Even stronger 2B/9B + our new powerful 27B! Once again, open models by Google DeepMind with…
Et voilà Gemma 2! Small models are getting soooo good! Even stronger 2B/9B + our new powerful 27B! Once again, open models by Google DeepMind with…
Liked by Sourabh Medapati (csgator)
-
昨日開催した「Google for Japan 2024」のキーノートセッションにて、Google DeepMindとその日本での取り組みと、Googleの大規模マルチモーダル基盤モデル Gemini…
昨日開催した「Google for Japan 2024」のキーノートセッションにて、Google DeepMindとその日本での取り組みと、Googleの大規模マルチモーダル基盤モデル Gemini…
Liked by Sourabh Medapati (csgator)
-
ChatGPT crossed 1M users in 5 days. Luma crossed it in 4 days. https://lnkd.in/gs8JHMkT #LumaDreamMachine
ChatGPT crossed 1M users in 5 days. Luma crossed it in 4 days. https://lnkd.in/gs8JHMkT #LumaDreamMachine
Liked by Sourabh Medapati (csgator)
-
Sakana AI is proud to sponsor “LLM Merging Competition: Building LLMs Efficiently through Merging” at #NeurIPS2024 🤗 If you’re excited about pushing…
Sakana AI is proud to sponsor “LLM Merging Competition: Building LLMs Efficiently through Merging” at #NeurIPS2024 🤗 If you’re excited about pushing…
Liked by Sourabh Medapati (csgator)
-
It has been 3 weeks since I left Cruise and started my self-education summer break. First I went to the ML Sys conference in the Bay Area. I had a…
It has been 3 weeks since I left Cruise and started my self-education summer break. First I went to the ML Sys conference in the Bay Area. I had a…
Liked by Sourabh Medapati (csgator)
-
🚨Publication alert🚨 Today we are releasing the next version of the Foundation Model Transparency Index! In addition to our paper, we are publishing…
🚨Publication alert🚨 Today we are releasing the next version of the Foundation Model Transparency Index! In addition to our paper, we are publishing…
Liked by Sourabh Medapati (csgator)
-
🚀 Exciting News! I just submitted my first (ever) paper! Here’s a brief overview of our key findings and their practical benefits: 𝗠𝗮𝗶𝗻…
🚀 Exciting News! I just submitted my first (ever) paper! Here’s a brief overview of our key findings and their practical benefits: 𝗠𝗮𝗶𝗻…
Liked by Sourabh Medapati (csgator)
Other similar profiles
-
Shubham Phal
Research Engineer @ Google DeepMind | Founding Engineer, AI for Education
Connect -
Vihan Jain
Connect -
Nathan Hilliard
Connect -
Yuzhu Dong
Connect -
Zhuo Xu
Research Engineer at Google Deepmind
Connect -
Kurtis Evan David
Connect -
Grace Lam
Connect -
Nikhil Mehta
Connect -
Poorva Rane
Connect -
Yu Mao
Connect
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More