Boson AI is an early-stage startup building large language tools for interaction and entertainment. Our founders, Alex Smola, Mu Li, and a team of Deep Learning, Optimization, NLP, AutoML and Statistics scientists and engineers are working on high quality generative AI models for language and beyond.
We are seeking research scientists and engineers to join our team full-time in our Santa Clara office. As part of your role, you will work on modeling and training LLMs, understanding and interpreting model behavior and aligning models to human values. The ideal candidate will possess a strong background in machine learning, and have motivations for developing state-of-the-art models towards AGI.
Responsibilities
Design and verify novel model architectures and training objectives.
Investigate novel model alignment algorithms.
Write efficient and clean code for ML training.
Conduct large-scale experiments to verify the modeling choices and identify improvement areas.
Experience (great if you have it)
Summarize results and clearly communicate the motivations and observations in your work
Proficiency in at least one deep learning framework, such as PyTorch.
Participation in at least one research project related to LLM or multimodal models, e.g. experience in training or fine-tuning them.
Alignment research
Experience in large-scale distributed model training
Experience in writing GPU kernels in CUDA
Education
PhD or Master's degree with solid scientific contributions
Active GitHub repository
Active scientific track record
Excellent problem-solving skills
Employment type
Full-time
Referrals increase your chances of interviewing at Boson AI by 2x