Heena Purohit’s Post

2w Edited

There's now an open source solution to slash LLM costs! 🔍 𝐓𝐡𝐞 𝐓𝐋/𝐃𝐑: - LMSYS launched an open source framework "RouteLLM". - It utilizes data from Chatbot Arena and advanced data augmentation techniques and learns how to route queries to the most appropriate model. - It intelligently routes based on query complexity and model capabilities. 📈 𝐈𝐦𝐩𝐫𝐞𝐬𝐬𝐢𝐯𝐞 𝐑𝐞𝐬𝐮𝐥𝐭𝐬: - The team reduced costs by up to 85%, while maintaining 95% of GPT-4’s performance level. - The routers were robust and could handle new model pairs like Claude 3 Opus & Llama 3 8B without retraining. - This performance was on par with commercial products like Martian and Unify AI but at 40% lower costs. 🤔 𝐄𝐱𝐜𝐢𝐭𝐞𝐝 𝐚𝐛𝐨𝐮𝐭 𝐜𝐨𝐬𝐭 𝐞𝐟𝐟𝐢𝐜𝐢𝐞𝐧𝐭 𝐀𝐈? 𝐋𝐞𝐭 𝐦𝐞 𝐤𝐧𝐨𝐰 𝐢𝐟 𝐲𝐨𝐮'𝐯𝐞 𝐜𝐨𝐦𝐞 𝐚𝐜𝐫𝐨𝐬𝐬 𝐚𝐧𝐲 𝐢𝐧𝐧𝐨𝐯𝐚𝐭𝐢𝐯𝐞 𝐬𝐨𝐥𝐮𝐭𝐢𝐨𝐧𝐬 𝐨𝐫 𝐤𝐧𝐨𝐰 𝐬𝐨𝐦𝐞𝐨𝐧𝐞 𝐬𝐨𝐥𝐯𝐢𝐧𝐠 𝐫𝐞𝐚𝐥 𝐩𝐫𝐨𝐛𝐥𝐞𝐦𝐬 𝐢𝐧 𝐭𝐡𝐢𝐬 𝐬𝐩𝐚𝐜𝐞! Link to blog and white paper 👇 -------- 🔔 If you like this, please repost it and share it with anyone who should know this ♻️ and follow me Heena Purohit, for more AI insights and trends. #artificialintelligence #generativeAI #startups #enterpriseAI #AIforBusiness

37 Comments

Heena Purohit

Link to blog post: https://lmsys.org/blog/2024-07-01-routellm/ White paper for more info: https://arxiv.org/abs/2406.18665 Note: The idea of LLM routing isn't new. However these routing solutions were based off of *task-specific* routing, the concept that different models are better at different tasks, which is novel.

9 Reactions

Arbaz Surti

Product Manager | GTM | Quality Assurance | I help companies build, test, and launch software products

This is amazing. Now, we don't have to choose/know which model to use for what use case. We have an AI to choose which AI to use.

4 Reactions

James Hocking

Top Linkedin AI Voice | Technical Co-Founder @ Hocking Digital • AI, Data, Information Technology, & Web3 Expertise in Strategy, Architecture, Implementation, and Adoption for Business Leaders, & Executives

It's an interesting result, However, as other comments have mentioned, - What test/bench mark was used? - Is it safe to assume that the test used public versions of each model? - In a private environment, the opex costs of hosting all models might be prohibitive - This approach doesn't address the data quality challenges that drive hallucinations A positive step but still many questions remain.

2 Reactions

Jeff Chou

Co-Founder, CEO at Sync Computing

Heena Purohit - very cool! We at Sync Computing are in this space, helping enterprises achieve more cost efficient use of Enterprise AI via Databricks compute optimization. We've demonstrated 60% cost savings for large enterprises!

2 Reactions

Stephen Collins

VP of Engineering at Boone Voyage LLC | AI-Focused Consulting | Engineering Leadership | Innovator in AI Solutions

There’s a saying in software, “Make it work, then make it work better”. RouteLLM can help optimize costs “better” because sometimes you don’t need the best model for a given task. Just a model good enough.

1 Reaction

Marc Heil

Sr. Solutions Architect - Global Financial Services at Amazon Web Services (AWS)

Eventually this spread is going to become tighter and tighter until cost and performance are almost singular, and what matters is the proprietary data backing individual models.

1 Reaction

Pranab Das

Where are proofs?

2 Reactions

John Melcher

AI media solutions provider, agent developer | Writer on AI, cryptocurrencies, Web3, blockchains

Out of date.

2 Reactions

Abhishek Ratna

Head of AI Product Marketing | x-Google, Meta, Microsoft | Advisor

Great share but there’s a big question - RouteLLM results were measured against standard benchmarks. But the most performant model for real world scenarios emerges from much trial and error. I wonder how LMSYS solves for routi by based on user preference vs for a standardized metric.

1 Reaction

Sathish S

Presales Specialist | Driving Sales Growth with Customized Solutions | Expert in Identifying Customer Pain Points and Delivering Value 💪🎯

We need to scale a lot by cost optimization with Open source tools

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

ScaleGenAI

764 followers
6mo
Report this post
Hello LinkedIn community! We're thrilled to announce the launch of ScaleGenAI, a game-changing platform in the LLM development landscape!🌟 🔎 What's ScaleGenAI? ScaleGenAI is our innovative solution to the complexities of large language model (LLM) and generative AI development. Our platform is a powerhouse of features designed to streamline and enhance the LLM fine-tuning and deployment cycle. 🌐 Why Choose ScaleGenAI? ▶ Multi-Cloud Support: Enjoy unparalleled flexibility, cost benefits and GPU availability. ScaleGenAI supports integration with all popular cloud providers. ▶ On-Premise Infrastructure Support: Securely deploy on-premise, with flexibility of cloud bursting for handling additional request loads. ▶ Spot-Instance Automation: Leverage built-in automated model checkpointing and re-provisioning to optimize costs, without compromising on computational power. ▶ Automated Orchestration: Focus on model development and data quality and let ScaleGenAI handle infrastructure management, orchestration and deployments. ▶ Auto-Scaled Inference: Scale your model deployments up or down based on real-time needs, ensuring efficiency and agility. ▶ OpenAPI and HuggingFace Compatible: Use HF model and data repositories to fine-tune your models. Fine-tune and infer using our OpenAI API-compatible APIs. 💡Our Vision At ScaleGenAI, we believe in empowering businesses with the tools to harness the full potential of generative AI. Whether you're a startup, an established enterprise or a service provider, our platform is designed to cater to your unique needs, making generative AI more accessible, scalable, and cost-effective. 🤝 Be Part of the AI Revolution Follow us for updates and insights. Let's connect and explore how ScaleGenAI can propel your genAI ambitions forward! To know more, head over to our website: https://www.scalegen.ai/ #AI #ArtificialIntelligence #MachineLearning #Innovation #ScaleGenAI #TechLaunch #GenerativeAI #GenAI
Like Comment
To view or add a comment, sign in
Aman Sharma

Head of Product @ ScaleGenAI | Building DL solutions for LLM fine-tuning and inferencing
6mo
Report this post
Officially announcing a new era in the #democratization of #generatieveai! ScaleGenAI is out now! Fine-tune and deploy your LLM models with unmatched #scalability and #security.
ScaleGenAI

764 followers
6mo

Hello LinkedIn community! We're thrilled to announce the launch of ScaleGenAI, a game-changing platform in the LLM development landscape!🌟 🔎 What's ScaleGenAI? ScaleGenAI is our innovative solution to the complexities of large language model (LLM) and generative AI development. Our platform is a powerhouse of features designed to streamline and enhance the LLM fine-tuning and deployment cycle. 🌐 Why Choose ScaleGenAI? ▶ Multi-Cloud Support: Enjoy unparalleled flexibility, cost benefits and GPU availability. ScaleGenAI supports integration with all popular cloud providers. ▶ On-Premise Infrastructure Support: Securely deploy on-premise, with flexibility of cloud bursting for handling additional request loads. ▶ Spot-Instance Automation: Leverage built-in automated model checkpointing and re-provisioning to optimize costs, without compromising on computational power. ▶ Automated Orchestration: Focus on model development and data quality and let ScaleGenAI handle infrastructure management, orchestration and deployments. ▶ Auto-Scaled Inference: Scale your model deployments up or down based on real-time needs, ensuring efficiency and agility. ▶ OpenAPI and HuggingFace Compatible: Use HF model and data repositories to fine-tune your models. Fine-tune and infer using our OpenAI API-compatible APIs. 💡Our Vision At ScaleGenAI, we believe in empowering businesses with the tools to harness the full potential of generative AI. Whether you're a startup, an established enterprise or a service provider, our platform is designed to cater to your unique needs, making generative AI more accessible, scalable, and cost-effective. 🤝 Be Part of the AI Revolution Follow us for updates and insights. Let's connect and explore how ScaleGenAI can propel your genAI ambitions forward! To know more, head over to our website: https://www.scalegen.ai/ #AI #ArtificialIntelligence #MachineLearning #Innovation #ScaleGenAI #TechLaunch #GenerativeAI #GenAI
Like Comment
To view or add a comment, sign in
Mihir Modi

Product Engineer @ScaleGenAI | Building DL solutions for LLM fine-tuning and inferencing.
6mo
Report this post
⚡️Exciting times at ScaleGenAI! We're redefining #AIModelDevelopment #LLMDevelopment with our latest offering. ScaleGenAI brings unmatched scalability and security to your LLM projects. Check out how we're changing the game in #generativeai!
ScaleGenAI

764 followers
6mo

Hello LinkedIn community! We're thrilled to announce the launch of ScaleGenAI, a game-changing platform in the LLM development landscape!🌟 🔎 What's ScaleGenAI? ScaleGenAI is our innovative solution to the complexities of large language model (LLM) and generative AI development. Our platform is a powerhouse of features designed to streamline and enhance the LLM fine-tuning and deployment cycle. 🌐 Why Choose ScaleGenAI? ▶ Multi-Cloud Support: Enjoy unparalleled flexibility, cost benefits and GPU availability. ScaleGenAI supports integration with all popular cloud providers. ▶ On-Premise Infrastructure Support: Securely deploy on-premise, with flexibility of cloud bursting for handling additional request loads. ▶ Spot-Instance Automation: Leverage built-in automated model checkpointing and re-provisioning to optimize costs, without compromising on computational power. ▶ Automated Orchestration: Focus on model development and data quality and let ScaleGenAI handle infrastructure management, orchestration and deployments. ▶ Auto-Scaled Inference: Scale your model deployments up or down based on real-time needs, ensuring efficiency and agility. ▶ OpenAPI and HuggingFace Compatible: Use HF model and data repositories to fine-tune your models. Fine-tune and infer using our OpenAI API-compatible APIs. 💡Our Vision At ScaleGenAI, we believe in empowering businesses with the tools to harness the full potential of generative AI. Whether you're a startup, an established enterprise or a service provider, our platform is designed to cater to your unique needs, making generative AI more accessible, scalable, and cost-effective. 🤝 Be Part of the AI Revolution Follow us for updates and insights. Let's connect and explore how ScaleGenAI can propel your genAI ambitions forward! To know more, head over to our website: https://www.scalegen.ai/ #AI #ArtificialIntelligence #MachineLearning #Innovation #ScaleGenAI #TechLaunch #GenerativeAI #GenAI
Like Comment
To view or add a comment, sign in
Dor Ovadia C.

🇮🇱 Founder at keepn.ai | Seasoned Machine Learning Engineer & MLOps Specialist | M.Sc. ML Safety, Technion
7mo Edited
Report this post
𝑺𝒖𝒑𝒆𝒓𝑫𝒖𝒑𝒆𝒓𝑫𝑩 - another open-source AI enabler intended to simplify ML systems: "𝘉𝘳𝘪𝘯𝘨 𝘈𝘐 𝘵𝘰 𝘺𝘰𝘶𝘳 𝘥𝘢𝘵𝘢𝘣𝘢𝘴𝘦; 𝘪𝘯𝘵𝘦𝘨𝘳𝘢𝘵𝘦, 𝘵𝘳𝘢𝘪𝘯 𝘢𝘯𝘥 𝘮𝘢𝘯𝘢𝘨𝘦 𝘢𝘯𝘺 𝘈𝘐 𝘮𝘰𝘥𝘦𝘭𝘴 𝘢𝘯𝘥 𝘈𝘗𝘐𝘴 𝘥𝘪𝘳𝘦𝘤𝘵𝘭𝘺 𝘸𝘪𝘵𝘩 𝘺𝘰𝘶𝘳 𝘥𝘢𝘵𝘢𝘣𝘢𝘴𝘦 𝘢𝘯𝘥 𝘺𝘰𝘶𝘳 𝘥𝘢𝘵𝘢." 𝐘𝐨𝐮𝐫 𝐝𝐚𝐭𝐚 does not have to go through complex pipelines or be duplicated. This tool transforms your database into: (1) a complete AI 𝐝𝐞𝐩𝐥𝐨𝐲𝐦𝐞𝐧𝐭 that includes a 𝐫𝐞𝐩𝐨𝐬𝐢𝐭𝐨𝐫𝐲 and registry of models; (2) a 𝐭𝐫𝐚𝐢𝐧𝐞𝐫 that lets you easily train and fine-tune your models; (3) a 𝐟𝐞𝐚𝐭𝐮𝐫𝐞 𝐬𝐭𝐨𝐫𝐞 where model outputs can be stored in the desired format and type, instantly accessible; and (4) a 𝐯𝐞𝐜𝐭𝐨𝐫 𝐝𝐚𝐭𝐚𝐛𝐚𝐬𝐞. -- Overloading ML into databases is clearly a trend now, what do you think? #mlops #database #storage #enablement #polymorphism
2 Comments
Like Comment
To view or add a comment, sign in
Aymeric Roucher

Machine Learning Engineer @ Hugging Face 🤗 | Polytechnique - Cambridge
3mo
Report this post
𝟐𝟎𝟐𝟒, 𝐭𝐡𝐞 𝐲𝐞𝐚𝐫 𝐨𝐟 𝐚𝐠𝐞𝐧𝐭 𝐰𝐨𝐫𝐤𝐟𝐥𝐨𝐰𝐬 🔧🦾🤖 I've just watched Andrew Ng's talk at Sequoia last week. And since his lessons on Coursera, I've always had the mind-blowing impression that this guy is exposing the future in clear light. ✨ If you're interested in Agents, you should really watch the talk! 𝗪𝗵𝘆 𝘂𝘀𝗲 𝗮𝗴𝗲𝗻𝘁 𝘄𝗼𝗿𝗸𝗳𝗹𝗼𝘄𝘀? The current LLM task solving workflow is not very intuitive: We ask it “write an essay all in one shot, without ever using backspace.” Why not allow the LLM a more similar process to what we would do? - “Write an essay outline” - “Do you need wen research?” - “Write a first draft” - “Consider improvements” … This is called an Agentic workflow. Existing ones bring a huge performance boost. Example with HumanEval: GPT-4 zero-shot gets 67% score, agentic with either one of tool use or reflection goes over 90%, and the combination of the two scores even higher! 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 𝗱𝗲𝘀𝗶𝗴𝗻 𝗽𝗮𝘁𝘁𝗲𝗿𝗻𝘀 On the following two points, the tech is robust: ⚙️ 𝗥𝗲𝗳𝗹𝗲𝘅𝗶𝗼𝗻: For instance: add a critic step after the writing step 🛠️ 𝗧𝗼𝗼𝗹 𝘂𝘀𝗲: extends the capabilities of the LLM by allowing it to call tools, like search or calculator The next two will be needed to go further, but the tech for them is more emerging and not reliable yet: 🗺️ 𝗣𝗹𝗮𝗻𝗻𝗶𝗻𝗴 forward to decompose task into subtasks. This allows great behaviours like an AI Agent re-routing after a failure 🐝 𝗠𝘂𝗹𝘁𝗶-𝗮𝗴𝗲𝗻𝘁 𝗰𝗼𝗹𝗹𝗮𝗯𝗼𝗿𝗮𝘁𝗶𝗼𝗻: Program a flock of agents with tasks. Improving the two above points will unlock huge performance boosts! Andrew NG says Research agents are already part of his workflow! 𝗖𝗹𝗼𝘀𝗶𝗻𝗴 𝘁𝗵𝗼𝘂𝗴𝗵𝘁𝘀 We love instant feedback, but for many workflows we’ll have to wait a few minutes or hours rather than micromanage. Andrew speculates that through agentic workflows, maybe generating many tokens fast from a small LLM will give better results than slower throughput from a powerful LLM like GPT-5. Finally, two quotes: “𝙄 𝙚𝙭𝙥𝙚𝙘𝙩 𝙩𝙝𝙚 𝙨𝙚𝙩 𝙤𝙛 𝙩𝙖𝙨𝙠𝙨 𝘼𝙄 𝙘𝙖𝙣 𝙙𝙤 𝙬𝙞𝙡𝙡 𝙚𝙭𝙥𝙖𝙣𝙙 𝙙𝙧𝙖𝙢𝙖𝙩𝙞𝙘𝙖𝙡𝙡𝙮 𝙩𝙝𝙞𝙨 𝙮𝙚𝙖𝙧 𝙗𝙚𝙘𝙖𝙪𝙨𝙚 𝙤𝙛 𝙖𝙜𝙚𝙣𝙩𝙞𝙘 𝙬𝙤𝙧𝙠𝙛𝙡𝙤𝙬𝙨”. “The path to AGI will be a long journey rather than a destination, but agentic workflows can help us take a small step forward in this journey”. 🎬 Watch the talk here 👉 https://lnkd.in/e2x9ZAdP 📚 I've added Andrew's recommended reading to my Agents paper collection, read it here 👉 https://lnkd.in/eas9KBxD
32 Comments
Like Comment
To view or add a comment, sign in
Srdjan Kovacevic, PhD

Vice President Engineering • Data Science • AI • Machine Learning | CTO
3mo
Report this post
I posted earlier about Agent Workflows as a way to scale LLM capabilities. I believe this is a promising direction worth watching and here's a nice summary of a recent talk on this topic by Andrew Ng.
Aymeric Roucher

Machine Learning Engineer @ Hugging Face 🤗 | Polytechnique - Cambridge
3mo

𝟐𝟎𝟐𝟒, 𝐭𝐡𝐞 𝐲𝐞𝐚𝐫 𝐨𝐟 𝐚𝐠𝐞𝐧𝐭 𝐰𝐨𝐫𝐤𝐟𝐥𝐨𝐰𝐬 🔧🦾🤖 I've just watched Andrew Ng's talk at Sequoia last week. And since his lessons on Coursera, I've always had the mind-blowing impression that this guy is exposing the future in clear light. ✨ If you're interested in Agents, you should really watch the talk! 𝗪𝗵𝘆 𝘂𝘀𝗲 𝗮𝗴𝗲𝗻𝘁 𝘄𝗼𝗿𝗸𝗳𝗹𝗼𝘄𝘀? The current LLM task solving workflow is not very intuitive: We ask it “write an essay all in one shot, without ever using backspace.” Why not allow the LLM a more similar process to what we would do? - “Write an essay outline” - “Do you need wen research?” - “Write a first draft” - “Consider improvements” … This is called an Agentic workflow. Existing ones bring a huge performance boost. Example with HumanEval: GPT-4 zero-shot gets 67% score, agentic with either one of tool use or reflection goes over 90%, and the combination of the two scores even higher! 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 𝗱𝗲𝘀𝗶𝗴𝗻 𝗽𝗮𝘁𝘁𝗲𝗿𝗻𝘀 On the following two points, the tech is robust: ⚙️ 𝗥𝗲𝗳𝗹𝗲𝘅𝗶𝗼𝗻: For instance: add a critic step after the writing step 🛠️ 𝗧𝗼𝗼𝗹 𝘂𝘀𝗲: extends the capabilities of the LLM by allowing it to call tools, like search or calculator The next two will be needed to go further, but the tech for them is more emerging and not reliable yet: 🗺️ 𝗣𝗹𝗮𝗻𝗻𝗶𝗻𝗴 forward to decompose task into subtasks. This allows great behaviours like an AI Agent re-routing after a failure 🐝 𝗠𝘂𝗹𝘁𝗶-𝗮𝗴𝗲𝗻𝘁 𝗰𝗼𝗹𝗹𝗮𝗯𝗼𝗿𝗮𝘁𝗶𝗼𝗻: Program a flock of agents with tasks. Improving the two above points will unlock huge performance boosts! Andrew NG says Research agents are already part of his workflow! 𝗖𝗹𝗼𝘀𝗶𝗻𝗴 𝘁𝗵𝗼𝘂𝗴𝗵𝘁𝘀 We love instant feedback, but for many workflows we’ll have to wait a few minutes or hours rather than micromanage. Andrew speculates that through agentic workflows, maybe generating many tokens fast from a small LLM will give better results than slower throughput from a powerful LLM like GPT-5. Finally, two quotes: “𝙄 𝙚𝙭𝙥𝙚𝙘𝙩 𝙩𝙝𝙚 𝙨𝙚𝙩 𝙤𝙛 𝙩𝙖𝙨𝙠𝙨 𝘼𝙄 𝙘𝙖𝙣 𝙙𝙤 𝙬𝙞𝙡𝙡 𝙚𝙭𝙥𝙖𝙣𝙙 𝙙𝙧𝙖𝙢𝙖𝙩𝙞𝙘𝙖𝙡𝙡𝙮 𝙩𝙝𝙞𝙨 𝙮𝙚𝙖𝙧 𝙗𝙚𝙘𝙖𝙪𝙨𝙚 𝙤𝙛 𝙖𝙜𝙚𝙣𝙩𝙞𝙘 𝙬𝙤𝙧𝙠𝙛𝙡𝙤𝙬𝙨”. “The path to AGI will be a long journey rather than a destination, but agentic workflows can help us take a small step forward in this journey”. 🎬 Watch the talk here 👉 https://lnkd.in/e2x9ZAdP 📚 I've added Andrew's recommended reading to my Agents paper collection, read it here 👉 https://lnkd.in/eas9KBxD
Like Comment
To view or add a comment, sign in
Aya Ladki, MBA

Empowering businesses through AI transformation
3mo
Report this post
Agent workflows are the natural next step in the evolution of generative AI applications. As organizations large and small gain accelerated access to the "neo-knowledge" economy and digital workforce, early adopters are set to make significant strides in their respective industries. #NextGen #AI #Agents
Aymeric Roucher

Machine Learning Engineer @ Hugging Face 🤗 | Polytechnique - Cambridge
3mo

𝟐𝟎𝟐𝟒, 𝐭𝐡𝐞 𝐲𝐞𝐚𝐫 𝐨𝐟 𝐚𝐠𝐞𝐧𝐭 𝐰𝐨𝐫𝐤𝐟𝐥𝐨𝐰𝐬 🔧🦾🤖 I've just watched Andrew Ng's talk at Sequoia last week. And since his lessons on Coursera, I've always had the mind-blowing impression that this guy is exposing the future in clear light. ✨ If you're interested in Agents, you should really watch the talk! 𝗪𝗵𝘆 𝘂𝘀𝗲 𝗮𝗴𝗲𝗻𝘁 𝘄𝗼𝗿𝗸𝗳𝗹𝗼𝘄𝘀? The current LLM task solving workflow is not very intuitive: We ask it “write an essay all in one shot, without ever using backspace.” Why not allow the LLM a more similar process to what we would do? - “Write an essay outline” - “Do you need wen research?” - “Write a first draft” - “Consider improvements” … This is called an Agentic workflow. Existing ones bring a huge performance boost. Example with HumanEval: GPT-4 zero-shot gets 67% score, agentic with either one of tool use or reflection goes over 90%, and the combination of the two scores even higher! 𝗔𝗴𝗲𝗻𝘁𝗶𝗰 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 𝗱𝗲𝘀𝗶𝗴𝗻 𝗽𝗮𝘁𝘁𝗲𝗿𝗻𝘀 On the following two points, the tech is robust: ⚙️ 𝗥𝗲𝗳𝗹𝗲𝘅𝗶𝗼𝗻: For instance: add a critic step after the writing step 🛠️ 𝗧𝗼𝗼𝗹 𝘂𝘀𝗲: extends the capabilities of the LLM by allowing it to call tools, like search or calculator The next two will be needed to go further, but the tech for them is more emerging and not reliable yet: 🗺️ 𝗣𝗹𝗮𝗻𝗻𝗶𝗻𝗴 forward to decompose task into subtasks. This allows great behaviours like an AI Agent re-routing after a failure 🐝 𝗠𝘂𝗹𝘁𝗶-𝗮𝗴𝗲𝗻𝘁 𝗰𝗼𝗹𝗹𝗮𝗯𝗼𝗿𝗮𝘁𝗶𝗼𝗻: Program a flock of agents with tasks. Improving the two above points will unlock huge performance boosts! Andrew NG says Research agents are already part of his workflow! 𝗖𝗹𝗼𝘀𝗶𝗻𝗴 𝘁𝗵𝗼𝘂𝗴𝗵𝘁𝘀 We love instant feedback, but for many workflows we’ll have to wait a few minutes or hours rather than micromanage. Andrew speculates that through agentic workflows, maybe generating many tokens fast from a small LLM will give better results than slower throughput from a powerful LLM like GPT-5. Finally, two quotes: “𝙄 𝙚𝙭𝙥𝙚𝙘𝙩 𝙩𝙝𝙚 𝙨𝙚𝙩 𝙤𝙛 𝙩𝙖𝙨𝙠𝙨 𝘼𝙄 𝙘𝙖𝙣 𝙙𝙤 𝙬𝙞𝙡𝙡 𝙚𝙭𝙥𝙖𝙣𝙙 𝙙𝙧𝙖𝙢𝙖𝙩𝙞𝙘𝙖𝙡𝙡𝙮 𝙩𝙝𝙞𝙨 𝙮𝙚𝙖𝙧 𝙗𝙚𝙘𝙖𝙪𝙨𝙚 𝙤𝙛 𝙖𝙜𝙚𝙣𝙩𝙞𝙘 𝙬𝙤𝙧𝙠𝙛𝙡𝙤𝙬𝙨”. “The path to AGI will be a long journey rather than a destination, but agentic workflows can help us take a small step forward in this journey”. 🎬 Watch the talk here 👉 https://lnkd.in/e2x9ZAdP 📚 I've added Andrew's recommended reading to my Agents paper collection, read it here 👉 https://lnkd.in/eas9KBxD
Like Comment
To view or add a comment, sign in
farshid hesami

Lead Design Engineer | BIW structure | Machine Learning | Deep Learning | Computer Vision | NLP | Statistics | Tableau | PowerBI | Science engineer | Researcher
3mo
Report this post
Elevating Machine Learning Efficiency with MLflow Integration In the domain of machine learning, the precision in optimizing model performance through hyperparameter tuning is pivotal. Traditionally, this process is nested directly within the machine learning pipeline, entwining it with other sequential steps such as data preprocessing and model evaluation. However, an emerging strategy has begun to redefine this conventional approach, notably in end-to-end machine learning projects. 👇 👇👇👇 Rethinking Hyperparameter Tuning : In lieu of embedding hyperparameter tuning within the pipeline, the adoption of MLflow presents a sophisticated alternative. This transition signifies not merely a change in operation but a strategic enhancement in managing the lifecycle of machine learning projects. MLflow, a comprehensive platform designed for machine learning lifecycle management, offers a suite of features that streamline the tracking, management, and deployment of machine learning models. Here’s how integrating MLflow transforms the conventional workflow: 1. Centralized Experimentation : MLflow provides an organized framework for experiment tracking. This is particularly advantageous for hyperparameter tuning, enabling a meticulous comparison of configurations and their corresponding outcomes. Such a centralized system fosters an environment where optimal model parameters are identified with efficiency and precision. 2. Enhanced Model Management : Transitioning models from the development phase to production is a critical step, necessitating robust version control and lifecycle management. MLflow excels in this regard, offering tools that simplify the deployment of models, ensuring that only the most effective iterations advance to production stages. 3. Scalability and Collaboration : The scalability afforded by MLflow, coupled with its facilitation of collaborative workflows, underscores the platform's value in complex machine learning projects. It enables teams to efficiently manage computational resources while promoting an integrated approach to model development and optimization. The strategic decision to utilize MLflow for hyperparameter tuning, rather than conventional pipeline integration, marks a significant evolution in how machine learning projects are conducted. It underscores a commitment to operational efficiency, model performance, and scalability. For professionals navigating the complexities of machine learning, MLflow offers a compelling solution that aligns with the advanced needs of end-to-end project management. Its adoption not only optimizes the hyperparameter tuning process but also enhances the overall management and deployment of machine learning models. #machinelearning #mlflow #hyperparametertuning #mlops #datascience #aimodeling #modelmanagement #experimenttracking #operationalefficiency #scalability #modeldeployment #techInnovation #ai
4 Comments
Like Comment
To view or add a comment, sign in

12,067 followers

View Profile Follow

Heena Purohit’s Post

More from this author

Top 6 AI Predictions and Trends for Enterprises in 2024

Explore topics