Don't miss Arash Taheri, Technical Lead, Groq Compiler, at the 8th Annual Toronto Machine Learning Summit (TMLS). He'll be speaking about modifying PyTorch models to enable custom data-types and persist precision information through Groq Compiler. https://hubs.la/Q02FjrDt0
Groq
Semiconductor Manufacturing
Mountain View, California 68,225 followers
Groq builds the world’s fastest AI inference technology.
About us
Groq is the creator of the LPU™ AI Inference Technology for fast, affordable, and energy efficient AI. With the LPU, we’re unlocking a new class of AI applications and use cases. Try GroqChat or start building on GroqCloud™ Developer Hub today. The LPU and related systems are designed, fabricated, and assembled in North America.
- Website
-
https://groq.com/
External link for Groq
- Industry
- Semiconductor Manufacturing
- Company size
- 51-200 employees
- Headquarters
- Mountain View, California
- Type
- Privately Held
- Founded
- 2016
- Specialties
- ai, ml, artificial intelligence, machine learning, engineering, hiring, compute, innovation, semiconductor, llm, large language model, gen ai, systems solution, generative ai, inference, LPU, and Language Processing Unit
Locations
-
Primary
400 Castro St
Mountain View, California 94041, US
-
Portland, OR 97201, US
Employees at Groq
-
John Carrillo
Hardware Validation Engineer at Groq
-
John Barrus
Product Manager, Entrepreneur - ML, Robotics, Cloud, IoT
-
Mohsen Moazami
Founder and Managing Partner at Seif Capital
-
Ad Boon
Group Chief People Officer & Member of the Executive Board of Pasha Holding LLC & Pasha Management Company (PMC) | Founder of Brave Souls -…
Updates
-
Groq reposted this
Gemma2
-
Groq reposted this
Minister of Communications and Information Technology, Eng. Abdullah bin Amer Al-Swaha, met with the CEO and founder of Groq Inc., Jonathan Ross, during his current visit to the United States of America. During the meeting, they discussed ways to enhance cooperation in advanced technology fields, focusing on developing artificial intelligence (AI) solutions to support innovations in various sectors..... Minister of Communications and Information Technology Meets with CEO and Founder of Groq Inc Minister of Communications and Information Technology, Eng. Abdullah bin Amer Al-Swaha, met with the CEO and founder of Groq Inc., Jonathan Ross, during his current visit to the United States of America. During the meeting, they discussed ways to enhance cooperation in advanced technology fields, focusing on developing artificial intelligence (AI) solutions to support innovations in various sectors..... https://bit.ly/3LckxhL Eng. Abdullah bin Amer Al-Swaha Ministry of Communications and Information Technology of Saudi Arabia Jonathan Ross | Groq #SaudiArabia #Tech #AI #GCC #MiddleEast Via SPA
-
Groq reposted this
My recent interview with Groq seems to be taking off on YouTube. If you're a fan of large language models, this is a must watch, because it seems that GPUs are not the answer if we really want great chatbots at scale. Ones that don't feel like you're talking to a tick-box exercise. Check this out~
How Groq’s LPUs overtake GPUs for the fastest LLM AI processing in large-scale deployments We’ve been wanting to release this ipXperience for a while, and ipXchange is thrilled to finally share this chat with Mark Heaps to explain just what makes Groq’s AI chips so disruptive. Learn what an LPU is, why it’s better than a GPU for deployment at scale, and what lowest-latency large language models enable by watching the full discussion on the ipXchange website: https://lnkd.in/eVkdzSB9 It’ll change the way you think about AI chips, and you can play with this functionality today! Keep designing! #EW24 #EW2024 #AI #LLM #largelanguagemodel #GPU #CPU #processor #chatGPT #electronics #datacentre #datacenter #electronicsengineering #artificialintelligence #disruptivetechnology #genAI #generativeAI
-
See you there!
7 Days to AI Revolution: IMAGINE AI LIVE - IMPACT NYC Countdown Begins! 🗽 ⚡ 🚀 In just one week, Cornell Tech transforms into the epicenter of AI innovation. How? Through the force of our AI-powered Human Village! • From concept to reality in 11 weeks • Announced only 6 weeks ago! • Now, 7 days from igniting the AI world 🔥! This isn't just an event. It's a testament to human ingenuity amplified by AI. Our global village of visionaries has moved mountains: • Cutting-edge sponsors • Innovative partners • World-class speakers • AI pioneers and enthusiasts • Incredible founders Together, we've harnessed AI agents and bleeding-edge tech to create an experience that will redefine the future of AI. 📽Watch our reel! Spot yourself or your company? Reshare and show the world you're part of this AI revolution! Got your ticket? Tell us what you're excited for: • Mind-bending AI workshops? • Networking with the AI elite? • Glimpsing the future of tech? 🎟Haven't secured your spot? There's still time! Visit imagineai.live now! This isn't just an event. It's the launchpad for the next era of AI innovation. Be there as we make history in the heart of NYC! 🗽 ⚡ 🚀 #ImagineAILive #AIRevolution #NYCTech #FutureIsNow #AIVillage #AIforImpact
-
Join Jonathan Ross at VB Transform on July 9. He'll delve into the transformative impact of AI inference on enterprise technology, show a live demo highlighting Groq capabilities & share why by next year, over half of the world's inference computing will run on Groqchips. https://hubs.la/Q02Dr67T0
-
We have a mission to drive the cost of compute to zero. It's an infinite goal that pushes us to always search for efficiencies in our technology stack. We just posted a new paper relating to our LPU, AI Inference Technology and power usage. https://lnkd.in/gGBMq7db
-
🚀🚀🚀 🙏 Artificial Analysis
Fast to launch & very fast output speed! Groq has launched their Gemma 2 9B offering and is serving it at ~600 output tokens/s Gemma 2 9B is worthy alternative to Llama 3 8B and other smaller models. It is particularly attractive for generalist and communication-focused use-cases as shown by its Chatbot Arena (1185) & MMLU (71%) score exceeding Llama 3 8B (1153, 68%). For more specific use-cases it is worth conducting more narrow tests, e.g. for coding Gemma 2 9B well underperforms Llama 3 8B (40% vs. 62% on HumanEval). Groq is offering the model at $0.2 per 1M Input & Output tokens, in-line with Fireworks. Congratulations Groq on the fast-launch and impressive performance. We look forward to benchmarking other providers as they begin to host the Gemma 2 models, including potentially Google itself Analysis of Gemma 2 Instruct (9B): https://lnkd.in/gC6Xnj3a Analysis of providers: https://lnkd.in/gb9S5khK
-
Groq reposted this
Government Solution Architect | E government consultant | Emerging Technology Advisor | Technology Analyst
Just tried out Groq Speech-to-Text, and I am thoroughly impressed! I fed it a 15-minute Arabic podcast, and it transcribed the entire thing in just 5 seconds. The speed and accuracy are truly remarkable. Groq, you never fail to amaze with your cutting-edge technology and innovative solutions. For context, the Groq Whisper Large V3 operates at an astounding ~172x speed factor and costs only $0.03 per hour transcribed. In comparison, OpenAI's service costs $0.006 per minute and $0.36 per hour, and is significantly slower than Groq. This efficiency and cost-effectiveness make Groq's solution a game-changer for anyone working with audio content. #SpeechToText #AI #MachineLearning #Groq #Productivity #Innovation #TechExcellence #CostEfficiency
-
Powered by Groq 🚀
🏆 Introducing the third place winner of the Build Together hackathon: HereToHelp.ai 🏆 HereToHelp.ai, has created an innovative solution to support crisis workers. With real-time transcription, contextual insights, and emotional support, they are revolutionizing how crisis support is provided 🚔. Congratulations to Jeffrey Tan, Rakshith Ramprakash, Sahil Kumar, and Jordan M. for their outstanding work! The platform utilizes advanced AI and real-time analysis technology to provide meaningful support for crisis workers. Powered by: - Vercel: build and deploy web experiences - Groq: world's fastest inference - OpenAI API Read their full story and see how they built this incredible project from the ground up in the comments 👇 https://lnkd.in/gaTEpbif
Hackathon Spotlight: HereToHelp.ai
builder-club.beehiiv.com