Lorenzo Thione’s Post

Public Speaker & Investor in Artificial Intelligence / Broadway Producer / 🏳️🌈 Advocate

4mo

Groq’s demo is computing proof that the chip race for better AI/inference is only starting. Generally, there is going to be a lot of really interesting chip-side innovation aimed at putting more computation at the edges. Groq’s chips are designed specifically for working with LLMs—they’re “LPUs” (language processing units)—and they can run Mixtral 8x7B at almost 500 tokens per second—by comparison, GPT-3.5T is 100 tokens/s. This, coming from a 200-person company—a massive feat when compared to huge players like NVIDIA. And NVIDIA is pushing forward as well, launching Chat with RTX, a model that runs locally on your PC, allowing it to ingest all of the data that’s stored on your computer, just last week. These are both super cool releases that will help make LLMs more usable and more pervasive. Between inference at the edge, privacy and locality, and more speed/less cost for compute, the revolution is apace. I expect that we’ll see even more advancement—and, likely, specialization—coming from these and other chipmakers in the future, including from Apple, which is rumored to have several AI-first chips ready to release. #Gaingels #AI #ArtificialIntelligence #NVIDIA #Groq

2 Comments

📹 Aaron Jones

Co-Founder @ Yepic AI | Edge-Based Generative Video Chatbots

4mo

The edge really the only future we have. Making models more accessible, efficient and privacy focused will unlock a tsunami of new use cases like the app store did at apple.

Francesco Cracolici

Posting about tech investments🤑, startup growth hacks 👽(mostly emerging markets) 🌎

4mo

Fr, how did you learn all this stuff?😍🤣

See more comments

To view or add a comment, sign in

More Relevant Posts

Armenak Mayalian
11mo
Report this post
NVIDIA has unveiled its new GH200 chip to run artificial intelligence models. What sets the GH200 apart is its combination of a powerful GPU and cutting-edge memory, boasting 141GB of memory and a 72-core ARM central processor; this chip is designed explicitly for the scale-out of data centres, addressing the increasing demand for GPU capacity. Its enhanced memory capacity allows larger AI models to be accommodated on a single system, making inference more efficient. Nvidia's commitment to advancing AI capabilities is evident through the GH200's introduction. This opens up exciting possibilities for utilising large language models, significantly reducing the cost of inference. The GH200 is expected to be available for sampling by the end of this year, with broader distribution planned in the second quarter of next year. NVIDIA introducing a specific chip to run artificial intelligence (AI) models is a strategic move to stay ahead of competitors like AMD, Google, & Amazon and a clear shift of focus dedicated explicitly to AI. #Nvidia #AI #GH200 #ArtificialIntelligence #Innovation #NVDA
Like Comment
To view or add a comment, sign in
Pawel Bulowski

Generative AI Director @ APTIV | I will help you drive your generative AI transformation | AWS, Azure, GCP | Cert. FinOps Practitioner | I Speak: 🇺🇸 🇪🇸 🇫🇷 🇵🇱
8mo
Report this post
🚀 Big news in the AI and GPU world! Nvidia just announced the HGX H200 Tensor Core GPU, a game-changer for AI applications, especially LLMs. 🧠 🌟 The H200 offers a massive 141GB of HBM3e memory at 4.8TB/s, nearly doubling the H100's capacity. This leap accelerates generative AI, LLMs, and high-performance computing (HPC) workloads. 💨 🏆 The H200 outperforms the H100 by up to 2X in inference speed, making it a promising choice for running LLMs more efficiently and quickly. 🏁 🔜 Expected to be available in Q2 2024, the H200 is set to be deployed in the upcoming Jupiter supercomputer, Europe’s first exascale supercomputer. 🌍 In short, the Nvidia HGX H200 Tensor Core GPU promises more powerful AI models and faster response times for existing ones like ChatGPT. Stay tuned for more updates! 📈 #AI #GPU #Nvidia #H200 #LLM #HPC 🤖💡🖥️
Like Comment
To view or add a comment, sign in
Brian McCumber

FinOps Leader, I help companies save 20% on their Cloud Bill 💸 FinOps Certified, Solutions Architect, Author, Instructor, Podcast Host, Digital Marketing 👉 BrianMcCumber.com
3mo
Report this post
🎬 𝗤𝗨𝗜𝗖𝗞 𝗧𝗔𝗞𝗘: 𝗡𝗩𝗜𝗗𝗜𝗔'𝘀 𝗚𝗮𝗺𝗲-𝗖𝗵𝗮𝗻𝗴𝗲𝗿: 𝗨𝗻𝘃𝗲𝗶𝗹𝗶𝗻𝗴 𝘁𝗵𝗲 𝗪𝗼𝗿𝗹𝗱'𝘀 𝗠𝗼𝘀𝘁 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 𝗖𝗵𝗶𝗽 – The Future is Here! Breakthrough in Computing: NVIDIA introduces the Blackwell B200 GPU, a technological marvel with 208 billion transistors 🔍 𝗧𝗛𝗘 𝗗𝗘𝗘𝗣 𝗗𝗜𝗩𝗘: 🚀 Unprecedented Performance: The GB200 Superchip, outdoing its predecessor by 30x in performance while consuming 25x less energy, showcases a leap in efficiency and power. 🧠 AI's New Playground: Blackwell's capabilities are not just numbers; they enable the training of models up to 10T parameters, pushing the boundaries of AI further than ever. 🕒 Speeding Up Innovation: Imagine GPT-4, with its 1.8T parameters, being trained in just 90 days using 2000 Blackwell chips. This is the speed of progress NVIDIA is driving. 🤔 𝗪𝗛𝗬 𝗜𝗧 𝗠𝗔𝗧𝗧𝗘𝗥𝗦: NVIDIA's Blackwell is not just a new chip on the block; it's a shift in how we approach computing and AI development. 🚀 𝗬𝗢𝗨𝗥 𝗡𝗘𝗫𝗧 𝗦𝗧𝗘𝗣𝗦: Dive deeper into NVIDIA's Blackwell and its implications for the tech landscape. 👉 https://vist.ly/s3vs #NVIDIA #Blackwell #AI
Like Comment
To view or add a comment, sign in
Usama Iftikhar

Machine learning Engineer | LLMs | Computer Vision | Stable Diffusion | SDXL
11mo
Report this post
Next-Gen AI: 𝐍𝐯𝐢𝐝𝐢𝐚'𝐬 𝐆𝐇𝟐𝟎𝟎 Breakthrough! Nvidia, a company known for advanced AI processors, is getting ready to release the GH200 super chip. The demand for processing complex AI models is going up and Nvidia is ready for it. The GH200 is a significant development, designed to handle complex generative AI tasks. It will be used for things like large-scale language models, recommender systems and vector databases. This chip is going to change what AI can do. What's special about the GH200 is its big memory capacity, three times more than Nvidia's current flagship the H100. It uses the same GPU as the H100 but with even more memory. The GH200 is set to come out in 2024. We don't know the price yet, but Nvidia's H100 series costs around $40,000. As we look forward to the GH200's launch, Nvidia's strong position in AI technology is now being challenged by AMD's participation in the AI GPU competition, setting the stage for an interesting clash of ideas. Source: https://lnkd.in/dBbPmjYr #nvidia #ai
Like Comment
To view or add a comment, sign in
Judy Lin 林昭儀

Journalist and International Business Researcher | 寫有溫度的故事 Writing tech stories with a human touch
6mo
Report this post
With the rapid development of generative #AI and large language model (#LLM) applications, the data center AI chip market size will also leap upward in 2023. While competitors #AMD and #Intel are set to get a piece of that pie, NVIDIA, a dominant leader in the #datacenter #AIchip market, seems more interested in making new pies. #Semiconductor

With Intel and AMD joining AI chip race, Nvidia takes one step ahead

digitimes.com

1 Comment
Like Comment
To view or add a comment, sign in
Tech Money Talks

70 followers
3mo
Report this post
🎬 𝗤𝗨𝗜𝗖𝗞 𝗧𝗔𝗞𝗘: 𝗡𝗩𝗜𝗗𝗜𝗔'𝘀 𝗚𝗮𝗺𝗲-𝗖𝗵𝗮𝗻𝗴𝗲𝗿: 𝗨𝗻𝘃𝗲𝗶𝗹𝗶𝗻𝗴 𝘁𝗵𝗲 𝗪𝗼𝗿𝗹𝗱'𝘀 𝗠𝗼𝘀𝘁 𝗣𝗼𝘄𝗲𝗿𝗳𝘂𝗹 𝗖𝗵𝗶𝗽 – The Future is Here! Breakthrough in Computing: NVIDIA introduces the Blackwell B200 GPU, a technological marvel with 208 billion transistors 🔍 𝗧𝗛𝗘 𝗗𝗘𝗘𝗣 𝗗𝗜𝗩𝗘: 🚀 Unprecedented Performance: The GB200 Superchip, outdoing its predecessor by 30x in performance while consuming 25x less energy, showcases a leap in efficiency and power. 🧠 AI's New Playground: Blackwell's capabilities are not just numbers; they enable the training of models up to 10T parameters, pushing the boundaries of AI further than ever. 🕒 Speeding Up Innovation: Imagine GPT-4, with its 1.8T parameters, being trained in just 90 days using 2000 Blackwell chips. This is the speed of progress NVIDIA is driving. 🤔 𝗪𝗛𝗬 𝗜𝗧 𝗠𝗔𝗧𝗧𝗘𝗥𝗦: NVIDIA's Blackwell is not just a new chip on the block; it's a shift in how we approach computing and AI development. 🚀 𝗬𝗢𝗨𝗥 𝗡𝗘𝗫𝗧 𝗦𝗧𝗘𝗣𝗦: Dive deeper into NVIDIA's Blackwell and its implications for the tech landscape. 👉 https://vist.ly/s3vr #NVIDIA #Blackwell #AI
Like Comment
To view or add a comment, sign in
Fintech Association Of Kenya

67,878 followers
2mo
Report this post
Nvidia CEO Delivers Innovative AI Processor to OpenAI Headquarters. Nvidia CEO Jensen Huang personally delivered the first Nvidia DGX H200 to OpenAI's headquarters in San Francisco. The event marks a milestone in AI research and development, as OpenAI becomes the recipient of the world's most powerful AI-specific hardware. The NVIDIA DGX GH200 is a super advanced computer specifically designed for artificial intelligence (AI) tasks. It has 32 powerful processing units and a huge amount of memory—nearly 20 terabytes, which is a lot more than what's typically found in standard computers. This setup allows it to process complex AI operations extremely fast, reaching speeds that are among the best in the world. For connectivity, it uses some of the fastest available options, making it capable of handling huge amounts of data quickly and efficiently. The DGX GH200 also comes with specialized software to manage and run AI tasks more effectively and supports several popular operating systems. Additionally, it includes three years of professional support to help users maintain and troubleshoot the system. In simple terms, it's like a super brain designed to help develop and run AI applications very efficiently. #Nvidia #OpenAI #DGXH200 #AIResearch #TechnologicalAdvancement #Collaboration #Innovation Image:The Nvidia DGX H200
Like Comment
To view or add a comment, sign in
Manoj Kumar Nagabandi

Full Stack Data Scientist at SIPA | MSc. in Data Science | Generative AI | GenBI | Natural Language Processing | Machine Learning | Python | R | SQL
4mo
Report this post
💥 1-bit quantization breakthrough by Microsoft : Are GPUs obsolete? Microsoft researchers may have just revolutionized LLMs! They discovered storing LLM weights in just 1 bit (or rather, 1 ternary bit) is possible! Why does this matter: * No matrix multiplications * Faster additions instead! * Smaller models, HUGE gains. * 10x faster throughput 🚀 * Up to 7x smaller memory footprint Is this the end for NVIDIA? Maybe not, but it could spark a wave of new AI hardware innovation.#ai #artificialintelligence #deeplearning #LLM #innovation #hardware #Microsoft #NVIDIA Link to paper: https://lnkd.in/esZs8nbQ
1 Comment
Like Comment
To view or add a comment, sign in
Najib Khan

Data Scientist | AI Engineer @Accenture
4mo Edited
Report this post
📢 This is SUPER IMPRESSIVE! NVIDIA introduces Blackwell - NVIDIA's Blackwell is currently the world’s most powerful chip for AI, capable of supporting trillion-parameter models. This is a significant leap forward, as it allows for more complex and accurate models to be trained. What’s more impressive is the efficiency of Blackwell. Training a 1.8 trillion parameter model now requires fewer resources - both in terms of GPUs and energy consumption. This is a substantial improvement over previous generations. The second-gen transformer engine is another noteworthy feature. It doubles the compute, bandwidth, and model size, enabling real-time generative AI. This opens up new possibilities for AI applications. #nvidia #nvidiagtc #generatieveai
2 Comments
Like Comment
To view or add a comment, sign in

8,519 followers

View Profile Follow

Lorenzo Thione’s Post

More from this author

How We Can Combat the Negative Impacts of AI

How to Ask a Chatbot a Question

Explore topics