Prasanna Krishnamoorthy’s Post

Managing Partner- Upekkha (AI fund and accelerator) Download EY-Upekkha Report - upekkha.io/ey-upekkha-report

1mo

A common and obvious assumption about generative models (incl LLMs) is that they can only be as good as the data they're trained on. They can't "do better" than that. Can they? This interesting new paper "Transcendence: Generative Models Can Outperform The Experts That Train Them" shows that these models can sometime perform better than any of the experts who trained them! While this was done in the context of chess, there is no reason to believe that it might not be the case in other domains as well. Personally, I especially find this useful when I munge two frameworks from different areas to apply to a problem I have. Doing that myself would be challenging, but having the LLM transcend my abilities to do it gives me a great starting point! I like the term "Transcendence" as well - good marketing :)

9 Comments

Prasanna Krishnamoorthy

Managing Partner- Upekkha (AI fund and accelerator) Download EY-Upekkha Report - upekkha.io/ey-upekkha-report

1mo

Paper link: https://arxiv.org/pdf/2406.11741 Hat tip Ethan Mollick on twitter for reviewing this

2 Reactions

Amandeep Singh Minhas

1mo

The fundamental fact about LLMs that most are unaware of is that LLMs are basically N-Gram models. Meaning, their job is to statistically predict the nth word as Output given n-1 words as Input. There is no real visibility into what exactly has gone into training the LLMs. No one knows what exactly the LLM knows. So, what most think of as Emergence is actually Recall. And, yes, the Data used to train and fine-tune is the Life Blood of an LLM. No LLM can perform better than what is possible based on the input Data.

Amit Sharma

1mo

Only challenge in Chess analogy is that its easy to measure performance in Chess. Continue to believe that domain specific, driven by hand-written test cases, dataset to evaluate various models (or tools built on top) is a big open opportunity. MMLU and other typical benchmarks don't cut it. 3rd party, trusted evals and benchmarks will be the new G2.

3 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Dilip Ittyera

Bringing the power of Enterprise Intelligence to AI | 3X Entrepreneur | ex-CTO Zensar | Built BU 0-20M+
1mo
Report this post
Great points. I would like to add that they would perform even better and reliably if we were to couple it with the relevant data in the right form

Prasanna Krishnamoorthy

Managing Partner- Upekkha (AI fund and accelerator) Download EY-Upekkha Report - upekkha.io/ey-upekkha-report
1mo

A common and obvious assumption about generative models (incl LLMs) is that they can only be as good as the data they're trained on. They can't "do better" than that. Can they? This interesting new paper "Transcendence: Generative Models Can Outperform The Experts That Train Them" shows that these models can sometime perform better than any of the experts who trained them! While this was done in the context of chess, there is no reason to believe that it might not be the case in other domains as well. Personally, I especially find this useful when I munge two frameworks from different areas to apply to a problem I have. Doing that myself would be challenging, but having the LLM transcend my abilities to do it gives me a great starting point! I like the term "Transcendence" as well - good marketing :)
Like Comment
To view or add a comment, sign in
Ram Bala

Building Samvid AI; Author of "The AI-Centered Enterprise"; Entrepreneur, Professor
1mo
Report this post
I am sorry, but this is a misconception of what that paper actually says. That paper is a classic case of “wisdom of the crowds”. When 3 judges rate something, the average of three ratings is usually closest to the true value. This is a well known statistical effect (call it “noise cancellation” if you will). This works great when predicting numerical / structured outcomes but is exactly the same effect which when applied to natural language results in the banality that is commonly associated with LLM natural language output. This is not some kind of emergent behavior. Just standard statistics. For an effect to be truly better than the data it is trained on, it needs to be unexplainable by existing statistical paradigms. But that would be like squeezing blood out of a stone. Not happening LLMs are just ML models applied to word tokens. They have to follow statistical laws. #llmsnotmagic #llmsareml

Prasanna Krishnamoorthy

Managing Partner- Upekkha (AI fund and accelerator) Download EY-Upekkha Report - upekkha.io/ey-upekkha-report
1mo

A common and obvious assumption about generative models (incl LLMs) is that they can only be as good as the data they're trained on. They can't "do better" than that. Can they? This interesting new paper "Transcendence: Generative Models Can Outperform The Experts That Train Them" shows that these models can sometime perform better than any of the experts who trained them! While this was done in the context of chess, there is no reason to believe that it might not be the case in other domains as well. Personally, I especially find this useful when I munge two frameworks from different areas to apply to a problem I have. Doing that myself would be challenging, but having the LLM transcend my abilities to do it gives me a great starting point! I like the term "Transcendence" as well - good marketing :)
Like Comment
To view or add a comment, sign in
Daniel Warfield

Director of Engineering Engagement at EyeLevel
2mo
Report this post
Learn how to Implement cutting edge image search in 5 minutes. You can read my articles for free by following the IAEE LinkedIn. Machine Learning - IAEE

Image Search in 5 Minutes

towardsdatascience.com
Like Comment
To view or add a comment, sign in
Jon Cowley

Tech Founder @ whatifi.io and VFX Leadership | Data-Driven Decision Making | Raising Angel Investment
11mo Edited
Report this post
It should just be simple math, right? ❌ Wrong. There is an entire art and science behind how innovative new products and services get adopted. The more novel or unique your solution, the more challenging it can be to get new users to adopt your product. They fear the learning curve and how they will be perceived in their organization or with their clients. It has been one of our challenges with whatifi. And here's what I've learned. This concept is famously documented in Geoffrey Moore's book - Crossing the Chasm. The "chasm" is the gap between the early-adopter types (those that are wiling to use a new product or service, lump, bumps, warts and all) and those that need to wait until a product has been validated and ruthlessly proven before jumping on the bandwagon ("the early majority"). As I have been slowly building whatifi over the past few years, I was promoting our "what if" financial and strategic decision-making tech to Fractional CFOs, other tech startups, and subscription-based SaaS companies (software-as-a-service). The feedback? There was a ton of interest. We definitely solved a problem for them. We could 5X this and do Y ten times faster than the current tools. Sexy. New revenue streams. Cool. But the unique, node-based, connect-the-dots style UI was such a foreign concept to them, that some were intimidated. When you spend a lifetime living in spreadsheets and rows and columns, adopting a new piece of technology is a psychological risk. There is a famous (but dated) quote in software circles. "No one gets fired for buying IBM." Translation - it is better to go with the "safe" option. Learning something new is risky. What if it doesn't work? What will people think of you? Look around your current organization. Are they truly innovating and trying new things? AI? Real-time rendering? Cloud-based systems? New software? An out-of-the box way of solving a problem or a new process or workflow? Or are new approaches frowned upon? Thanks to the recent Writer's Strike, I've now been able to share our tech with an industry that I know incredibly well - VFX and post-production. It is an industry that "thinks" the way that I do - visually. We understand what a timeline is. We're used to this low-code, connect-the-dots style interface. It is "familiar". And then there is timing. The whole TV and film industry is suddenly under extreme pressure thanks to the protracted writer's strike. I've spoken to multiple companies that are laying off staff, rethinking their business models, their office space, and their capacity. We now have a dozen VFX studios that are testing whatifi. Yet change is still hard. Part of my job as a Founder is to listen for the "yeah, buts..." One of the big ones is our learning curve. Cue the video, Jon: https://lnkd.in/g2u5FB9H If your #VFX studio is struggling with how to navigate the current economic climate and all of the upcoming "what ifs", let's chat!

The Learning Curve - How to Build "what if" Models in whatifi - in seconds

https://www.youtube.com/

1 Comment
Like Comment
To view or add a comment, sign in
Mark Gash

Creative Swiss Army Knife for E-learning. Not Swiss.
4mo
Report this post
Dirtyword issue 2 is out now! Is AR the future of e-learning? The digital nomad survival guide. How does Farina Mackay lead at Degreed? And did Bill and Ted create non-linear learning? All this and more in issue 2 of Dirtyword! https://lnkd.in/eBSMRiNN

Dirtyword | Issue 2 - Dirty Word Magazine

https://dirtywordmag.com

4 Comments
Like Comment
To view or add a comment, sign in
Siva Surendira

CEO at Lyzr AI | Fully Autonomous AI Agents For Businesses
9mo
Report this post
GenAI is like the bookworm student you had in your class. Great at reading and remembering stuff. But not necessarily strong on actual knowledge of what they read. They are the best RAG humans, good at answering questions in the class, instantly and accurately. Building applications using RAG is equivalent to the practical genius you had in your class. They know how to apply all this learning/reading practically, in the real world. So, irrespective of whether you are building a wrapper or a rocket on GPT, just keep building. You are on the right path.
Like Comment
To view or add a comment, sign in
Pawan Lamba

General Manager Data Science at Paytm
10mo
Report this post
And This Research time can be greatly minimized by Good Data Culture and a well documented Data Catalog. #Opinion #ML #datascience

Brandon Rohrer

Data scientist
10mo

Every machine learning project is a research project. Plan accordingly.
Like Comment
To view or add a comment, sign in
Alastair Town

I talk about AI literacy, I've trained over 1500 academics on getting the best from AI ✅
9mo
Report this post
Friday Check-In ✅ This week I've been delving into some more advanced uses of structured prompts for teaching and research, question is which would you prefer next week? 1. A set of structured prompts to help conduct a literature analysis, automating the inclusion/exclusion screening process for papers based on abstract. 2. A set of structured prompts or a bot that allow you to create unlimited MCQ's from materials stored as a .pdf or .docx files. Either one could save a huge amount of time... 👉 Follow me for AI and Learning & Development posts 🔔 Ring the bell on my profile for my latest posts
Like Comment
To view or add a comment, sign in
Quant|Labs.net

53 followers
2w
Report this post
Demystifying #MachineLearning #Engineering: A Beginner's Guide with #Free Course Recommendation Machine learning engineering (ML) has become a ubiquitous term, impacting everything from the way we interact with social media to how businesses make strategic decisions. But for those new to the field, understanding the intricacies of ML can feel daunting. This is where Brian Downing's video review comes in, offering a beacon of hope for beginners by recommending a free course that breaks down the fundamentals of machine learning in a comprehensible way. https://lnkd.in/eYVjESV7
Like Comment
To view or add a comment, sign in
Henri Schildt
7mo Edited
Report this post
I hacked together a "GPTFeedback" tool last night. It takes user suggestions, feedback, or brainstorming inputs on a simple web form. 1. I use GPT to correct capitalization, edit out bad language, and remove names of individuals. 2. In the second step GPT forms inductive categories of feedback. 3. All data is stored in a Google Sheet, and teacher/leader can provide personal responses to each category which show up on the web page. Planning to use this on my courses and workshops. It could use some graphic design for sure, and allow custom instructions for categorization logic. 😁
1 Comment
Like Comment
To view or add a comment, sign in

19,268 followers

View Profile Follow

Prasanna Krishnamoorthy’s Post

More from this author

Why AI Will Replace Small Business CEOs Sooner Than You Think!

Tummy Talks & Heartfelt Hums: Tapping Into Your Body's Emotional Beat!

What’s the TAM Chasm?

Explore topics

Prasanna Krishnamoorthy’s Post

More Relevant Posts

The Learning Curve - How to Build "what if" Models in whatifi - in seconds

https://www.youtube.com/

More from this author

Why AI Will Replace Small Business CEOs Sooner Than You Think!

Tummy Talks & Heartfelt Hums: Tapping Into Your Body's Emotional Beat!

What’s the TAM Chasm?

Explore topics