Everybody's talking about GPT-4o and Google's Gemini releases this week... and most folks focus on some of the more "wow factor" features. I personally feel like most of the demos were sort of "smokes and mirrors" and mostly "cool marketing" demos.
I'm not saying that the new models and their capabilities aren't impressive. They are incredible engineering feats. Training some of these models with all three modalities is certainly very impressive and super complex (I know audio features are complex to handle!).
But these models are far from perfect, full-featured replacements to already existing products ... What they do is serve as a way to show what's possible, show what's coming, and help set a direction for the general public. Making them free and/or cheap to access and use means that they unlock possibilities for innovators try new ideas, iterate quickly, and find product-market fit on these ideas fast. 📈
The under appreciated problems (and main barriers to AI) remain: Bias In Bias Out, environmental impact (cost of training, etc), and Reliability.
At Rev, we focus on smaller models that perform their tasks reliably, quickly, and in an unbiased manner. Let's continue to focus on solving real problems in an ethical manner. 💪
Having said this... I am glad to see Google include some of these AI features in their amazing products. I'm a huge fan of Google's products (gmail, Google Maps, Pixel Phone, etc). It'll be fun to see these features appear over time.
#asr #ai #translation #bias #greentech
Growth | Strategic Partnerships | LLM Fine-Tuning | RLHF | AI & Machine Learning
1moLoved this demo. Super innovative product! Giving AssemblyAI and Otter.ai a run for their money... 😁