Ezra Sandzer-bell’s Post

Fractional CMO for B2B and B2C Music Tech Startups | Founder at AudioCipher Technologies | Marketing at Audio Design Desk

2mo

Last week Suno teased a new audio-to-audio feature, but they're not alone in this space. 👀 You probably saw the demo - it shows a person tapping on a watering can and turning it into a heavy psych rock track. (It's on their Twitter account here if you missed it: https://lnkd.in/gYMcJfYZ) Some people mocked the demo, pointing out that it's just BPM-matching. I have a feeling that the actual feature will go deeper, based on what other companies like SoundGen, CassetteAI and Stability AI previously brought to market. Before I dive into competitors and alternatives, let me say this. When I saw Suno's announcement, I nearly fell out of my chair. If they've got a multimodal system that combines audio input with text prompts, it's probably going to blow everyone else out of the water. 🤺 But they have at least one serious competitor to look out for. Back in November 2023, Google DeepMind shared a screenshot of an interface for their unreleased Lyria model. Lyria's design piggybacked on claims from the MusicLM team (January 2023), saying they could combine melodic conditions with text to generate song arrangements. That never materialized. When they rebranded as MusicFX, audio-to-audio was still missing. Even this month, with the big AI music reveal at Google's I/O event, we saw combinatory music seeds but no audio-to-audio. 🙄 So what the heck, Google? You going to release this thing or not? Anyway, pulling back to look at the big picture, there are several subcategories under the umbrella of "audio to audio" and only a few of them turn melodies into songs. In this new AudioCipher Technologies article, I've mapped out some subcategories and identified the big players. Here's a high level summary: 🎶 Melodic conditioning: Humming, whistling, or performing a solo melody on an instrument and turning it into a complete arrangement 💿 Music samples into songs: Turning multi-instrumental music arrangements into new music clips and extending them to generate new song sections. 🎙 Voice cloning for singers: Transferring an audio recording of a singing voice into another vocalist's style and timbre. 🎸 👉 🎷 Tone transfer: Using AI models that were trained on an instrument to turn user input, like a guitar performance, into new instruments like a violin or saxophone. 😚 👉 🎸🥁 Style transfer: Changing the style of audio and music inputs. It combines melody conditioning, tone transfer, and remixing in a single function. 🎹 Audio-to-midi-to-audio: A more precise approach to tone transfer. ML is used in the first step, but standard virtual instruments are used during the MIDI-to-audio step. Check out the article below for a complete overview, and some video demos of how these tools work. I'll follow up with another report when Suno and Google roll out their models to the public. #ai #aimusic #suno #audio #music https://lnkd.in/gfnPiC5G

Audio to Audio AI: Melody-to-Song, Style Transfer & More

audiocipher.com

31 Comments

Georg Zoeller

2mo

All the same things you can do with images and video you can do with audio. But many of these features would trigger an immediate boss battle with the music industry which is why nobody wants to be first. Not a technical hurdle at all

1 Reaction

Kord Taylor

2mo

👏 Once again another fine breakdown. "Audio-to-midi-to-audio: A more precise approach to tone transfer. ML is used in the first step, but standard virtual instruments are used during the MIDI-to-audio step." Thanks so much for calling this out. I have to say that some of the "AI" demos of "Tone transfer" really feel like the kind of demos we did at Opcode in the 90's with Studio Vision audio-to-midi and Steinberg Media Technologies audio-to-midi demos from a few years ago (2007 or ?). Funny enough but both did a sax to a MIDI instrument. People are amazed. I know you wouldn't be 😀

2 Reactions

Diamond Duggal

Music Producer | AI Music Consultant | Founder at DesiRock | Founder at SUPRODA

2mo

Great article..! Audio to audio is definitely the missing link in original AI music creation for professionals especially on Suno and Udio and trying to force the melody is currently a frustrating process of creating many versions and multiple edits...

3 Reactions

Drew Thurlow

Entertainment Executive | Music Tech & AI | Streaming & DSPs | Artist & Label Relations | Recorded Music & Publishing

2mo

Interestingly, Mikey at Suno told me their actual musical training data is multimodal

1 Reaction

Christopher Wieduwilt (The AI Musicpreneur)

Helping music creators grow with AI (aimusicpreneur.com)

2mo

Great insights as always Ezra! Here's me waiting patiently for that Suno release:

1 Reaction

Bence Csernak

AI Builder and Engineer • AI Adoption and Integration Specialist • Strategic Product Designer • User Experience

2mo

Adrian Tineo, Ph.D.

2 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Leon Furze

Guiding educators through the practical and ethical implications of GenAI. Consultant & Author | PhD Candidate | Director @ Young Change Agents & Reframing Autism
10mo
Report this post
In the second post in this series I'm exploring audio generation through ElevenLabs, MusicLM, and Stable Audio. Voice, music, and sound effect generation is now easy and (mostly) cheap, but comes with some ethical and privacy concerns. #ai #aieducation

Hands on with AI audio generation: GAI voice, music, and sound effects

http://leonfurze.com

5 Comments
Like Comment
To view or add a comment, sign in
Dr. Antonio Ali Di Fenza

Alignment Researcher: AI, Art, Fitness 👁️🌻🤸🏻
1y
Report this post
🎵 Ever felt challenged by generating music audio using text-to-audio techniques? You're not alone! 📖 Hiromu Yakura and Masataka Goto tackle this in their innovative paper, "IteraTTA: An Interface for Exploring Both Text Prompts and Audio Priors in Generating Music with Text-to-Audio Models". 💡 They introduce IteraTTA, a user-friendly interface that takes the mystery out of music generation. It provides computational guidance, enabling you to construct initial prompts and iteratively refine the generated audio. And the secret ingredient? A novel technique of audio priors! 🎧 🔄 The result? A dual-sided iterative exploration of text prompts and audio priors. Now you can turn your text prompts into beautiful music and continually refine the sound by exploring different audio priors and adjusting your prompts. It's like having a music studio at your fingertips! 🎚️🎹 🔍 Curious to learn more about this groundbreaking approach to music generation? Dive into the full paper. Link in the comments! 📲 #MusicGeneration #AI #TextToAudio #IteraTTA

IteraTTA: An interface for exploring both text prompts and audio priors in generating music with text-to-audio models

arxiv.org

1 Comment
Like Comment
To view or add a comment, sign in
Stig Larsson
6mo
Report this post
Will AI replace composers and artists? For some time I have been reading and thinking about how AI will affect music creation and it is clear that there are many aspects to analyze. One use of AI is to create new music for specific artists based on old material. Another is that many tools that producers use today are using AI to speed up processes, for example for mixing and mastering, and to create new sounds. Also, new music can be created through generative AI. So is there a future for human composers and songwriters? https://lnkd.in/dm2fvA7Y

AI and music

andthensoclear.com

3 Comments
Like Comment
To view or add a comment, sign in
Stephan Theron

AI Engineer, Business Analyst, iGaming Specialist, Marketing & Strategic Advisor, Martech Solutions, Author
10mo
Report this post
Stability AI's new text-to-audio tool is like a Midjourney for music samples | TechRadar: Stability AI's new text-to-audio tool is like a Midjourney for music samples ... Stability AI is taking its generative AI tech into the world of music ...

Stability AI's new text-to-audio tool is like a Midjourney for music samples

techradar.com
Like Comment
To view or add a comment, sign in
John Porter

Project Manager at Tundra Federal | Program and Project Management
1mo
Report this post
Exciting development in AI-generated music! Stable Audio Open, a generative model from Stability AI, has been trained using ~486,000 samples from free music libraries FreeSound and the Free Music Archive. This innovative model takes a text description, like "Rock beat played in a treated studio, session drumming on an acoustic kit," and can produce a recording up to 47 seconds in length. 🎵🤖 #AI #MusicGeneration #Innovation

Stability AI releases a sound generator | TechCrunch

https://techcrunch.com
Like Comment
To view or add a comment, sign in
Zoe Scaman

Founder and Keynote Speaker at Bodacious - a strategy studio.
6mo Edited
Report this post
A brilliant overview of how different music artists are leveraging AI - from assistance with lyrics, to isolating stems, to translating music to imagery, to ever changing songs that adjust to different contexts. Mind blowing.

✘ Musicians harnessing AI, while doing no harm

musicx.substack.com

3 Comments
Like Comment
To view or add a comment, sign in
Mohammad Waris

The founder and the driving force behind the platform FUTURE TREND FLOW.
6mo
Report this post
Have you ever imagined an AI that can not only create music but also write lyrics, generate sound effects, and revolutionize media creation? Suno.AI is here to make your imagination a reality. Let’s explore the capabilities of Suno.AI and its impact on the music and media landscape.

Suno AI: Revolutionizing Music Creation and More

https://futuretrendflow.in
Like Comment
To view or add a comment, sign in
Chance Neihouse

In Development
6mo
Report this post
MU-LLaMA builds on Meta’s LLaMA model and specializes in understanding music, generating subtitles for music files, and answering music-related questions. M2UGen excels in music captioning, outperforming other models in quality and accuracy. These advancements could have significant implications for AI-driven music production and understanding, enhancing user experience in platforms focused on music technology. Check out the full article for more insights: MarkTechPost Article #AI #MusicTechnology #MU-LLaMA #M2UGen

Can a Single Model Revolutionize Music Understanding and Generation? This Paper Introduces the Groundbreaking MU-LLaMA and M2UGen Models

https://www.marktechpost.com
Like Comment
To view or add a comment, sign in
Codefact

35 followers
4mo
Report this post
The landscape of music creation is undergoing a transformative shift with the advent of sophisticated artificial intelligence (AI) algorithms. Far from the dystopian visions of creativity’s demise, we’re witnessing an era where AI not only augments human creativity but also democratizes music production, blurring the lines between professional musicians and amateurs. The essence of this transformation lies not in the displacement of human creativity but in its amplification and diversification, as pointed out by an article on The Economist.

The Harmonious Blend of AI and Artistry in Music's New Era

codefact.xyz
Like Comment
To view or add a comment, sign in
Fondo

5,205 followers
4mo
Report this post
🚀 Soundry AI launched! 🔊 Generative AI for musicians 🤖 "AI for Musicians, by Musicians." 🌐 www.soundry.ai ✅ Your new best friend in the music creation process: Text-to-sound AI generator for musicians ▶ personalized generations & training data hand-picked by your favorite artists — all that a user needs to do is click a button to get accurate stylized sounds 🖱 📊 Creates musical building blocks perfect for reducing friction in the music creation process: Proprietary foundational latent diffusion transformer offers state-of-the-art audio quality 🌟 🎶 For professional & amateurs alike: Music producers build songs with samples that they've hand-crafted using Soundry's AI, and sound designers can incorporate generated sound effects into film, TV, and video games 💯 Better than sample libraries, Faster than sound design, Unlimited variations, Super easy to use, Glossary for inspiration, & Completely unique results. Try the sample generator here! ➡ https://soundry.ai/ Congrats on the launch Mark Buckler, PhD, Justin Parus, & Diandre Ruiz!! https://lnkd.in/g3d9cur8

The Journal by Fondo | 🔊 Soundry AI launches generative AI for musicians

tryfondo.com
Like Comment
To view or add a comment, sign in

3,060 followers

125 Posts

View Profile Follow

Ezra Sandzer-bell’s Post

More Relevant Posts

Explore topics