- ThePrompt
- Posts
- The Language of Audio 📻
The Language of Audio 📻
AudioLDM2: AI generation model for all audio "types
Hi folks!👋🏻 This is The Prompt! No fishy AI stories here. Just the catch of the day.
Let's reel in the news
FEATURED
AudioLDM 2: The Language of Audio
You’d think that it would be easy to train an AI model that will generate music, sound effects, and speech all at once.
These audio types are actually quite similar. But they actually have lots of biases, so building a general AI was tough.
At least so far.
AudioLDM 2 is the latest work in that field, that can generalize and create different audio files with just one AI model.
How does it work?
The goal is to have a universal representation of audio.
So, they built a framework that can generalize well between different types of audio; they call it the “Language of Audio” or LOA.
It's like it has a universal key to all sounds, and it uses that key to learn about them on its own, without needing anyone to tell it what each sound means.
Resources
They gave us a list of 350 AI-generated audio files, plus you can try to create your own on HuggingFace.
You can also find some examples on their project page.
🚨 What else is going on
Seven of AI’s top companies met with Biden, and all of them committed to investing in research and safety for their AI models
TikTok now lets you add an “AI-generated” label to your videos and warns that it may take down content if it isn’t labeled properly.
The famous module for generative agents that can simulate believable human behaviors, and adapt to their game environment is now open-source.
📕 Resources
[a must read] Artificial General Intelligence, an Intro
[useful] Curated list of the best LLMOps tools for developers
[great overview] Why you should use Custom Instructions in ChatGPT
🧰 Tools of the trade
Rubbrband: Detect deformities in AI-generated images
Notion to Chatbot: Turn your notion docs into chatbots
EasySEO: AI writing tool for SEO-optimized blog posts
LocalBot: AI bot for small businesses
PolicyPro: Chat with your company policy
Supabase AI SQL editor: Write SQL without knowing SQL
✍🏼 Prompt of the Day
TOOL
Midjourney
PROMPT
breathtaking landscape shot, [LOCATION] --ar 3:2 --style raw
RESULT