ThePrompt
Posts
The Language of Audio 📻

The Language of Audio 📻

AudioLDM2: AI generation model for all audio "types

Anita Kirkovska
August 11, 2023

Hi folks!👋🏻 This is The Prompt! No fishy AI stories here. Just the catch of the day.

Let's reel in the news

FEATURED

AudioLDM 2: The Language of Audio

You’d think that it would be easy to train an AI model that will generate music, sound effects, and speech all at once.

These audio types are actually quite similar. But they actually have lots of biases, so building a general AI was tough.

At least so far.

AudioLDM 2 is the latest work in that field, that can generalize and create different audio files with just one AI model.

How does it work?

The goal is to have a universal representation of audio.

So, they built a framework that can generalize well between different types of audio; they call it the “Language of Audio” or LOA.

It's like it has a universal key to all sounds, and it uses that key to learn about them on its own, without needing anyone to tell it what each sound means.

Resources

They gave us a list of 350 AI-generated audio files, plus you can try to create your own on HuggingFace.

You can also find some examples on their project page.

🚨 What else is going on

Seven of AI’s top companies met with Biden, and all of them committed to investing in research and safety for their AI models
TikTok now lets you add an “AI-generated” label to your videos and warns that it may take down content if it isn’t labeled properly.
The famous module for generative agents that can simulate believable human behaviors, and adapt to their game environment is now open-source.

📕 Resources

^{[a must read]} Artificial General Intelligence, an Intro
^[useful]Curated list of the best LLMOps tools for developers
^{[great overview]}Why you should use Custom Instructions in ChatGPT

🧰 Tools of the trade

Rubbrband: Detect deformities in AI-generated images
Notion to Chatbot: Turn your notion docs into chatbots
EasySEO: AI writing tool for SEO-optimized blog posts
LocalBot: AI bot for small businesses
PolicyPro: Chat with your company policy
Supabase AI SQL editor: Write SQL without knowing SQL

✍🏼 Prompt of the Day

TOOL

Midjourney

PROMPT

breathtaking landscape shot, [LOCATION] --ar 3:2 --style raw

RESULT

https://twitter.com/chaseleantj/status/1689970489463349248/photo/4

source: https://www.reddit.com/r/midjourney/comments/11bwgl7/midjourney_version_comparison_using_prompt_photo/

Subscribe to keep reading

This content is free, but you must be subscribed to ThePrompt to continue reading.

Already a subscriber?Sign In.Not now