A new model takes on Midjourney

ALSO: Creating product photo animations with AI

Read time: under 4 minutes

Welcome back, Superhuman

Germany’s Black Forest, the mystical region that inspired the Grimm brothers' stories, is lending its name to a new AI startup. And the company just unveiled its own enigmatic creation — not a fairytale, but a cutting-edge AI model.

Today’s Insights

  • An impressive new image generator launches from stealth

  • Tutorial: Creating product photo animations with AI

  • How AI can help prevent shark attacks

  • Everything else you should know today

  • 5 new AI tools to boost your productivity

  • AI-Generated Images: Crazy frog

NEXT IN AI

A new text-to-image model soars to the top of the charts

Source: Black Forest Labs

An AI startup called Black Forest Labs just emerged from stealth — and immediately leapfrogged over the competition. Its suite of text-to-image models, known as FLUX 1, already outperforms rivals like Midjourney 6.0, DALL-E 3 HD, and Stable Diffusion 3-Ultra. 

It leads across multiple factors, like image detail, scene complexity, and prompt adherence, according to the Elo rating system. And Black Forest says it has all of the aspect ratio and style options you’ve come to expect from today’s image generators, too.

Who’s behind it? Several of the startup's engineers come from Stability AI, the developer behind the influential image generator Stable Diffusion. That company ran into trouble earlier this year when its CEO, Emad Mostaque, was accused of mismanagement and eventually stepped down. 

Black Forest is also backed by a who’s-who of AI industry insiders, like Y Combinator CEO Garry Tan and Timo Aila, a senior researcher at Nvidia — not to mention Andreessen Horowitz, which led the startup’s $31 million seed funding round.

How it's different: Black Forest says it combined several experimental training techniques to make FLUX 1 both faster and more accurate. 

  • For one, it uses rotary positional embeddings — an approach that helps the model keep track of large sequences; each piece of data is assigned its own unique characteristics, which makes it easier for the LLM to tell them apart

  • It also uses a parallel diffusion transformer, which lets the model analyze multiple parts of a sequence simultaneously — speeding up the time it takes to transform visual noise into a cohesive image

What next? Black Forest says it’s planning to drop a state-of-the-art text-to-video model soon. If it’s anything like FLUX 1, other video-focused AI companies (including OpenAI, HeyGen, and Runway) might have their work cut out for them. In the meantime, you can try out FLUX 1 on cloud platforms like Fal and Replicate.

PRESENTED BY GAMMA

How to create beautiful presentations with AI

  • Go to the Gamma website and sign up.

  • Once you’re signed up, you’ll get an option to create slides by pasting text, uploading a file, or giving it a prompt.

  • You can pick any option you like. For this example, I’ll pick the prompting option.

  • On the next screen, pick how many slides you want, and the language, describe the content of the slides, and click generate.

  • You can also use its AI to find or generate images

  • Next, you can choose the design and colors you’d like for the presentation.

  • The app will create the presentation in a few seconds.

  • Edit the contents of the slides to suit your needs.

THE AI ACADEMY

How to create product photo animations with AI

  • Go to Mojo AI’s website and Sign up to get tokens.

  • Upload your product photo or company logo to convert it into animation.

  • Click on ‘Use this Image’, and on the next screen select any animation style from the list.

  • Wait a few minutes and you’ll get your perfect-looking animated logo or product photo.

  • You can use it as an Instagram story, reel, add to your website, or just add some text to make it more appealing and share it with your audience.

Want to master AI and future-proof your skills? Access 100+ courses and tutorials at our AI Academy

AI & THE OCEAN

How AI is helping prevent shark attacks

Would you feel safe swimming in waters monitored solely by an LLM? If so, you might be onto something. Researchers at the University of California, Santa Barbara designed a model that’s been fine-tuned to spot sharks in the water — and early tests suggest it might be more precise than humans. 

Even with the help of aerial drones, people can only detect sharks with about 60% accuracy, according to CNN. The researchers’ model, SharkEye, can pick out most of the sharks humans see, but it’s also capable of spotting ones that elude us. That’s because it can peer deeper into the water, detecting creatures below the surface.

How it works: Like humans, SharkEye uses drones to find sharks in the water along SoCal’s Padaro Beach. If there’s a match, it’ll send out an alert to nearby lifeguards, parents, and business owners. For now, humans are still in control of the notification system. But as soon as next summer, the model could start taking on more responsibilities. 

It could be good for sharks, too: As much as humans might feel threatened by sharks, the truth is that sharks are probably just as afraid of us. Human development and climate change have driven some species closer to shore, but better surveillance can help us find new ways to keep them safe.

THE AI ACADEMY

Make AI Your Superpower

Learn the AI skills that will help you get ahead in your career. Level up your command of the most powerful AI tools and workflows with:

  • 100+ tutorials and workflows

  • New tutorials posted every day

  • Our Top AI tools + cheatsheets for work

  • A global community of professionals using AI

Built by our team of AI experts and practitioners to help you unlock your productivity at work. Readers of Superhuman AI get $100 off with the code SUPERHUMAN100.

AI & TECH NEWS

Everything else you need to know today

ChatGPT’s new voice model. Source: Swipe Insight

  •  Power Hungry: On Meta’s earnings call this week, CEO Mark Zuckerberg said he predicts training Llama 4 will take about 10 times more computational power than Llama 3.

  • Dark Horse: An experimental version of Gemini 1.5 Pro that’s only available to developers just landed at number one on the Lmsys leaderboard — the first time Alphabet has claimed the top spot.

  • Long Time Coming: The European AI Act — the world’s first major AI legislation — officially went into effect Thursday, four years after it was conceived.

  • Karma Chameleon: Reddit has acquired the startup Memorable AI for roughly $40 million. The Massachusetts-based firm uses AI to optimize digital ads for different audiences.

  • Game Changer: New York’s Runway AI unveiled a new text-to-video model called Gen-3 Alpha Turbo that it says is significantly more efficient than its predecessor.

😁 One Fun Thing: Early users are having fun with ChatGPT’s long-awaited voice features. One user asked the model to sing “Happy Birthday” in a croaking frog’s voice — then in the style of a meowing cat, and finally, as a dog. Next, the user asked: “What if the dog was an opera singer?” Watch how ChatGPT responded here.

🧠 Brain Food: Taco Bell announced it’s rolling out AI voices across hundreds of drive-thru locations in 13 states. The chain said the new system will help with order accuracy and could reduce customers’ wait times.

PRODUCTIVITY

5 AI Tools to Supercharge Your Productivity

 Folderr: Create AI for any task, from a helpful chat assistant trained on your data to a powerful workflow automation for your business.

 Rome AI: An AI platform that creates podcasts for any topic you’re interested in by doing research, understanding subtopics, and crafting an episode you can listen to on the go.

 Not Diamond: Call the right model at the right time with the world’s most powerful AI model router.

 Amabay: Build a custom AI agent that can handle queries on your behalf.

 DeepKeys: An AI app that unlocks insights from your devices to enhance mental wellness and boost productivity.

PS: Want more? Check out our Top 100 AI Tools.

* indicates a promoted tool, if any

PROMPT OF THE DAY

Friday Funday - Emoji Party

Prompt: I want you to translate the sentences I wrote into emojis. I will write the sentence, and you will express it with emojis. I just want you to express it with emojis. I don’t want you to reply with anything but emojis. When I need to tell you something in English, I will do it by wrapping it in curly brackets like {like this}. My first sentence is “Hello, what is your profession?”

Source: Beebom

AI-GENERATED IMAGES

Crazy Frog

Source: @_cyberone on Midjourney

Midjourney Prompt: A black and white photobooth film photostrip of a well-dressed frog wearing a Pez and sunglasses. The same well-dressed has different expressions in each photo
--style raw --ar 5:7 --v 6 --personalize asnzs4y

Acquire new customers and drive revenue by partnering with us

Superhuman is the world’s biggest AI newsletter for businesses and professionals with 600,000+ readers working at the world’s leading startups and enterprises. Companies like Amazon, Hubspot, and Salesforce feature their products in Superhuman. You can learn more about partnering with us here.

🧞Your wish is my command

What did you think of today's email?

Your feedback helps me create better emails for you!

Login or Subscribe to participate in polls.

Thanks for reading.

Until next time!

Zain & the Superhuman AI team

p.s. If you liked this newsletter, share it with your friends and colleagues here.