- Superhuman AI
- Posts
- Gemini goes multimodal
Gemini goes multimodal
ALSO: How to turn text into infographics with AI
Read time: under 4 minutes
While OpenAI was slowly revealing its hand of AI updates this past week, Alphabet kept its lips zipped. On Wednesday, it finally put all its cards on the table, revealing everything from a multimodal Gemini to an agent that can navigate Chrome on its own.
💡 Share your genius: Tell us how you’re using AI to level up your work or play, and you could be featured in our new Community Spotlight—submit your story today.
Today’s Insights
Today in AI: What’s new in iOS 18.2, plus Microsoft AI’s health pivot
Tutorial: How to turn text into infographics with AI
Alphabet unveils Gemini 2, AI agents, and more
Everything else you should know today
5 new AI tools to boost your productivity
TECH STOCKS & FUNDING
▲ Tesla | $424.77 | +5.93% |
▲ Amazon | $230.26 | +2.32% |
▲ Meta | $632.68 | +2.16% |
▲ Alphabet | $196.71 | +5.46% |
▲ Nasdaq | 20,034.89 | +1.77% |
▲ S&P 500 | 6,084.19 | +0.82% |
*Stock data as of market close.
📈 Markets: Amazon, Alphabet, Tesla, and Meta each soared to record highs Wednesday, boosting the tech-heavy Nasdaq above 20,000 points for the first time. Apple and Nvidia also joined in on the green wave ahead of expected interest rate cuts.
💰️ Funding: Three major chipmakers — Nvidia, Intel, and AMD — each backed Ayar Labs, a startup building optical AI chips, in a $155M funding round. SpaceX is now valued at $350B after the Elon Musk-led company announced it would buy back shares from employees. And Singapore-based startup Sapient raised $22M to develop unorthodox AI models that buck the transformer trend.
TODAY IN AI
New Apple Intelligence features are rolling out across Macs, iPads, and iPhones. Source: Apple
1. Apple drops long-awaited AI updates: While the AI-powered Siri is still under construction, iPhone 15 Pro (or newer) users can now use ChatGPT directly on their phones. (That is, unless OpenAI experiences another major outage.)
With iOS 18.2, the iPhone 16 also gets Visual Intelligence, or the ability to see your surroundings through your camera lens
It can do things like translate a menu, add printed phone numbers to your contacts list, or find where you can buy a particular product
New GPT-powered writing tools can now generate content for you across apps
The new Image Playground and Genmoji features give you free rein to design your own art and emojis
2. Microsoft AI head scoops up former colleagues for health push: Mustafa Suleyman is assembling a new crew to develop health-related AI products at Microsoft. And he’s turning to some familiar names, including a neonatal doctor and a surgeon with whom he’d previously worked at DeepMind. The FT reported that the new team will work on a consumer-facing platform that lets users ask questions about their health — something that nearly half of AI users already do with general chatbots.
PRESENTED BY SANA
The AI assistant made for work (that actually works)
Most AI chatbots and assistants are great – but they're not made for work.
You can use them for one-off tasks but can't connect them to your work apps like Google Drive, Zoom, and Notion to automate tasks consistently.
Sana is the all-in-one AI assistant for your work that saves you hours:
Write client emails
Draft reports
Compare invoices
Summarize meetings
Analyze documents and data sets
Try it yourself and see the difference. PS: You can even watch Superhuman’s CEO Zain Kahn take you through how to create an AI assistant with Sana here.
THE AI ACADEMY
How to turn text into infographics with AI
Go to Infografix AI and log in or sign up with a new account.
Click on Get Started and then click on Use AI.
Write in your prompt, select your required template, and wait for it to generate a response.
Sample prompt: Create a historical timeline of the US presidents
You can use it to create data visualizations, mind maps, SWOT analyses, QAs, lists, and more.
You can edit or customize your infographic once created by clicking on the customize tab.
FROM THE FRONTIER
Alphabet’s AI blowout: Gemini 2, agents, and more
Project Astra now has a better understanding of your surroundings. Source: Alphabet
Talk about a difference in strategy. While OpenAI is giving us a steady drip of updates across 12 days, Alphabet just unleashed a firehose of announcements all in one go. Let’s break it all down:
Gemini goes multimodal: Gemini 2.0 Flash beats 1.5 Pro on multiple benchmarks while being twice as fast. It can now easily move between videos, images, and audio. Plus, a new “deep research” mode lets it think through multi-step problems, similar to OpenAI’s o1.
Project Mariner: This new prototype can navigate Chrome for you. For example, you can ask it to find the websites for a list of companies and save them to your bookmarks while you sit back and watch.
Project Astra: Alphabet’s yet-to-be-released “universal AI assistant” can now help with more tasks, like teaching you how to use a washing machine, remembering a door code, or identifying plants. You can even show it a bus and ask where it’s headed.
Everything else: Alphabet’s new Trillium chip delivers 4x the speed as its predecessor while being 67% more efficient. A new coding assistant called Jules can catch bugs and fix errors on its own. And AI Overviews will soon be able to answer complicated math and coding questions.
PRESENTED BY INNOVATING WITH AI
Want to become an AI Consultant?
Innovating with AI just welcomed 200 new students into The AI Consultancy Project, their new program that trains you to build a business as an AI consultant:
Tools, frameworks, and a 6-month plan to build a 6-figure AI consulting business
AI & TECH NEWS
Everything else you need to know today
GM announced it’s closing its Cruise robo-taxi division. Source: Getty
🤝 Compute Collab: Apple is teaming up with semiconductor maker Broadcom to design what would be the company’s first AI server chip, which could go into production by 2026.
🤺 Data Duel: The EU vowed to invest more than $1.5B in seven supercomputer sites across the continent in a bid to compete with US-based AI startups.
🚗 End of the Road: US auto giant GM announced it’s shuttering its Cruise robo-taxi division and pivoting to autonomous personal vehicles — highlighting the daunting economics behind the driverless taxi industry.
PRODUCTIVITY
5 AI Tools to Supercharge Your Productivity
✅ SocialBlaze: An all-in-one platform for social media management, offering seamless posting, analytics tracking, and AI-driven post creation.
✅ Bricks: An AI spreadsheet that does tasks for you using natural language prompts — no formulas or hours of data cleanup needed.
✅ BrandWell*: Turn traffic into leads with TrafficID. Identify the actual people on your website without forms. Try it unlimited for 7 days and boost your ROI.
✅ Quizdom: Easily create, customize, and grade high-quality assessments, quizzes, and tests with AI-powered precision.
✅ Shortcut: Use AI to ask questions, organize ideas, or role-play conversations — all through natural dialogue.
* indicates a promoted tool, if any
PROMPT OF THE DAY
Content Creation for Websites
Prompt: I have a small business focusing on [enter your business details here, plus any additional info like location or services offered]. I am looking to create a website, can you suggest an outline for said website? Please also offer suggestions on optimizing the page for more conversions.
Follow-up prompt: Can you suggest a witty headline for and a short introductory paragraph for the website?
What type of prompt would you like to see for Friday? |
Source: Inspired by Elegant Themes
AI-GENERATED IMAGES
Day to Night
Source: Inspired by @surprised0887 on Midjourney
Midjourney Prompt: Aerial photography of the frozen river in winter, surrounded by white snow and a black sunset sky, with dense forests on both sides. The trees, covered thickly under heavy snow, were lush, and the dark light shone through from behind to illuminate them. In front is an endless, winding valley stream along which they grow. It's like a fairyland in reality. High-definition photography, a super-wide-angle lens, delicate details, and beautiful scenery. I can't believe how beautiful it was. --ar 9:16 --style raw
Acquire new customers and drive revenue by partnering with us
Superhuman is the world’s biggest AI newsletter for businesses and professionals with 800,000+ readers and 1.5 Million followers on socials working at the world’s leading startups and enterprises. Companies like Amazon, Hubspot, and Salesforce feature their products in Superhuman. You can learn more about partnering with us here.
Your opinion matters!
What did you think of today's email?Your feedback helps me create better emails for you! |
Got more feedback or just want to get in touch? Reply to this email and we’ll get back to you.
Thanks for reading.
Until next time!
Zain & the Superhuman AI team