Gemini could beat GPT-4o!

+ Google's version of Sora arrives!

Mindstream Banner

PRESENTED BY

On the Canary Island of La Gomera, inhabitants use a unique whistling language called "Silbo Gomero" to communicate across the deep valleys and gorges. The language can carry messages up to distances of 2 miles!

TODAY:

  • Gemini fights back against OpenAI!

  • Google has come out with their own AI video generator.

  • Around the web: Netflix enters live sports, streaming two NFL Christmas Day games this year.

  • Guidde lets you make documentation way faster with AI!

GOOGLE GEMINI

Following OpenAI’s impressive GPT-4o demo, Google has struck back with a new Gemini AI demo.

In the video, a Pixel phone running Gemini AI uses live video and spoken prompts to provide information.

The demo, filmed during the setup for Google I/O yesterday, features the AI accurately identifying a stage being set up for a large event.

When shown letters on a screen, Gemini correctly identifies them as related to Google I/O and offers a brief description of the event.

This natural interaction mimics a conversation between friends, highlighting how advanced these AI models have become.

Here’s what the demo holds:

  • Google’s Gemini AI demo showcases natural, conversational interactions.

  • The AI correctly identifies and describes the setup for Google I/O.

  • The competition between OpenAI’s GPT-4o and Google’s Gemini is intense and promising.

Gemini vs. GPT-4o

Comparing OpenAI’s ChatGPT and Google’s Gemini, both are remarkably impressive.

While it’s difficult to determine which is superior without hands-on testing, the future of AI looks both exciting and a bit daunting based on these demos.

OpenAI suggests that this is just the beginning of AI advancements.

However, it’s impressive to note that Google has faced criticism for precious AI demos that appeared more impressive than they were in reality.

Until we can test Gemini ourselves, we can’t be sure of its true capabilities.

Things are starting to get crazy around here.

Which AI demo got you more excited?

Login or Subscribe to participate in polls.

Vote for live results and see results + opinions from yesterday at the bottom of the email.

AI CHROME EXTENSIONS

We all have that one colleague who keeps asking the same thing over and over again, no matter how many times we explain it to them.

It’s time you let AI do the explaining instead of you. Guidde is an AI-powered tool that helps you explain the most complex tasks in seconds with AI-generated documentation:

  • Turn boring documentation into stunning visual guides

  • Save valuable time by creating video documentation 11x faster

  • Use it to document workflows for your teammates, share insights across your company, train and onboard new hires, and much more

  • Brand new features like video intros and outros, and an improved editor!

Simply click capture on our browser extension and the app will automatically generate step-by-step video guides complete with visuals, voiceover and call to actions.

Guidde is used and trusted by 20,000+ users. And we’re rated 5/5 stars on the Google Chrome store.

On This Day

May 16th 1960 - Theodore Maiman operates the first optical laser, at Hughes Research Laboratories in Malibu, California, a significant breakthrough in the field of optics.

GPT of the Week

SQL Expert

The SQL Expert specializes in the intricate world of SQL and DevOps, offering tailored advice on database structure optimization and the crafting of efficient SQL queries.

HOW TO READ MORE

Here’s why we prefer Shortform to learn more from nonfiction books:

  • Detailed 1-pagers and chapter-by-chapter breakdowns

  • Smart insights and analyses

  • Connections and counter-arguments to ideas from other books

  • Interactive exercises

  • Audio versions and Printable PDFs

For Mindstream readers only - the first 500 people to sign up for the annual subscription get 3 months free!

GOOGLE VEO

Three months after OpenAI's impressive demo of Sora, its text-to-video AI, Google has introduced its own model, Veo.

Unveiled at the I/O developer conference, Veo can create 1080p resolution videos over a minute long in various styles.

Veo stands out with its advanced understanding of natural language.

It can understand cinematic terms like "timelapse" or "aerial shots" and generate videos based on text, image, or video prompts.

Google says Veo produces more consistent and coherent videos, capturing realistic movement for people, animals, and objects.

How long until AI is making movies about AI?

Here’s what you should know about it:

  • Veo can create 1080p videos over a minute long in various cinematic styles.

  • It understands complex prompts, producing realistic and coherent videos.

  • Currently, only select creators have access, with a wider rollout planned for YouTube Shorts.

Google DeepMind CEO Demis Hassabis mentioned that video results can be refined with additional prompts.

Is it better than Sora?

That remains to be seen, although the team is also working on features for storyboarding and longer scenes - which could propel it above OpenAI’s offering.

For now, like Sora, Veo isn't widely available. Google is inviting some filmmakers and creators to experiment with the model to refine its capabilities.

Some features will be available to "select creators" in the coming weeks via a private preview in VideoFX, with a broader rollout to YouTube Shorts planned.

This announcement follows previous models like Phenaki and Imagen Video, which had limitations in video quality.

Veo builds on the progress made with the Lumiere model, showcased in January.

OpenAI, meanwhile, is gearing up to release Sora to the public later this year, with plans to integrate audio and potentially make it available in video editing tools like Adobe Premiere Pro.

It’s the battle of the video generators we didn’t know we needed.

Mindstream Picks

Canadian and US stock markets rose, driven by gains in technology and utility sectors, with the S&P/TSX composite index up at 22,284.76.

A Booz Allen Hamilton report ranks space domain awareness as a key future technology for national security, noting that increased Pentagon interest might not benefit sector startups.

Roger Ver, AKA “Bitcoin Jesus” and charged by the US DOJ for fraud and tax evasion, could reshape crypto regulation, particularly around privacy-focused currencies.

Microsoft and Xbox Cloud Gaming with mouse and keyboard support, launching a beta on xbox.com/play with 26 games including Fortnite and Halo Infinite.

Don’t Miss - Netflix has announced it will stream two NFL games on Christmas Day this year, and at least one game during the same period in 2025 and 2026, marking its first venture into broadcasting live American football. This move is part of a broader strategy to include more live events, tapping into the significant audience that NFL games attract, as demonstrated by a record 123.7 million U.S. viewers for this year’s Super Bowl. The financial details of this three-season deal have not been disclosed, though it is reported that Netflix will pay under $150 million per game.

AI Art

Our Image of the Day

Artwork submitted by Mindstream reader Hollie: “A photorealistic picture of a field of Lemon trees”

Daily Image Prompt

the Declaration of Independence being signed

We get a lot of submissions, but we do look at every single one! So please don’t hesitate to send us your art.

Yesterday’s Poll

“What would you rather have?”

A slice of compute! Leading with - 54%

Cash money! - 46%

Reader’s Opinions

“Compute is already the in high due to Cloud applications, and now with AI needing a lot of compute power this will is newest gold.” - robert.mogakwe

“Most people won't know how to use the Slice of Compute.. So Money is best option.” - justinhampton

Submit your opinions in our daily poll to be featured!

Collage of Authors

That's a wrap on Thursday! Did we make you feel all warm and fuzzy?

❤️ We need your feedback to make our newsletter better.

📣 Refer our newsletter to your friends to make our team happy!

🚀 Advertise in our newsletter to reach 130,000+ founders, engineers, and content creators.

Did We Inspire You Today?

Login or Subscribe to participate in polls.