Generative AI Publication

Generative AI Publication

Share this post

Generative AI Publication
Generative AI Publication
Google IO 2024: 5 Biggest AI Announcements

Google IO 2024: 5 Biggest AI Announcements

Google just unveiled a slew of new products and impressive AI updates in Google IO 2024. Here are my top 5 picks.

Jim Clyde Monge's avatar
Jim Clyde Monge
May 15, 2024
∙ Paid

Share this post

Generative AI Publication
Generative AI Publication
Google IO 2024: 5 Biggest AI Announcements
1
Share

OpenAI vs Google is the biggest beef I’ve seen by far in the AI space.

Just one day after OpenAI unveiled its highly advanced and impressive GPT-4o model, Google fired back with several huge updates to Gemini and brand new AI products at the Google IO 2024 conference.

Honestly, the nearly 2-hour event was a lot to take in. While Google packed in numerous new and updated features, here are my top five picks that stood out:

1. Project Astra

2. Imagen 3 (Text-to-image)

3. Veo (Text-to-video)

4. Gemini in Google Search

5. Gemini in Google Photos

Let’s dive into each of these exciting developments.


1. Project Astra (Gemini Live)

Demis Hassabis, head of Google DeepMind, showed off a very early version of Project Astra, a real-time, multimodal AI assistant aimed at becoming a universal assistant.

This is arguably the most intriguing new product Google unveiled, directly competing with OpenAI’s real-time voice assistant powered by GPT-4o.

Google IO 2024 Project Astra (Gemini Live)
Image from Google IO 2024

According to Google, public access to Astra will come through the Gemini app later this year. The vision is to evolve beyond chatbots to AI agents that know everything about you and can work 24/7. Bots that don’t just talk with you but actually accomplish stuff on your behalf.

If this lives up to the hype, I would be very happy to use it on a daily basis.

Generative AI Publication is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.

2. Imagen 3 (Text-to-image)

It looks like Midjourney now has a strong competitor. The initial results shown at the demo look very promising. I mean just take a look at this example.

Prompt: Three women stand together laughing, with one woman slightly out of focus in the foreground. The sun is setting behind the women, creating a lens flare and a warm glow that highlights their hair and creates a bokeh effect in the background. The photography style is candid and captures a genuine moment of connection and happiness between friends. The warm light of the golden hour lends a nostalgic and intimate feel to the image.

Prompt: Three women stand together laughing, with one woman slightly out of focus in the foreground. The sun is setting behind the women, creating a lens flare and a warm glow that highlights their hair and creates a bokeh effect in the background. The photography style is candid and captures a genuine moment of connection and happiness between friends. The warm light of the golden hour lends a nostalgic and intimate feel to the image.
Image from Google IO 2024

It looks so photorealistic. Aside from better quality, Google also improved the model to produce better interpretation and better text generation. Here’s an example:

Prompt: Word “light” made from rainbow feathers, black background

Image from Google IO 2024. Word “light” made from rainbow feathers, black background
Image from Google IO 2024

It’s funny that they had to put “unedited raw output” below the image since Google has been widely criticized for faking demo images and videos.

3. Veo (Text-to-video)

Keep reading with a 7-day free trial

Subscribe to Generative AI Publication to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Jim Clyde Monge
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share