Artificial Intelligence

Google I/O 2024: Gemini AI, Project Astra, Android 15, and More

By Sunita Chhipa

Posted on May 15, 2024

Google Unveil Gemini AI, Project Astra, Android 15, and More

After Google I/O 2024, held at the Shoreline Amphitheater in Mountain View, Google CEO Sundar Pichai highlighted the prominence of artificial intelligence (AI) in their latest innovations. Here are the critical updates unveiled during the two-hour keynote.

Project Astra: A Universal AI Assistant

Google introduced Project Astra, an advanced AI-powered assistant to enhance daily life. Demonstrated through a video filmed in one take, the assistant interacts seamlessly with the environment. The user navigates Google’s London office, engaging in natural conversations with Astra via their camera. Astra accurately identifies the location of the user’s misplaced glasses without prior mention. The video hints at the development of smart glasses with onboard cameras, potentially rivaling Meta’s Ray-Ban smart glasses.

Veo and Imagen 3: New AI Media Creation Engines

Google launched Veo and Imagen 3, two powerful AI-driven media creation tools. Veo competes with OpenAI’s Sora, generating high-quality 1080p videos over a minute long, with an understanding of cinematic concepts like timelapse. Imagen 3, a text-to-image generator, surpasses its predecessor by producing highly detailed, photorealistic images with minimal artifacts, positioning it against OpenAI’s DALLE-3.

Gemini Integration with Android 15

The upcoming release of Android 15 will feature direct integration with Gemini, enabling context-specific interactions. Users can access Gemini as an overlay to ask questions about the app, image, or video currently used. The future of Google Assistant remains uncertain, as it was notably absent from the keynote.

Transformative Updates to Google Search

Google announced significant changes to Search functionality. New features, such as answering complex queries and planning meals or vacations, will be accessible via Search Labs, allowing users to test experimental features. A notable addition is AI Overviews, which will provide AI-generated answers at the top of search results. This feature, tested for a year, will soon be available to millions of users in the US and eventually to over a billion worldwide by year-end.

Enhanced Google Photos with AI

Google Photos is set to become even more brilliant for Google One subscribers in the US. Users can ask complex questions like “Show me the best photo from each national park I’ve visited,” leveraging GPS data and AI to select the best images. Additionally, users can generate captions for social media posts, enhancing the overall experience.

Introducing Gemini 1.5 Flash and Updates to Gemini Pro

Google revealed Gemini 1.5 Flash, a new AI model optimized for speed and efficiency. Positioned between Gemini 1.5 Pro and Gemini 1.5 Nano, Flash caters to developers seeking a cost-effective solution with a long context window of one million tokens. Later this year, Google plans to double Gemini’s context window to two million tokens, enabling the processing of extensive video, audio, code, and text content simultaneously.

These announcements highlight Google’s commitment to integrating AI across its platforms, promising innovative enhancements for users worldwide.