Issue #1: Google and GenAI
Google introduces live image generation to Imagen 2, Gemini 1.5 Pro now available in 180+ countries
Welcome to Issue #1 of One Minute AI, your daily AI news companion. This issue will cover two of Google’s latest updates in the AI world.
Google introduces live image generation to Imagen 2
At Google Cloud Next ‘24, Google announced that they have added support for “live images” to Imagen 2, their image generation tool.
With this update, Imagen 2 can create four-second videos from text prompts, similar to GenAI tools such as Runway and Pika. Right now, live images are limited to a resolution of 360 pixels by 640 pixels; however, Google pledges to improve this in the future.
Gemini 1.5 Pro is now available in 180+ countries
Google made Gemini 1.5 Pro available in 180+ countries via the Gemini API in public preview and added two new features: native audio (speech) understanding capability and a new File API to simplify handling files.
With the new audio (speech) understanding capability, Gemini can now reason across both image (frames) and audio (speech) for videos uploaded in Google AI Studio (API support coming soon).
The File API lets you store up to 20GB of files (images, text, videos, etc.) per project (with a max file size of 2GB) for up to 48 hours to use in your prompts.
Want to help?
If you liked this issue, help spread the word and share One Minute AI with your peers and community.
You can also share feedback with us, as well as news from the AI world that you’d like to see featured by joining our chat on Substack.