/odishatv/media/post_attachments/uploadimage/library/16_9/16_9_0/recent_photo_1715850945.webp)
Google I/O 2024 top highlights
At its Google I/O 2024 event, online giant Google introduced several significant upgrades, including the release of new Gemini and Gemma models, the latest AI features for Android, a new AI voice assistant, a text-to-video generator, and much more.
Many people are drawing comparisons to the fictional "Jarvis" from the Marvel movie Iron Man, suggesting that Google's advancements are bringing science fiction closer to reality.
Here are the top highlights from the Google I/O event:
Gemini 1.5
Google made a new Gemini 1.5 Flash computer program. They say it is lighter than the older Gemini 1.5 Pro program but still works quickly and well. This new program can understand things in different ways and can look at a lot of information at once. Google made Gemini 1.5 Flash to be good at tasks that need quick responses.
New ways for developers to build with Gemini → https://t.co/xP0yqhtSoH
😎 Gemini 1.5 Flash joins 1.5 Pro in public preview via the Gemini API in Google AI Studio
🥳 A new Context Caching feature in the Gemini API
🤗 A preview of our 2 million context window#GoogleIOpic.twitter.com/WEgF2CMP6U— Google for Developers (@googledevs) May 15, 2024
Gemma
Google also made a better version of Gemma, another smart computer program. This new Gemma program is faster and works better. It's been improved using special computer parts called TPUs and GPUs, and it's got a lot of tiny parts that help it work well.
Navarasa, a fine-tuned Gemma variant trained on 15 Indic languages, is taking important steps towards making AI more inclusive in India.
Learn more → https://t.co/CCnkb8NoXYpic.twitter.com/9sQrxGmyWA
— Google for Developers (@googledevs) May 15, 2024
Ask Photos
Google will soon add a new feature to Google Photos called Ask Photos. It lets you ask Google Photos questions, and it will try to give you helpful answers. This feature will start in the summer, and they will add more things to it later.
With Ask Photos on @GooglePhotos, you’ll soon be able to search for specific moments more intuitively. Say you want to see photos from your national park tour — you can type “show me a photo from each national park I’ve visited.” Learn more → https://t.co/rsO4J85LrRpic.twitter.com/Q66g3B1O0a
— Google (@Google) May 15, 2024
Project Astra Announced
This is the most interesting announcement for AI agents. So, what is this? This is like when you really see things that many people claim the Marvel movie “Jarvis” robot to be. This Project Astra AI model works when you're asking something, as your camera opening, and also something like the video shown below.
When asking, 'Tell me when you see something that makes sounds,' as the camera shows a speaker, the Project Astra said that the speaker will make sounds.
When asking for speaker parts' names, it also provides the names. You can see this video below.
We’re sharing Project Astra: our new project focused on building a future AI assistant that can be truly helpful in everyday life. 🤝
Watch it in action, with two parts - each was captured in a single take, in real time. ↓ #GoogleIOpic.twitter.com/x40OOVODdv
— Google DeepMind (@GoogleDeepMind) May 14, 2024
In that video, when showcasing the diagram of the system and then asking, 'What can I add here to make this system faster?' after that, the Project Astra said, 'Adding a cache between the server and database could improve the speed.
After listening to the answer in the Google office, everyone is cheering, as the Project Astra gave an exact answer.
As Astra is 'a universal AI agent that can be truly helpful in everyday life,' said Demis. The pace and quality of interaction with Astra feel natural.
Imagen 3 For Super-Realistic Images
Imagen 3 is a smart computer program that makes pictures look incredibly real and detailed. According to Doug Eck, it can even show you the tiny hairs on a wolf's nose! It's also good at understanding commands in a more natural way, like how a person would talk.
You can start using Imagen 3 today on ImageFX, and soon it will be available for people who make software and big companies too.
We’re introducing Imagen 3: our highest quality text-to-image generation model yet. 🎨
It produces visuals with incredible detail, realistic lighting and fewer distracting artifacts.
From quick sketches to very high-res imagery, here’s a look at what it can create. 👀 #GoogleIOpic.twitter.com/XMrQYGeSiO
— Google DeepMind (@GoogleDeepMind) May 14, 2024
Google's New Video-Making Computer Program - VEO
Google introduces Veo, its clever computer program that can make high-quality videos just by reading text instructions. You can choose from different movie styles and edit the videos however you want using text commands. This seems like Google's way of competing with another cool program called Sora made by OpenAI. You can try out Veo on a website called VideoFX.
Veo could help make high-quality video production accessible to everyone. ✨
It can understand many kinds of effects and even captures the nuance and tone of a prompt - offering an unprecedented level of creative control. → https://t.co/IplcinAVrKpic.twitter.com/IZl7vQfSNa
— Google DeepMind (@GoogleDeepMind) May 15, 2024
LearnLM wants to be your personal teacher
Imagine if you could have your own computer tutor to help you learn anything. People in Silicon Valley, where a lot of tech companies are, talk about this idea a lot. LearnLM is a group of new computer models that are like Gemini but made specifically to help with learning. They are trying to make it possible for everyone to have their own smart tutor.
Google Introduces LearnLM: Empowering Educators with Generative AI
Google has recently unveiled LearnLM, a groundbreaking AI-powered tool designed to revolutionize the classroom experience. This innovative platform aims to empower educators by providing them with cutting-edge… pic.twitter.com/SNFpeYKXyl
— SuperBrain (@superbrain) May 15, 2024