Google unveils Gemini AI

What Google's new AI technology means for you and your organisation.

On December 6, Google revealed its latest and most powerful AI model, named “Gemini”. They are claiming that it represents a significant leap forward in the field of artificial intelligence, boasting capabilities far exceeding any previous model.

What makes this AI model different, is that it was built from the ground up to be multimodal. That means Gemini can understand and process information from various sources, including text, images, audio, video, and code. It can also transform any type of input into any type of output. This sets it apart from earlier models that were limited to handling specific types of data.

Gemini capabilities

As a result, Gemini can:

Generate text and images: This should result in more engaging and interactive experiences, and even open the doors to new forms of artistic expression.
Answer complex questions: With its multimodal understanding, Gemini is able to tackle intricate queries that span multiple domains.
Explain complex concepts: Through its sophisticated reasoning abilities, Gemini can break down complicated ideas into easily digestible explanations.
Write code: Gemini can understand and generate code in multiple languages, making it a valuable tool for programmers.
Surpass human experts: On the MMLU benchmark, Gemini outperformed human experts, demonstrating its superior knowledge and problem-solving skills in over 50 different domains.

Applications of Gemini

If all this is true, the applications could be almost endless.

Science: By analysing vast amounts of data, Gemini could accelerate scientific discoveries and breakthroughs.
Education: With Gemini’s ability to understand diverse information, personalised learning experiences could be tailor-built to match individual needs.
Healthcare: Gemini could assist with medical diagnosis and treatment by analysing complex data and making custom recommendations.
Arts: Gemini could empower artists and creators to explore new forms of expression and push the boundaries of creativity.

Versions

Gemini will be available in three sizes:

Gemini Ultra: The largest and most powerful model for highly complex tasks.
Gemini Pro: Best performing model for a wide range of tasks.
Gemini Nano: The most efficient model for use on mobile devices.

Availability

Starting December 2023, Gemini will be integrated into various Google products and services including:

Bard: Google’s AI chatbot is already utilising Gemini Pro for advanced reasoning and understanding. Gemini Ultra will be added to Bard early next year to create a new experience called Bard Advanced.
Pixel: Pixel 8 Pro will be the first smartphone to run Gemini Nano, powering new features like Summarise in the Recorder app.
Search: Gemini will be used to provide more relevant and informative search results.
Ads: Gemini will optimise ad targeting for greater effectiveness.
Chrome: Gemini will enhance the browsing experience with personalised features.
Duet AI: Gemini will power Duet AI for more seamless and natural interactions.

Gemini and the Future of AI

With its exceptional capabilities, Gemini could be a significant leap forward in AI development. It just might have the potential to transform the way we live, work, and interact with the world.

Interested in AI skills training? We offer a range of solutions. Explore our AI training courses, including generative AI courses and AI security and governance training, or read more on how to become an AI engineer.