ARTICLE AD BOX
Google Deepmind has unveiled Gemma 3, a new generation of open AI models designed to deliver high performance with a relatively small footprint, making them suitable for running on individual GPUs or TPUs.
The Gemma 3 family includes four models ranging from 1 to 27 billion parameters. Despite their compact size, these models outperform much larger LLMs like Llama-405B and DeepSeek-V3 in initial tests, according to Google Deepmind.
The models can handle more than 140 languages, with 35 requiring no additional training. They process text, images (except for the 1B version), and short videos using a 128,000-token context window. Google says their function calling and structured output capabilities make them well-suited for agentic tasks.
 Google's Gemma 3 family includes five AI models ranging from 1 billion to 27 billion parameters, each optimized for different use cases. | Image: Google
Google's Gemma 3 family includes five AI models ranging from 1 billion to 27 billion parameters, each optimized for different use cases. | Image: GoogleAll models underwent distillation training followed by specialized post-training using various reinforcement learning approaches. These techniques specifically target improvements in mathematics, chat functionality, instruction following, and multilingual communication.
Ad
THE DECODER Newsletter
The most important AI news straight to your inbox.
✓ Weekly
✓ Free
✓ Cancel at any time
Making LLMs more efficient
For the first time, Google is officially offering quantized versions that reduce memory and computing requirements while maintaining accuracy. The company says Gemma 3 will reproduce less verbatim text than previous versions and avoid reproducing personal data.
Human evaluators in the chatbot arena gave Gemma 3-27B-IT an Elo score of 1338, ranking it among the top 10 AI models. The smaller 4B model performs surprisingly well, matching the capabilities of the larger Gemma 2-27B-IT. The 27B version shows similar performance to Gemini 1.5-Pro across many benchmark tests.
 Human evaluators ranked AI models in the chatbot arena, with Grok-3 and GPT-4.5 achieving the highest Elo scores. Gemma-3-27B-IT placed in the top 10 by March 8, 2025 during testing. | Image: via Google
Human evaluators ranked AI models in the chatbot arena, with Grok-3 and GPT-4.5 achieving the highest Elo scores. Gemma-3-27B-IT placed in the top 10 by March 8, 2025 during testing. | Image: via GoogleAlongside the multimodal Gemma models, Google introduced ShieldGemma 2, a specialized 4-billion-parameter security checker designed to identify dangerous content, explicit material, and depictions of violence in images.
 Example using Gemma's vision capability: When AI helps to analyze bills, the Zürcher Geschnetzeltes is no longer so heavy on the stomach. | Image: via Google
Example using Gemma's vision capability: When AI helps to analyze bills, the Zürcher Geschnetzeltes is no longer so heavy on the stomach. | Image: via GoogleGemma 3 models are available through Hugging Face, Kaggle, and Google AI Studio. They support common frameworks including PyTorch, JAX, and Keras. Academics can access $10,000 in cloud credits through the Gemma 3 Academic Program. The models run on NVIDIA GPUs, Google Cloud TPUs, and AMD GPUs, with Gemma.cpp available for CPU use.

 7 months ago
                                22
                        7 months ago
                                22
                    

