Google Gemma 3

Gemma 3: The Most Powerful AI You Can Run on a Single GPU or TPU

Google DeepMind has introduced Gemma 3, the latest generation of its open model family, designed to deliver state-of-the-art performance on local devices. With variants of 1B, 4B, 12B, and 27B parameters, these models allow developers to create AI applications that can run on phones, laptops, workstations, and local servers.

Key Features of Gemma 3:

Best performance on a single accelerator: Outperforms models like Llama-405B and DeepSeek-V3 in human preference evaluations.

Supports 140 languages: 

  • Ideal for global applications.

Advanced text and image analysis: 

  • Enables multimodal reasoning with images and short videos.

128K-token context window: 

  • Handles extensive information in a single query.

Official quantized models: 

  • Lower resource consumption without sacrificing accuracy.

Compatible with popular frameworks: Integrates with Hugging Face, Ollama, PyTorch, vLLM, Google AI Edge, and Gemma.cpp.

Run AI Locally with a Single GPU  

Unlike other large models, Gemma 3 is optimized to run on a single GPU or TPU, making it an accessible option for researchers and developers. 

Security Innovations with ShieldGemma 2

Alongside Gemma 3, ShieldGemma 2 has been launched, an AI-powered detection system capable of identifying dangerous, sexually explicit, and violent content in images.  

Flexible Deployment Options  

Gemma 3 can be used in the cloud with Vertex AI and Google GenAI API, but it also supports deployments in local environments, Google Colab, and even gaming GPUs. Additionally, NVIDIA has optimized the model to ensure top performance on its GPUs.  

An Expanding Ecosystem: 

The Gemmaverse  

With over 100 million downloads and 60,000 variants created, Gemma 3 is part of a growing ecosystem. Google has also launched an academic program offering cloud credits to researchers interested in exploring the model’s potential.  

Try Gemma 3 now in Google AI Studio or download it from Hugging Face and Kaggle. 

Google AI Studio

Developers Google Blog

Gemma 3 Technical Report