InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

Enter your e-mail address

Select your country

We protect your privacy.

InfoQ Homepage News Google Launches Gemma 3 1B For Mobile and Web Apps

Mobile

Google Launches Gemma 3 1B For Mobile and Web Apps

Mar 17, 2025 2 min read

Write & Win: InfoQ Contest

Join the contest to:

Win a conference ticket
Boost your profile
Help the community

Send your article proposal

Requiring a "mere" 529MB, Gemma 3 1B is a small language model (SLM) specifically meant for distribution across mobile and Web apps, where models must download quickly and be responsive to keep user engagement high.

Thanks to its reduced footprint, Gemma 3 1B can be downloaded and works locally even when WiFi or cellular connection is not available. Running on-device, the model offers minimal latency and does not incur Cloud costs. More importantly, user data is kept private since it does not need to leave the device.

The main use case for adopting Gemma 3 1B in integrating a natural language interface in your app:

Including Gemma 3 1B in your app, you can use natural language to drive your application or generate content from in-app data or context, all fully customizable and fine-tunable.

This includes generating descriptions and captions for data, supporting conversation, ingesting long documents to answer user questions using the AI Edge RAG SDK, creating dialog based on current app state, and more.

Gemma 3 1B can be fine-tuned through a variety of methods, including using a synthetic reasoning dataset, LoRA adaptors, and more. Google is providing a ready-to-use Colab notebook showing how to combine the two mentioned methods and then convert the resulting model to the LiteRT format, which is the new name for the TensorFlow Lite format.

To make it easier for developers to integrate Gemma 3, Google also provided a sample chat app for Android showing how to use the model for text generation, information retrieval and summarization, email drafting, and more. The app uses the MediaPipe LLM Inference API, although the model can be integrated also using the LiteRT stack directly.

The sample app using Gemma 3 is not yet available for iOS, for which Google is only providing an outdated sample app using Gemma 2 as the MediaPipe LLM Inference API for iOS does not yet support the new model.

Google provided performance figures showing that Gemma 3 1B significantly outperforms Gemma 2 2B while requiring only 20% of the deployment size. As Google engineers explain, these improvements were achieved through extensive optimizations using quantization-aware training, improving the KV Cache performance, reducing loading time thanks to optimized weight layouts, and sharing weights across the prefill and decode phases.

While these optimizations apply to all open-weight models, not only Gemma, the final results may vary greatly with the device used to run the model and its runtime configuration.

Gemma 3 1B can run on either the CPU or the GPU of a mobile device with at least 4GB of memory for best performance. The model is available for download from HuggingFace under Google's usage license.

About the Author

Sergio De Simone

Sergio De Simone is a software engineer. Sergio has been working as a software engineer for over twenty five years across a range of different projects and companies, including such different work environments as Siemens, HP, and small startups. For the last 10+ years, his focus has been on development for mobile platforms and related technologies. He is currently working for BigML, Inc., where he leads iOS and macOS development.

Show moreShow less

This content is in the Mobile topic

Write Your Way to a QCon or InfoQ Dev Summit!

Join the InfoQ article competition to win a complimentary ticket to QCon or InfoQ Dev Summit! We're seeking in-depth technical articles written by software developers for software developers.

Send your proposal

The InfoQ Newsletter

A round-up of last week’s content on InfoQ sent out every Tuesday. Join a community of over 250,000 senior developers. View an example

We protect your privacy.

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Google Launches Gemma 3 1B For Mobile and Web Apps

Write & Win: InfoQ Contest

About the Author

Sergio De Simone

This content is in the Mobile topic

Related Topics:

Related Editorial

Related Sponsored Content

Popular across InfoQ

Write Your Way to a QCon or InfoQ Dev Summit!

The InfoQ Newsletter