Gemini 1.5 Flash is now Faster, Cheaper, and More Accessible Than Ever

Janice Wong Artificial Intelligence, Google Cloud

BLOG

Gemini 1.5 Flash is now
Faster, Cheaper, and More Accessible Than Ever

Gemini 1.5 Flash price drop, tuning rollout complete and improvements to Gemini API and Google AI Studio

Last week, an experimental updated version of Gemini 1.5 Pro (0801) was launched. It was ranked #1 on the LMSYS leaderboard for both text and multi-modal queries. The Google team was so excited by the immediate response to this model that they raised the limits to test with it.

Today, a series of improvements were announced across AI Studio and the Gemini API:

Significant reduction in costs for Gemini 1.5 Flash, with input token costs decreasing by 78% and output token costs decreasing by 71%
1.5 Flash tuning is now available to all developers
Expanding the Gemini API to support queries in 100+ additional languages
Expanded AI Studio access for Google Workspace customers
Revamped documentation UI and API reference and more!

Gemini 1.5 Flash price decrease

Gemini 1.5 Flash is the most popular Gemini model amongst developers who want to build high volume, low latency use cases such as summarization, categorization, multi-modal understanding and more. To make this model even more affordable, as of August 12, the input price has been reduced by 78% to $0.075/1 million tokens and the output price by 71% to $0.3/1 million tokens for prompts under 128K tokens (cascading the reductions across the >128K tokens tier as well as caching). With these prices and tools like context caching, developers should see major cost savings when building with Gemini 1.5 Flash’s long context and multimodal capabilities.

Gemini 1.5 Flash reduced prices effective August 12, 2024. See full price list at ai.google.dev/pricing.

Expanded Gemini API language availability

Language understanding is being expanded for both Gemini 1.5 Pro and Flash models to cover more than 100 languages. This will enable developers across the globe to prompt and receive outputs in the language of their choice. It is expected that this expansion will eliminate model “language” block finish reasons via the Gemini API.

Google AI Studio access for Google Workspace

Google Workspace users can now access Google AI Studio without having to enable any additional settings by default. This change unlocks frictionless access for millions of users. Account admins will still retain control to manage AI Studio access.

Gemini 1.5 Flash tuning rollout now complete

Gemini 1.5 Flash text tuning has now been rolled out to all developers via the Gemini API and Google AI Studio. Tuning enables developers to customize base models and improve performance for tasks by providing the model additional data. This helps reduce the context size of prompts, reduces latency and in some cases cost, while also increasing the accuracy of the model on tasks.

Improved developer documentation

Google’s developer documentation is core to the experience of building with the Gemini API. They recently released a series of improvements, updated the content, navigation, look and feel, and released a revamped API reference.

Improved developer documentation experience for the Gemini API on ai.google.dev/pricing.

PDF Vision and Text understanding

The Gemini API and AI Studio now support PDF understanding through both text and vision. If a PDF includes graphs, images, or other non-text visual content, the model uses native multi-modal capabilities to process the PDF. This feature can be tried out via Google AI Studio or in the Gemini API.

Google AI Studio improvements

Read more on Google Cloud Blog: Click Here

Gemini for Google Cloud

Gemini for Google Cloud helps you be more productive and creative. It can be your writing and coding assistant, creative designer, expert adviser, or even your data analyst.

Learn More