Gemini 1.5 Flash is now Faster, Cheaper, and More Accessible Than Ever
Janice Wong2024-08-27T09:45:21+08:00Gemini 1.5 Flash is now
Faster, Cheaper, and More Accessible Than Ever
Gemini 1.5 Flash price drop, tuning rollout complete and improvements to Gemini API and Google AI Studio
Last week, an experimental updated version of Gemini 1.5 Pro (0801) was launched. It was ranked #1 on the LMSYS leaderboard for both text and multi-modal queries. The Google team was so excited by the immediate response to this model that they raised the limits to test with it.
Today, a series of improvements were announced across AI Studio and the Gemini API:
- Significant reduction in costs for Gemini 1.5 Flash, with input token costs decreasing by 78% and output token costs decreasing by 71%
- 1.5 Flash tuning is now available to all developers
- Expanding the Gemini API to support queries in 100+ additional languages
- Expanded AI Studio access for Google Workspace customers
- Revamped documentation UI and API reference and more!
Gemini 1.5 Flash price decrease
Gemini 1.5 Flash is the most popular Gemini model amongst developers who want to build high volume, low latency use cases such as summarization, categorization, multi-modal understanding and more. To make this model even more affordable, as of August 12, the input price has been reduced by 78% to $0.075/1 million tokens and the output price by 71% to $0.3/1 million tokens for prompts under 128K tokens (cascading the reductions across the >128K tokens tier as well as caching). With these prices and tools like context caching, developers should see major cost savings when building with Gemini 1.5 Flash’s long context and multimodal capabilities.
Expanded Gemini API language availability
Language understanding is being expanded for both Gemini 1.5 Pro and Flash models to cover more than 100 languages. This will enable developers across the globe to prompt and receive outputs in the language of their choice. It is expected that this expansion will eliminate model “language” block finish reasons via the Gemini API.
Google AI Studio access for Google Workspace
Google Workspace users can now access Google AI Studio without having to enable any additional settings by default. This change unlocks frictionless access for millions of users. Account admins will still retain control to manage AI Studio access.
Gemini 1.5 Flash tuning rollout now complete
Gemini 1.5 Flash text tuning has now been rolled out to all developers via the Gemini API and Google AI Studio. Tuning enables developers to customize base models and improve performance for tasks by providing the model additional data. This helps reduce the context size of prompts, reduces latency and in some cases cost, while also increasing the accuracy of the model on tasks.
Improved developer documentation
Google’s developer documentation is core to the experience of building with the Gemini API. They recently released a series of improvements, updated the content, navigation, look and feel, and released a revamped API reference.
PDF Vision and Text understanding
The Gemini API and AI Studio now support PDF understanding through both text and vision. If a PDF includes graphs, images, or other non-text visual content, the model uses native multi-modal capabilities to process the PDF. This feature can be tried out via Google AI Studio or in the Gemini API.
Google AI Studio improvements
The Gemini API and AI Studio now support PDF understanding through both text and vision. If a PDF includes graphs, images, or other non-text visual content, the model uses native multi-modal capabilities to process the PDF. This feature can be tried out via Google AI Studio or in the Gemini API.
Read more on Google Cloud Blog: Click Here