Get all your news in one place.
100’s of premium titles.
One app.
Start reading
Geekflare
Geekflare
Keval Vachharajani

Gemini 2.5 Flash-Lite Is Here: Low Cost, High Speed

Google has finally rolled out a stable version of Gemini 2.5 Flash-Lite. This is the fastest and most cost-effective model in the Gemini 2.5 family. It is said to be designed for performance at scale, and is now available to developers building latency-sensitive applications like real-time translation, classification, and content processing.

The main spotlight of this release is on its low cost and high speed. At just $0.10 per million input tokens and $0.40 per million output tokens, Flash-Lite offers the best price-to-performance ratio in the Gemini 2.5 lineup. It delivers lower latency than previous Flash-Lite and Flash models, making it ideal for high-throughput tasks without compromising on quality.

It’s also worth keeping in mind that this is the smallest model in the Gemini 2.5 series. Despite that, it supports advanced capabilities such as one million-token context window, native reasoning, and integrations with tools like Grounding with Google Search and Code Execution. Developers can even toggle reasoning modes on or off, depending on the task’s complexity, an option that adds flexibility for different use cases.

According to the company, some early adopters are already seeing real-world benefits. For example, Satlyt, a space-tech startup, reported a 45% drop in latency and 30% lower power usage for its satellite diagnostics. Apart from that, video avatar company HeyGen uses the model to automate video planning and translate content into 180+ languages. 

With this stable release, developers can now use the model by specifying “gemini-2.5-flash-lite” in their code. The company is also planning to retire the “preview” alias by August 25. 

Google has also been rolling out updates to the Gemini API, including a conversational image segmentation feature. It allows developers to interact with images using natural language prompts like “the person holding the umbrella” or “the car that is farthest away,” without needing detailed labels or boxes. 

That’s all about Google and Gemini for now. But if you want the next tech or AI update delivered straight to your phone, join us on WhatsApp. 

Sign up to read this article
Read news from 100’s of titles, curated specifically for you.
Already a member? Sign in here
Related Stories
Top stories on inkl right now
One subscription that gives you access to news from hundreds of sites
Already a member? Sign in here
Our Picks
Fourteen days free
Download the app
One app. One membership.
100+ trusted global sources.