- Sarvam AI claims its Sarvam Vision model beats Gemini and ChatGPT on major OCR benchmarks
- The startup focuses on India’s 22 official languages
- Its “Sovereign AI” approach aims to create technology specifically tailored to India’s needs.
ChatGPT, Gemini, and other AI chatbots are often very good at reading English and many other languages, but while they can interpret Hindi, they start to falter when confronted with more complex scripts or regional nuances among Indian languages.
Now, a Bengaluru startup called Sarvam AI is ramping up its models that it claims can outperform global competitors in optical character recognition (OCR) and multilingual speech, especially when it comes to subcontinent languages.
Regarding Indian languages, Sarvam Vision is by far the best model, while supporting all 22 Indian languages programmed pic.twitter.com/nM4Ujz0wvPFebruary 5, 2026
The Sarvam Vision and Bulbul V3 models are designed keeping in mind the linguistic complexity of India. Sarvam Vision can interpret complex tables, understand graphics, recognize text in real-world scenes and generate captions, while Bulbul V3 handles the text-to-speech system. They support all 22 official Indian languages.
With 35 voices, Bulbul is able to still speak like a local. As many multilingual users know, the difficulty of hearing their language pronounced as if it were a distant cousin of English can make someone reluctant to try this technology. A well-trained text-to-speech model that captures rhythm and tone more accurately can make people feel more comfortable using it.
And while OCR doesn’t sound glamorous, it silently powers everything from scanning a document with your phone, uploading a PDF, or scanning an old recording. Garbled characters, misread names, and missing context can be a real problem. Sarvam says this will help small business owners and government offices convert documents into searchable records more quickly and accurately than would otherwise be possible.
Sovereign AI
Sarvam AI presents itself as a builder of sovereign AI. The idea is to distinguish yourself from foreign platforms. With the spread of AI models in government, business and education, the question of who builds them and whose data they understand is very important. Sarvam wants to have tools adapted to India.
The emergence of Sarvam is also sparking a broader debate about the origins of innovation. The AI boom has often been portrayed as a race between a few dominant players. Yet breakthroughs increasingly come from focused teams solving specific problems. Sarvam appears to have identified a gap in high-quality, language-rich OCR and voice systems for Indian scripts.
Of course, benchmarks are snapshots and not guarantees of performance, especially in the real world. The proof of Sarvam’s impact will be in adoption. Additionally, if Sarvam’s claims hold up, large AI companies will feel compelled to improve their own support for more languages and scripts.
At its best, the Sarvam AI story goes beyond beating Gemini or ChatGPT in a ranking and becomes a way to showcase the technology in the image of the people who use it. If AI is to shape the next decade of digital life, it will need to be fluent in many languages and read more than just plain English text.
Sarvam is betting that attention to detail and cultural specificity can rival scale. For the millions of users who feel underserved by traditional AI tools, this bet may seem safer.
And of course you can too follow TechRadar on TikTok for news, reviews, unboxings in video form and receive regular updates from us on WhatsApp Also.




