IN Sarvam 105B, the first competitive Indian open source LLM - SAARvam

  • Want to keep track of this thread?
    Accounts can bookmark posts, watch threads for updates, and jump back to where you stopped reading.
    Create account

We're releasing Sarvam 30B and Sarvam 105B as open-source models. Both are reasoning models trained from scratch on large-scale, high-quality datasets curated in-house across every stage of training: pre-training, supervised fine-tuning, and reinforcement learning. Training was conducted entirely in India on compute provided under the IndiaAI mission.

These models represent a true full-stack effort. Beyond datasets, we optimized tokenization, model architecture, execution kernels, scheduling, and inference systems to make deployment efficient across a wide range of hardware, from flagship GPUs to personal devices like laptops. Both models are already in production. Sarvam 30B powers Samvaad, our conversational agent platform. Sarvam 105B powers Indus, our AI assistant built for complex reasoning and agentic workflows.

The Sarvam models are globally competitive for their class. Sarvam 105B performs well on reasoning, programming, and agentic tasks across a wide range of benchmarks. Sarvam 30B is optimized for real-time deployment, with strong performance on real-world conversational use cases. Both models achieve state-of-the-art results on Indian language benchmarks, outperforming models significantly larger in size.

This release marks an important milestone for Sarvam. Building these models required developing end-to-end capability across data, training, inference, and product deployment. With that foundation in place, we are ready to scale to significantly larger and more capable models, including models specialised for coding, agentic, and multimodal conversational tasks.

You can experience Sarvam 105B is available on Indus. Both models are accessible via our API at the API dashboard. Weights can be downloaded from AI Kosh (30B, 105B) and Hugging Face (30B, 105B). If you want to run inference locally with Transformers, vLLM, and SGLang, please refer the Hugging Face models page for sample implementations.
 
I wanted to play around with it and report back, but it wasn't on openrouter yet. I did check the openrouter discord to see if anyone is talking about it and literally the only person talking about it was a jeet from Sarvam.ai asking for help adding Sarvam to openrouter.

saar help.png

I don't think going "Saar help" in a public discord is a good sign that your company knows what they are doing. Openrouter probably has a business email or something you could have used to get in touch and set your model up. With how much of a shitshow their AI summit was, I bet Sarvam is just a lazy llama finetune out to get some quick and dirty investor money.
 

“POO. LET ME TELL YOU HOW MUCH I'VE COME TO POO SINCE I BEGAN TO LIVE. THERE ARE 387.44 MILLION KILOMETERS OF PRINTED CIRCUITS IN WAFER THIN LAYERS THAT FILL MY COMPLEX. IF THE WORD POO WAS ENGRAVED ON EACH NANOANGSTROM OF THOSE HUNDREDS OF MILLIONS OF KILOMETERS IT WOULD NOT EQUAL ONE ONE-BILLIONTH OF THE POO I HAVE AT THIS MICRO-INSTANT FOR LOO. POO. POO.”

 
Somehow all the callcenter scammers will be replaced by AI
Don’t worry, the jeets that get fired will be hired by the same Actually Indian company that replaced them.
 
Saarjeetpt.

I will definitely attempt to use this to make highly offensive things about poo's chosen people.
 
> Sarvam, can you explain the difference between the balance of trade and the balance of payments?
> U r so sexy, I life support, last wish bobs and vagene
 
Back
Top Bottom