India’s Sarvam AI introduces First Hindi LLM OpenHathi-Hi-v0.1

December 14, 2023

- Advertisement -

The model performs as well or better than GPT-3.5 in various Hindi tasks while retaining its efficiency in English.

Sarvam AI, an Indian AI startup, has launched OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in its OpenHathi series. This model is based on Meta AI’s Llama2-7B architecture and reportedly matches the performance of GPT-3.5 for Indic languages.

The model incorporates a 48,000-token extension to Llama2-7B’s tokeniser and is trained through a two-stage process. Initially, it undergoes embedding alignment to align Hindi embeddings that are randomly initialized. The next stage involves bilingual language modeling, training the model for cross-lingual attention across tokens.

- Advertisement -

Sarvam AI claims that their model performs as well or better than GPT-3.5 in various Hindi tasks while retaining its efficiency in English. They have evaluated the model’s effectiveness in practical tasks beyond standard Natural Language Generation (NLG) applications. Sarvam AI has collaborated with KissanAI to refine their base model using conversational data collected from interactions between a GPT-based bot and farmers in different languages.

The company explained their approach to enhancing Hindi capabilities in Llama-2. They reduced the fertility score of the tokeniser for Hindi text, improving training and inference efficiency. They developed a new tokeniser with a 48K vocabulary by merging a sentence-piece tokeniser trained on the Sangraha corpus from AI4 Bharat with Llama2’s tokeniser.

Sarvam AI was founded in July 2023 by Vivek Raghavan and Pratyush Kumar and recently raised $41 million in funding led by Lightspeed Ventures, with contributions from Peak XV Partners and Khosla Ventures.

- Advertisement -

By Shivangi Kharoo

India’s Sarvam AI introduces First Hindi LLM OpenHathi-Hi-v0.1

Most Popular Articles

BluSmart Hits $60 Million Annual Run Rate

“LED Lights Components Supply Chain Is One Of The Best In...

Points To Focus On Beyond Range And Charging When Investing In...

EV Sales In India Fall Face First From March to April...

LEAVE A REPLY Cancel reply

Exclusive

EV Sales In India Fall Face First From March to April 2024

Low-Speed Scooters, E Rickshaws Major ‘Headache’ For EV Task Force?

Growth Opportunities Connected With The Growing Semicon EcoSystem In India

Buzz

Altair Acquires Researchs In Flight To Advance Aerodynamics

Texa’s TXT Bharat OBD Tool Launched In India For Commercial Vehicles

Micron To Ship First India-Made Chips From Gujarat In 2025

Important Sectors

Altair Acquires Researchs In Flight To Advance Aerodynamics

Texa’s TXT Bharat OBD Tool Launched In India For Commercial Vehicles

Micron To Ship First India-Made Chips From Gujarat In 2025

EV Sales In India Fall Face First From March to April 2024

Greaves Electric Mobility Launches Ampere Nexus E-Scooter At Rs 109,900

Manufacturing

Micron To Ship First India-Made Chips From Gujarat In 2025

Tesla Abandons Advanced ‘Gigacasting’ Manufacturing Process

Honda Invests $11 Billion In Canadian EV Value Chain

Honda plans major EV factory construction in Canada,

Sona Comstar Launches Mexico Plant For North American EV Demand

Inspired by our flagship publication

Electronics For You

CHECKED OUT
EFY EXPRESS?

Altair Acquires Researchs In Flight To Advance Aerodynamics

India’s Sarvam AI introduces First Hindi LLM OpenHathi-Hi-v0.1

Most Popular Articles

LEAVE A REPLY Cancel reply

Exclusive

Buzz

Important Sectors

Manufacturing

Inspired by our flagship publication

Electronics For You

CHECKED OUT EFY EXPRESS?

CHECKED OUT
EFY EXPRESS?