India’s Sarvam AI introduces First Hindi LLM OpenHathi-Hi-v0.1

- Advertisement -
- Advertisement -

The model performs as well or better than GPT-3.5 in various Hindi tasks while retaining its efficiency in English.

Sarvam AI, an Indian AI startup, has launched OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in its OpenHathi series. This model is based on Meta AI’s Llama2-7B architecture and reportedly matches the performance of GPT-3.5 for Indic languages.

The model incorporates a 48,000-token extension to Llama2-7B’s tokeniser and is trained through a two-stage process. Initially, it undergoes embedding alignment to align Hindi embeddings that are randomly initialized. The next stage involves bilingual language modeling, training the model for cross-lingual attention across tokens.

- Advertisement -

Sarvam AI claims that their model performs as well or better than GPT-3.5 in various Hindi tasks while retaining its efficiency in English. They have evaluated the model’s effectiveness in practical tasks beyond standard Natural Language Generation (NLG) applications. Sarvam AI has collaborated with KissanAI to refine their base model using conversational data collected from interactions between a GPT-based bot and farmers in different languages.

The company explained their approach to enhancing Hindi capabilities in Llama-2. They reduced the fertility score of the tokeniser for Hindi text, improving training and inference efficiency. They developed a new tokeniser with a 48K vocabulary by merging a sentence-piece tokeniser trained on the Sangraha corpus from AI4 Bharat with Llama2’s tokeniser.

Sarvam AI was founded in July 2023 by Vivek Raghavan and Pratyush Kumar and recently raised $41 million in funding led by Lightspeed Ventures, with contributions from Peak XV Partners and Khosla Ventures.

- Advertisement -

Related Artcles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Exclusive

How Effective Will The Global Biofuel Alliance Be?

0
From environmental benefits to economic opportunities, the potential of biofuels in revolutionising our energy landscape. India has made history by hosting the 18th G-20 summit...
Marina

Investments In Drone Industry Decline To US$1.7 billion

0
The 2023 data now separates drone company funding from AAM/eVTOL funding, which significantly boosted the total in 2020 and 2021. Drone companies received US$1.7 billion...

“Now’s A Great Time To Make In India”

0
In news and in current affairs relevant to the current geopolitical situation, India is emerging as one of the fastest-growing electronics manufacturing services nations...

Buzz

Jakson Green Signs Power Purchase Agreement With SECI

0
It is estimated to power 100,000 households annually, reduce 188,000 MT of carbon emissions each year, and boost job opportunities. Jakson Green, a leading player...

ZF Expands Aftermarket Commercial Vehicle Range With ADAS Sensors

0
Advanced Driver Assistance Systems (ADAS) are gaining traction in the commercial vehicle market. Since 2015, trucks have been required to have automatic emergency braking...

Samsung Semiconductor India Research Launches New R&D Facility in Bangalore

0
The facility spans 1.6 lakh square feet across four floors and is designed to accommodate around 1,600 employees. Samsung Semiconductor India Research (SSIR) has inaugurated...

Important Sectors

Jakson Green Signs Power Purchase Agreement With SECI

0
It is estimated to power 100,000 households annually, reduce 188,000 MT of carbon emissions each year, and boost job opportunities. Jakson Green, a leading player...

ZF Expands Aftermarket Commercial Vehicle Range With ADAS Sensors

0
Advanced Driver Assistance Systems (ADAS) are gaining traction in the commercial vehicle market. Since 2015, trucks have been required to have automatic emergency braking...

Samsung Semiconductor India Research Launches New R&D Facility in Bangalore

0
The facility spans 1.6 lakh square feet across four floors and is designed to accommodate around 1,600 employees. Samsung Semiconductor India Research (SSIR) has inaugurated...
Intel, Digital India, CPU

Intel Eyes AI For Standalone Programmable Chip Unit Altera

0
The contract manufacturing customers, including Intel's own business units, may receive favorable pricing.  Intel's standalone programmable chip unit, Altera, is eyeing a significant opportunity in...

TelioEV Expands EV Charging Solutions To Five APAC And GCC Countries

0
The company started operations in these areas in December 2023 and is currently engaging with stakeholders such as charge point operators, OEMs, EV users,...

Manufacturing

Union Cabinet Approves CG-Led JV Proposal For OSAT Facility In Sanand

0
CG Power owns a 92.3% stake, while Renesas Electronics and Thai OSAT provider Stars Microelectronics will have 6.8% and 0.9% equity, respectively in the...

Govt Sanctions 3 Semiconductor Plants; Rs 1.26 Lakh Cr Investment

0
The Dholera (Gujarat) semiconductor fab will produce 50,000 wafers monthly. In contrast, the Morigaon (Assam) and Sanand (Gujarat) units will assemble, test, monitor, and...

“Now’s A Great Time To Make In India”

0
In news and in current affairs relevant to the current geopolitical situation, India is emerging as one of the fastest-growing electronics manufacturing services nations...

MeitY Secretary: Domestic Semiconductor Manufacturing Is The Next Big Thing

0
At the India Digital Summit (IDS) 2024, S Krishnan, Secretary of MeitY, highlighted that India is on the brink of a major move towards...

Tata Picks Somerset For UK Battery Plant

0
The Bridgewater factory, operated by the subsidiary Agratas, will initially produce batteries for JLR and Tata Motors; with a capacity of 40GWh, it will...