Google DeepMind's Project Morni Aims to Digitize and Preserve 125 Indian Languages. Photo: Medium


August 30, 2024 Tags:

Google DeepMind’s Indian team is working on an exciting new project called Morni (Multimodal Representation for India) that aims to create advanced artificial intelligence for 125 Indian languages and dialects. This initiative highlights a major push to enhance language technology in India, which is known for its rich linguistic diversity.
Manish Gupta, the director at Google DeepMind in India, shared details about the project at the Global Fintech Fest in Mumbai. He explained that while India officially recognizes 22 languages, the project is targeting over 100 languages. This is because there are about 60 Indian languages with more than a billion speakers and over 125 languages with at least 100,000 speakers each.

One significant challenge faced by the project is the lack of digital data for many Indian languages. Gupta revealed that 73 of the 125 targeted languages had no existing digital text data. Even Hindi, spoken by roughly 10% of the global population, accounts for only 0.1% of text available online. This lack of representation in digital media poses a significant barrier to developing language technologies.

To address this issue, Google DeepMind launched Project Vaani, a collaborative effort involving Google, the Indian Institute of Science, and ARTPARK (Artificial Intelligence & Robotics Technology Park). The first phase of Project Vaani is complete, which involves creating an open-source database with over 14,000 hours of speech data. This data was collected from 80,000 speakers across 80 districts in India. The goal of Project Vaani is to gather and transcribe 154,000 hours of anonymized speech data from all regions of India.

Currently, the project is in its second phase, expanding to cover 160 districts across the country. This expansion will further enrich the database and support the development of AI technologies for Indian languages.

In addition to Project Vaani, Google recently expanded its language capabilities in Google Translate. The company added 110 new languages, including five Indian languages, making it one of the largest updates ever. This expansion was achieved using PaLM-2, a transformer model designed to understand over 1,500 languages globally. The inclusion of these new languages helps bridge communication gaps for more than 600 million people.

Google is also working on developing a digital agri-stack, which could revolutionize agricultural practices in India. This system aims to provide farmers with access to loans, credit, affordable crop insurance, and various government subsidies. By leveraging data-driven solutions, this initiative could greatly improve the efficiency and effectiveness of agricultural support programs.

How useful was this post?

Click on a star to rate it!

Average rating 0 / 5. Vote count: 0

No votes so far! Be the first to rate this post.

You may also like

Bitcoin Investor Buys an Entire SpaceX Flight for the Ultimate Polar Adventure

A bold new chapter in space tourism unfolded as Chun Wang, a Bitcoin investor and entrepreneur, launched into orbit on....

Elon Musk’s xAI Acquires X in $33 Billion Stock Deal

Elon Musk’s artificial intelligence startup, xAI, has officially taken over his social media platform, X, in a deal valued at....

Trump Considers Lowering Tariffs to Seal TikTok Deal

Former U.S. President Donald Trump signalled on Wednesday that he might reduce tariffs on China to facilitate the sale of....

U.S. Robotics Firms Urge National Strategy to Compete China

American robotics companies are calling for a national U.S. robotics strategy to strengthen the industry and maintain a competitive edge....

Waymo Plans Self-Driving Taxi Service in Washington by 2026

Alphabet’s autonomous taxi service, Waymo, is expanding to Washington, D.C., with plans to launch in 2026. The announcement, made on....

Trump Aides Used Signal for Secret War Talks – What to Know

Top officials from the Trump administration reportedly used the encrypted messaging app Signal to discuss military plans, sparking concerns over....

PsiQuantum Secures $750M to Advance Quantum Computing

According to sources, Quantum computing startup PsiQuantum is securing at least $750 million in funding, pushing its valuation to $6....

Are We Ready to Mine Metals from Space? The Future of Asteroid Mining

Asteroid Mining: A Sci-Fi Dream or an Inevitable Future? For decades, space enthusiasts and scientists have imagined a future where....

Nvidia CEO Surprised By Public Quantum Computing Companies

Nvidia CEO Jensen Huang admitted he was unaware that publicly traded quantum computing firms existed when he previously commented on....

Tesla Faces Crisis: Cybertruck Recall & Musk’s Trump Ties

Tesla and its CEO Elon Musk are in hot water as controversy swirls around the company. One of Tesla’s strongest....

Humanoid Robots Could Arrive Sooner Than Expected, Says Nvidia CEO

The world may be closer to a robotics revolution than most people think. Nvidia CEO Jensen Huang believes humanoid robots....

Nvidia’s AI Vision: Jensen Huang Unveils Future at GTC 2025

Nvidia CEO Jensen Huang took center stage at the GTC 2025 conference, often dubbed “AI Woodstock,” to discuss the rapid....