WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.WAXAL is an open-source speech database designed to support the development of voice-based artificial intelligence for African languages.

Google joins push to localise AI for African languages with speech database

3 min read

Google has collaborated with African universities and research institutions to launch WAXAL, an open-source speech database designed to support the development of voice-based artificial intelligence for African languages. 

African institutions, including Makerere University in Uganda, the University of Ghana, Digital Umuganda in Rwanda, and the African Institute for Mathematical Sciences (AIMS), participated in the data collection for this initiative. The dataset provides foundational data for 21 Sub-Saharan African languages, including Hausa, Luganda, Yoruba, and Acholi.

WAXAL is designed to support the development of speech recognition systems, voice assistants, text-to-speech tools, and other voice-enabled applications across sectors such as education, healthcare, agriculture, and public services.

“This dataset provides the critical foundation for students, researchers, and entrepreneurs to build technology on their own terms, in their own languages,” said Aisha Walcott-Bryantt, Head of Google Research Africa

WAXAL’s launch comes amid growing efforts across Africa to develop language technologies that reflect local cultures and realities. 

In September 2025, the Nigerian government unveiled N-ATLAS, an open-source language model capable of recognising and transcribing spoken words and generating text, in Yoruba, Hausa, Igbo, and Nigerian-accented English. 

Similar initiatives are emerging in the private sector, where startups such as  South Africa’s Lelapa AI are building tools like Vulavula, which offers speech recognition, translation, and sentiment analysis. 

By making this speech dataset openly accessible, WAXAL provides the fuel for a growing wave of homegrown efforts to bring African languages into the digital age.

Although Sub-Saharan Africa is home to more than 2,000 languages, reports suggest that fewer than 5% of those languages have the resources needed for Natural Language Processing (NLP), which allows computers to understand and comprehend human language. This lack of representation in training datasets limits the effectiveness of speech recognition and text-to-speech systems for African users.  

Developed over three years with funding and technical support from Google, WAXAL addresses a major gap in global AI development.

WAXAL provides speech data for 21 Sub-Saharan African languages, including Fulani (Fula), Hausa, Igbo, Ikposo (Kposo), Swahili, and Yoruba. The dataset contains more than 11,000 hours of speech drawn from nearly two million individual recordings. 

Under the project’s partnership model, contributing institutions retain ownership of the data they collected, while making it openly available to researchers and developers worldwide.

“For AI to have a real impact in Africa, it must speak our languages and understand our contexts,” Joyce Nakatumba-Nabende, Senior Lecturer at Makerere University’s School of Computing and Information Technology, said. 

“The WAXAL dataset gives our researchers the high-quality data they need to build speech technologies that reflect our unique communities.”

Get The Best African Tech Newsletters In Your Inbox

Subscribe
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Michael Saylor’s Strategy follows Metaplanet, adding 6,269 BTC worth $729 million

Michael Saylor’s Strategy follows Metaplanet, adding 6,269 BTC worth $729 million

The post Michael Saylor’s Strategy follows Metaplanet, adding 6,269 BTC worth $729 million appeared on BitcoinEthereumNews.com. The two giant BTC holders, Strategy and Metaplanet, have stirred the waters despite the FUD in the Bitcoin market by acquiring a total of 6,269 Bitcoins. According to reports, Strategy has acquired 850 BTC while Metaplanet has acquired a bumper 5,419 tokens. Michael Saylor’s Strategy, the world’s largest corporate Bitcoin holder, purchased BTC worth $99.7 million at $117,344 per Bitcoin. This has brought its total Bitcoin holdings to 639,835 BTC, acquired for about $47.3 billion at $73,971 per Bitcoin. JUST IN: Strategy buys 850 BTC for $99.7M at $117,344 per BTC. Now holds 639,835 $BTCTotal spent: $47.33B Avg cost: $73,971 per BTCYTD BTC yield: 26.0% https://t.co/7iv2difHzR pic.twitter.com/O8WfDpJDxQ — Cryptopolitan (@CPOfficialtx) September 22, 2025 On the other hand, as reported by Cryptopolitan, Metaplanet purchased BTC worth $632.53 million at an average price of roughly $116,724 per Bitcoin. This has brought its total BTC holdings to 25,555 BTC, which was acquired for approximately $2.7 billion and purchased at an average price of $106,065 per BTC. Strategy slows down BTC purchase while Metaplanet adds speed The US company’s most recent Bitcoin purchase is in line with a recent trend of small purchases, showing a slowdown compared to the big purchases seen earlier this year. Strategy bought 3330 Bitcoin in September, which is a big drop from the 7,714 BTC it bought in August and a 75% drop from the 31,466 BTC it bought in July. In line with Bitcoin, Strategy’s stock has dropped about 2% in the last 30 days. Starting in 2020, the company put most of its money into Bitcoin. It used a mix of debt and stock to buy huge amounts of BTC, which turned the business intelligence software company into a Bitcoin giant. Still, the stock has gone up 2,200% since it started buying BTC. On the other hand,…
Share
BitcoinEthereumNews2025/09/22 22:54
Payward Revenue Hits $2.2 Billion as Kraken Exchange Reports Strong 2025 Growth

Payward Revenue Hits $2.2 Billion as Kraken Exchange Reports Strong 2025 Growth

TLDR Payward, Kraken’s parent company, earned $2.2 billion in 2025, a 33% increase from 2024’s $1.6 billion Trading revenue and asset-based services each contributed
Share
Blockonomi2026/02/04 20:11
Super Micro Computer (SMCI) Stock: Revenue Soars Past $12B on AI Server Boom

Super Micro Computer (SMCI) Stock: Revenue Soars Past $12B on AI Server Boom

TLDR Revenue hit $12.7 billion, crushing $10.42 billion estimate and up 123.4% year-over-year EPS of $0.69 beat consensus $0.49 by 40.8% in fiscal Q2 Q3 guidance
Share
Blockonomi2026/02/04 20:36