Google adds Kikuyu and Luo to its WAXAL speech data set

TECHNOLOGY
Google adds Kikuyu and Luo to its WAXAL speech data set

Google has expanded its WAXAL speech dataset to include Luo, Kikuyu, and Luganda languages, aiming to improve AI understanding of African languages and bridge the digital divide. This move will accelerate the development of voice-enabled AI tools for millions of African language speakers previously excluded from these technologies .

The WAXAL dataset features 1,250 hours of transcribed natural speech and over 20 hours of high-quality studio recordings, providing a strong foundation for building reliable language technologies. Google’s goal is to enhance access to AI-powered tools in East Africa, including voice assistants, speech-to-text services, educational platforms, and digital public services.

“The ultimate impact of WAXAL is the empowerment of people in Africa,” says Aisha Walcott-Bryantt, Head of Google Research Africa. “This dataset provides the critical foundation for students, researchers, and entrepreneurs to build technology on their own terms, in their own languages, finally reaching over 100 million people”.

The inclusion of these languages will transform sectors like education, agriculture, and healthcare by delivering information in local languages, especially in communities with limited English proficiency.

Taking its name from the Wolof word for “speak,” WAXAL dataset was developed over three years to empower researchers and drive the development of inclusive technology across Africa. 

Trending Now


Nairobi County’s Chief Officer for Citizen Engagement and Customer Service Geoffrey Mosiria has…


Subscribe to Our Newsletter

*we hate spam as much as you do

More From Author


Related Posts

See all >>

Latest Posts

See all >>