- OneNine, founded by Senegalese entrepreneur Doudou Ba and Czech-Vietnamese engineer Duc Anh Tran, is building an AI infrastructure focused on African languages.
- The startup already works with over 160 native contributors and several data companies on pilot projects launched in August 2025.
- OneNine plans to process more than 500,000 hours of linguistic data to position itself as a global leader in underrepresented language datasets.
OneNine, a Sweden-based startup co-founded by Senegalese entrepreneur Doudou Ba and Czech-Vietnamese engineer Duc Anh Tran, is developing an artificial intelligence platform that understands and communicates in African languages.
The company emerged from a key observation: most AI systems are trained on dominant languages such as English, French, or Chinese, overlooking hundreds of languages spoken by millions across Africa. OneNine seeks to bridge this gap by collecting, sorting, annotating, and validating voice and text data in native African tongues.
#OneNine is the Data Supply Chain for AI.
— Doudou BA (@doudou_onenine) October 23, 2025
We provide production ready dataset to AI labs like @OpenAI @Meta @Google @AnthropicAI @xai @netflix @YouTube saving them 70-80 % FTE.
We are highly specialized in low resource languages and mission to make AI understand everyone.… pic.twitter.com/FxVt4aUHqN
Launched in August 2025, OneNine’s platform relies on a network of more than 160 African language contributors, supported by automated data-processing tools. The company has already initiated pilot projects with several linguistic data and research firms.
“Many people cannot read or write, but they can speak — maybe not in English, but in their mother tongue. We want AI to hear them,” said co-founder Doudou Ba.
Ba emphasized that Africa has a crucial role in shaping the future of AI. “The next frontier of AI will not depend solely on building more powerful models, but on creating richer and more diverse datasets. Africa, with its hundreds of languages, holds the world’s largest untapped data resource,” he added.
OneNine’s long-term goal is to become a global leader in linguistic data for underrepresented languages. The startup is building a pipeline estimated at over 500,000 hours of audio and text data.
The company recently joined the Google for Startups program and participated in Norrsken Africa Week, an event dedicated to innovation, entrepreneurship, and investment across the continent.
In the short term, OneNine plans to collaborate with major AI laboratories to refine its models and expand its reach. In the long term, it aims to establish the foundation for a truly inclusive artificial intelligence ecosystem that reflects the world’s linguistic diversity.
This article was initially published in French by Adoni Conrad Quenum
Adapted in English by Ange Jason Quenum