The MBZUAI research initiative led by Dr. Thamar Solorio has received $1 million USD in funding from Google.org to develop high-performance AI systems tailored to the linguistic diversity of the Middle East and North Africa region.
Dr. Solorio serves as Vice Provost of Faculty Excellence and Advancement and Professor of Natural Language Processing at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI). The funding supports a research framework designed to address what the initiative describes as the “data divide” — a structural gap that limits AI accuracy for speakers of underrepresented languages compared to English.
The Problem: AI Models Skewed Toward Western Languages
Current AI language models rely heavily on data from high-resource, Western languages. When applied to the MENA region, these models often fail to capture cultural nuance and technical precision. The region’s linguistic landscape includes Arabic in multiple dialects, alongside numerous other languages, each capable of expressing a single sentiment in a wide variety of ways.
Rather than adapting existing Western-built technologies, the MBZUAI research initiative establishes a native research framework grounded in the sociocultural and linguistic realities specific to the MENA region. This represents a shift in approach from localization to foundational development.
MBZUAI Research Initiative: Building Resource-Lean AI
A central component of the project focuses on developing “resource-lean” AI systems. As global AI models grow larger and more expensive to maintain, this initiative creates new training frameworks that require less manually annotated data and lower computational power. The goal is to lower barriers for local institutions, startups, and researchers who lack access to large-scale infrastructure or capital.
This efficiency-focused approach aims to enable sophisticated AI development across the MENA region without dependence on the resources typically available only to large technology companies.
Quotes from Key Stakeholders
“This funding allows us to take our research from an early exploratory phase to a level that can not only redefine the field, but lead to impact in people’s lives. This support is vital because it allows us to move beyond adaptation of high-resource models to linguistically grounded AI for MENA languages, which highlights a much-needed paradigm shift in the field.”
Dr. Thamar Solorio, Vice Provost of Faculty Excellence and Advancement and Professor of Natural Language Processing, MBZUAI
“We are happy to collaborate with MBZUAI, which is deeply rooted in advancing AI research and fostering regional academic talent. By focusing on low-resource languages in Large Language Models, we are progressing on the MENA AI Opportunity Initiative’s commitment to providing access to the most innovative AI technology in Arabic, its dialects and other languages spoken in the region.”
Yossi Matias, Vice President, Google, and Head of Google Research
Talent Development and Real-World Applications
Beyond technical research, the funding supports a new generation of postdoctoral and early-career researchers at MBZUAI. These researchers will receive sustained leadership development in a field that has historically seen limited representation from the MENA region.
The initiative targets practical applications in education, cultural preservation, and digital communication. By developing speech and language technologies tailored to specific community needs, the project aims to ensure broader participation in the evolution of artificial intelligence across the region.
The funding aligns with Google’s MENA AI Opportunity Initiative, which focuses on expanding access to AI tools in Arabic and other regional languages. Matias said the collaboration reflects Google’s goal to accelerate scientific discovery through cooperation that delivers real-world impact.