In a landmark step for Artificial Intelligence (AI) development in Southeast Asia, Cambodia and Singapore have signed a Memorandum of Understanding (MoU) to enhance the Khmer Language model using AI technologies.
The agreement was formalized yesterday at the Cambodia University of Technology and Science (CamTech University) between Chhem Siriwat, President of AI Forum Cambodia, and Darius Liu, Head of Strategy, Partnership, and Growth at AI Singapore. This collaboration is part of AI Singapore’s Southeast Asian Language is One Network (Sea-Lion) project, a family of open-source Large Language Models (LLM) designed to better understand the diverse languages, contexts, and cultures of the region.
The Sea-Lion project, under AI Singapore’s Products Pillar, seeks to address the challenges faced by under-represented population groups and low-resource languages in Southeast Asia. The development of the Khmer LLM represents a significant milestone in this mission.
Bridging Heritage and Innovation
During the signing ceremony, Chhem Siriwat highlighted the importance of the partnership, describing it as a step forward for Cambodia’s AI future.
“This partnership represents a pivotal step in the realization of the Sea-Lion project, a shared endeavor to develop the Khmer LLM for Cambodia’s future in AI,” Siriwat stated. “Together, we stand at the intersection of ancient heritage and cutting-edge technology, where our collective efforts will open the doors for innovation and opportunities across generations.”
Siriwat emphasized the cultural significance of the Khmer language, describing it as more than a means of communication but a legacy of creativity and resilience, exemplified by Cambodia’s rich heritage, including the Angkor Empire.
“Today, we honor this legacy by ensuring the richness of Khmer language becomes a cornerstone of AI’s future,” he added. “Through the Sea-Lion project, we embrace open development, a collaborative approach to building AI that benefits researchers, entrepreneurs, and educators alike.”
Laying the Foundation for Khmer AI
William Tjhi, Head of Applied Research for Foundation Models (ARF) at AI Singapore, explained that the initial phase of the project will focus on data collection and foundational development.
“In the case of Khmer, the base foundation itself is not strong enough to perform advanced instruction tasks. Therefore, we need to enhance the base capability of the AI,” Tjhi said.
He outlined strategies for gathering data, including converting PDF documents into text, transcribing speech and voice data, and other methods to expand the volume of raw Khmer data. These efforts will strengthen the AI’s foundational capacity, enabling more robust applications for the language.
The Khmer LLM project is expected to bridge linguistic and digital divides, empowering local communities while positioning Cambodia as a contributor to the global AI landscape.
This collaboration marks a crucial milestone for both nations, demonstrating the potential of regional partnerships to foster innovation and inclusivity in the evolving field of artificial intelligence.
Source: Khmer Times