Singapore, September 10, 2024 – In a groundbreaking partnership, Sony Research and AI Singapore (AISG) have signed a memorandum of understanding (MOU) to co-develop large language models (LLMs) specifically tailored to Southeast Asian languages. This collaborative effort will kick off with a focus on the Tamil language and the SEA-LION (Southeast Asian Languages In One Network) family of models, aiming to ensure that AI technologies are more globally representative.
Sony Research, through its AI division, will work closely with AISG to refine and enhance the SEA-LION models, particularly in their performance for Tamil and other Southeast Asian languages. This collaboration seeks to address the significant gap in LLM representation for Southeast Asia, where over a thousand languages are spoken. The research will incorporate best practices from both organizations, leveraging Sony Research’s expertise in content analysis, speech generation, and recognition, with a particular focus on Indian languages, including Tamil.
Linguistic Diversity and AI: Addressing a Global Challenge
As the AI landscape continues to evolve, there has been an increasing need to create models that can cater to the linguistic diversity of the world. Tamil, a language spoken by an estimated 60-85 million people globally, serves as an important first step in addressing this gap. The collaboration between Sony Research and AISG marks a pivotal moment for AI development in Southeast Asia, as it aims to push the boundaries of how LLMs can be used to support underrepresented languages.
“Access to LLMs that address the global landscape of language and culture has been a barrier to driving research and developing new technologies that are representative and equitable for the global populations we serve,” said Hiroaki Kitano, President of Sony Research. “As a global company, diversity and localization are vital forces. In Southeast Asia specifically, there are more than a thousand different languages spoken by the citizens of the region. This linguistic diversity underscores the importance of ensuring AI models and tools are designed to support the needs of all populations around the world. We look forward to our collaboration with AISG and the potential to make AI work for everyone.”
Also Read: Vietjet Posts Stellar H1 2024 Results, Reports Significant Growth in Revenue and Profit
Boosting AI Innovation in Southeast Asia
AI Singapore’s involvement in the SEA-LION project will provide critical support in testing and refining the models for various Southeast Asian languages. The organization’s expertise in LLM development and regional knowledge will be instrumental in optimizing the models for linguistic and cultural contexts.
“AI Singapore is excited to collaborate with Sony Research in this groundbreaking partnership. The integration of the SEA-LION model, with its Tamil language capabilities, holds great potential to boost the performance of new solutions. We are particularly eager to contribute to the testing and refinement of the SEA-LION models for Tamil and other Southeast Asian languages, while also sharing our expertise and best practices in LLM development. We look forward to seeing how this collaboration will drive innovation in multilingual AI technologies,” said Leslie Teo, Senior Director of AI Products, AI Singapore.
With an initial focus on Tamil, the partnership between Sony Research and AI Singapore is set to pave the way for more inclusive AI solutions that cater to the linguistic needs of Southeast Asia. The two organizations are committed to ensuring that the benefits of AI reach all communities by creating technology that truly represents global diversity. For more information on AI Singapore, please visit https://www.aisingapore.org.