The field of Natural Language Processing (NLP) has seen rapid advancements in recent years, leading to the development of sophisticated algorithms for text and speech processing in various languages, with a majority of the work focusing on English. The Semitic language group, which includes Arabic spoken by 422 million people, Amharic, Hebrew, Tigrinya, Syriac, and Maltese, presents unique linguistic challenges such as complex morphological processes and challenging syntax. To tackle these challenges, a robust and high-quality corpus linguistic infrastructure is necessary.
The Israeli Association of Human Language Technologies (IAHLT) is a non-profit organization made up of leading industry players, academia, and government agencies, with a focus on enhancing the understanding of Hebrew and Arabic in computer systems. The IAHLT is building a cutting-edge corpus linguistic infrastructure and providing open-source tools to facilitate research and development in NLP/GenAI/LLM and Speech technologies. Funding for the IAHLT is sourced from its members (1/3) and the Israel Innovation Authority and National Digital Israel Initiative (2/3).