The codebase for the "ALDi: Quantifying the Arabic Level of Dialectness of Text" paper accepted to EMNLP 2023.
-
Updated
Sep 15, 2024 - HTML
The codebase for the "ALDi: Quantifying the Arabic Level of Dialectness of Text" paper accepted to EMNLP 2023.
WIBARAB is a project in the field of Arabic dialectology. It consists of various regional sub-projects (four PhD projects) and a large database about bedouin-type dialects of Arabic. The Feature Database will be the main point of integrating the results of the sub-projects. In this repository we collect the primary data of the database in TEI/XML.
Add a description, image, and links to the arabic-dialects topic page so that developers can more easily learn about it.
To associate your repository with the arabic-dialects topic, visit your repo's landing page and select "manage topics."