A simple, consistent and extendable toolkit for IndicTrans2
-
Updated
Aug 27, 2024 - Python
A simple, consistent and extendable toolkit for IndicTrans2
Fine-tuned and compared 3 🤗 pre-trained Multilingual LLMs
Setu dashboard is a all-in-one streamlit application that allows users to provide feedback on the outputs of the setu data cleaning pipeline for @AI4Bharat
This repository contains Python implementations for processing multilingual text data, focusing on language classification and translation tasks. The project addresses two distinct tasks: language classification and English translation, each involving different complexities in the processing of text data.
Add a description, image, and links to the ai4bharat topic page so that developers can more easily learn about it.
To associate your repository with the ai4bharat topic, visit your repo's landing page and select "manage topics."