Skip to content
@IS2AI

ISSAI

Institute of Smart Systems and Artificial Intelligence

Popular repositories Loading

  1. Kazakh_TTS Kazakh_TTS Public

    An expanded version of the previously released Kazakh text-to-speech (KazakhTTS) synthesis corpus. In KazakhTTS2, the overall size has increased from 93 hours to 271 hours, the number of speakers h…

    Shell 112 21

  2. SpeakingFaces SpeakingFaces Public

    A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication, facial recognition, speech recognition, and human-computer …

    Python 75 8

  3. TurkicASR TurkicASR Public

    A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

    Python 54 7

  4. ISSAI_SAIDA_Kazakh_ASR ISSAI_SAIDA_Kazakh_ASR Public

    the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 …

    Shell 44 6

  5. TurkicTTS TurkicTTS Public

    A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Turkmen, Uyghur, and Uzbek.

    Python 42 3

  6. thermal-facial-landmarks-detection thermal-facial-landmarks-detection Public

    SF-TL54: Thermal Facial Landmark Dataset with Visual Pairs.

    Jupyter Notebook 36 6

Repositories

Showing 10 of 56 repositories
  • IS2AI/AnyFacePP’s past year of commit activity
    Python 0 GPL-3.0 0 0 0 Updated Jun 28, 2024
  • IS2AI/unified_multimodal_transformer’s past year of commit activity
    Python 0 1 0 0 Updated Jun 28, 2024
  • TurkicASR Public

    A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uyghur, and Uzbek.

    IS2AI/TurkicASR’s past year of commit activity
    Python 54 CC-BY-4.0 7 1 1 Updated Jun 26, 2024
  • COHI-O365 Public

    The most diverse in number of images/labels/classes fisheye synthetic dataset with source codes and models. As well as a benchmarking testing real dataset.

    IS2AI/COHI-O365’s past year of commit activity
    Python 1 MIT 0 0 0 Updated Jun 19, 2024
  • city-sustainability-indexes Public

    This repo contains code and models for detecting city sustainability indexes

    IS2AI/city-sustainability-indexes’s past year of commit activity
    Python 1 MIT 0 0 0 Updated Jun 11, 2024
  • HPE-depth-fisheye Public

    This project used synthetic data created using Nvidia Omniverse to train a camera-view invariant multi-pose HPE model for depth and fisheye cameras.

    IS2AI/HPE-depth-fisheye’s past year of commit activity
    0 MIT 0 0 0 Updated Jun 7, 2024
  • Central-Asian-Food-Dataset Public

    42 food classes from Kazakh National and Central Asian cuisine

    IS2AI/Central-Asian-Food-Dataset’s past year of commit activity
    Python 13 MIT 0 0 0 Updated Jun 6, 2024
  • Enhancing-Ambient-Assisted-Living-with-Multi-Modal-Vision-and-Language-Models Public

    This project is aimed at detecting the abnormal behaviour or emergency cases using vision-language model (VLM), large language model (LLM), human detection model, text-to-speech (TTS) and speech-to-text models (STT). The framework can detect the subtle sings of emergency and actively interact with the user to make an accurate decision.

    IS2AI/Enhancing-Ambient-Assisted-Living-with-Multi-Modal-Vision-and-Language-Models’s past year of commit activity
    0 0 0 0 Updated May 22, 2024
  • TatarSCR Public

    An Open-Source Speech Commands Dataset for the Tatar Language

    IS2AI/TatarSCR’s past year of commit activity
    Jupyter Notebook 0 Apache-2.0 0 0 0 Updated May 16, 2024
  • KazQAD Public

    An open-source Kazakh Question Answering Dataset

    IS2AI/KazQAD’s past year of commit activity
    2 CC-BY-SA-4.0 0 0 0 Updated Apr 23, 2024

Top languages

Loading…

Most used topics

Loading…