Skip to content
@cisnlp

Deep NLP @ CIS - LMU

Deep Natural Language Processing Group at Center for Language and Information Processing, University of Munich (LMU)

Popular repositories Loading

  1. simalign simalign Public

    Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

    Python 343 47

  2. Glot500 Glot500 Public

    Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023

    Python 96 3

  3. GlotLID GlotLID Public

    GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

    Python 76 7

  4. semi-markov-crf semi-markov-crf Public

    Code for paper "Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging"

    Python 17 4

  5. parcoure parcoure Public

    ParCourE - Parallel Corpus Explorer

    Python 12

  6. GlotScript GlotScript Public

    GlotScript: A Resource and Tool for Low Resource Writing System Identification -- LREC 2024

    Python 12 1

Repositories

Showing 10 of 24 repositories
  • cisnlp.github.io Public

    Homepage of cisnlp

    cisnlp/cisnlp.github.io’s past year of commit activity
    SCSS 3 MIT 0 0 0 Updated Jun 17, 2024
  • GlotWeb Public

    GlotWeb: Web Indexing for Low-Resource Languages -- under construction.

    cisnlp/GlotWeb’s past year of commit activity
    Python 5 CC0-1.0 0 0 0 Updated Jun 13, 2024
  • cisnlp/analogical_reasoning’s past year of commit activity
    JavaScript 0 0 0 0 Updated Jun 13, 2024
  • GlotCC Public

    GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages

    cisnlp/GlotCC’s past year of commit activity
    Jupyter Notebook 9 CC0-1.0 0 0 0 Updated Jun 12, 2024
  • MaskLID Public

    MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024

    cisnlp/MaskLID’s past year of commit activity
    Python 3 MIT 1 0 0 Updated Jun 11, 2024
  • GlotScript Public

    GlotScript: A Resource and Tool for Low Resource Writing System Identification -- LREC 2024

    cisnlp/GlotScript’s past year of commit activity
    Python 12 MIT 1 0 0 Updated Jun 7, 2024
  • Taxi1500 Public
    cisnlp/Taxi1500’s past year of commit activity
    Python 5 Apache-2.0 0 1 0 Updated May 31, 2024
  • TransMI Public

    TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data

    cisnlp/TransMI’s past year of commit activity
    Python 4 0 0 0 Updated May 30, 2024
  • TransliCo Public

    TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models

    cisnlp/TransliCo’s past year of commit activity
    Python 4 0 0 0 Updated May 23, 2024
  • cisnlp/Spatial_Schemas’s past year of commit activity
    JavaScript 1 0 0 0 Updated May 23, 2024

Top languages

Loading…

Most used topics

Loading…