-
Jožef Stefan Institute
- Ljubljana, Slovenia
- @TajaKuzman
- in/taja-kuzman
Block or Report
Block or report TajaKuzman
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
-
Parlamint-translation Public
A pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic…
-
A benchmark for evaluating robustness of automatic genre identification models to test their usability for the automatic enrichment of large text collections with genre information.
benchmarking benchmark text-classification genres-classification genre-identification cross-dataset-biasJupyter Notebook UpdatedJan 23, 2024 -
NER-recognition Public
An evaluation of various encoder Transformer-based large language models on the named entity recognition task. The models are compared on 6 datasets, manually-annotated with named entitites.
Jupyter Notebook UpdatedDec 28, 2023 -
-
Achademio Public
AI assistant, based on the GPT-3.5 model by OpenAI, designed to enhance your proficiency in writing research papers. Allows you to adapt your content to academic standards, transform bullet points …
-
semshift_esslli2023 Public
Forked from lmphcs/semshift_esslli2023Hands-on sessions for ESSLLI course "Computational approaches to semantic change detection"
Jupyter Notebook UpdatedAug 10, 2023 -
-
-
Training and evaluating topic classification models (fastText and Transformer-based language models) for topic classification of Slovenian news texts. The repository can be used as a tutorial to le…
-
Analysing different text representations for genre identification. I parse CONLL-u files and extract various representations of a text (running text, lemmas, part-of-speech), then train a Fasttext …
Jupyter Notebook UpdatedAug 18, 2022 -
GINCO-Genre-Annotation-Guidelines Public
Forked from spyysalo/annodocGenre Annotation Guidelines for GINCO corpora
-
Jupyter Notebook Updated
Jul 28, 2022 -
Hate-Speech-Classification Public
Classification of hate speech and implicitness of hate speech, using Transformer language models (BERT). This repository can be used as an introduction to text classification with BERT-like models.
Jupyter Notebook MIT License UpdatedJul 18, 2022 -
A ML web app which detect objectivity of the text
Jupyter Notebook MIT License UpdatedJun 2, 2022 -
machinetranslate.org Public
Forked from machinetranslate/machinetranslate.orgOpen resources and community for machine translation
HTML Creative Commons Attribution Share Alike 4.0 International UpdatedJun 2, 2022 -
tdm-notebooks Public
Forked from ithaka/constellate-notebooksExample notebooks and tutorials from Constellate, the text analysis service from ITHAKA.
Jupyter Notebook UpdatedMay 20, 2022 -
Transformers-GINCO-Experiments Public
Forked from 5roop/task5_webgenresJupyter Notebook UpdatedMar 10, 2022 -
-
notion_widgets Public
Forked from ShoroukAziz/notion_widgetsA set of HTML widgets that could be embedded into Notion.so https://www.notion.so/ pages. For more see https://blog.shorouk.dev/notion-widgets-gallery/
HTML UpdatedFeb 21, 2022