-
Insight Centre for Data Anaytics / University of Galway Library
- Galway, Ireland
- @ancatmara
- in/oksana-dereza
Block or Report
Block or report ancatmara
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A machine learning software for extracting information from scholarly documents
Let's build better datasets, together!
Parse JSON response of Amazon Textract
A deep learning toolkit specialized for handwritten document analysis
Convert between Tesseract hOCR and ALTO XML using XSL stylesheets
Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
Tesseract Open Source OCR Engine (main repository)
Images of example pages from Transkribus model training sets to make it easier to find a match.
TensorFlow code and pre-trained models for BERT
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at CLEF 2020.
Public repository for Coptic SCRIPTORIUM Corpora Releases
Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)
BERT and ELECTRA models trained on Europeana Newspapers
Source code for the submissions to SIGTYP 2024, EvaLatin 2024, and AXOLOTL 2024 shared tasks
Repository for "Towards Robust Named Entity Recognition for Historic German"
Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a framework for Chinese language learners to explore Chinese.
Multilingual BERT model for Ancient and Historical Languages for SIGTYP Shared Task 2024
Code for the paper "Language Models are Unsupervised Multitask Learners"
Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
Official style files for papers submitted to venues of the Association for Computational Linguistics
danielhers / semeval-ucca
Forked from bethard/semeval-codalabSample CodaLab competition for use as a template for SemEval tasks
Curated list of valuable salary negotiation advice.