Stars
CHisIEC An Information Extraction Corpus for Ancient Chinese History
Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
A Joint Chinese segmentation and POS tagger based on bidirectional GRU-CRF
An Open-Source Package for Neural Relation Extraction (NRE)
Tokenizer POS-tagger and Dependency-parser for Classical Chinese
ctbparser是一个用C++语言实现的开源的中文处理工具包(GBK编码),用于分词、词性标注、依存句法分析,采用的是中文宾州树库(Chinese Tree Bank, CTB)标准。
Pytorch Implementation of Our NAACL 2021 Paper "Incorporating Syntax and Semantics in Coreference Resolution with Heterogeneous Graph Attention Network"
A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text
在原本BERT-BILSTM-CRF上融合GCN和词性标签等做NER任务
Graph Convolutional neural network named entity recognition
Implementation of Graph Convolutional Networks in TensorFlow
Graph Convolutional Networks for Text Classification. AAAI 2019
A Benchmark for Classical Chinese Based on a Crowdsourcing System.
A PyTorch implementation of the BI-LSTM-CRF model.
Simple Solution for Multi-Criteria Chinese Word Segmentation
Resources for the MRQA 2019 Shared Task
Convert Bert TF-checkpoint to Pytorch
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
This repo contains the code for ACL2020 paper "Coreference Resolution as Query-based Span Prediction"
甲言,专注于古代汉语(古汉语/古文/文言文/文言)处理的NLP工具包,支持文言词库构建、分词、词性标注、断句和标点。Jiayan, the 1st NLP toolkit designed for Classical Chinese, supports lexicon construction, tokenizing, POS tagging, sentence segmentation a…