-
NCKU/FZU/MCU
- China
- eternalfeather.github.io
Stars
中文大模型能力评测榜单:目前已囊括128个大模型,覆盖chatgpt、gpt-4o、谷歌gemini、百度文心一言、阿里通义千问、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及qwen2.5、llama3.1、glm4、书生internLM2.5、openbuddy、AquilaChat等开源大模型。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
中文人名语料库。人名生成器。中文姓名,姓氏,名字,称呼,日本人名,翻译人名,英文人名。可用于中文分词、人名实体识别。
主要是我是日常看过的不错的文章的资源汇总,方便自己也分享给大家。有些我看过的,就会做简单的解读,没看过的,就先罗列一下,然后之后看了把解读更新上;涉及到搜索/推荐/自然语言处理。
Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)
An Open-source Neural Hierarchical Multi-label Text Classification Toolkit
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
Deep learning model zoo with TensorFlow 2.X (& Keras)
A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …
Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Facilitating the design, comparison and sharing of deep text matching models.
Datasets, SOTA results of every fields of Chinese NLP
Unsupervised text tokenizer for Neural Network-based text generation.
Neural machine translation and sequence learning using TensorFlow
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
深度学习在推荐系统中的应用及论文小结。
High level Python client for Elasticsearch
Word2vec 千人千面 个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Ongoing research training transformer models at scale
Easy-to-use word-to-word translations for 3,564 language pairs.