Language
Sort by: Recently starred
Starred repositories
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
🐙 关于提示词工程(prompt)的指南、论文、讲座、笔记本和资源大全(自动持续更新)
机器学习、深度学习的学习路径及知识总结
Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misinformation", accepted by AI Magazine 2024
potato: portable text annotation tool
A repository of useful research/skill-upgrading talks or acticles in NLP/CV/AI Area (in Chinese).
Some Conferences' accepted paper lists (including AI, ML, Robotic)
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Machine learning program to identify when a news source may be producing fake news.
DomainWordsDict, Chinese words dict that contains more than 68 domains, which can be used as text classification、knowledge enhance task。涵盖68个领域、共计916万词的专业词典知识库,可用于文本分类、知识增强、领域词汇库扩充等自然语言处理应用。
Conditional GAN for generating synthetic tabular data.
hitchenwenhao521 / funNLP
Forked from fighting41love/funNLP中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
Compare to The Knowledge: Graph Neural Fake News Detection with External Knowledge (ACL 2021)
DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.
Code and Data for "Characterizing Multi-Domain False News on Weibo and the Underlying User Effects"
akkarimi / aeda_nlp
Forked from jasonwei20/eda_nlpData augmentation for NLP, accepted at EMNLP 2021 Findings
Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021
自然语言处理学习笔记:机器学习及深度学习原理和示例,基于 Tensorflow 和 PyTorch 框架,Transformer、BERT、ALBERT等最新预训练模型及源代码详解,及基于预训练模型进行各种自然语言处理任务。模型部署
Global Encoding for Abstractive Summarization (ACL 2018)
Library for fast text representation and classification.
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
Contextual augmentation, a text data augmentation using a bidirectional language model.