- Guilin【桂林】
- https://blog.csdn.net/rensihui
Block or Report
Block or report yongzhuo
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
Pytorch-NLU Public
Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi labe…
-
nlg-yongzhuo Public
中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize to…
-
Keras-TextClassification Public
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN,…
-
near-synonym Public
near-synonym, 基于大模型LLM的中文反义词/近义词(antonym/synonym)工具包.
-
LLM-SFT Public
中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微调, 推理, 测评, 接口)等.
-
qwen2-sft Public
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
-
LLaMA3-SFT Public
LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)
-
gemma-sft Public
Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
-
Qwen-SFT Public
阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
-
InternLM-SFT Public
InternLM-7B微调, SFT/LoRA, instruction finetune
-
char-similar Public
汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used for data augmentation, CSC misclassified character detection and rec…
-
MacroGPT-Pretrain Public
macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor
-
ChatGLM3-SFT Public
chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话
-
chatglm-maths Public
chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu
-
Llama2-SFT Public
Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理
-
ChatGLM2-SFT Public
ChatGLM2-6B微调, SFT/LoRA, instruction finetune
-
Tft-Preprocess Public
tensorflow-transformer(tft) of pre-processing and post-processing of text-classification
-
Macadam Public
Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRN…
-
Macropodus Public
自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NE…
-
pytorch-model-to-tensorflow Public
transformers-model of pytorch1.x to tensorflow2.x, deploy for tf-serving
-
layoutlmv3-layoutxlm-chinese Public
chinese document classification of layoutlmv3 and layoutxlm
-
-
Text-Analysis Public
文本数据分析, Text-Analysis
-
nlp_xiaojiang Public
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子…
-
pytorch-loss Public
pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】
-
中文开放信息抽取系统, open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)
-
Tookit-Sihui Public
Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(func_recursive),中文数字转阿拉伯数字(chinese to number),阿拉伯数字转汉语数字, HMM,…
-
nni Public
Forked from microsoft/nniAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Python MIT License UpdatedAug 19, 2020 -
Kashgari Public
Forked from BrikerMan/KashgariKashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
Python Apache License 2.0 UpdatedJul 5, 2020 -
leetcode-in-out Public
leetcode一些热门题型的python代码,包括输入输出。leetcode of hot, which Includes input and output.