Skip to content
View yongzhuo's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report yongzhuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Pytorch-NLU Public

    Pytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词、抽取式文本摘要等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi labe…

    Python 313 50 Apache License 2.0 Updated Jul 18, 2024
  • 中文文本生成(NLG)之文本摘要(text summarization)工具包, 语料数据(corpus data), 抽取式摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(graph,feature,topic model,summarize to…

    Python 405 53 MIT License Updated Jun 17, 2024
  • 中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN,…

    Python 1,741 404 MIT License Updated Jun 17, 2024
  • near-synonym, 基于大模型LLM的中文反义词/近义词(antonym/synonym)工具包.

    Python 4 Apache License 2.0 Updated May 29, 2024
  • LLM-SFT Public

    中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微调, 推理, 测评, 接口)等.

    Python 144 9 Apache License 2.0 Updated May 17, 2024
  • qwen2-sft Public

    Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理

    Python 31 2 Apache License 2.0 Updated May 17, 2024
  • LLaMA3-SFT Public

    LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)

    Python 14 6 Apache License 2.0 Updated May 17, 2024
  • gemma-sft Public

    Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)

    Python 25 4 Apache License 2.0 Updated May 17, 2024
  • Qwen-SFT Public

    阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理

    Python 59 7 Apache License 2.0 Updated May 17, 2024
  • InternLM-7B微调, SFT/LoRA, instruction finetune

    Python 12 Apache License 2.0 Updated May 17, 2024
  • 汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used for data augmentation, CSC misclassified character detection and rec…

    Python 9 2 Apache License 2.0 Updated Feb 20, 2024
  • macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor

    Python 12 2 Apache License 2.0 Updated Nov 30, 2023
  • chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话

    Python 16 3 Apache License 2.0 Updated Nov 30, 2023
  • chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu

    Python 163 17 Apache License 2.0 Updated Aug 24, 2023
  • Llama2-SFT Public

    Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理

    Python 20 Apache License 2.0 Updated Jul 26, 2023
  • ChatGLM2-6B微调, SFT/LoRA, instruction finetune

    Python 105 11 Apache License 2.0 Updated Jul 19, 2023
  • tensorflow-transformer(tft) of pre-processing and post-processing of text-classification

    Python 1 1 Apache License 2.0 Updated Mar 24, 2023
  • Macadam Public

    Macadam是一个以Tensorflow(Keras)和bert4keras为基础,专注于文本分类、序列标注和关系抽取的自然语言处理工具包。支持RANDOM、WORD2VEC、FASTTEXT、BERT、ALBERT、ROBERTA、NEZHA、XLNET、ELECTRA、GPT-2等EMBEDDING嵌入; 支持FineTune、FastText、TextCNN、CharCNN、BiRN…

    Python 323 38 MIT License Updated Mar 24, 2023
  • Macropodus Public

    自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NE…

    Python 653 94 MIT License Updated Mar 24, 2023
  • transformers-model of pytorch1.x to tensorflow2.x, deploy for tf-serving

    Python 1 MIT License Updated Dec 9, 2022
  • chinese document classification of layoutlmv3 and layoutxlm

    Python 34 5 Apache License 2.0 Updated Oct 25, 2022
  • web-demo Public

    web-demo of http and ui

    Python 1 Apache License 2.0 Updated Aug 5, 2022
  • 文本数据分析, Text-Analysis

    Python 5 5 Apache License 2.0 Updated Nov 1, 2021
  • 自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子…

    Python 1,517 394 MIT License Updated Sep 23, 2021
  • pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】

    12 Apache License 2.0 Updated Aug 22, 2021
  • 中文开放信息抽取系统, open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

    Python 8 1 Apache License 2.0 Updated Aug 2, 2021
  • Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(func_recursive),中文数字转阿拉伯数字(chinese to number),阿拉伯数字转汉语数字, HMM,…

    Python 23 15 MIT License Updated Apr 9, 2021
  • nni Public

    Forked from microsoft/nni

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

    Python MIT License Updated Aug 19, 2020
  • Kashgari Public

    Forked from BrikerMan/Kashgari

    Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

    Python Apache License 2.0 Updated Jul 5, 2020
  • leetcode一些热门题型的python代码,包括输入输出。leetcode of hot, which Includes input and output.

    Python 2 Updated May 4, 2020