Skip to content
View youngfire's full-sized avatar
Block or Report

Block or report youngfire

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

中国5级行政区域mysql库

1 Updated Jul 16, 2021

使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除等。

Python 65 14 Updated Apr 29, 2020

经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

HTML 19,853 1,514 Updated Jul 12, 2024

Python library which implements the Ethereum Trie structure.

Python 104 49 Updated Apr 22, 2024

📙 中华新华字典数据库。包括歇后语,成语,词语,汉字。

Python 10,771 2,534 Updated Dec 26, 2023

Alpaca Chinese Dataset -- 中文指令微调数据集【持续更新】

Python 122 12 Updated Jul 3, 2024

The GUI for Milvus

TypeScript 1,056 109 Updated Jul 17, 2024

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 30,268 3,566 Updated Jul 13, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 3,750 271 Updated Jul 17, 2024

Official release of InternLM2.5 7B base and chat models. 1M context support

Python 5,823 418 Updated Jul 17, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,325 266 Updated Jul 13, 2024

FinQwen: 致力于构建一个开放、稳定、高质量的金融大模型项目,基于大模型搭建金融场景智能问答系统,利用开源开放来促进「AI+金融」。

Jupyter Notebook 198 25 Updated Jun 11, 2024

Legal-Eagle-InternLM 是一个基于商汤科技和上海人工智能实验室推出的书生浦语大模型InternLM的法律问答机器人。旨在为用户提供符合3H(即Helpful、Honest、Harmless)原则的专业、智能、全面的法律服务的法律领域大模型。

Python 29 4 Updated Feb 26, 2024

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Python 3,011 234 Updated Apr 14, 2024

[COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集

Python 547 54 Updated Jun 19, 2023

A generative speech model for daily dialogue.

Python 27,820 3,020 Updated Jul 16, 2024

S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models

26 3 Updated Jun 24, 2024

pip install nb_http_client ,nb_http_client 是 python 史上性能最强的http客户端,比任意请求包快很多倍

Python 33 7 Updated May 28, 2024

ChatGPT DAN, Jailbreaks prompt

6,138 575 Updated Jul 10, 2024

用于生成文本纠错模型(如Gector)需要的大量数据。

Python 14 1 Updated Jan 5, 2023

基于C#和C++开发的文本查重/论文查重系统,一亿字次级论文库秒级查重。关联:查重算法、数据去重、文档查重、文本去重、标书查重、辅助防串标、作业查重、duplicate check

C# 393 106 Updated Mar 28, 2023

LERT: A Linguistically-motivated Pre-trained Language Model(语言学信息增强的预训练模型LERT)

Python 189 15 Updated Mar 29, 2023

自然语言处理之中文文本分类(以垃圾短信识别为例)

Python 16 3 Updated Jun 4, 2020

text correction papers

279 17 Updated Jan 23, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫

Python 15,187 4,941 Updated Jul 15, 2024

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 26,514 3,278 Updated Jul 16, 2024

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 6,261 354 Updated Jul 16, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

13,356 1,236 Updated Jul 15, 2024
Next