Skip to content
View ydeng117's full-sized avatar

Block or report ydeng117

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 93,020 7,340 Updated Oct 12, 2024

一个 GPT-2 模型。致谢:https://github.com/Morizeyao/GPT2-Chinese

11 3 Updated Apr 22, 2021

code for piccolo embedding model from SenseTime

Python 98 5 Updated May 21, 2024

Minimal keyword extraction with BERT

Python 3,485 345 Updated Jul 16, 2024

TextNet: A deep neural network framework for text matching

C++ 107 38 Updated May 15, 2019

Unofficial API for zhihu.

JavaScript 264 52 Updated Jul 16, 2017

获取知乎内容信息,包括问题,答案,用户,收藏夹信息

Python 2,289 858 Updated Feb 8, 2022

[不再维护] 后继者 zhihu-oauth https://github.com/7sDream/zhihu-oauth 已被 DMCA,亦不再开发,仅提供代码存档:

Python 1,038 358 Updated Sep 17, 2016

INFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客…

Python 7,808 1,490 Updated Aug 20, 2024

一个简单的知乎爬虫,支持多账户、多线程、爬取代理、中断恢复,数据通过API获取。

Python 14 Updated May 9, 2018

Zhihu API for Humans

Python 966 256 Updated Aug 6, 2021

An R package for Keyword Assisted Topic Models

R 100 13 Updated Sep 14, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 18,011 1,742 Updated Oct 12, 2024

GPT Meet Zotero.

TypeScript 5,033 200 Updated Sep 23, 2024

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Python 22,284 1,666 Updated Sep 26, 2024

中文常用停用词表(哈工大停用词表、百度停用词表等)

4,621 2,220 Updated Jan 25, 2024

常用中文停用词表及对比

63 45 Updated Feb 20, 2019

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,947 3,897 Updated Oct 2, 2024

An R package for the Quantitative Analysis of Textual Data

R 840 188 Updated Oct 6, 2024

Estimate life table survivorship from orphanhood and vice versa

Stata 2 Updated Sep 20, 2021

Code for the paper "Forecasting of cohort fertility under a hierarchical Bayesian approach" by Joanne Ellison, Erengul Dodd and Jonathan J. Forster

R 1 1 Updated Nov 2, 2023

GATHER compliant code for the GBD

R 1 Updated Apr 15, 2024

人民网-习近平系列重要讲话爬虫,Java简单爬虫,Jsoup爬虫

Java 8 Updated Aug 1, 2018

人民日报(1946-2023)、习近平系列重要讲话数据库

45 2 Updated May 4, 2023

Tools for multimodal and multilevel network analysis

HTML 39 7 Updated Sep 4, 2024

北京理工大学硕士博士研究生学位论文 LaTeX 模板 — 非官方

TeX 8 Updated Nov 25, 2022

[📂内容存档]习近平系列重要讲话内容

HTML 6 1 Updated Mar 25, 2019

《统计学习方法》的代码实现

Jupyter Notebook 1 Updated Aug 22, 2023

《统计学习方法》的代码实现

Jupyter Notebook 18,905 6,284 Updated Aug 22, 2023

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 33,218 7,640 Updated Jul 23, 2024
Next