- beijing
Stars
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
Foundational Models for State-of-the-Art Speech and Text Translation
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
An open-source chatgpt tool ecosystem where you can combine tools with chatgpt and use natural language to do anything.
A Github Action that executes jobs/commands on non-x86 cpu architectures (ARMv6, ARMv7, aarch64, s390x, ppc64le, riscv64) via QEMU
Apache Spark - A unified analytics engine for large-scale data processing
The source code used for self-supervised taxonomy expansion method TaxoExpan, published in WWW 2020
PyTorch deep learning projects made easy.
📄 🇨🇳 📃 论文阅读笔记(分布式系统、虚拟化、机器学习)Papers Notebook (Distributed System, Virtualization, Machine Learning)
Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.
Notes talking about the design and implementation of Apache Spark
Google Coding Competitions Solutions(Maybe) & Codes
🌟 Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
📚 经典技术书籍 PDF 文件,持续更新...
It is open source ebook about TensorFlow kernel and implementation mechanism.
An annotated implementation of the Transformer paper.
The source code used for automatic taxonomy construction method HiExpan, published in KDD 2018
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification