Skip to content
View zhiyxu's full-sized avatar

Block or report zhiyxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Meta's fleetwide profiler framework

C++ 39 6 Updated Aug 2, 2024

Learning eBPF, published by O'Reilly - out now! Here's where you'll find a VM config for the examples, and more

C 1,228 260 Updated Aug 19, 2024

⚡️SwanLab: your ML experiment notebook. 你的AI实验笔记本,跟踪与可视化你的机器学习全流程

Python 470 48 Updated Oct 20, 2024

Fast and memory-efficient exact attention

Python 13,812 1,273 Updated Oct 15, 2024

🔥 经典编程书籍大全,涵盖:计算机系统与网络、系统架构、算法与数据结构、前端开发、后端开发、移动开发、数据库、测试、项目与团队、程序员职业修炼、求职面试等

17,516 2,544 Updated Dec 3, 2023

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,024 670 Updated Oct 19, 2024

Kubernetes Scheduler for Deep Learning

Go 252 38 Updated May 22, 2022

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,245 387 Updated Aug 18, 2024

基于《cuda编程-基础与实践》(樊哲勇 著)的cuda学习之路。

Cuda 235 51 Updated Jan 15, 2024
XSLT 113 8 Updated May 2, 2024

A robust web archive analytics toolkit

Cython 81 13 Updated Aug 30, 2024

Command-line JSON processor

C 30,378 1,576 Updated Oct 5, 2024

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06

JavaScript 54,637 15,869 Updated Jun 26, 2024

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,535 253 Updated Aug 22, 2024

Numbers every LLM developer should know

4,089 139 Updated Jan 16, 2024

a unified scheduler for online and offline tasks

Go 441 73 Updated Oct 18, 2024
Python 937 132 Updated Oct 10, 2024

All-in-one text de-duplication

Python 607 69 Updated May 21, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,487 5,685 Updated Oct 20, 2024

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

Jupyter Notebook 2,133 379 Updated Sep 29, 2023

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 286 23 Updated Dec 20, 2023

BetterAndBetter 是一款包含很多功能的 macOS 软件。

AppleScript 513 31 Updated Jul 29, 2020

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,736 94 Updated Jan 21, 2024

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,546 349 Updated Oct 17, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 68,469 14,465 Updated May 10, 2024

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 3,559 256 Updated Oct 18, 2024

Retrieval and Retrieval-augmented LLMs

Python 7,192 522 Updated Oct 17, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,771 455 Updated May 3, 2024

Inference code for CodeLlama models

Python 15,959 1,852 Updated Aug 12, 2024

The Cloud-Native API Gateway

Lua 14,439 2,511 Updated Oct 19, 2024
Next