Skip to content
View Taring's full-sized avatar
Block or Report

Block or report Taring

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
19 stars written in Python
Clear filter

🦜🔗 Build context-aware reasoning applications

Python 89,252 14,065 Updated Jul 17, 2024

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 59,340 10,682 Updated Jul 6, 2024

Inference code for Llama models

Python 54,254 9,329 Updated Jul 13, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 34,840 5,357 Updated Jul 14, 2024

Write scalable load tests in plain Python 🚗💨

Python 24,232 2,924 Updated Jul 16, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,182 2,446 Updated Jul 15, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 14,482 1,109 Updated Jul 17, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,080 530 Updated Jul 16, 2024

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 8,771 790 Updated Jul 17, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,645 609 Updated Jul 25, 2023

Semantic Image Synthesis with SPADE

Python 7,575 987 Updated Aug 7, 2023

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,399 410 Updated Jun 22, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,138 155 Updated Jul 16, 2024

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,640 83 Updated Jan 21, 2024

Unofficial implementation of InstantID for ComfyUI

Python 1,239 71 Updated May 22, 2024

Serving multiple LoRA finetuned LLM as one

Python 897 40 Updated May 8, 2024

A text generation benchmarking platform

Python 861 202 Updated Jul 3, 2021

Unified Reinforcement Learning Framework

Python 598 59 Updated Jun 6, 2024

Implementing Recurrent Neural Network from Scratch

Python 464 152 Updated May 28, 2018