Skip to content
View Taring's full-sized avatar
Block or Report

Block or report Taring

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An open-source, high-performance SQL vector database built on ClickHouse.

C++ 745 33 Updated Jun 18, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 13,739 1,054 Updated Jun 28, 2024

Building a quick conversation-based search demo with Lepton AI.

TypeScript 7,440 954 Updated Jun 22, 2024

Unofficial implementation of InstantID for ComfyUI

Python 1,207 67 Updated May 22, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,571 401 Updated Jun 28, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 33,610 3,499 Updated Jun 11, 2024

Crawl a site to generate knowledge files to create your own custom GPT from a URL

TypeScript 18,141 1,878 Updated Jun 2, 2024

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,610 82 Updated Jan 21, 2024

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

330 16 Updated Apr 11, 2024

Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.

Python 8,660 771 Updated Jun 29, 2024

Serving multiple LoRA finetuned LLM as one

Python 883 40 Updated May 8, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,058 149 Updated Jun 12, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 34,307 5,260 Updated Jun 27, 2024

Inference code for Llama models

Python 54,071 9,301 Updated May 15, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,076 2,434 Updated Jun 24, 2024

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

C++ 3,179 320 Updated Jun 27, 2024

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,643 611 Updated Jul 25, 2023

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,350 403 Updated Jun 22, 2024

Unified Reinforcement Learning Framework

Python 586 59 Updated Jun 6, 2024

🦜🔗 Build context-aware reasoning applications

Python 88,228 13,825 Updated Jun 29, 2024

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,067 528 Updated Apr 19, 2024

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 58,372 10,586 Updated Jun 22, 2024

Semantic Image Synthesis with SPADE

Python 7,559 983 Updated Aug 7, 2023

A text generation benchmarking platform

Python 860 202 Updated Jul 3, 2021
C++ 2 Updated Sep 30, 2016

AirConcurrentMap is a fast, memory efficient Java ConcurrentNavigableMap implementation

Java 31 4 Updated Sep 15, 2017

HttpRunner 是一个开源的 API/UI 测试工具,简单易用,功能强大,具有丰富的插件化机制和高度的可扩展能力。

Go 4,006 1,273 Updated May 16, 2024

Write scalable load tests in plain Python 🚗💨

Python 24,111 2,919 Updated Jun 28, 2024

Implementing Recurrent Neural Network from Scratch

Python 461 152 Updated May 28, 2018

ACM_Website_2014_Develop

JavaScript 2 3 Updated Aug 16, 2014