Jeffwan

Jiaxin Shan Jeffwan

Software Engineer @ Bytedance

372 followers · 290 following

Bytedance
Seattle, WA

Achievements

x3 x3

Achievements

x3 x3

Highlights

Organizations

Stars

IBM / LLM-performance-prediction

Predict the performance of LLM inference services

Jupyter Notebook 14 Updated Jun 27, 2024

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 640 26 Updated Sep 21, 2024

v6d-io / v6d

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

C++ 835 122 Updated Nov 12, 2024

AlibabaPAI / llumnix

Efficient and easy multi-instance LLM serving

Python 219 12 Updated Nov 22, 2024

volcengine / veTurboIO

A library developed by Volcano Engine for high-performance reading and writing of PyTorch model files.

Python 13 3 Updated Jun 9, 2024

S-LoRA / S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,758 98 Updated Jan 21, 2024

spegel-org / spegel

Stateless cluster local OCI registry mirror.

Go 1,298 70 Updated Nov 22, 2024

ServerlessLLM / ServerlessLLM

Serverless LLM Serving for Everyone.

Python 358 33 Updated Nov 23, 2024

nianhuatiandi / Fast-Distributed-Inference-Serving-for-Large-Language-Models

Fast Distributed Inference Serving for Large Language Models

3 Updated Oct 18, 2023

kserve / modelmesh

Distributed Model Serving Framework

Java 154 64 Updated Oct 11, 2024

DataDog / watermarkpodautoscaler

Custom controller that extends the Horizontal Pod Autoscaler

Go 212 25 Updated Nov 16, 2024

lambda7xx / awesome-AI-system

paper and its code for AI System

216 13 Updated Aug 29, 2024

Hsword / SpotServe

SpotServe: Serving Generative Large Language Models on Preemptible Instances

102 8 Updated Feb 22, 2024

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,211 145 Updated Nov 23, 2024

xlang-ai / OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,001 444 Updated Nov 18, 2024

letta-ai / letta

Letta (formerly MemGPT) is a framework for creating LLM services with memory.

Python 12,909 1,415 Updated Nov 23, 2024

BeachWang / DAIL-SQL

A efficient and effective few-shot NL2SQL method on GPT-4.

Python 440 71 Updated Jun 4, 2024

jbilcke-hf / ai-comic-factory

Generate comic panels using a LLM + SDXL. Powered by Hugging Face 🤗

TypeScript 1,058 220 Updated Oct 15, 2024

Zjh-819 / LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

2,653 169 Updated Nov 28, 2023

ACL2023-Retrieval-LM / ACL2023-Retrieval-LM.github.io

https://acl2023-retrieval-lm.github.io/

JavaScript 154 13 Updated Oct 18, 2023

InternLM / InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,495 460 Updated Nov 21, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 30,681 4,655 Updated Nov 23, 2024

CloudNativeGame / aigc-gateway

A user gateway that provides serverless AIGC experience.

Go 41 8 Updated Apr 17, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 9,138 1,075 Updated Nov 22, 2024

conceptofmind / toolformer

Python 344 38 Updated Mar 10, 2023

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 13,798 1,858 Updated Nov 22, 2024

allegroai / clearml-serving

ClearML - Model-Serving Orchestration and Repository Solution

Python 137 40 Updated Aug 15, 2024

LLMFlow / LLMFlow

Easy, Fast, Secure and Cost-Efficient LLM Pipelines to generate GhatGPT-like private domain models and knowledgeable agents for your organization.

6 1 Updated May 25, 2023

chenfei-wu / TaskMatrix

Python 34,547 3,321 Updated Jan 6, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 95,212 15,443 Updated Nov 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiaxin Shan Jeffwan

Achievements

Achievements

Highlights

Organizations

Block or report Jeffwan

Stars

IBM / LLM-performance-prediction

efeslab / Nanoflow

v6d-io / v6d

AlibabaPAI / llumnix

volcengine / veTurboIO

S-LoRA / S-LoRA

spegel-org / spegel

ServerlessLLM / ServerlessLLM

nianhuatiandi / Fast-Distributed-Inference-Serving-for-Large-Language-Models

kserve / modelmesh

DataDog / watermarkpodautoscaler

lambda7xx / awesome-AI-system

Hsword / SpotServe

predibase / lorax

xlang-ai / OpenAgents

letta-ai / letta

BeachWang / DAIL-SQL

jbilcke-hf / ai-comic-factory

Zjh-819 / LLMDataHub

ACL2023-Retrieval-LM / ACL2023-Retrieval-LM.github.io

InternLM / InternLM

vllm-project / vllm

CloudNativeGame / aigc-gateway

huggingface / text-generation-inference

conceptofmind / toolformer

eosphoros-ai / DB-GPT

allegroai / clearml-serving

LLMFlow / LLMFlow

chenfei-wu / TaskMatrix

langchain-ai / langchain