Skip to content
View ycool's full-sized avatar

Block or report ycool

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,862 174 Updated Sep 11, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,186 4,167 Updated Oct 11, 2024

Fast and memory-efficient exact attention

Python 13,719 1,257 Updated Oct 11, 2024

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 4,543 361 Updated Sep 25, 2024

TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API and RTC. In addition, TEN Agent also has vision and RAG capabilities.

Python 597 85 Updated Oct 11, 2024

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

C++ 1,420 97 Updated Aug 10, 2024

The Memory layer for your AI apps

Python 22,247 2,047 Updated Oct 11, 2024

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 10,637 2,115 Updated Oct 10, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 35,046 4,061 Updated Oct 10, 2024

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

C++ 7,050 2,218 Updated Oct 11, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 63,757 32,404 Updated Oct 7, 2024

A resource for learning about Machine learning & Deep Learning

Python 7,535 2,684 Updated Aug 17, 2024

list of Chinese mainland mirrors

Makefile 159 13 Updated Oct 10, 2024

Tensor library for machine learning

C++ 10,981 1,011 Updated Oct 9, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 93,334 15,015 Updated Oct 10, 2024

LlamaIndex is a data framework for your LLM applications

Python 36,009 5,125 Updated Oct 10, 2024

BLAS-like Library Instantiation Software Framework

C 2,275 365 Updated Oct 10, 2024

Open weights LLM from Google DeepMind.

Python 2,424 306 Updated Sep 20, 2024

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 643 128 Updated Sep 15, 2024

A curated list of awesome End-to-End Autonomous Driving resources (continually updated)

381 19 Updated Aug 13, 2023

[CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Python 153 10 Updated Jun 29, 2023

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Jupyter Notebook 13,286 4,194 Updated Aug 19, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 133,217 26,605 Updated Oct 11, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,075 821 Updated Oct 3, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,805 2,583 Updated Sep 30, 2024

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

2,162 218 Updated Aug 15, 2024

A JAX-based simulator for autonomous driving research.

Python 832 93 Updated Mar 22, 2024

The Python package installer

Python 9,501 3,011 Updated Oct 8, 2024

LLM inference in C/C++

C++ 66,130 9,499 Updated Oct 11, 2024

OpenMMLab Computer Vision Foundation

Python 5,851 1,631 Updated Sep 26, 2024
Next