- Woodinville, WA
Lists (2)
Sort Name ascending (A-Z)
Stars
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Interact with your documents using the power of GPT, 100% privately, no data leaks
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
The simplest, fastest repository for training/finetuning medium-sized GPTs.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Convert PDF to markdown quickly with high accuracy
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Retrieval and Retrieval-augmented LLMs
Supercharge Your LLM Application Evaluations 🚀
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
min(DALL·E) is a fast, minimal port of DALL·E Mini to PyTorch
🔮 A refreshing functional take on deep learning, compatible with your favorite libraries
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
A fast, efficient universal vector embedding utility package.
Collective Knowledge (CK, CM, CM4MLOps and CMX) is an educational project to learn how to run AI, ML and other emerging workloads in the most efficient and cost-effective way across diverse models,…
Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"
Constrained Decoding for LLMs against JSON Schema
A large-scale simulation framework for LLM inference
Common utilities for ONNX converters
A comprehensive deep dive into the world of tokens
Excel spreadsheet crawler and table parser for data extraction and querying