- Santa Clara
Stars
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An automated pipeline for evaluating LLMs for role-playing.
A high-throughput and memory-efficient inference and serving engine for LLMs
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Materials for the Hugging Face Diffusion Models Course
Large Language Model Text Generation Inference
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, in…
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
Algorithm-hardware Co-design for Deformable Convolution
Open deep learning compiler stack for cpu, gpu and specialized accelerators
A list of ICs and IPs for AI, Machine Learning and Deep Learning.
📐 Jekyll theme for building a personal site, blog, project documentation, or portfolio.
The CORE-V CVA6 is an Application class 6-stage RISC-V CPU capable of booting Linux