Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Learn AI's role in addressing complex challenges. Build skills combining human and machine intelligence for positive real-world impact using AI
Open neural machine translation models and web services
A list of awesome Machine Translation frameworks, libraries, software and papers
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
A tutorial about neural machine translation including tips on building practical systems
Facebook Low Resource (FLoRes) MT Benchmark
Robust Speech Recognition via Large-Scale Weak Supervision
Efficient Deep Learning Systems course materials (HSE, YSDA)
Benchmarking Neural Network Inference on Mobile Devices
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Penn CIS 5650 (GPU Programming and Architecture) Final Project
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
AI for all: Build the large graph of the language models
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Awesome LLMs on Device: A Comprehensive Survey
research work on multimodal cognitive ai
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
Awesome LLM compression research papers and tools.
(ICML 2024) BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Fast and memory-efficient exact attention
Fast inference from large lauguage models via speculative decoding