Skip to content
View princenimo's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report princenimo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Learn AI's role in addressing complex challenges. Build skills combining human and machine intelligence for positive real-world impact using AI

Jupyter Notebook 13 15 Updated Jul 20, 2023

Open neural machine translation models and web services

Python 610 71 Updated Oct 7, 2024

A list of awesome Machine Translation frameworks, libraries, software and papers

170 23 Updated Jul 15, 2024

Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.

PHP 41 17 Updated Dec 19, 2023

A tutorial about neural machine translation including tips on building practical systems

Perl 371 76 Updated Nov 16, 2016

Facebook Low Resource (FLoRes) MT Benchmark

Python 698 122 Updated Nov 20, 2023

State-of-the-art LLM-based translation models.

Ruby 404 34 Updated Oct 7, 2024

A programming framework for agentic AI 🤖

C# 31,993 4,655 Updated Oct 16, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 69,630 8,203 Updated Sep 30, 2024

SLING - A natural language frame semantics parser

C++ 1,930 268 Updated Jan 22, 2021

Efficient Deep Learning Systems course materials (HSE, YSDA)

Jupyter Notebook 656 105 Updated Mar 20, 2024

Benchmarking Neural Network Inference on Mobile Devices

C++ 358 57 Updated Apr 10, 2023

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 630 43 Updated Sep 27, 2024

Penn CIS 5650 (GPU Programming and Architecture) Final Project

C++ 23 3 Updated Dec 11, 2023

LLM inference in C/C++

C++ 66,330 9,540 Updated Oct 16, 2024

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,926 1,163 Updated Oct 15, 2024

AI for all: Build the large graph of the language models

Python 234 20 Updated Jun 3, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,926 409 Updated Sep 6, 2024

Awesome LLMs on Device: A Comprehensive Survey

835 112 Updated Oct 8, 2024

research work on multimodal cognitive ai

Python 56 11 Updated Aug 28, 2024

papers of llm compression

7 Updated Mar 6, 2024

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 1,900 151 Updated Mar 27, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,431 183 Updated Oct 10, 2024

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python 288 28 Updated Oct 16, 2024

Awesome LLM compression research papers and tools.

1,127 68 Updated Oct 15, 2024

(ICML 2024) BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Python 181 13 Updated May 27, 2024

Fast and memory-efficient exact attention

Python 13,783 1,266 Updated Oct 15, 2024

Fast inference from large lauguage models via speculative decoding

Python 538 54 Updated Aug 22, 2024

Compartmental SEIR model

Python 9 4 Updated Nov 3, 2023
Next