Skip to content
View r4ghu's full-sized avatar
:octocat:
:octocat:

Block or report r4ghu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

T5 onnxruntime cpp

C++ 2 Updated Jul 28, 2024

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 53 4 Updated Jul 11, 2024

Question answering system for PDF files

Python 576 292 Updated Oct 30, 2023

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 61,055 30,848 Updated Aug 20, 2024

A library & tools to evaluate predictive language models.

Python 63 14 Updated Aug 9, 2023

📝 A text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

Python 10,406 1,816 Updated Jun 16, 2024

Rust libraries and programs focused on succinct data structures

Rust 112 8 Updated Aug 22, 2024

Modern spell checking library - accurate, fast, multi-language

C++ 600 99 Updated May 23, 2024

Tutorial for Porting PyTorch Transformer Models to Candle (Rust)

Rust 227 12 Updated Jul 22, 2024

SCOWL (and friends).

Python 376 87 Updated Jul 25, 2024

Typo Detector using Transformers ⚡. Demo (👇 )

Python 8 Updated Jul 2, 2024

A modern C++ header only SQLite3 wrapper

C++ 7 2 Updated Nov 30, 2023

SQLite3++ - C++ wrapper of SQLite3 API

C++ 593 175 Updated Nov 12, 2023

MLX: An array framework for Apple silicon

C++ 16,226 925 Updated Aug 22, 2024

This repository contains the official implementation of the research paper, "An Improved One millisecond Mobile Backbone".

Swift 716 61 Updated Jul 25, 2022

C++ project structure with Google Test (gtest) example using CMake as build system.

C++ 111 23 Updated May 14, 2023

[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning

Python 439 80 Updated Mar 29, 2024

Transformer related optimization, including BERT, GPT

C++ 5,727 882 Updated Mar 27, 2024

Typing Assistant provides the ability to autocomplete words and suggests predictions for the next word. This makes typing faster, more intelligent and reduces effort.

CSS 51 15 Updated Dec 27, 2017

NeuSpell: A Neural Spelling Correction Toolkit

Python 655 100 Updated Jul 31, 2023

Prune a model while finetuning or training.

Jupyter Notebook 391 57 Updated Jun 21, 2022

Group Fisher Pruning for Practical Network Compression(ICML2021)

Python 150 15 Updated May 24, 2023

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 3,777 1,161 Updated Aug 21, 2024

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Python 1,794 101 Updated Nov 30, 2023

Inference code for Llama models

Python 55,188 9,410 Updated Aug 18, 2024

Bolt is a deep learning library with high performance and heterogeneous flexibility.

C++ 906 158 Updated Jul 30, 2024

Android Keyboard with 180+ dictionaries. Support swipe input (sliding input), Emoji keyboard, AI predictions, dictionaries downloading, and keyboard themes.

Java 196 75 Updated May 27, 2020

oneAPI Threading Building Blocks (oneTBB)

C++ 5,544 1,005 Updated Aug 22, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,460 1,481 Updated Aug 22, 2024

A simple framework for mobile system design interviews

4,043 423 Updated Aug 9, 2024
Next