Skip to content
View rrkarim's full-sized avatar
🌊
🌊

Organizations

@ShareChat @derintelligence @geoopt
Block or Report

Block or report rrkarim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

YaFSDP: Yet another Fully Sharded Data Parallel

Python 806 37 Updated Jul 29, 2024

Open Platform for Embodied Agents

Python 186 13 Updated Aug 4, 2024

Code for QuaRot, an end-to-end 4-bit inference of large language models.

Python 229 17 Updated Jul 22, 2024

A Native-PyTorch Library for LLM Fine-tuning

Python 3,750 317 Updated Aug 9, 2024

Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians

C++ 494 33 Updated Jul 9, 2024

Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry, Materials Science and Biology

Python 5,330 1,646 Updated Aug 8, 2024
Python 1,722 53 Updated Jun 28, 2024

An open source, standard data file format for graph data storage and retrieval.

C++ 204 44 Updated Aug 9, 2024

MLX: An array framework for Apple silicon

C++ 16,103 914 Updated Aug 9, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 11,542 960 Updated Jul 5, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,806 172 Updated Aug 5, 2024

R-friendly threading in C++

C++ 54 5 Updated Feb 22, 2024

MSCCL++: A GPU-driven communication stack for scalable AI applications

C++ 200 30 Updated Aug 7, 2024

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,280 554 Updated May 25, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 24,471 3,525 Updated Aug 9, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,861 858 Updated Aug 8, 2024

Probabilistic Machine Learning: Advanced Topics

1,376 118 Updated Jun 27, 2024

(Asyncio OR Threadsafe) Google Cloud Client Library for Python

Python 265 89 Updated Aug 8, 2024

asyncio client for kafka

Python 1,103 226 Updated Jul 23, 2024

Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

61,571 6,382 Updated Jul 30, 2024

An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

Python 677 176 Updated Aug 8, 2024

A YAML parser and emitter in C++

C++ 4,994 1,795 Updated Aug 6, 2024

A permissively licensed C and C++ Task Scheduler for creating parallel programs. Requires C++11 support.

C++ 1,682 140 Updated May 29, 2024

Inference Llama 2 in one file of pure 🔥

Mojo 2,087 140 Updated May 21, 2024

The Mojo Programming Language

Mojo 22,536 2,566 Updated Aug 8, 2024

Author's implementation of SIGGRAPH 2023 paper, "A Practical Walk-on-Boundary Method for Boundary Value Problems."

Cuda 49 4 Updated Oct 3, 2023

VRS is a file format optimized to record & playback streams of sensor data, such as images, audio samples, and any other discrete sensors (IMU, temperature, etc), stored in per-device streams of ti…

C++ 296 53 Updated Aug 8, 2024

3D Gaussian Splatting, reimagined: Unleashing unmatched speed with C++ and CUDA from the ground up!

C 870 73 Updated Dec 26, 2023

An open benchmarking platform for medical artificial intelligence using Federated Evaluation.

Python 140 27 Updated Aug 6, 2024

A scalable inference server for models optimized with OpenVINO™

C++ 650 203 Updated Aug 8, 2024
Next