Skip to content
View ryantd's full-sized avatar
🏎️
🏎️

Organizations

@kubeflow
Block or Report

Block or report ryantd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

57 stars written in C++
Clear filter

LLM inference in C/C++

C++ 62,450 8,959 Updated Jul 26, 2024

Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)

C++ 32,263 1,468 Updated Jul 26, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 29,682 3,501 Updated Jul 26, 2024

Cross-platform, customizable ML solutions for live and streaming media.

C++ 26,333 5,057 Updated Jul 25, 2024

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 25,849 8,689 Updated Jul 26, 2024

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 24,835 3,929 Updated Jun 22, 2024

Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences

C++ 19,041 585 Updated Jul 24, 2024

MLX: An array framework for Apple silicon

C++ 15,945 906 Updated Jul 25, 2024

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

C++ 13,993 3,410 Updated Jul 26, 2024

Development repository for the Triton language and compiler

C++ 12,083 1,441 Updated Jul 26, 2024

Tensor library for machine learning

C++ 10,427 966 Updated Jul 25, 2024

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 10,335 2,089 Updated Jul 18, 2024

A General-purpose Task-parallel Programming System using Modern C++

C++ 9,858 1,165 Updated Jul 24, 2024

Diablo devolved - magic behind the 1996 computer game

C++ 8,675 917 Updated Apr 17, 2024

cuDF - GPU DataFrame Library

C++ 8,080 875 Updated Jul 26, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,691 407 Updated Jul 15, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,674 834 Updated Jul 25, 2024

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,014 610 Updated Jul 25, 2024

Header-only C++/python library for fast approximate nearest neighbors

C++ 4,174 614 Updated Jul 25, 2024

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

C++ 4,173 689 Updated Jul 26, 2024

cuML - RAPIDS Machine Learning Library

C++ 4,070 525 Updated Jul 25, 2024

oneAPI Deep Neural Network Library (oneDNN)

C++ 3,539 973 Updated Jul 25, 2024

Go-style concurrency in C++11

C++ 3,161 756 Updated Jul 3, 2023

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,138 323 Updated May 16, 2023

Fast inference engine for Transformer models

C++ 3,090 274 Updated Jul 25, 2024

A distributed graph deep learning framework.

C++ 2,885 559 Updated Aug 19, 2023

CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.

C++ 2,287 210 Updated Jun 28, 2024

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 1,609 221 Updated Jul 25, 2024

A lightweight parameter server interface

C++ 1,523 542 Updated Jan 11, 2023

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,455 194 Updated Jun 12, 2023
Next