Skip to content
View dbuades's full-sized avatar

Organizations

@clinia

Block or report dbuades

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
29 stars written in C++
Clear filter

An Open Source Machine Learning Framework for Everyone

C++ 185,313 74,151 Updated Sep 4, 2024

The new Windows Terminal and the original Windows console host, all in the same place!

C++ 94,835 8,212 Updated Sep 3, 2024

LLM inference in C/C++

C++ 64,424 9,218 Updated Sep 3, 2024

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,760 6,800 Updated Oct 25, 2023

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

C++ 16,506 3,821 Updated Sep 3, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 14,021 2,830 Updated Sep 4, 2024

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 13,059 1,152 Updated Jul 29, 2024

Development repository for the Triton language and compiler

C++ 12,444 1,506 Updated Sep 4, 2024

Google's Operations Research tools:

C++ 10,954 2,094 Updated Sep 3, 2024

PS4 emulator for Windows,Linux,MacOS

C++ 8,222 396 Updated Sep 3, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,079 890 Updated Sep 3, 2024

Bringing Characters to Life with Computer Brains in Unity

C++ 7,778 1,055 Updated Jul 23, 2024

Fit interpretable models. Explain blackbox machine learning.

C++ 6,196 726 Updated Sep 3, 2024

A flexible, high-performance serving system for machine learning models

C++ 6,156 2,190 Updated Sep 4, 2024

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,053 615 Updated Sep 2, 2024

Head tracking software for MS Windows, Linux, and Apple OSX

C++ 3,573 443 Updated Aug 6, 2024

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

C++ 3,399 1,112 Updated Sep 3, 2024

A lightweight process isolation tool that utilizes Linux namespaces, cgroups, rlimits and seccomp-bpf syscall filters, leveraging the Kafel BPF language for enhanced security.

C++ 2,913 272 Updated Jul 29, 2024

Stan development repository. The master branch contains the current release. The develop branch contains the latest stable development. See the Developer Process Wiki for details.

C++ 2,570 368 Updated Aug 26, 2024

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 1,636 223 Updated Sep 3, 2024

Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.

C++ 1,328 89 Updated Aug 10, 2024

🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,266 51 Updated Aug 30, 2024

Fast Neural Machine Translation in C++

C++ 1,217 227 Updated Aug 25, 2023

Puffer is a free live TV streaming website and a research study at Stanford using machine learning to improve video streaming

C++ 832 130 Updated Aug 4, 2024

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 736 313 Updated Sep 4, 2024

Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO

C++ 698 282 Updated Sep 3, 2024

TinyChatEngine: On-Device LLM Inference Library

C++ 682 67 Updated Jul 4, 2024

ggml implementation of BERT

C++ 459 56 Updated Feb 23, 2024

torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.

C++ 173 35 Updated Jun 20, 2024