Skip to content
View dbuades's full-sized avatar

Organizations

@clinia
Block or Report

Block or report dbuades

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
28 stars written in C++
Clear filter

An Open Source Machine Learning Framework for Everyone

C++ 183,735 74,023 Updated Jun 22, 2024

The new Windows Terminal and the original Windows console host, all in the same place!

C++ 94,095 8,134 Updated Jun 22, 2024

LLM inference in C/C++

C++ 60,598 8,635 Updated Jun 22, 2024

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,730 6,810 Updated Oct 25, 2023

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

C++ 16,301 3,795 Updated Jun 22, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 13,308 2,727 Updated Jun 22, 2024

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 12,870 1,151 Updated Feb 6, 2024

Development repository for the Triton language and compiler

C++ 11,767 1,383 Updated Jun 22, 2024

Google's Operations Research tools:

C++ 10,720 2,080 Updated Jun 21, 2024

Bringing Characters to Life with Computer Brains in Unity

C++ 7,358 1,024 Updated May 16, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,250 780 Updated Jun 20, 2024

Fit interpretable models. Explain blackbox machine learning.

C++ 6,117 719 Updated Jun 22, 2024

A flexible, high-performance serving system for machine learning models

C++ 6,114 2,195 Updated Jun 18, 2024

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 4,970 609 Updated Jun 19, 2024

Head tracking software for MS Windows, Linux, and Apple OSX

C++ 3,432 430 Updated Jun 21, 2024

A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

C++ 3,256 1,070 Updated Jun 22, 2024

A lightweight process isolation tool that utilizes Linux namespaces, cgroups, rlimits and seccomp-bpf syscall filters, leveraging the Kafel BPF language for enhanced security.

C++ 2,837 264 Updated Feb 14, 2024

Stan development repository. The master branch contains the current release. The develop branch contains the latest stable development. See the Developer Process Wiki for details.

C++ 2,540 367 Updated Jun 22, 2024

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 1,576 218 Updated Jun 22, 2024

🛰️ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.

C++ 1,205 48 Updated Jun 18, 2024

Fast Neural Machine Translation in C++

C++ 1,193 223 Updated Aug 25, 2023

Tensor parallelism is all you need. Run LLMs on weak devices or make powerful devices even more powerful by distributing the workload and dividing the RAM usage.

C++ 999 68 Updated Jun 22, 2024

Puffer is a free live TV streaming website and a research study at Stanford using machine learning to improve video streaming

C++ 813 129 Updated May 20, 2024

Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure

C++ 708 306 Updated Jun 22, 2024

Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO

C++ 691 279 Updated Jun 21, 2024

TinyChatEngine: On-Device LLM Inference Library

C++ 607 57 Updated Jun 20, 2024

ggml implementation of BERT

C++ 443 56 Updated Feb 23, 2024

torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters in a single C++ process.

C++ 166 36 Updated Jun 20, 2024