Skip to content
View hiro-v's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report hiro-v

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,236 205 Updated Oct 2, 2024

Llama3.1 learns to Listen

Python 151 5 Updated Sep 30, 2024
Python 886 86 Updated Sep 17, 2024

Generative AI extensions for onnxruntime

C++ 449 107 Updated Oct 2, 2024

Estimate Your LLM's Token Toll Across Various Platforms and Configurations

Python 28 4 Updated Aug 8, 2024

LLM training in simple, raw C/CUDA

Cuda 23,642 2,645 Updated Oct 2, 2024

Ikigai is an AI-powered Open Assignment System

TypeScript 26 Updated Aug 27, 2024

Development repository for the Triton language and compiler

C++ 12,911 1,565 Updated Oct 2, 2024

A natural language interface for computers

Python 52,441 4,627 Updated Sep 26, 2024

A platform for community discussion. Free, open, simple.

Ruby 41,963 8,279 Updated Oct 2, 2024

Grok open release

Python 49,461 8,329 Updated Aug 30, 2024

A SQLite extension for efficient vector search, based on Faiss!

C++ 1,706 62 Updated May 5, 2024

Next generation BLAS implementation for ROCm platform

C++ 339 161 Updated Oct 2, 2024

Examples using MLX Swift

Swift 935 97 Updated Sep 30, 2024

WebAssembly binding for llama.cpp - Enabling in-browser LLM inference

C++ 371 18 Updated Sep 25, 2024

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 179 25 Updated Sep 10, 2024

OpenAI compatible API for TensorRT LLM triton backend

Rust 154 25 Updated Aug 1, 2024

Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.

C++ 37 2 Updated Sep 26, 2024

Scheduling infrastructure for absolutely everyone.

TypeScript 31,653 7,700 Updated Oct 2, 2024

A privacy-first, open-source platform for knowledge management and collaboration. Download link: https://github.com/logseq/logseq/releases. roadmap: https://trello.com/b/8txSM12G/roadmap

Clojure 32,320 1,882 Updated Oct 2, 2024

:octocat: Browser extension that simplifies the GitHub interface and adds useful features

TypeScript 24,430 1,479 Updated Oct 2, 2024

A curated list of awesome remote jobs and resources. Inspired by https://github.com/vinta/awesome-python

29,273 3,464 Updated Aug 12, 2024

OBS Studio - Free and open source software for live streaming and screen recording

C 59,149 7,862 Updated Sep 27, 2024

Package conda environments for redistribution

Python 516 91 Updated Sep 25, 2024

Stable Diffusion with Core ML on Apple Silicon

Python 16,731 934 Updated Sep 18, 2024

Swift Package to implement a transformers-like API in Swift

Swift 657 70 Updated Oct 2, 2024

Everything we actually know about the Apple Neural Engine (ANE)

2,020 75 Updated Sep 23, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 709 40 Updated Sep 28, 2024

NVIDIA Federated Learning Application Runtime Environment

Python 612 174 Updated Oct 2, 2024
Next