Skip to content
View negrinho's full-sized avatar

Block or report negrinho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Model components of the Llama Stack APIs

Python 3,445 491 Updated Oct 9, 2024

[NeurlPS D&B 2024] Generative AI for Math: MathPile

Python 383 20 Updated Sep 27, 2024

Collaborative book Machine Learning Systems

TeX 1,013 128 Updated Oct 8, 2024

Efficient Triton Kernels for LLM Training

Python 3,152 167 Updated Oct 5, 2024

An interactive framework to visualize and analyze your AutoML process in real-time.

Python 70 11 Updated Oct 8, 2024

Open source hardware and software platform to build a small scale self driving car.

Python 3,132 1,292 Updated Sep 15, 2024

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)

Python 63 8 Updated Sep 23, 2024

An automated pipeline for evaluating LLMs for role-playing.

Python 124 3 Updated Sep 14, 2024

Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.

Python 130 6 Updated Sep 2, 2024

Official inference repo for FLUX.1 models

Python 14,659 1,054 Updated Oct 8, 2024

AI wearables

C 3,527 414 Updated Oct 9, 2024

Utilities intended for use with Llama models.

Python 4,369 772 Updated Oct 8, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,017 337 Updated Oct 9, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,067 4,144 Updated Oct 9, 2024

Explorations into some recent techniques surrounding speculative decoding

Python 197 16 Updated Oct 9, 2023

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

C 119 3 Updated Jul 7, 2024

LLM101n: Let's build a Storyteller

29,236 1,600 Updated Aug 1, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,737 548 Updated Oct 2, 2024

Must-read Papers on LLM Agents.

1,721 93 Updated Sep 10, 2024

A self-organizing file system with llama 3

Jupyter Notebook 4,872 303 Updated Aug 9, 2024

The easiest way to use Agentic RAG in any enterprise

TypeScript 3,629 374 Updated Sep 25, 2024

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 167 16 Updated May 29, 2024

🙌 OpenHands: Code Less, Make More

Python 32,819 3,757 Updated Oct 9, 2024

A list of LLMs Tools & Projects

126 20 Updated Oct 1, 2024

A repo lists papers related to LLM based agent

Python 1,014 74 Updated Aug 1, 2024

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,654 312 Updated Sep 13, 2024

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,116 66 Updated Feb 14, 2024

Neural Networks: Zero to Hero

Jupyter Notebook 11,684 1,460 Updated Aug 18, 2024

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,082 74 Updated Sep 2, 2024

Reconquer the canvas: beautiful Tikz figures without clunky Tikz code

Python 375 34 Updated Nov 18, 2020
Next