negrinho

Renato Negrinho negrinho

Applied Scientist @ AWS Bedrock

93 followers · 85 following

Achievements

Starred repositories

meta-llama / llama-stack

Model components of the Llama Stack APIs

Python 3,445 491 Updated Oct 9, 2024

GAIR-NLP / MathPile

[NeurlPS D&B 2024] Generative AI for Math: MathPile

Python 383 20 Updated Sep 27, 2024

harvard-edge / cs249r_book

Collaborative book Machine Learning Systems

TeX 1,013 128 Updated Oct 8, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,152 167 Updated Oct 5, 2024

automl / DeepCAVE

An interactive framework to visualize and analyze your AutoML process in real-time.

Python 70 11 Updated Oct 8, 2024

autorope / donkeycar

Open source hardware and software platform to build a small scale self driving car.

Python 3,132 1,292 Updated Sep 15, 2024

thunlp / Ouroboros

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)

Python 63 8 Updated Sep 23, 2024

boson-ai / RPBench-Auto

An automated pipeline for evaluating LLMs for role-playing.

Python 124 3 Updated Sep 14, 2024

whyNLP / LCKV

Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance. Accepted to ACL 2024.

Python 130 6 Updated Sep 2, 2024

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 14,659 1,054 Updated Oct 8, 2024

BasedHardware / omi

AI wearables

C 3,527 414 Updated Oct 9, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 4,369 772 Updated Oct 8, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,017 337 Updated Oct 9, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,067 4,144 Updated Oct 9, 2024

lucidrains / speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Python 197 16 Updated Oct 9, 2023

ash-01xor / bpe.c

Simple Byte pair Encoding mechanism used for tokenization process . written purely in C

C 119 3 Updated Jul 7, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,236 1,600 Updated Aug 1, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,737 548 Updated Oct 2, 2024

zjunlp / LLMAgentPapers

Must-read Papers on LLM Agents.

1,721 93 Updated Sep 10, 2024

iyaja / llama-fs

A self-organizing file system with llama 3

Jupyter Notebook 4,872 303 Updated Aug 9, 2024

ragapp / ragapp

The easiest way to use Agentic RAG in any enterprise

TypeScript 3,629 374 Updated Sep 25, 2024

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 167 16 Updated May 29, 2024

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 32,819 3,757 Updated Oct 9, 2024

PetroIvaniuk / llms-tools

A list of LLMs Tools & Projects

126 20 Updated Oct 1, 2024

AGI-Edgerunners / LLM-Agents-Papers

A repo lists papers related to LLM based agent

Python 1,014 74 Updated Aug 1, 2024

google-research / timesfm

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 3,654 312 Updated Sep 13, 2024

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,116 66 Updated Feb 14, 2024

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 11,684 1,460 Updated Aug 18, 2024

ibm-granite / granite-code-models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,082 74 Updated Sep 2, 2024

negrinho / sane_tikz

Reconquer the canvas: beautiful Tikz figures without clunky Tikz code

Python 375 34 Updated Nov 18, 2020

Starred topics

tikz

hyperparameter-optimization