- Santo Domingo
- http:https://gabrielebaez.github.io/
Block or Report
Block or report gabrielebaez
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
A data visualization and analytics component, especially well-suited for large and/or streaming datasets.
Python library for solving reinforcement learning (RL) problems using generative models.
Code for "SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning" by Zolman et al.
Dataset Crafting and Efficient Fine-Tuning Using Only Free Open-Source Tools
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
A minimal GPU design in Verilog to learn how GPUs work from the ground up
A collection of resources that showcase the intersection of simulation and LLM-agents.
A library for building software agents using behavior trees and language models.
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…
The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
Reasoning Computers. Lambda Calculus, Fully Differentiable. Also Neural Stacks, Queues, Arrays, Lists, Trees, and Latches.
Fine-tune LLM agents with online reinforcement learning
Official Repository for LEURN: Learning Explainable Univariate Rules with Neural Networks
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
DSPy: The framework for programming—not prompting—foundation models
Self-Alignment with Principle-Following Reward Models
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
SOTA RAG engine with automatic knowledge graph construction
The official implementation of Self-Play Fine-Tuning (SPIN)