bdok23

bdok23

3 followers · 7 following

Highlights

Starred repositories

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,034 1,269 Updated Nov 14, 2024

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,004 2,608 Updated Sep 30, 2024

princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 707 49 Updated Nov 4, 2024

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,973 5,773 Updated Nov 14, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—foundation models

Python 18,746 1,437 Updated Nov 13, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,164 180 Updated Aug 11, 2024

bifurcated-attn-icml-2024 / gpt-fast-parallel-sampling

Python 6 1 Updated Jun 3, 2024

bdok23 / Treehacks

hackathon

JavaScript 2 1 Updated Feb 18, 2024

bdok23 / Infinitus_App

JavaScript 1 Updated Apr 26, 2024

bdok23 / puppersim

Forked from jietan/puppersim

Simulation for DJI Pupper v2 robot

Jupyter Notebook 1 Updated Mar 2, 2024

bdok23 / Algorithms

1 Updated May 29, 2020

bdok23 / ProgrammingNotes

Notes on programming

1 Updated Jun 26, 2020

bdok23 / Data-Science-Algorithms

Jupyter Notebook 1 Updated Apr 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly