Skip to content
View bdok23's full-sized avatar

Highlights

  • Pro

Block or report bdok23

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Train transformer language models with reinforcement learning.

Python 10,034 1,269 Updated Nov 14, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 15,004 2,608 Updated Sep 30, 2024

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 707 49 Updated Nov 4, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,973 5,773 Updated Nov 14, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 18,746 1,437 Updated Nov 13, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,164 180 Updated Aug 11, 2024

hackathon

JavaScript 2 1 Updated Feb 18, 2024
JavaScript 1 Updated Apr 26, 2024

Simulation for DJI Pupper v2 robot

Jupyter Notebook 1 Updated Mar 2, 2024
1 Updated May 29, 2020

Notes on programming

1 Updated Jun 26, 2020
Jupyter Notebook 1 Updated Apr 28, 2024