Skip to content
View bcui19's full-sized avatar
Block or Report

Block or report bcui19

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • FastChat Public

    Forked from lm-sys/FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

    Python Apache License 2.0 Updated Aug 2, 2024
  • RewardBench: the first evaluation tool for reward models.

    Python Apache License 2.0 Updated Jun 21, 2024
  • Arena-Hard benchmark

    Jupyter Notebook Apache License 2.0 Updated May 23, 2024
  • Python Apache License 2.0 Updated Dec 5, 2023
  • composer Public

    Forked from mosaicml/composer

    Train neural networks up to 7x faster

    Python Apache License 2.0 Updated Aug 31, 2023
  • Fast and flexible reference benchmarks

    Python Apache License 2.0 Updated Jun 15, 2023
  • scratch Public

    scratch work

    Python 1 Updated Jun 6, 2023
  • RL4LMs Public

    Forked from allenai/RL4LMs

    A modular RL library to fine-tune language models to human preferences

    Python Apache License 2.0 Updated Mar 20, 2023
  • toolbox Public

    Forked from stas00/ml-engineering

    Essential guides and programming tools in my toolbox (with focus on ML Training)

    Python Apache License 2.0 Updated Mar 12, 2023
  • streaming Public

    Forked from mosaicml/streaming

    A Data Streaming Library for Efficient Neural Network Training

    Python Apache License 2.0 Updated Mar 9, 2023
  • NeMo Public

    Forked from NVIDIA/NeMo

    NeMo: a toolkit for conversational AI

    Python Apache License 2.0 Updated Dec 22, 2022
  • rlmeta Public

    Forked from facebookresearch/rlmeta

    RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.

    Python MIT License Updated Jul 22, 2022
  • Implementation of the Off Belief Learning algorithm.

    Python Other Updated Jul 18, 2022
  • A reinforcement learning toolkit for compiler optimizations

    Python MIT License Updated Nov 12, 2021
  • A pytorch implementation of the paper "Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control"

    Python MIT License Updated Feb 17, 2020
  • Probabilistic reasoning and statistical analysis in TensorFlow

    Jupyter Notebook Apache License 2.0 Updated Nov 22, 2019
  • CS-148 Public

    Updated Oct 1, 2017
  • Glooko Public

    Updated Mar 27, 2017
  • Computation using data flow graphs for scalable machine learning

    C++ Apache License 2.0 Updated Mar 9, 2017