Block or Report
Block or report zaemyung
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
RewardBench: the first evaluation tool for reward models.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 40+ MLLMs. (Qwen2, GLM4, Internlm2.5, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Go ahead and axolotl questions
Unify Efficient Fine-Tuning of 100+ LLMs
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
A framework for few-shot evaluation of language models.
A curated list of reinforcement learning with human feedback resources (continually updated)
Cycle-consistent Generative Adversarial Network (CycleGAN) with Convolutional Block Attention Module (CBAM) - Cycle-CBAM. Modified UNet with CBAM - CBAM-UNet.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.
Non-official implement of Paper:CBAM: Convolutional Block Attention Module
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
This repository contains the example of fine-tuning panoptic segmentation model.
This repository contains code used to train U-Net on a multi-class segmentation dataset.
This repository contains demos I made with the Transformers library by HuggingFace.
A JAX research toolkit for building, editing, and visualizing neural networks.
Tree edit distance using the Zhang Shasha algorithm
Rich is a Python library for rich text and beautiful formatting in the terminal.
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
Modeling, training, eval, and inference code for OLMo
RUCAIBox / RLMEC
Forked from Timothy023/RLMECThe official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"