Skip to content
View zaemyung's full-sized avatar
Block or Report

Block or report zaemyung

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

Python 29 2 Updated Jun 4, 2024

RewardBench: the first evaluation tool for reward models.

Python 277 26 Updated Jul 3, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,713 159 Updated Jul 4, 2024

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 40+ MLLMs. (Qwen2, GLM4, Internlm2.5, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 2,222 214 Updated Jul 4, 2024

Go ahead and axolotl questions

Python 6,823 748 Updated Jul 2, 2024

Unify Efficient Fine-Tuning of 100+ LLMs

Python 25,615 3,170 Updated Jul 4, 2024

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,797 1,159 Updated Jun 30, 2023
Python 42 6 Updated Feb 23, 2024

A framework for few-shot evaluation of language models.

Python 5,761 1,536 Updated Jul 3, 2024

Mamba SSM architecture

Python 11,500 941 Updated Jul 3, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,007 192 Updated Jun 24, 2024

Cycle-consistent Generative Adversarial Network (CycleGAN) with Convolutional Block Attention Module (CBAM) - Cycle-CBAM. Modified UNet with CBAM - CBAM-UNet.

Python 17 1 Updated Mar 27, 2023

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 29,326 7,321 Updated Jul 2, 2024

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Python 26,199 5,447 Updated Nov 20, 2023

Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.

Python 2,601 591 Updated Jun 30, 2023

Non-official implement of Paper:CBAM: Convolutional Block Attention Module

Python 1,300 284 Updated Jul 12, 2023

[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"

Python 15 2 Updated Mar 28, 2024

A pure NumPy implementation of Mamba.

Python 202 9 Updated Jun 5, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 7,470 957 Updated Jun 27, 2024

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 9,097 1,618 Updated Jun 20, 2024

This repository contains the example of fine-tuning panoptic segmentation model.

Jupyter Notebook 2 Updated Dec 7, 2023

This repository contains code used to train U-Net on a multi-class segmentation dataset.

Python 48 14 Updated Apr 20, 2023

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 8,465 1,328 Updated Jul 1, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,534 45 Updated Jul 4, 2024

Tree edit distance using the Zhang Shasha algorithm

Python 424 64 Updated Oct 15, 2020

Rich is a Python library for rich text and beautiful formatting in the terminal.

Python 48,115 1,690 Updated Jul 4, 2024

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Python 874 68 Updated Mar 3, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,189 390 Updated Jul 4, 2024

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

Python 26 2 Updated Jan 12, 2024
Next