zaemyung

Follow

Zae Myung Kim zaemyung

Follow

24 followers · 34 following

https://zaemyung.github.io/

Achievements

Achievements

Block or Report

Block or report zaemyung

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Stars

YangRui2015 / RiC

Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

Python 29 2 Updated Jun 4, 2024

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 277 26 Updated Jul 3, 2024

OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,713 159 Updated Jul 4, 2024

modelscope / swift

ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 40+ MLLMs. (Qwen2, GLM4, Internlm2.5, Yi, Llama3, Llava, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)

Python 2,222 214 Updated Jul 4, 2024

OpenAccess-AI-Collective / axolotl

Go ahead and axolotl questions

Python 6,823 748 Updated Jul 2, 2024

hiyouga / LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Python 25,615 3,170 Updated Jul 4, 2024

databrickslabs / dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,797 1,159 Updated Jun 30, 2023

databrickslabs / doc-qa

Python 42 6 Updated Feb 23, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 5,761 1,536 Updated Jul 3, 2024

state-spaces / mamba

Mamba SSM architecture

Python 11,500 941 Updated Jul 3, 2024

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,007 192 Updated Jun 24, 2024

AAleka / Cycle-CBAM-and-CBAM-UNet

Cycle-consistent Generative Adversarial Network (CycleGAN) with Convolutional Block Attention Module (CBAM) - Cycle-CBAM. Modified UNet with CBAM - CBAM-UNet.

Python 17 1 Updated Mar 27, 2023

facebookresearch / detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 29,326 7,321 Updated Jul 2, 2024

facebookresearch / Detectron

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Python 26,199 5,447 Updated Nov 20, 2023

LeeJunHyun / Image_Segmentation

Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.

Python 2,601 591 Updated Jun 30, 2023

luuuyi / CBAM.PyTorch

Non-official implement of Paper：CBAM: Convolutional Block Attention Module

Python 1,300 284 Updated Jul 12, 2023

halfrot / ALaRM

[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"

Python 15 2 Updated Mar 28, 2024

idoh / mamba.np

A pure NumPy implementation of Mamba.

Python 202 9 Updated Jun 5, 2024

lucidrains / denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 7,470 957 Updated Jun 27, 2024

qubvel / segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Python 9,097 1,618 Updated Jun 20, 2024

kryvokhyzha / panoptic-segmentation-experiments

This repository contains the example of fine-tuning panoptic segmentation model.

Jupyter Notebook 2 Updated Dec 7, 2023

hamdaan19 / UNet-Multiclass

This repository contains code used to train U-Net on a multi-class segmentation dataset.

Python 48 14 Updated Apr 20, 2023

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 8,465 1,328 Updated Jul 1, 2024

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,534 45 Updated Jul 4, 2024

Weixin-Liang / Mapping-the-Increasing-Use-of-LLMs-in-Scientific-Papers

Python 18 Updated May 14, 2024

timtadh / zhang-shasha

Tree edit distance using the Zhang Shasha algorithm

Python 424 64 Updated Oct 15, 2020

Textualize / rich

Rich is a Python library for rich text and beautiful formatting in the terminal.

Python 48,115 1,690 Updated Jul 4, 2024

redotvideo / mamba-chat

Mamba-Chat: A chat LLM based on the state-space model architecture 🐍

Python 874 68 Updated Mar 3, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,189 390 Updated Jul 4, 2024

RUCAIBox / RLMEC

Forked from Timothy023/RLMEC

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

Python 26 2 Updated Jan 12, 2024