kiminh

Ramsey kiminh

Starred repositories

RUC-GSAI / Llama-3-SynE

Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 Llama-3 的科学推理和中文能力

27 3 Updated Aug 18, 2024

guosyjlu / MRDR-DL

TensorFlow Implementation of "Enhanced Doubly Robust Learning for Debiasing Post-click Conversion Rate Estimation" in SIGIR'21

Python 24 4 Updated Jun 22, 2023

quchangle1 / LLM-Tool-Survey

This is the repository for the Tool Learning survey.

235 9 Updated Oct 28, 2024

alibaba / pairec

A Go web framework for quickly building recommendation online services based on JSON configuration.

Go 62 12 Updated Oct 30, 2024

shufangxun / LLaVA-MoD

Making LLaVA Tiny via MoE-Knowledge Distillation

Python 53 2 Updated Oct 24, 2024

facebookresearch / lingua

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,065 202 Updated Oct 29, 2024

ChenghaoMou / deduplicate-text-datasets

A modified version of Google's tool for pure text file

Rust 4 Updated Mar 7, 2022

google-research / deduplicate-text-datasets

Rust 1,116 111 Updated Jul 30, 2024

refuel-ai / autolabel

Label, clean and enrich text datasets with LLMs.

Python 2,072 147 Updated Nov 1, 2024

guyulongcs / Awesome-papers-in-DeepLearning-GenerativeAI

Python 1 Updated Sep 20, 2024

li-plus / chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,934 334 Updated Jul 31, 2024

RLHFlow / RAFT

This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or rejection sampling fine-tuning.

Python 14 2 Updated Sep 22, 2024

liziniu / GEM

Code for Paper (Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity)

Python 4 Updated Oct 23, 2024

OpenMatch / ActiveRAG

This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".

Python 97 6 Updated Oct 24, 2024

LuJunru / SamPO

[EMNLP24] Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Python 7 Updated Oct 24, 2024

YeexiaoZheng / NLP-W2NER

使用W2NER模型进行命名实体识别

Python 8 Updated Nov 20, 2022

dvlab-research / Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 274 7 Updated Jul 15, 2024

fengranMark / QRACDR

Aligning Query Representation with Rewritten Query and Relevance Judgments for Conversational Search. A code base for CIKM 2024 accepted paper.

Python 3 1 Updated Jul 21, 2024

code-philia / CoEdPilot

Source code for "CoEdPilot: Recommending Code Edits with Learned Prior Edit Relevance, Project-wise Awareness, and Interactive Nature"

Python 7 Updated May 2, 2024

TUDB-Labs / MixLoRA

State-of-the-art Parameter-Efficient MoE Fine-tuning Method

Python 88 9 Updated Aug 22, 2024

JayMan91 / ltr-predopt

Python 9 2 Updated Jul 31, 2022

junkangwu / alpha-DPO

Python 11 Updated Oct 17, 2024

peytontolbert / DPOAssistant

Continuous learning to fine-tune a pre-trained generative transformer model with DPO from real examples and a knowledge retrieval system

1 Updated Oct 4, 2024

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 590 26 Updated Oct 19, 2024

360AILABNLP / 360LayoutAnalysis

18 1 Updated Oct 14, 2024

UFNRec-code / UFNRec

Python 4 2 Updated May 10, 2022

bytedance / HLLM

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 150 19 Updated Oct 4, 2024

lxcnju / sampling

Some methods to sampling data points from a given distribution.

Python 15 4 Updated Jul 16, 2018

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Ramsey kiminh

Starred repositories

arrays

parameter-server

lottery-ticket-hypothesis

covariate-shift

submodular-optimization