Skip to content
View kiminh's full-sized avatar

Block or report kiminh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 Llama-3 的科学推理和中文能力

27 3 Updated Aug 18, 2024

TensorFlow Implementation of "Enhanced Doubly Robust Learning for Debiasing Post-click Conversion Rate Estimation" in SIGIR'21

Python 24 4 Updated Jun 22, 2023

This is the repository for the Tool Learning survey.

235 9 Updated Oct 28, 2024

A Go web framework for quickly building recommendation online services based on JSON configuration.

Go 62 12 Updated Oct 30, 2024

Making LLaVA Tiny via MoE-Knowledge Distillation

Python 53 2 Updated Oct 24, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,065 202 Updated Oct 29, 2024

A modified version of Google's tool for pure text file

Rust 4 Updated Mar 7, 2022

Label, clean and enrich text datasets with LLMs.

Python 2,072 147 Updated Nov 1, 2024

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

C++ 2,934 334 Updated Jul 31, 2024

This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or rejection sampling fine-tuning.

Python 14 2 Updated Sep 22, 2024

Code for Paper (Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity)

Python 4 Updated Oct 23, 2024

This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".

Python 97 6 Updated Oct 24, 2024

[EMNLP24] Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence

Python 7 Updated Oct 24, 2024

使用W2NER模型进行命名实体识别

Python 8 Updated Nov 20, 2022

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 274 7 Updated Jul 15, 2024

Aligning Query Representation with Rewritten Query and Relevance Judgments for Conversational Search. A code base for CIKM 2024 accepted paper.

Python 3 1 Updated Jul 21, 2024

Source code for "CoEdPilot: Recommending Code Edits with Learned Prior Edit Relevance, Project-wise Awareness, and Interactive Nature"

Python 7 Updated May 2, 2024

State-of-the-art Parameter-Efficient MoE Fine-tuning Method

Python 88 9 Updated Aug 22, 2024
Python 9 2 Updated Jul 31, 2022
Python 11 Updated Oct 17, 2024

Continuous learning to fine-tune a pre-trained generative transformer model with DPO from real examples and a knowledge retrieval system

1 Updated Oct 4, 2024

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 590 26 Updated Oct 19, 2024
Python 4 2 Updated May 10, 2022

HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling

Python 150 19 Updated Oct 4, 2024

Some methods to sampling data points from a given distribution.

Python 15 4 Updated Jul 16, 2018

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 180 22 Updated Aug 6, 2024

Distill a Small Static Model from any Sentence Transformer

Python 375 17 Updated Nov 2, 2024
Next