Skip to content
View waterwaterrr's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report waterwaterrr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 1,779 170 Updated Jul 17, 2024

Arena-Hard-Auto: An automatic LLM benchmark.

Jupyter Notebook 325 33 Updated Jul 15, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 3,377 302 Updated Jul 17, 2024

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Python 14 Updated Jul 16, 2024

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Python 159 2 Updated Jul 15, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

13,376 1,238 Updated Jul 17, 2024

中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com

Python 3,136 378 Updated Jul 5, 2024

Robust recipes to align language models with human and AI preferences

Python 4,231 361 Updated Jul 17, 2024

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 829 72 Updated Apr 29, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,322 200 Updated Jul 16, 2024

Bob 是一款 macOS 平台的翻译和 OCR 软件。

8,876 510 Updated Feb 21, 2024

Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters

Python 15 2 Updated May 30, 2024

The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"

Python 8 Updated Nov 25, 2023

[AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities

Python 12 Updated Apr 26, 2024

Automatically split your PyTorch models on multiple GPUs for training & inference

Python 603 37 Updated Jan 2, 2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Python 69 7 Updated Jun 24, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,370 488 Updated Jul 13, 2024
Python 7 Updated Jun 11, 2024

[ICLR 2021] Contrastive Learning with Adversarial Perturbations for Conditional Text Generation

Python 83 5 Updated Oct 11, 2022

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,068 3,269 Updated Jul 17, 2024

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Python 81 3 Updated Mar 26, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 1,528 179 Updated Jun 2, 2024

A natural language interface for computers

Python 50,916 4,444 Updated Jul 16, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 1,873 145 Updated May 23, 2024

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Python 104 4 Updated Jun 5, 2024

Knowledge Verification to Nip Hallucination in the Bud

Python 18 Updated Mar 10, 2024
Python 143 9 Updated May 31, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,285 346 Updated Jul 17, 2024

A framework for few-shot evaluation of language models.

Python 5,870 1,565 Updated Jul 17, 2024

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 245 32 Updated May 19, 2024
Next