kl3259

Follow

Kangshuo Li kl3259

Follow

2 followers · 6 following

Block or Report

Block or report kl3259

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Stars

lmmlzn / Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

737 64 Updated Jun 15, 2024

lifan-yuan / OOD_NLP

[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations".

Python 26 1 Updated Jun 8, 2023

csitfun / LogiQA2.0

Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks

Python 69 10 Updated Aug 11, 2023

asaparov / prontoqa

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 98 12 Updated Oct 21, 2023

mlr-org / mlr3fairness

mlr3 extension for Fairness in Machine Learning

HTML 14 2 Updated Jun 5, 2024

openai / transformer-debugger

Python 3,981 232 Updated Jun 4, 2024

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,398 2,551 Updated Jul 13, 2024

deeplearning-wisc / vos

source code for ICLR'22 paper "VOS: Learning What You Don’t Know by Virtual Outlier Synthesis"

Python 303 52 Updated Oct 1, 2023

CDEIUK / bias-mitigation

Machine Learning Bias Mitigation

Jupyter Notebook 7 6 Updated May 9, 2022

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,783 238 Updated Jul 18, 2024

nyu-mll / crows-pairs

This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models" (EMNLP 2020).

HTML 93 24 Updated Mar 1, 2024

rudinger / winogender-schemas

Data for evaluating gender bias in coreference resolution systems.

Python 63 11 Updated May 14, 2019

tslearn-team / tslearn

The machine learning toolkit for time series analysis in Python

Python 2,842 335 Updated Jul 1, 2024

carla-simulator / carla

Open-source simulator for autonomous driving research.

C++ 10,885 3,499 Updated Jul 17, 2024

wangcunxiang / QA-Eval

The repository for paper <Evaluating Open-QA Evaluation>

Python 19 Updated Apr 9, 2024

OpenMOSS / Say-I-Dont-Know

[ICML'2024] Can AI Assistants Know What They Don't Know?

Python 56 4 Updated Feb 5, 2024

sunericd / TISSUE

TISSUE (Transcript Imputation with Spatial Single-cell Uncertainty Estimation) provides tools for estimating well-calibrated uncertainty measures for gene expression predictions in single-cell spat…

Python 25 4 Updated Mar 10, 2024

IINemo / lm-polygraph

Python 74 18 Updated Jul 17, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,227 399 Updated Jul 19, 2024

torrvision / focal_calibration

Code for the paper "Calibrating Deep Neural Networks using Focal Loss"

Jupyter Notebook 146 25 Updated Jan 10, 2024

bhaweshiitk / ConformalLLM

Extending Conformal Prediction to LLMs

Jupyter Notebook 51 6 Updated Jun 21, 2024

amirarsalan90 / TabFairGAN

Python 17 6 Updated Mar 9, 2023

dchen236 / FairFace

Python 438 90 Updated Apr 6, 2023

joojs / fairface

FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age

404 17 Updated Sep 28, 2023

facebookresearch / metaseq

Repo for external large-scale work

Python 6,436 722 Updated Apr 27, 2024

lorenzkuhn / semantic_uncertainty

Python 104 16 Updated Jun 20, 2024

charan223 / FairDeepLearning

Python 35 13 Updated Mar 4, 2023

LiJunnan1992 / DivideMix

Code for paper: DivideMix: Learning with Noisy Labels as Semi-supervised Learning

Python 525 81 Updated Sep 14, 2020

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

582 42 Updated Jun 18, 2024

weijiaheng / Advances-in-Label-Noise-Learning

A curated (most recent) list of resources for Learning with Noisy Labels

640 58 Updated Feb 29, 2024