Skip to content
View seraphinatarrant's full-sized avatar
Block or Report

Block or report seraphinatarrant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Universal and Transferable Attacks on Aligned Language Models

Python 3,127 436 Updated Jun 6, 2024

LLM prompt attacks for hacker CTFs via CTFd.

Python 8 1 Updated Dec 17, 2023

LLM Prompt Injection Detector

TypeScript 1,021 71 Updated Jul 16, 2024

A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, accompanying The 'Problem' of Human Label Variation: On Grou…

70 8 Updated Apr 15, 2024

How Contextual are Contextualized Word Representations?

Python 38 9 Updated Apr 29, 2020

Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"

Python 18 6 Updated May 31, 2021
Jupyter Notebook 84 21 Updated Jun 6, 2022
Python 16 5 Updated Jan 17, 2024

Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics

Jupyter Notebook 188 62 Updated Jul 19, 2022

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 11,466 953 Updated Jul 5, 2024

Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"

Python 26 6 Updated Oct 3, 2021

calatan

Jupyter Notebook 3 1 Updated Dec 13, 2021

Sparse Additive Generative Model of Text

MATLAB 86 27 Updated Sep 2, 2016

Ontonotes-5-parsing: parser of Ontonotes 5.0 to transform this corpus to a simple JSON format

Python 8 3 Updated Jun 9, 2021

A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.

84 20 Updated Apr 26, 2024

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

Python 1,713 334 Updated May 22, 2023

Curated research at the intersection of causal inference and natural language processing.

767 94 Updated Feb 1, 2024

Papers on fairness in NLP

419 51 Updated May 2, 2024

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 10,047 10,757 Updated Jul 25, 2024

Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.

Python 20 3 Updated Dec 8, 2022

codes for EMNLP2020 LOGAN paper

Jupyter Notebook 7 1 Updated Feb 15, 2021

A Python package to assess and improve fairness of machine learning models.

Python 1,866 405 Updated Jul 24, 2024

Code for the paper "Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias"

Python 71 17 Updated Aug 25, 2021

An overview and exploration of the concept of missing datasets.

454 16 Updated Jan 25, 2018

Joint Bilingual Sentiment Embeddings and Classifier

Python 31 9 Updated Jul 13, 2021

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 1,991 203 Updated Jan 9, 2024

LAnguage Model Analysis

Python 1,329 179 Updated Jul 7, 2024
Python 2 Updated May 17, 2020

An example python package as a starter for good research code.

Python 9 1 Updated Apr 29, 2020
Next