Block or Report
Block or report wyshi
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!
Process Common Crawl data with Python and Spark
Statistics of Common Crawl monthly archives mined from URL index files
Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically similar, but not exactly identical).
KokoMind: Can LLMs Understand Social Interactions?
Official repository for "Action-Based Conversations Dataset: A Corpus for Building More In-Depth Task-Oriented Dialogue Systems"
Perform data science on data that remains in someone else's server
Training PyTorch models with differential privacy
Library for training machine learning models with privacy for training data
Code for Structured Attention for Unsupervised Dialogue Structure Induction.
A curated list of awesome imitation learning resources and publications
Multilingual Sentence & Image Embeddings with BERT