AmritaBh

Follow

Amrita Bhattacharjee AmritaBh

Follow

PhD Student @ Arizona State University. LLM + ML researcher

9 followers · 3 following

Tempe, AZ, USA
22:06 (UTC -07:00)
https://scholar.google.com/citations?user=hdXXMPwAAAAJ&hl=en

Achievements

Achievements

Highlights

Pro

Organizations

Lists (1)

Sort

Research Code

Stars

lynneeai / ConvoSentinel

This is the official repository of the EMNLP 2024 paper: Defending Against Social Engineering Attacks in the Age of LLMs.

Python 5 1 Updated Oct 8, 2024

Libr-AI / do-not-answer

Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs

Jupyter Notebook 183 24 Updated Jun 7, 2024

streamlit / streamlit

Streamlit — A faster way to build and share data apps.

Python 35,701 3,094 Updated Nov 15, 2024

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

5,163 286 Updated Nov 11, 2024

thu-coai / SafeUnlearning

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Python 21 1 Updated Jul 9, 2024

mlfoundations / model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Python 428 38 Updated Jul 15, 2024

sshkhr / safeguarding-llms

TMLS 2024 Workshop: A Practitioner's Guide To Safeguarding Your LLM Applications

Jupyter Notebook 4 1 Updated Jul 11, 2024

apple / ml-aura

Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024

Python 15 3 Updated Jul 7, 2024

andyrdt / refusal_direction

Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".

Python 120 25 Updated Oct 1, 2024

wesg52 / sparse-probing-paper

Sparse probing paper full code.

Jupyter Notebook 50 10 Updated Dec 17, 2023

GraySwanAI / circuit-breakers

Improving Alignment and Robustness with Circuit Breakers

Jupyter Notebook 152 17 Updated Sep 24, 2024

tldrsec / prompt-injection-defenses

Every practical and proposed defense against prompt injection.

343 25 Updated May 31, 2024

QData / TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Python 2,969 398 Updated Jul 25, 2024

agiresearch / OpenAGI

OpenAGI: When LLM Meets Domain Experts

Python 1,962 166 Updated Sep 2, 2024

neubig / research-career-tools

Python 149 3 Updated Apr 23, 2024

SafeAILab / RAIN

[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning

Python 83 4 Updated May 23, 2024

Ztrimus / ResumeFlow

Simplify and improve the job hunting experience by integrating LLMs to automate tasks such as resume and cover letter generation, as well as application submission, saving users time and effort.

Python 107 54 Updated Oct 6, 2024

Zhen-Tan-dmml / LLM4Annotation

209 12 Updated Mar 9, 2024

shizhouxing / LLM-Detector-Robustness

[TACL] Code for "Red Teaming Language Model Detectors with Language Models"

Python 16 3 Updated Nov 24, 2023

ahans30 / Binoculars

[ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text

Python 211 30 Updated May 14, 2024

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,630 473 Updated Nov 15, 2024

thunlp / LLM-generated-text-detection

Python 10 2 Updated Nov 7, 2023

msclar / formatspread

Code accompanying "How I learned to start worrying about prompt formatting".

Python 93 9 Updated Oct 2, 2024

google-deepmind / opro

official code for "Large Language Models as Optimizers"

Python 441 46 Updated Aug 16, 2024

hongwang600 / FLAIR

Python 57 7 Updated Sep 1, 2024

leondz / garak

the LLM vulnerability scanner

Python 1,434 172 Updated Nov 14, 2024

intuit / sac3

Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency

Jupyter Notebook 33 7 Updated May 28, 2024

paras2612 / PEACE

Code for our paper titled "PEACE: Cross-Platform Hate Speech Detection - A Causality-guided Framework"

Python 4 Updated Jun 12, 2023

microsoft / robustlearn

Robust machine learning for responsible AI

Python 458 57 Updated Jul 12, 2024

saran9991 / llm-data-annotation

Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improv…

Python 29 4 Updated Sep 11, 2023