uvafan

Eli Lifland uvafan

Achievements

Stars

princeton-nlp / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,216 1,293 Updated Aug 31, 2024

elicit / machine-learning-list

A curriculum for learning about foundation models, from scratch to the frontier

922 69 Updated Jun 6, 2024

thestephencasper / everything-you-need

we got you bro

30 Updated Jul 29, 2024

princeton-nlp / SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,698 288 Updated Aug 20, 2024

neoneye / arc-notes

My writings about ARC (Abstraction and Reasoning Corpus)

53 1 Updated Aug 25, 2024

lukasberglund / reversal_curse

Python 258 18 Updated Nov 17, 2023

Sage-Future / fatebook

The fastest way to make and track predictions

TypeScript 26 8 Updated Aug 27, 2024

hamishhuggard / AI-alignment-map

A map of the AI alignment landscape

JavaScript 10 2 Updated Aug 26, 2024

anthropics / evals

228 23 Updated Jul 2, 2024

collin-burns / discovering_latent_knowledge

Python 245 36 Updated Mar 2, 2024

rethinkpriorities / squigglepy

Squiggle programming language for intuitive probabilistic estimation features in Python

Python 63 8 Updated Aug 20, 2024

quantified-uncertainty / squiggle

An estimation language

TypeScript 150 23 Updated Sep 2, 2024

quantified-uncertainty / squiggle-models

Experimental models by the QURI team & others

4 1 Updated Aug 2, 2023

manifoldmarkets / manifold

Manifold Markets: A market for every question

TypeScript 413 156 Updated Sep 1, 2024

nyu-mll / quality

Python 113 8 Updated Jan 31, 2024

QData / TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Python 2,883 385 Updated Jul 25, 2024

orpatashnik / StyleCLIP

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

HTML 3,950 553 Updated May 30, 2023

csinva / imodels

Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).

Jupyter Notebook 1,357 120 Updated Aug 16, 2024

hendrycks / math

The MATH Dataset (NeurIPS 2021)

Python 813 76 Updated Aug 5, 2024

nilesc / Long-Structured-Debate-Generation-and-Evaluation

Python 13 8 Updated Dec 8, 2022

kingoflolz / mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Python 6,268 890 Updated Jan 21, 2023

sylinrl / TruthfulQA

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Jupyter Notebook 571 65 Updated Nov 6, 2023

AI21Labs / lm-evaluation

Evaluation suite for large-scale language models.

Python 123 14 Updated Aug 15, 2021

reglab / casehold

Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings"

Python 80 16 Updated Mar 27, 2023

hendrycks / test

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,134 86 Updated May 28, 2023

hendrycks / ethics

Aligning AI With Shared Human Values (ICLR 2021)

Python 227 37 Updated Apr 21, 2023

quantified-uncertainty / metaforecast

Fetch forecasts from prediction markets/forecasting platforms to make them searchable. Integrate these forecasts into other services.

TypeScript 57 6 Updated Aug 29, 2024

gruns / icecream

🍦 Never use print() to debug again.

Python 8,859 182 Updated Jul 12, 2024

textflint / textflint

Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Python 633 93 Updated Sep 27, 2022

rrmenon10 / ADAPET

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

Python 153 15 Updated Jun 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eli Lifland uvafan

Achievements

Achievements

Block or report uvafan

Stars

princeton-nlp / SWE-agent

elicit / machine-learning-list

thestephencasper / everything-you-need

princeton-nlp / SWE-bench

neoneye / arc-notes

lukasberglund / reversal_curse

Sage-Future / fatebook

hamishhuggard / AI-alignment-map

anthropics / evals

collin-burns / discovering_latent_knowledge

rethinkpriorities / squigglepy

quantified-uncertainty / squiggle

quantified-uncertainty / squiggle-models

manifoldmarkets / manifold

nyu-mll / quality

QData / TextAttack

orpatashnik / StyleCLIP

csinva / imodels

hendrycks / math

nilesc / Long-Structured-Debate-Generation-and-Evaluation

kingoflolz / mesh-transformer-jax

sylinrl / TruthfulQA

AI21Labs / lm-evaluation

reglab / casehold

hendrycks / test

hendrycks / ethics

quantified-uncertainty / metaforecast

gruns / icecream

textflint / textflint

rrmenon10 / ADAPET