Skip to content
View uvafan's full-sized avatar

Block or report uvafan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,216 1,293 Updated Aug 31, 2024

A curriculum for learning about foundation models, from scratch to the frontier

922 69 Updated Jun 6, 2024

we got you bro

30 Updated Jul 29, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,698 288 Updated Aug 20, 2024

My writings about ARC (Abstraction and Reasoning Corpus)

53 1 Updated Aug 25, 2024

The fastest way to make and track predictions

TypeScript 26 8 Updated Aug 27, 2024

A map of the AI alignment landscape

JavaScript 10 2 Updated Aug 26, 2024

Squiggle programming language for intuitive probabilistic estimation features in Python

Python 63 8 Updated Aug 20, 2024

An estimation language

TypeScript 150 23 Updated Sep 2, 2024

Experimental models by the QURI team & others

4 1 Updated Aug 2, 2023

Manifold Markets: A market for every question

TypeScript 413 156 Updated Sep 1, 2024
Python 113 8 Updated Jan 31, 2024

TextAttack πŸ™ is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Python 2,883 385 Updated Jul 25, 2024

Official Implementation for "StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery" (ICCV 2021 Oral)

HTML 3,950 553 Updated May 30, 2023

Interpretable ML package πŸ” for concise, transparent, and accurate predictive modeling (sklearn-compatible).

Jupyter Notebook 1,357 120 Updated Aug 16, 2024

The MATH Dataset (NeurIPS 2021)

Python 813 76 Updated Aug 5, 2024

Model parallel transformers in JAX and Haiku

Python 6,268 890 Updated Jan 21, 2023

TruthfulQA: Measuring How Models Imitate Human Falsehoods

Jupyter Notebook 571 65 Updated Nov 6, 2023

Evaluation suite for large-scale language models.

Python 123 14 Updated Aug 15, 2021

Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings"

Python 80 16 Updated Mar 27, 2023

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,134 86 Updated May 28, 2023

Aligning AI With Shared Human Values (ICLR 2021)

Python 227 37 Updated Apr 21, 2023

Fetch forecasts from prediction markets/forecasting platforms to make them searchable. Integrate these forecasts into other services.

TypeScript 57 6 Updated Aug 29, 2024

🍦 Never use print() to debug again.

Python 8,859 182 Updated Jul 12, 2024

Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Python 633 93 Updated Sep 27, 2022

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

Python 153 15 Updated Jun 10, 2022
Next