Stars
Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claude, GPT-4, Gemini, Llama, etc.) with standardized evaluation …
Hallucinations (Confabulations) Document-Based Benchmark for RAG
DEF CON 31 AI Village - LLMs: Loose Lips Multipliers
Experiments training computer models using policy optimization
A batched offline inference oriented version of segment-anything
LLM Self Defense: By Self Examination, LLMs know they are being tricked
A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)
[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)
Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable. https://docs.kidger.site/diffrax/
A simple & elegant experiment tracking framework that integrates persistence logic & best practices directly into Python
An LLM playground you can run on your laptop
The lazier way to manage everything docker
Chapyter: ChatGPT Code Interpreter in Jupyter Notebooks
federated learning autonomous driving in CARLA simulation
Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Diffusion model papers, survey, and taxonomy
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Macro Placement - benchmarks, evaluators, and reproducible results from leading methods in open source
Create powerful Hydra applications without the yaml files and boilerplate code.
You like pytorch? You like micrograd? You love tinygrad! ❤️
Optax is a gradient processing and optimization library for JAX.
NASBench: A Neural Architecture Search Dataset and Benchmark