Skip to content
View LoryPack's full-sized avatar

Highlights

  • Pro

Organizations

@NeuralLikelihoodFreeInference @Kinds-of-Intelligence-CFI
Block or Report

Block or report LoryPack

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 42,021 5,013 Updated Aug 1, 2024

Acceptance rates for the major AI conferences

Jupyter Notebook 4,030 289 Updated Jul 25, 2024

fastFM: A Library for Factorization Machines

Python 1,072 205 Updated Jul 17, 2022

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…

Python 112 13 Updated Apr 11, 2024

Jekyll version of the newest Agency Bootstrap theme, plus new features: Google Analytics, Markdown support, custom pages, and more!

JavaScript 336 631 Updated May 28, 2024

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

657 41 Updated May 8, 2024

🚵 Landing Pages of Ant Design System

JavaScript 6,100 624 Updated Aug 16, 2023

A free React / Next.js landing page template designed to showcase open source projects, SaaS products, online services, and more. Made by

TypeScript 3,661 1,727 Updated Jul 18, 2024

An extensible benchmark for evaluating large language models on planning

PDDL 228 28 Updated May 21, 2024

The Abstraction and Reasoning Corpus

JavaScript 3,146 520 Updated Jul 26, 2024
Jupyter Notebook 5 1 Updated Oct 26, 2023

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 102 12 Updated Jul 25, 2024
Jupyter Notebook 2 1 Updated Jul 3, 2023

Forecasting Future World Events with Neural Networks (NeurIPS 2022)

Jupyter Notebook 176 48 Updated May 13, 2023

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,463 2,558 Updated Jul 31, 2024

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,805 240 Updated Aug 1, 2024

A Python library for variational inference with normalizing flow and annealing

Python 13 2 Updated Jul 30, 2024

Code and data for the paper Revealing the structure of language model capabilities

7 1 Updated Jun 14, 2023

Scientific Inkscape: Inkscape extensions for figure resizing and editing

Python 473 18 Updated Jul 30, 2024

Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"

Jupyter Notebook 55 9 Updated Jun 19, 2024

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Python 3,812 242 Updated Jul 30, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,969 4,422 Updated Jul 31, 2024

Specify what you want it to build, the AI asks for clarification, and then builds it.

Python 51,534 6,703 Updated Jul 27, 2024

Likelihood-free AMortized Posterior Estimation with PyTorch

Python 110 10 Updated Jul 22, 2024

A domain-specific probabilistic programming language for modeling and inference with language models

Python 108 9 Updated Sep 26, 2023

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 165,336 43,829 Updated Jul 31, 2024

📚 A curated list of papers & technical articles on AI Quality & Safety

150 13 Updated Oct 13, 2023
Next