LoryPack

Lorenzo Pacchiardi LoryPack

Research Associate at the Leverhulme Centre for the Future of Intelligence, University of Cambridge

25 followers · 3 following

University of Cambridge
Cambridge, UK
lorenzopacchiardi.me/
https://orcid.org/0000-0003-4760-7638

Achievements

x2 x2

Achievements

x2 x2

Highlights

Organizations

Block or Report

Block or report LoryPack

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

stylianos-kampakis / supervisedPCA-Python

Python 36 6 Updated Jan 31, 2016

geekan / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 42,021 5,013 Updated Aug 1, 2024

lixin4ever / Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Jupyter Notebook 4,030 289 Updated Jul 25, 2024

ibayer / fastFM

fastFM: A Library for Factorization Machines

Python 1,072 205 Updated Jul 17, 2022

simulation-based-inference / simulation-based-inference.github.io

Website

HTML 10 3 Updated Jul 28, 2024

microsoft / SmartPlay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support futu…

Python 112 13 Updated Apr 11, 2024

raviriley / agency-jekyll-theme

Jekyll version of the newest Agency Bootstrap theme, plus new features: Google Analytics, Markdown support, custom pages, and more!

JavaScript 336 631 Updated May 28, 2024

tjunlp-lab / Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

657 41 Updated May 8, 2024

ant-design / ant-design-landing

🚵 Landing Pages of Ant Design System

JavaScript 6,100 624 Updated Aug 16, 2023

cruip / open-react-template

A free React / Next.js landing page template designed to showcase open source projects, SaaS products, online services, and more. Made by

TypeScript 3,661 1,727 Updated Jul 18, 2024

karthikv792 / LLMs-Planning

An extensible benchmark for evaluating large language models on planning

PDDL 228 28 Updated May 21, 2024

fchollet / ARC-AGI

The Abstraction and Reasoning Corpus

JavaScript 3,146 520 Updated Jul 26, 2024

xiaomeng-ma / ToMChallenges

Jupyter Notebook 5 1 Updated Oct 26, 2023

asaparov / prontoqa

Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.

Python 102 12 Updated Jul 25, 2024

yaelmd / TFG

Jupyter Notebook 2 1 Updated Jul 3, 2023

andyzoujm / autocast

Forecasting Future World Events with Neural Networks (NeurIPS 2022)

Jupyter Notebook 176 48 Updated May 13, 2023

lukasberglund / reversal_curse

Python 258 18 Updated Nov 17, 2023

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 14,463 2,558 Updated Jul 31, 2024

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 1,805 240 Updated Aug 1, 2024

desResLab / LINFA

A Python library for variational inference with normalizing flow and annealing

Python 13 2 Updated Jul 30, 2024

RyanBurnell / revealing-LLM-capabilities

Code and data for the paper Revealing the structure of language model capabilities

7 1 Updated Jun 14, 2023

burghoff / Scientific-Inkscape

Scientific Inkscape: Inkscape extensions for figure resizing and editing

Python 473 18 Updated Jul 30, 2024

LoryPack / LLM-LieDetector

Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"

Jupyter Notebook 55 9 Updated Jun 19, 2024

Giskard-AI / giskard

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Python 3,812 242 Updated Jul 30, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,969 4,422 Updated Jul 31, 2024

gpt-engineer-org / gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Python 51,534 6,703 Updated Jul 27, 2024

probabilists / lampe

Likelihood-free AMortized Posterior Estimation with PyTorch

Python 110 10 Updated Jul 22, 2024

probcomp / LLaMPPL

A domain-specific probabilistic programming language for modeling and inference with language models

Python 108 9 Updated Sep 26, 2023

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 165,336 43,829 Updated Jul 31, 2024

Giskard-AI / awesome-ai-safety

📚 A curated list of papers & technical articles on AI Quality & Safety

150 13 Updated Oct 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lorenzo Pacchiardi LoryPack

Achievements

Achievements

Highlights

Organizations

Block or report LoryPack

Stars

stylianos-kampakis / supervisedPCA-Python

geekan / MetaGPT

lixin4ever / Conference-Acceptance-Rate

ibayer / fastFM

simulation-based-inference / simulation-based-inference.github.io

microsoft / SmartPlay

raviriley / agency-jekyll-theme

tjunlp-lab / Awesome-LLMs-Evaluation-Papers

ant-design / ant-design-landing

cruip / open-react-template

karthikv792 / LLMs-Planning

fchollet / ARC-AGI

xiaomeng-ma / ToMChallenges

asaparov / prontoqa

yaelmd / TFG

andyzoujm / autocast

lukasberglund / reversal_curse

openai / evals

stanford-crfm / helm

desResLab / LINFA

RyanBurnell / revealing-LLM-capabilities

burghoff / Scientific-Inkscape

LoryPack / LLM-LieDetector

Giskard-AI / giskard

lm-sys / FastChat

gpt-engineer-org / gpt-engineer

probabilists / lampe

probcomp / LLaMPPL

Significant-Gravitas / AutoGPT

Giskard-AI / awesome-ai-safety