Skip to content
View shatu's full-sized avatar

Organizations

@CogComp
Block or Report

Block or report shatu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
532 results for source starred repositories
Clear filter

๐ŸŒ Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024

Python 51 3 Updated Aug 2, 2024

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 284 32 Updated Jun 18, 2024

Call for participation in the impact of LLM for scientific discovery

46 6 Updated Apr 11, 2024

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter Notebook 1,288 150 Updated Aug 6, 2024

๐Ÿค– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป for ๐—ณ๐—ฟ๐—ฒ๐—ฒ how to ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ an end-to-end ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป-๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—Ÿ๐—Ÿ๐—  & ๐—ฅ๐—”๐—š ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ using ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€ best practices: ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + 12 ๐˜ฉ๐˜ข๐˜ฏ๐˜ฅ๐˜ด-๐˜ฐ๐˜ฏ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ๐˜ด

Python 2,148 340 Updated Aug 3, 2024

The full dataset behind paperswithcode.com

311 32 Updated Oct 8, 2021

๐ŸŒŸ The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 42,236 5,036 Updated Aug 9, 2024

An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.

Python 1,404 154 Updated Aug 7, 2024

Formal to Formal Mathematics Benchmark

Objective-C++ 288 42 Updated Aug 16, 2023

Reasoning by Communicating with Agents

Python 19 3 Updated Jul 24, 2024

Gorilla: An API store for LLMs

Python 11,017 886 Updated Aug 10, 2024
Python 1,022 75 Updated Mar 12, 2024

Learn System Design concepts and prepare for interviews using free resources.

Java 14,740 3,645 Updated Jul 31, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 5,496 319 Updated Jul 5, 2024
Jupyter Notebook 318 17 Updated Oct 4, 2023

Solve puzzles. Improve your pytorch.

Jupyter Notebook 2,983 243 Updated Jul 15, 2024

Machine Learning Engineering Open Book

Python 10,411 628 Updated Aug 9, 2024

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Jupyter Notebook 29,329 4,276 Updated Aug 10, 2024

[IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.

Python 1,126 146 Updated Apr 16, 2024

Tutorial on neural theorem proving

Jupyter Notebook 147 14 Updated Jan 5, 2024

A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.

TypeScript 7,261 671 Updated Jul 3, 2024

[ACL 2023] Reasoning with Language Model Prompting: A Survey

844 65 Updated Jul 19, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,369 87 Updated Jun 1, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,474 2,211 Updated Jul 29, 2024

The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset

Python 153 9 Updated Apr 23, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,473 421 Updated Jun 22, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,434 1,019 Updated Aug 10, 2024

Supporting code for ReCEval paper

Python 25 5 Updated May 14, 2023

Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"

Python 86 12 Updated Jan 21, 2024
Next