Stars
Source code for a video on computing Fibonacci numbers efficiently
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
[NeurIPS2021] A plant image dataset with high label ambiguity and a long-tailed distribution
Distribute and run LLMs with a single file.
Curated list of all the easter eggs and hidden jokes in Python
UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.
RETVec is an efficient, multilingual, and adversarially-robust text vectorizer.
Ghidra is a software reverse engineering (SRE) framework
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Learning to Brachiate via Simplified Model Imitation
An Open-Ended Embodied Agent with Large Language Models
QLoRA: Efficient Finetuning of Quantized LLMs
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Takagi and Nishimoto, CVPR 2023
Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community
This repo includes ChatGPT prompt curation to use ChatGPT better.
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
High-Resolution Image Synthesis with Latent Diffusion Models
A Haskell kernel for the Jupyter project.