Stars
Therapeutics Commons (TDC-2): Multimodal Foundation for Therapeutic Science
A cloud-native vector database, storage for next generation AI applications
Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.
VectorizeDB is a database for vectorized data and metadata, allowing for fast similarity search and retrieval.
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
Implementation of Alpha Fold 3 from the paper: "Accurate structure prediction of biomolecular interactions with AlphaFold3" in PyTorch
AlphaFold Meets Flow Matching for Generating Protein Ensembles
Scan your AI/ML models for problems before you put them into production.
An open library for the analysis of molecular dynamics trajectories
Memly is an extensible analysis tool for lipid bilayer simulations.
EigenFold: Generative Protein Structure Prediction with Diffusion Models
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
Evolutionary Scale Modeling (esm): Pretrained language models for proteins
LlamaIndex is a data framework for your LLM applications
Hallucination evaluation for Large Language Models
High accuracy RAG for answering questions from scientific documents with citations
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
ImageBind One Embedding Space to Bind Them All
Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Inference code and configs for the ReplitLM model family
StableLM: Stability AI Language Models
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.