-
Futuretalk Inc
- futuretalk.ca
- @realsammyt
Block or Report
Block or report realsammyt
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusemultimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
ReVersion: Diffusion-Based Relation Inversion from Images
This script allows to automate video stylization task using StableDiffusion and ControlNet.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
An open-source framework for training large multimodal models.
Curated list of useful LLM / Analytics / Datascience resources
This is the official repository for the LENS (Large Language Models Enhanced to See) system.
🦜🔗 Build context-aware reasoning applications
A multimodal inference pipeline that integrates InstructBLIP with textgen-webui for Vicuna and related models.
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, B…
Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…
Meta-Transformer for Unified Multimodal Learning
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
A collection of papers, codes, projects, tutorials ... for Knowledge Graph and other NLP methods
WavJourney: Compositional Audio Creation with LLMs
Python library for designing and training your own Diffusion Models with PyTorch.
PALLAIDIUM - a generative AI movie studio integrated in the Blender video editor.
Magick is a cutting-edge toolkit for a new kind of AI builder. Make Magick with us!
ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.
Official implementation of SEED-LLaMA (ICLR 2024).
A framework to enable multimodal models to operate a computer.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…