-
Better Solar LLC
- Orlando, FL
- https://joefioresi718.github.io/
Highlights
- Pro
Stars
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
MOSSBench: A webpage for an oversensitivity benchmark
A high-throughput and memory-efficient inference and serving engine for LLMs
A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models
Open-source and strong foundation image recognition models.
[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds
Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
Schedule-Free Optimization in PyTorch
PyTorch code and models for V-JEPA self-supervised learning from video.
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
Official webpage for TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection, accepted at ICCV '23.
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models - CVPR 2024 - Official Repo
Official implementation of the paper "MotionCrafter: One-Shot Motion Customization of Diffusion Models"
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Code for the paper "A Whac-A-Mole Dilemma Shortcuts Come in Multiples Where Mitigating One Amplifies Others"
Pytorch I3D implmentation on Toyota Smarthome Dataset
Official repository for the "Big Transfer (BiT): General Visual Representation Learning" paper.
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
A natural language interface for computers