Lists (3)
Sort Name ascending (A-Z)
Stars
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation
python codes for CIDEr - Consensus-based Image Caption Evaluation
Hackable and optimized Transformers building blocks, supporting a composable construction.
LAVIS - A One-stop Library for Language-Vision Intelligence
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment" and our MICCAI 2023 paper "Encoding Surgical Videos as Spa…
Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment
Aligning pretrained language models with instruction data generated by themselves.
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) avai…
Official implementation of Masked-Attention Transformers for Surgical Instrument Segmentation
[TMI'22]Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
[NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation
TransNet: A deep network for fast detection of common shot transitions
Official repository for "Self-Supervised Video Transformer" (CVPR'22)
[ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models