Skip to content
View jihwanp's full-sized avatar

Highlights

  • Pro

Block or report jihwanp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 380 34 Updated Oct 15, 2024

Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".

16 Updated Jun 20, 2023

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,773 958 Updated Oct 11, 2024

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 7,803 1,075 Updated Sep 10, 2024

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

JavaScript 2,762 241 Updated Oct 14, 2024

An open source implementation of CLIP.

Python 10,008 964 Updated Oct 9, 2024
5 1 Updated Jun 27, 2024

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

Python 604 57 Updated Mar 1, 2023
Python 2 Updated Aug 18, 2023

A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...

258 8 Updated Oct 10, 2024

【NeurIPS 2024】Dense Connector for MLLMs

Python 118 4 Updated Oct 14, 2024

[TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces

Python 39 13 Updated Jun 29, 2022
Python 133 18 Updated Mar 23, 2021

[ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions

Python 27 2 Updated Feb 28, 2023
Python 127 15 Updated Aug 18, 2022

This is the pytorch implementation of some representative action recognition approaches including I3D, S3D, TSN and TAM.

Python 243 46 Updated Oct 8, 2021

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,634 251 Updated Aug 9, 2024

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

360 26 Updated Jan 21, 2024

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

391 26 Updated Sep 26, 2024

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Python 58 5 Updated Apr 4, 2024

This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions", which is accepted by ACL 2024 (Findings).

17 Updated May 21, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 925 27 Updated Jul 31, 2024

Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021

Python 64 1 Updated May 26, 2022

[ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces

Python 235 15 Updated Jan 10, 2024

A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating Object Detection with Flexible Expressions" (NeurIPS 2023).

Python 105 7 Updated Mar 20, 2024

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…

188 15 Updated Aug 17, 2024

[NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding

Python 44 1 Updated Mar 5, 2024

[ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

Python 486 20 Updated Jun 24, 2024

A curated list of foundation models for vision and language tasks

800 35 Updated Oct 14, 2024
Next