sithu31296

Focusing

sithu3 sithu31296

Focusing

Computer Vision Researcher

96 followers · 57 following

Seoul, Korea
https://sithu31296.github.io/

Achievements

Highlights

Developer Program Member

Block or Report

Block or report sithu31296

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

borglab / gtsfm

End-to-end SFM framework based on GTSAM

Jupyter Notebook 396 49 Updated Jun 30, 2024

cvg / sfm-disambiguation-colmap

Making Structure-from-Motion (COLMAP) more robust to symmetries and duplicated structures

Python 273 27 Updated Jun 3, 2024

huggingface / knockknock

🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

Python 2,771 230 Updated Jun 23, 2023

dcharatan / flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Python 835 80 Updated Jun 13, 2024

sithu31296 / magicnerf

A user-friendly and high-performance implementation of neural radiance fields (NeRF)

Python 1 Updated Mar 29, 2024

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,556 449 Updated Jul 11, 2024

ilaria-manco / muscall

Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)

Python 100 9 Updated Jan 7, 2023

iejMac / clip-video-encode

Easily compute clip embeddings from video frames

Python 132 19 Updated Oct 31, 2023

MasterBin-IIAU / Unicorn

[ECCV'22 Oral] Towards Grand Unification of Object Tracking

Python 950 87 Updated Oct 17, 2022

michaelgutmann / ml-pen-and-paper-exercises

Pen and paper exercises in machine learning

TeX 1,810 136 Updated May 21, 2024

IDEA-Research / MaskDINO

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,107 100 Updated Dec 20, 2023

dingjiansw101 / ZegFormer

Official code for "Decoupling Zero-Shot Semantic Segmentation"

Python 160 4 Updated Nov 30, 2022

NaiyuGao / PanopticDepth

PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation （CVPR2022）

Python 102 4 Updated Jun 2, 2022

yancie-yjr / StreamYOLO

Real-time Object Detection for Streaming Perception, CVPR 2022

Python 298 40 Updated Sep 21, 2022

NVlabs / Bongard-HOI

[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning

Python 61 6 Updated Nov 7, 2022

fnzhan / Generative-AI

[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era

TeX 773 60 Updated Nov 21, 2023

zju3dv / OnePose

Code for "OnePose: One-Shot Object Pose Estimation without CAD Models", CVPR 2022

Python 921 79 Updated Jan 6, 2023

sithu31296 / EasyFace

Easy-to-use Face Analysis Tool

Python 32 8 Updated Jun 20, 2022

dair-ai / Mathematics-for-ML

🧮 A collection of resources to learn mathematics for machine learning

4,309 395 Updated Jan 24, 2023

MCG-NJU / VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,273 128 Updated Dec 8, 2023

lukemelas / deep-spectral-segmentation

[CVPR 2022] Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization

Python 225 42 Updated Feb 20, 2023

bethgelab / model-vs-human

Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)

Python 329 45 Updated Feb 9, 2024

cliport / cliport

CLIPort: What and Where Pathways for Robotic Manipulation

Jupyter Notebook 437 81 Updated Nov 2, 2023

luping-liu / PNDM

The official implementation for Pseudo Numerical Methods for Diffusion Models on Manifolds (PNDM, PLMS | ICLR2022)

Python 313 31 Updated Apr 25, 2023

SkyeSong38 / CSWinTT

Transformer Tracking with Cyclic Shifting Window Attention (CSWinTT)

Python 69 6 Updated May 10, 2022

yxuansu / MAGIC

Language Models Can See: Plugging Visual Controls in Text Generation

Python 252 27 Updated Jun 1, 2022

microsoft / SwinBERT

Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"

Python 235 33 Updated May 26, 2022

keonlee9420 / Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ulti…

Python 142 19 Updated Jun 6, 2022

yoyo-nb / Thin-Plate-Spline-Motion-Model

[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.

Jupyter Notebook 3,393 556 Updated Feb 10, 2024

facebookresearch / metaseq

Repo for external large-scale work

Python 6,438 722 Updated Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sithu3 sithu31296

Achievements

Achievements

Highlights

Block or report sithu31296

Stars

borglab / gtsfm

cvg / sfm-disambiguation-colmap

huggingface / knockknock

dcharatan / flowmap

sithu31296 / magicnerf

clovaai / donut

ilaria-manco / muscall

iejMac / clip-video-encode

MasterBin-IIAU / Unicorn

michaelgutmann / ml-pen-and-paper-exercises

IDEA-Research / MaskDINO

dingjiansw101 / ZegFormer

NaiyuGao / PanopticDepth

yancie-yjr / StreamYOLO

NVlabs / Bongard-HOI

fnzhan / Generative-AI

zju3dv / OnePose

sithu31296 / EasyFace

dair-ai / Mathematics-for-ML

MCG-NJU / VideoMAE

lukemelas / deep-spectral-segmentation

bethgelab / model-vs-human

cliport / cliport

luping-liu / PNDM

SkyeSong38 / CSWinTT

yxuansu / MAGIC

microsoft / SwinBERT

keonlee9420 / Comprehensive-E2E-TTS

yoyo-nb / Thin-Plate-Spline-Motion-Model

facebookresearch / metaseq