Skip to content
View deepsworld's full-sized avatar
💻
Never underestimate the power of more data
💻
Never underestimate the power of more data

Organizations

@necla-ml

Block or report deepsworld

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 5,115 416 Updated Oct 2, 2024

Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)

Python 172 10 Updated Apr 1, 2024

[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper

Python 122 7 Updated May 7, 2024

Official implementation of the NeurIPS 2023 paper "Self-supervised Object-Centric Learning for Videos"

Python 21 Updated Feb 6, 2024

[CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"

Python 73 3 Updated Jan 20, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,432 479 Updated May 31, 2024

Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"

Python 18 Updated Apr 20, 2023

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,463 286 Updated Aug 14, 2024

[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset

Python 45 Updated Sep 3, 2024

Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Python 19 2 Updated Sep 23, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,052 304 Updated Oct 6, 2024

The official Meta Llama 3 GitHub site

Python 26,549 3,001 Updated Aug 12, 2024

[ICCV 2023] ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking

Python 137 15 Updated Mar 27, 2024

Port of OpenAI's Whisper model in C/C++

C 34,911 3,559 Updated Oct 8, 2024

[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challen…

Python 13,422 1,344 Updated Oct 9, 2024

⛄ Possibly the smallest compiler ever

JavaScript 27,879 2,853 Updated Feb 19, 2024

High-efficiency floating-point neural network inference operators for mobile, server, and Web

C 1,836 354 Updated Oct 9, 2024

Inference Vision Transformer (ViT) in plain C/C++ with ggml

C++ 223 17 Updated Apr 11, 2024

A collection of learning resources for curious software engineers

Python 46,532 3,709 Updated Oct 7, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,216 4,776 Updated Sep 19, 2024

Master programming by recreating your favorite technologies from scratch.

Markdown 303,689 28,477 Updated Sep 3, 2024

NVIDIA's Deep Imagination Team's PyTorch Library

Python 3,999 447 Updated Nov 29, 2022

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,072 160 Updated Dec 22, 2022

Code for Diffusion Action Segmentation (ICCV 2023)

Python 52 4 Updated Aug 16, 2023

[ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"

32 Updated Mar 30, 2023

RelTR: Relation Transformer for Scene Graph Generation: https://arxiv.org/abs/2201.11460v2

Python 250 51 Updated Aug 20, 2024

Open-Set Grounded Text-to-Image Generation

Python 1,988 148 Updated Mar 6, 2024
Shell 20 1 Updated Nov 6, 2023

Inference code for Persimmon-8B

Python 416 23 Updated Sep 9, 2023
Next