Skip to content
View sjYoondeltar's full-sized avatar

Block or report sjYoondeltar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

inspire

100 repositories

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Python 175 23 Updated May 23, 2023

Pytorch package to compute Chamfer distance between point sets (pointclouds).

Cuda 295 50 Updated Apr 10, 2024

Startup Funding Simulator

Svelte 317 22 Updated Jan 26, 2024

[CoRL 2023] Robot Parkour Learning

Python 516 92 Updated Jul 19, 2024

Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)

Python 48 5 Updated Dec 28, 2023

a simple and scalable agent for training adaptive policies with sequence-based RL

Python 77 2 Updated Sep 2, 2024

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Python 501 65 Updated Jul 14, 2024

The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"

Python 437 59 Updated Jan 8, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,140 642 Updated Aug 12, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,294 379 Updated Aug 19, 2024

[ICLR 2024] DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models

Python 206 15 Updated Feb 26, 2024
Python 334 12 Updated Jul 29, 2024

This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models.

339 23 Updated Jan 21, 2024

An open-source framework for training large multimodal models.

Python 3,631 278 Updated Aug 31, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,323 423 Updated Sep 2, 2024

PyTorch implementation of over 30 realtime semantic segmentations models, e.g. BiSeNetv1, BiSeNetv2, CGNet, ContextNet, DABNet, DDRNet, EDANet, ENet, ERFNet, ESPNet, ESPNetv2, FastSCNN, ICNet, LEDN…

Python 96 18 Updated Aug 7, 2024
Python 205 10 Updated Jun 28, 2024

Fast Diffusion Models with Transformers

Python 663 88 Updated Oct 7, 2023

EfficientViT is a new family of vision models for efficient high-resolution vision.

Python 1,739 159 Updated Aug 9, 2024

[NeurIPS 2023] Latent Exploration for Reinforcement Learning

Python 26 2 Updated Feb 23, 2024

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Python 158 9 Updated Feb 5, 2024

tiny vision language model

Jupyter Notebook 4,841 431 Updated Aug 27, 2024

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 782 41 Updated Aug 2, 2024

[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of the Open World"

Python 440 14 Updated Aug 9, 2024

This repository contains code for object detection and tracking in videos using the YOLOv9 object detection model and the DeepSORT algorithm.

Jupyter Notebook 67 15 Updated Mar 3, 2024

Social Ways: Learning Multi-Modal Distributions of Pedestrian Trajectories with GANs (CVPR 2019)

Python 121 46 Updated Mar 27, 2020

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Python 2,440 371 Updated Jul 29, 2024

NumPy & SciPy for GPU

Python 8,083 806 Updated Aug 30, 2024

JARVIS-1: Open-world Multi-task Agents with Memory-Augmented Multimodal Language Models

Java 328 15 Updated Apr 8, 2024