Skip to content
View 1jsingh's full-sized avatar
:electron:
Working on RL research
:electron:
Working on RL research

Block or report 1jsingh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 85 1 Updated Nov 4, 2024
Python 894 91 Updated Nov 6, 2024

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 2,544 178 Updated Nov 1, 2024

Enhancing AI Software Engineering with Repository-level Code Graph

Python 89 12 Updated Aug 25, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 2,298 184 Updated Nov 3, 2024

CLIP+MLP Aesthetic Score Predictor

Python 898 89 Updated Jul 1, 2024

Agentless🐱: an agentless approach to automatically solve software development problems

Python 706 84 Updated Oct 29, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

325 18 Updated Oct 19, 2024

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 283 10 Updated Oct 16, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 8,455 807 Updated Nov 6, 2024

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 772 67 Updated Nov 4, 2024

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Python 1,279 97 Updated Nov 6, 2024

Evaluating text-to-image/video/3D models with VQAScore

Python 217 20 Updated Sep 9, 2024

Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"

Python 243 24 Updated Jan 8, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 681 24 Updated Nov 5, 2024

Mora: More like Sora for Generalist Video Generation

Python 1,512 97 Updated Oct 10, 2024

Official inference repo for FLUX.1 models

Python 15,655 1,122 Updated Oct 8, 2024

SEED-Story: Multimodal Long Story Generation with Large Language Model

Python 734 56 Updated Oct 11, 2024

LLM101n: Let's build a Storyteller

29,619 1,620 Updated Aug 1, 2024
Python 37 3 Updated Jul 18, 2024

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 5,918 595 Updated Sep 26, 2024

Deep Contextual Video Compression

Python 397 65 Updated Feb 28, 2024
Python 15 2 Updated Jul 15, 2024

Implementation of MagViT2 Tokenizer in Pytorch

Python 560 34 Updated Oct 14, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 684 28 Updated Sep 27, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,148 1,104 Updated Oct 14, 2024
Python 215 15 Updated Apr 10, 2024

Your image is almost there!

Python 7,314 418 Updated Jul 26, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,228 337 Updated Jun 28, 2024
Next