Block or Report
Block or report tbergman
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
[ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model
Transformer implementation for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
[RSS24] AdaptiGraph: Material-Adaptive Graph-Based Neural Dynamics for Robotic Manipulation
Swap audio from video1 into video2 via a replicate cog
[ECCV 2024] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Wrapper to use DynamiCrafter models in ComfyUI
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
This repository shows how to solve ONNX export issue in Segment Anything model
A proposal for a web API for prompting browser-provided language models
The official implementation of the ChordMixer architecture.
Use an image classifier to predict audio file labels.
Modular node graph based noise generation library using SIMD, C++17 and templates
🤖🖌️ Generate photo-realistic textures based on source images or (soon) PBR materials. Remix, remake, mashup! Useful if you want to create variations on a theme or elaborate on an existing texture.
Fine-Grained Open Domain Image Animation with Motion Guidance
Envision3D: One Image to 3D with Anchor Views Interpolation
Open-Sora: Democratizing Efficient Video Production for All
StyleGAN-Human: A Data-Centric Odyssey of Human Generation
Code release for "Segment Anything without Supervision"
BertaQA: How Much Do Language Models Know About Local Culture?
A repo to hold some notes and demos.
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."