-
Stability.ai, Eleuther.ai
- Seattle, WA
- http:https://dmarx.github.io
- @DigThatData
Block or Report
Block or report dmarx
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseprompting
[CVPR 2024] Official Implementation of "Seamless Human Motion Composition with Blended Positional Encodings".
A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting
Official implementation of "ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models".
Improved AnimateAnyone implementation that allows you to use the opse image sequence and reference image to generate stylized video
🔥 [ECCV2024] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"
proof of concept prototype for generating and querying against an ever-expanding knowledge graph with ai
Representation Engineering: A Top-Down Approach to AI Transparency
ComfyUI Version of "Visual Style Prompting with Swapping Self-Attention"
Official Pytorch implementation of "Visual Style Prompting with Swapping Self-Attention"
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
CustomDiffusion360: Customizing Text-to-Image Diffusion with Camera Viewpoint Control
About Official code for TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts (Siggraph 2024 & TOG)
Word2World is an LLM-based PCG system that creates playable 2D world from stories
MasterWeaver: Taming Editability and Face Identity for Personalized Text-to-Image Generation (ECCV 2024)
Extension for A1111's Stable Diffusion Webui. Controls amount of detail.
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors
AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
a research paper for generative cartoon interpolation