-
https://blog.guanghan.info
- United States
- https://guanghan.info/projects/
Stars
llama3 implementation one matrix multiplication at a time
Open-Sora: Democratizing Efficient Video Production for All
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Hackable and optimized Transformers building blocks, supporting a composable construction.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Ongoing research training transformer models at scale
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
A collaboration friendly studio for NeRFs
A General NeRF Acceleration Toolbox in PyTorch.
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Stable Diffusion built-in to Blender
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. Th…
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer
VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
A deep learning library for video understanding research.
Scenic: A Jax Library for Computer Vision Research and Beyond
A latent text-to-image diffusion model
KeypointNeRF Generalizing Image-based Volumetric Avatars using Relative Spatial Encoding of Keypoints
A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.