-
Futuretalk Inc
- futuretalk.ca
- @realsammyt
Block or Report
Block or report realsammyt
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (32)
Sort Name ascending (A-Z)
360
3D
audio
auth
user management, logins, authorizationavatar
Decentralization
Dev AI Tools
Gaussian Splatter
graphs
human
Image Extrapolation
Learn
LLM
multimodal
music
nerf
neural
node editors
Point cloud
SD
Search
shaders
text
Text and story
tools
training
tts
Text to Speechunity
Unreal
User
video
WebXR
Stars
Language: Jupyter Notebook
Sort by: Most stars
🔊 Text-Prompted Generative Audio Model
Google Research
A guidance language for controlling large language models.
Code release for NeRF (Neural Radiance Fields)
Sweep: open-source AI-powered Software Developer for small features and bug fixes.
Using Low-rank adaptation to quickly fine-tune diffusion models.
Overview and tutorial of the LangChain Library
Inpaint anything using Segment Anything and inpainting models.
Taming Transformers for High-Resolution Image Synthesis
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
Automatic Generation of Visualizations and Infographics using Large Language Models
Let us democratise high-resolution generation! (CVPR 2024)
Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, R…
Discovering Interpretable GAN Controls [NeurIPS 2020]
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Deterministic LLMs Outputs for AI Applications and AI Agents
Diffusion attentive attribution maps for interpreting Stable Diffusion.
Course content and resources for the AIAIART course.
[SIGGRAPH Asia 2022] Text2Light: Zero-Shot Text-Driven HDR Panorama Generation
code and resources used in the Going Meta sessions
UI interface for experimenting with multimodal (text, image) models (stable diffusion).
This is the official repository for the LENS (Large Language Models Enhanced to See) system.
Symphony Generation with Permutation Invariant Language Model
AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE