FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

Jupyter Notebook 2,067 289 Updated Oct 7, 2024

OpenGVLab / unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Python 286 15 Updated May 27, 2024

ytongbai / LVM

Python 1,750 54 Updated Jun 28, 2024

apple / ml-ferret

Python 8,363 490 Updated Oct 9, 2024

lucidrains / perceiver-pytorch

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Python 1,081 134 Updated Aug 22, 2023

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,690 280 Updated Aug 31, 2024

stoneMo / DeepAVFusion

Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

Python 16 1 Updated Aug 2, 2024

yaohungt / Multimodal-Transformer

[ACL'19] [PyTorch] Multimodal Transformer

Python 805 150 Updated Sep 12, 2022

drscotthawley / fad_pytorch

Frechet Audio Distance evaluation in PyTorch

Python 34 3 Updated Jun 9, 2023

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,097 774 Updated Oct 9, 2024

lucidrains / multistream-transformers

Implementation of Multistream Transformers in Pytorch

Python 54 3 Updated Jul 31, 2021

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,741 7,314 Updated Oct 10, 2024

JHLew / Learnable-Fourier-Features

Unofficial pytorch implementation of the paper "Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding", NeurIPS 2021.

Python 13 1 Updated Apr 24, 2024

willGuimont / learnable_fourier_positional_encoding

Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding

Python 42 9 Updated Sep 30, 2024

riggraz / no-style-please

A (nearly) no-CSS, fast, minimalist Jekyll theme.

HTML 1,096 546 Updated Aug 5, 2024

lfwa / carbontracker

Track and predict the energy consumption and carbon footprint of training deep learning models.

Python 374 27 Updated Sep 20, 2024

VatsaDev / nanoChatGPT

nanogpt turned into a chat model

Python 61 11 Updated Aug 30, 2023

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,725 2,112 Updated Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ilpo Viertola ilpoviertola

Achievements

Achievements

Highlights

Block or report ilpoviertola

Stars

GeWu-Lab / awesome-audiovisual-learning

Breakthrough / PySceneDetect

OpenGVLab / InternVideo

LAION-AI / aesthetic-predictor

xiaobai1217 / Awesome-Video-Datasets

FoundationVision / LlamaGen

awesome-mlss / awesome-mlss

ga642381 / speech-trident

metavoiceio / metavoice-src

Alokia / Idempotent-Generative-Network

facebookresearch / jepa

v-iashin / Synchformer

FurkanGozukara / Stable-Diffusion