Skip to content
View bil-ash's full-sized avatar
Block or Report

Block or report bil-ash

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

babyLM WhisBERT code

Jupyter Notebook 11 1 Updated May 27, 2024

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,596 161 Updated Jul 12, 2024

Implementation of the RWKV language model in pure WebGPU/Rust.

Rust 217 15 Updated Jul 3, 2024
C 268 28 Updated Jul 30, 2024
Python 2,022 390 Updated Apr 29, 2022

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

498 10 Updated Jul 22, 2024
Python 477 28 Updated Feb 13, 2024

Fine-Tuning of a multi-language transformer model on Nvidia GPUs.

Jupyter Notebook 1 Updated Jul 25, 2023

Implements VAR+CLIP for image generation

23 Updated Jul 25, 2024
Python 174 18 Updated May 26, 2023

Run Gemini Nano locally on chrome

HTML 18 Updated Jun 27, 2024

Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish

Jupyter Notebook 158 5 Updated Jul 30, 2024

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 210 15 Updated May 19, 2024

A Framework of Small-scale Large Multimodal Models

Python 523 48 Updated Jul 30, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,091 39 Updated Jul 14, 2024

A base64 encoder/decoder with gzip or deflate abilities.

JavaScript 27 Updated Jun 12, 2024

Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.

Python 75 5 Updated Mar 26, 2024

MobiLlama : Small Language Model tailored for edge devices

Python 569 41 Updated Mar 3, 2024

this plugin embeds an video player. when upload a video this does not appear as a link but as players in the content. Accepted video formats MP4, WEBM, MOV, OGV。

JavaScript 1 1 Updated Jun 30, 2021

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 1,945 80 Updated Jul 23, 2024

Examples of Pages using WebM files with Encrypted Media Extensions

JavaScript 8 5 Updated Oct 28, 2018

A simple MP3 and AAC Decoder (not only) for Arduino based on libhelix

C 65 21 Updated Jul 3, 2024

A video player on the M5Stack Core2.

C++ 4 Updated Sep 24, 2023

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 676 50 Updated Jul 30, 2024

ESP32 library that generates composite video signal for PAL, SECAM and NTSC.

C 174 6 Updated May 3, 2022

Build android apps without any java, entirely in C and Make

C 2,721 196 Updated May 9, 2024

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.

Jupyter Notebook 19 1 Updated Oct 2, 2023

GPT in TensorFlow.js

JavaScript 27 5 Updated Oct 16, 2023

tiny vision language model

Jupyter Notebook 4,639 413 Updated Jul 30, 2024
Next