Skip to content
View LooperXX's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report LooperXX

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,571 96 Updated Jul 6, 2024
Python 12 Updated Jun 7, 2024

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Python 8,019 563 Updated Jul 19, 2024
Python 1,307 71 Updated Jul 19, 2024

The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024

Python 11 Updated May 11, 2024

Multimodal language model benchmark, featuring challenging examples

Python 139 6 Updated May 14, 2024

Data Toolkit for Sailor Language Models

Python 68 6 Updated Jul 11, 2024

Sailor: Open Language Models for South-East Asia

Python 84 7 Updated Jul 11, 2024

Unsupervised text tokenizer focused on computational efficiency

C++ 950 97 Updated Mar 29, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 10,530 1,503 Updated Jul 21, 2024

The official Meta Llama 3 GitHub site

Python 23,414 2,519 Updated Jul 17, 2024

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

279 10 Updated Apr 18, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,862 294 Updated Jul 16, 2024

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Python 223 7 Updated Jun 25, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,102 277 Updated May 4, 2024
Python 4 Updated Mar 29, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 1,891 180 Updated Apr 24, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,122 63 Updated Jul 20, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 10,936 975 Updated Jul 21, 2024

distributed trainer for LLMs

Python 503 73 Updated May 20, 2024

The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".

480 19 Updated Mar 21, 2024

The startup template for Chirpy.

Ruby 506 251 Updated Jun 25, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,774 800 Updated Jul 1, 2024

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Python 67 8 Updated Jul 15, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 352 119 Updated Jul 4, 2024

中文Mixtral-8x7B(Chinese-Mixtral-8x7B)

Python 633 31 Updated Apr 2, 2024

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 16,569 3,816 Updated Jul 21, 2024

DeepSeek Coder: Let the Code Write Itself

Python 6,143 441 Updated May 21, 2024

DeepSeek LLM: Let there be answers

Makefile 1,338 88 Updated Feb 4, 2024
Next