-
Harbin Institute of Technology
- Singapore
-
22:02
(UTC +08:00) - https://looperxx.github.io/
- @looperxx27
Highlights
- Pro
Block or Report
Block or report LooperXX
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
Multimodal language model benchmark, featuring challenging examples
Sailor: Open Language Models for South-East Asia
Unsupervised text tokenizer focused on computational efficiency
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Accelerating the development of large multimodal models (LMMs) with lmms-eval
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
中文Mixtral-8x7B(Chinese-Mixtral-8x7B)
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…
DeepSeek Coder: Let the Code Write Itself