- Shanghai
-
07:40
(UTC +08:00) - https://www.zhihu.com/people/lmj-75-77
- https://percent4.github.io/
Block or Report
Block or report percent4
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
中华古诗文数据库和API。包含10000首古文(诗、词、歌、赋以及其它形式的文言文),近4000名作者,10000名句
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)
VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
Image Captioning using Transformers in PyTorch
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
Large Action Model framework to develop AI Web Agents
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
A algebraic word problem dataset, with multiple choice questions annotated with rationales.
PyTorch Tutorial for Deep Learning Researchers
Label Studio is a multi-type data labeling and annotation tool with standardized output format
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
lightweight, standalone C++ inference engine for Google's Gemma models.
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
A Native-PyTorch Library for LLM Fine-tuning