Skip to content
View kamuyix's full-sized avatar

Block or report kamuyix

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 51 4 Updated Aug 7, 2024

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Python 294 8 Updated Jul 11, 2024

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python 1,510 191 Updated Aug 12, 2020

PyTorch implementation of AirFormer, AAAI-23

Python 96 22 Updated Dec 29, 2022

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 8,700 848 Updated Aug 11, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 9,453 943 Updated Aug 23, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 455 28 Updated May 20, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 63,414 7,856 Updated Aug 21, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 392 156 Updated Jul 4, 2024

VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.

Python 164 12 Updated Aug 23, 2024

A collection of visual instruction tuning datasets.

Python 73 3 Updated Mar 14, 2024
Python 104 11 Updated Apr 16, 2024

Multilingual Sentence & Image Embeddings with BERT

Python 14,708 2,425 Updated Aug 24, 2024

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,649 460 Updated Jul 11, 2024

Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".

Python 39 1 Updated Jul 16, 2024

Convert avro files to parquet, csv and json format

Python 21 2 Updated Jun 30, 2021

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 18,882 2,070 Updated Aug 12, 2024

✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs.

TypeScript 29,692 1,778 Updated Aug 22, 2024

Grok open release

Python 49,372 8,326 Updated Aug 7, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,135 834 Updated Aug 13, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 4,213 439 Updated Aug 6, 2024

Yet another Ph.D. adventure.

15 4 Updated Apr 15, 2024

The source code of IJCAI2020 paper "Unsupervised Monocular Visual-inertial Odometry Network".

Python 51 6 Updated Aug 23, 2023

Code for "Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection", ECCV 2022

Python 97 18 Updated Oct 19, 2022

Deep learning algorithms source code for beginners

Python 1,191 988 Updated Aug 13, 2020

Top Deep Learning Projects based on their Stars!

Python 412 117 Updated Feb 17, 2024

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 23,467 3,077 Updated Aug 14, 2024

Curated list of project-based tutorials

193,426 25,210 Updated Aug 15, 2024