Skip to content
View SparksJoe's full-sized avatar
  • 20:37 (UTC +08:00)

Highlights

  • Pro
Block or Report

Block or report SparksJoe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Python 105 2 Updated Jun 27, 2024

A Framework for Decoupling and Assessing the Capabilities of VLMs

Python 27 1 Updated Jun 28, 2024

[ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset

64 1 Updated Jul 3, 2024

The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"

Python 45 2 Updated Apr 22, 2024

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

295 8 Updated Mar 22, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 50+ HF models, 20+ benchmarks

Python 677 80 Updated Jul 11, 2024

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

Python 764 80 Updated Dec 15, 2023

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,245 343 Updated Jul 11, 2024