Skip to content
View Giruvegan's full-sized avatar

Block or report Giruvegan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,762 115 Updated Oct 30, 2024

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 4,271 377 Updated Nov 18, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,138 193 Updated Oct 4, 2024

Official inference framework for 1-bit LLMs

C++ 11,214 760 Updated Nov 11, 2024

Korean Sentence Embedding Model Performance Benchmark for RAG

Jupyter Notebook 44 4 Updated Apr 27, 2024

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 807 33 Updated Nov 5, 2024

KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch

95 13 Updated May 3, 2022

Utilities intended for use with Llama models.

Python 4,829 831 Updated Nov 9, 2024

📖 Korean NLU Benchmark

565 57 Updated Jul 6, 2022

AnyLoc: Universal Visual Place Recognition (RA-L 2023)

Python 470 43 Updated Mar 13, 2024

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 3,413 336 Updated Jun 20, 2024

Doppelgangers: Learning to Disambiguate Images of Similar Structures

Jupyter Notebook 179 24 Updated Mar 1, 2024

A personal list of papers and resources of image matching and pose estimation, including perspective images and panoramas.

259 29 Updated Sep 9, 2024

Code release for CVPR'24 submission 'OmniGlue'

Python 573 50 Updated Aug 12, 2024

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Jupyter Notebook 994 111 Updated Oct 27, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,284 95 Updated Oct 8, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,041 471 Updated Nov 18, 2024

Korean Sentence Embedding Repository

Python 202 17 Updated Jan 23, 2024

MetaFormer Baselines for Vision (TPAMI 2024)

Python 420 28 Updated Jun 1, 2024

[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.

Python 618 51 Updated Oct 23, 2024

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,954 273 Updated Nov 18, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 47,331 5,784 Updated Nov 18, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,187 397 Updated Nov 18, 2024

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 376 50 Updated Jul 26, 2024

Efficient vision foundation models for high-resolution generation and perception.

Python 2,353 189 Updated Nov 12, 2024

Korean Visual Question Answering

57 5 Updated Feb 18, 2020

MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingual text perception and comprehension capabilities across nine…

Python 45 2 Updated Sep 29, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,284 436 Updated Nov 14, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,058 385 Updated Aug 7, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,625 889 Updated Oct 22, 2024
Next