Giruvegan

Giruvegan

7 followers · 26 following

Achievements

Starred repositories

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,762 115 Updated Oct 30, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-V…

Python 4,271 377 Updated Nov 18, 2024

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 3,138 193 Updated Oct 4, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 11,214 760 Updated Nov 11, 2024

ssisOneTeam / Korean-Embedding-Model-Performance-Benchmark-for-Retriever

Korean Sentence Embedding Model Performance Benchmark for RAG

Jupyter Notebook 44 4 Updated Apr 27, 2024

NVlabs / RADIO

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 807 33 Updated Nov 5, 2024

snunlp / KR-SBERT

KoRean based SBERT pre-trained models (KR-SBERT) for PyTorch

95 13 Updated May 3, 2022

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 4,829 831 Updated Nov 9, 2024

KLUE-benchmark / KLUE

📖 Korean NLU Benchmark

565 57 Updated Jul 6, 2022

AnyLoc / AnyLoc

AnyLoc: Universal Visual Place Recognition (RA-L 2023)

Python 470 43 Updated Mar 13, 2024

cvg / LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Python 3,413 336 Updated Jun 20, 2024

RuojinCai / doppelgangers

Doppelgangers: Learning to Disambiguate Images of Similar Structures

Jupyter Notebook 179 24 Updated Mar 1, 2024

chicleee / Image-Matching-Paper-List

A personal list of papers and resources of image matching and pose estimation, including perspective images and panoramas.

259 29 Updated Sep 9, 2024

google-research / omniglue

Code release for CVPR'24 submission 'OmniGlue'

Python 573 50 Updated Aug 12, 2024

verlab / accelerated_features

Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!

Jupyter Notebook 994 111 Updated Oct 27, 2024

McGill-NLP / llm2vec

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,284 95 Updated Oct 8, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 6,041 471 Updated Nov 18, 2024

BM-K / Sentence-Embedding-Is-All-You-Need

Korean Sentence Embedding Repository

Python 202 17 Updated Jan 23, 2024

sail-sg / metaformer

MetaFormer Baselines for Vision (TPAMI 2024)

Python 420 28 Updated Jun 1, 2024

Parskatt / RoMa

[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.

Python 618 51 Updated Oct 23, 2024

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

Jupyter Notebook 1,954 273 Updated Nov 18, 2024

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 47,331 5,784 Updated Nov 18, 2024

NVIDIA / NeMo-Guardrails

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 4,187 397 Updated Nov 18, 2024

kyegomez / RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 376 50 Updated Jul 26, 2024

mit-han-lab / efficientvit

Efficient vision foundation models for high-resolution generation and perception.

Python 2,353 189 Updated Nov 12, 2024

SKTBrain / KVQA

Korean Visual Question Answering

57 5 Updated Feb 18, 2020

bytedance / MTVQA

MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingual text perception and comprehension capabilities across nine…

Python 45 2 Updated Sep 29, 2024

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,284 436 Updated Nov 14, 2024

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,058 385 Updated Aug 7, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,625 889 Updated Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Giruvegan

Achievements

Achievements

Block or report Giruvegan

Starred repositories

cambrian-mllm / cambrian

modelscope / ms-swift

QwenLM / Qwen2-VL

microsoft / BitNet

ssisOneTeam / Korean-Embedding-Model-Performance-Benchmark-for-Retriever

NVlabs / RADIO

snunlp / KR-SBERT

meta-llama / llama-models

KLUE-benchmark / KLUE

AnyLoc / AnyLoc

cvg / LightGlue

RuojinCai / doppelgangers

chicleee / Image-Matching-Paper-List

google-research / omniglue

verlab / accelerated_features

McGill-NLP / llm2vec

OpenGVLab / InternVL

BM-K / Sentence-Embedding-Is-All-You-Need

sail-sg / metaformer

Parskatt / RoMa

embeddings-benchmark / mteb

open-webui / open-webui

NVIDIA / NeMo-Guardrails

kyegomez / RT-2

mit-han-lab / efficientvit

SKTBrain / KVQA

bytedance / MTVQA

THUDM / GLM-4

QwenLM / Qwen-VL

OpenBMB / MiniCPM-V

Starred topics

text-to-video