Skip to content
View akashmjn's full-sized avatar

Block or report akashmjn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Vision Document Retrieval (ViDoRe): Benchmark 👀. Evaluation code for the "ColPali: Efficient Document Retrieval with Vision Language Models" paper.

Python 77 5 Updated Sep 1, 2024

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 331 24 Updated Jun 4, 2024

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,590 169 Updated Sep 1, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 4,822 324 Updated Sep 2, 2024

Tevatron - A flexible toolkit for neural retrieval research and development.

Python 455 90 Updated Aug 20, 2024

The official Meta Llama 3 GitHub site

Python 25,876 2,887 Updated Aug 12, 2024

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 42,269 7,644 Updated Sep 2, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,185 170 Updated Sep 1, 2024

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Python 140 10 Updated Apr 3, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,455 5,509 Updated Jun 24, 2024

A Repo For Document AI

Python 2,467 127 Updated Aug 26, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,045 2,086 Updated Aug 12, 2024

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,725 629 Updated Aug 26, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 515 36 Updated Aug 23, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,994 824 Updated Jul 1, 2024

Redis Python client

Python 12,518 2,496 Updated Sep 2, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,259 188 Updated Sep 2, 2024

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,686 87 Updated Jan 21, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,320 4,470 Updated Sep 2, 2024

This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and…

Jupyter Notebook 23 4 Updated Dec 23, 2023

MLX: An array framework for Apple silicon

C++ 16,321 929 Updated Sep 2, 2024

All things prompt engineering

Python 5,332 292 Updated Jun 4, 2024
Python 285 11 Updated Jun 21, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 677 39 Updated May 30, 2024

A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB team.

Python 246 37 Updated Dec 15, 2023

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,683 209 Updated Sep 1, 2024

A New Tamil Large Language Model (LLM) Based on Llama 2

Python 251 33 Updated Apr 5, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 30,279 3,541 Updated Sep 1, 2024

Train transformer language models with reinforcement learning.

Python 9,181 1,149 Updated Sep 2, 2024

Robust recipes to align language models with human and AI preferences

Python 4,418 384 Updated Aug 20, 2024
Next