Skip to content
View numb3r3's full-sized avatar

Organizations

@jina-ai
Block or Report

Block or report numb3r3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Improving Text Embedding of Language Models Using Contrastive Fine-tuning

Python 16 1 Updated Aug 2, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 730 39 Updated Jul 31, 2024

Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "

22 Updated Jul 26, 2024
Python 195 22 Updated Aug 2, 2024

A Blazing Fast AI Gateway. Route to 200+ LLMs with 1 fast & friendly API.

TypeScript 5,510 371 Updated Aug 2, 2024
Python 66 4 Updated Aug 2, 2024

Fast inference from large lauguage models via speculative decoding

Python 444 47 Updated Jul 25, 2024

Utilities intended for use with Llama models.

Python 3,070 443 Updated Aug 1, 2024

Blazingly fast neighborhood attention

Python 8 Updated Nov 28, 2023

Implementation of Infini-Transformer in Pytorch

Python 97 1 Updated May 9, 2024

Repo of HawkLlama.

Python 8 Updated Jun 21, 2024

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)

Python 132 5 Updated May 31, 2024

EfficientViT is a new family of vision models for efficient high-resolution vision.

Python 1,680 153 Updated Jul 29, 2024

Work-in-progress vector search SQLite extension that runs anywhere.

C 1,591 30 Updated Aug 1, 2024

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Python 139 10 Updated Apr 3, 2024

TF-ID: Table/Figure IDentifier for academic papers

Python 1 Updated Jul 11, 2024

Latent Large Language Models

C++ 16 Updated Jul 12, 2024

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Python 120 9 Updated Jul 12, 2024

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 238 9 Updated Jul 24, 2024

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 317 27 Updated Apr 23, 2024

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 312 16 Updated Jul 18, 2024

A pytorch port of Google's RETSim model used in UniSim

Python 3 Updated Mar 25, 2024
Jupyter Notebook 149 22 Updated Jan 4, 2024

This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.

Python 18 3 Updated Jun 22, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,621 99 Updated Jul 26, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 790 45 Updated Aug 2, 2024

Gemma 2B with 10M context length using Infini-attention.

Python 911 57 Updated May 12, 2024

Efficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval

Python 54 5 Updated May 9, 2024

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

Python 258 22 Updated May 4, 2024
Next