Skip to content
View nzw0301's full-sized avatar
🐢
I may be slow to respond.
🐢
I may be slow to respond.
  • Preferred Networks, Inc. / Preferred Elements, Inc.
  • Japan
  • 02:34 (UTC +09:00)

Organizations

@apache @optuna

Block or report nzw0301

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Long Context Extension and Generalization in LLMs

Python 31 2 Updated Sep 21, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 21,178 622 Updated Sep 27, 2024
Python 5,683 420 Updated Sep 27, 2024

An Experiment on Dynamic NTK Scaling RoPE

Python 59 3 Updated Nov 26, 2023

Helpful tools and examples for working with flex-attention

Python 363 15 Updated Aug 17, 2024

Whisperのデコーダをllm-jp-1.3b-v1.0に置き換えた音声認識モデルを学習させるためのコード

Python 5 Updated Sep 7, 2024

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,315 115 Updated Apr 17, 2024

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 132 8 Updated Aug 31, 2024

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 85 8 Updated Aug 23, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,474 152 Updated Aug 17, 2024

A throughput-oriented high-performance serving framework for LLMs

Cuda 529 21 Updated Sep 21, 2024
Python 39 5 Updated Jun 28, 2024

Efficient Triton Kernels for LLM Training

Python 3,080 158 Updated Sep 25, 2024

LaTeX style file for Transactions on Machine Learning Research

TeX 8 3 Updated Jun 30, 2023

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

Python 5,592 649 Updated Sep 26, 2024

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Python 147 10 Updated Jul 12, 2024

PyTorch video decoding

Python 56 7 Updated Sep 26, 2024

Official pytorch implementation of "Unsqueeze [CLS] Bottleneck to Learn Rich Representations " (ECCV 2024)

Python 5 Updated Sep 26, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,188 196 Updated Sep 27, 2024

Retrying library for Python

Python 6,583 281 Updated Sep 1, 2024

Maestro: Netflix’s Workflow Orchestrator

Java 3,255 199 Updated Aug 9, 2024

Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.

Python 2,112 164 Updated Sep 27, 2024

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 208 3 Updated Jul 20, 2024

Minimalist ML framework for Rust

Rust 15,296 894 Updated Sep 26, 2024

PyTorch native quantization and sparsity for training and inference

Python 834 100 Updated Sep 27, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,507 4,048 Updated Sep 27, 2024

Fast inference from large lauguage models via speculative decoding

Python 522 51 Updated Aug 22, 2024
Next