nzw0301

🐢

I may be slow to respond.

Kento Nozawa nzw0301

🐢

I may be slow to respond.

181 followers · 127 following

Preferred Networks, Inc. / Preferred Elements, Inc.
Japan
02:34 (UTC +09:00)
nzw0301.github.io

Achievements

x4 x3 x3 x2

Achievements

x4 x3 x3 x2

Organizations

Lists (4)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

Leooyii / LCEG

Long Context Extension and Generalization in LLMs

Python 31 2 Updated Sep 21, 2024

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 21,178 622 Updated Sep 27, 2024

kyutai-labs / moshi

Python 5,683 420 Updated Sep 27, 2024

NormXU / Consistent-DynamicNTKRoPE

An Experiment on Dynamic NTK Scaling RoPE

Python 59 3 Updated Nov 26, 2023

google-deepmind / pg19

226 18 Updated Feb 25, 2020

pytorch-labs / attention-gym

Helpful tools and examples for working with flex-attention

Python 363 15 Updated Aug 17, 2024

apple / ml-sigmoid-attention

Python 212 9 Updated Sep 9, 2024

tosiyuki / llm-jp-asr

Whisperのデコーダをllm-jp-1.3b-v1.0に置き換えた音声認識モデルを学習させるためのコード

Python 5 Updated Sep 7, 2024

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,315 115 Updated Apr 17, 2024

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 132 8 Updated Aug 31, 2024

microsoft / LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 85 8 Updated Aug 23, 2024

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,474 152 Updated Aug 17, 2024

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Cuda 529 21 Updated Sep 21, 2024

pfnet-research / plamo-examples

18 1 Updated Sep 11, 2024

xincanfeng / vitsGPT

Python 39 5 Updated Jun 28, 2024

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 3,080 158 Updated Sep 25, 2024

JmlrOrg / tmlr-style-file

LaTeX style file for Transactions on Machine Learning Research

TeX 8 3 Updated Jun 30, 2023

allegroai / clearml

ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

Python 5,592 649 Updated Sep 26, 2024

mct10 / RepCodec

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Python 147 10 Updated Jul 12, 2024

pytorch / torchcodec

PyTorch video decoding

Python 56 7 Updated Sep 26, 2024

QingSuML / udi

Official pytorch implementation of "Unsqueeze [CLS] Bottleneck to Learn Rich Representations " (ECCV 2024)

Python 5 Updated Sep 26, 2024

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,188 196 Updated Sep 27, 2024

jd / tenacity

Retrying library for Python

Python 6,583 281 Updated Sep 1, 2024

Netflix / maestro

Maestro: Netflix’s Workflow Orchestrator

Java 3,255 199 Updated Aug 9, 2024

astanin / python-tabulate

Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.

Python 2,112 164 Updated Sep 27, 2024

baaivision / EVE

[NeurIPS'24 Spotlight] EVE: Encoder-Free Vision-Language Models

Python 208 3 Updated Jul 20, 2024

huggingface / candle

Minimalist ML framework for Rust

Rust 15,296 894 Updated Sep 26, 2024

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 834 100 Updated Sep 27, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,507 4,048 Updated Sep 27, 2024

feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Python 522 51 Updated Aug 22, 2024

Kento Nozawa nzw0301

Organizations

Lists (4)

datasets

resources

self-sup

tools

Stars