Skip to content
View shibuiwilliam's full-sized avatar
Block or Report

Block or report shibuiwilliam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Verification of the effect of speculative decoding in Japanese.

Python 2 Updated Mar 4, 2024

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 656 25 Updated May 13, 2024

Retrying library for Python

Python 6,336 276 Updated Jul 8, 2024

Integrate GraphQL with your Pydantic models

Python 223 45 Updated Jun 11, 2024
Python 194 14 Updated Apr 10, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32,210 5,487 Updated Jul 24, 2024

Evans: more expressive universal gRPC client

Go 4,181 187 Updated Dec 26, 2023

A library for squeakily cleaning and filtering language datasets.

Jupyter Notebook 45 9 Updated Jul 10, 2023

Checkpointable dataset utilities for foundation model training

Python 31 5 Updated Jan 29, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,798 112 Updated Jul 22, 2024

Few Shot Text Classification with Large Language Models

Jupyter Notebook 6 2 Updated Oct 16, 2023

Classify data instantly using an LLM

Python 218 20 Updated Jun 18, 2024

Curate better data for LLMs

Python 909 81 Updated Mar 19, 2024

NLG and NLU for dialogue processing

Python 43 10 Updated Jun 17, 2023

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 157 16 Updated Jun 10, 2024

This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the informati…

208 18 Updated Jan 10, 2022

Support Continual pre-training & Instruction Tuning forked from llama-recipes

Python 31 5 Updated Feb 17, 2024

DagStream is the Python package in order to manage relationship between functions, especially for data-preprocessing functions for machine learning applications.

Python 7 Updated May 27, 2024

Library for Textless Spoken Language Processing

Python 514 50 Updated Aug 29, 2023

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Python 43,105 4,546 Updated Jul 24, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,530 240 Updated Dec 12, 2023

Libraries for efficient and scalable group-structured dataset pipelines.

Python 22 2 Updated Apr 11, 2024

Real-time lossless audio compression in Python

C 126 7 Updated Apr 16, 2024

A collection of useful audio datasets and transforms for PyTorch.

Python 128 22 Updated Feb 11, 2023

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,044 2,307 Updated Jul 24, 2024

Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety.

Python 123 6 Updated Jun 24, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 1,799 124 Updated Jul 24, 2024

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Python 11,676 2,443 Updated Jul 23, 2024

Easily create large video dataset from video urls

Python 513 59 Updated Jul 18, 2024

A deep learning library for video understanding research.

Python 3,237 397 Updated Mar 3, 2024
Next