Skip to content
View shibuiwilliam's full-sized avatar

Block or report shibuiwilliam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
877 results for source starred repositories
Clear filter

Extract frames and motion vectors from H.264 and MPEG-4 encoded video.

C 298 59 Updated Nov 1, 2024

A utility tool to create a tarball of existing objects in Amazon S3

Go 171 12 Updated Oct 23, 2024

Any Install makes easier to maintain installer shell scripts by its own manifest DSL.

TypeScript 2 Updated Nov 3, 2024

Verification of the effect of speculative decoding in Japanese.

Python 2 Updated Mar 4, 2024

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 991 38 Updated May 13, 2024

Retrying library for Python

Python 6,718 281 Updated Nov 1, 2024

Integrate GraphQL with your Pydantic models

Python 229 44 Updated Jun 11, 2024
Python 215 15 Updated Apr 10, 2024

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,764 5,744 Updated Nov 6, 2024

Evans: more expressive universal gRPC client

Go 4,273 188 Updated Dec 26, 2023

A library for squeakily cleaning and filtering language datasets.

Jupyter Notebook 45 9 Updated Jul 10, 2023

Checkpointable dataset utilities for foundation model training

Python 31 5 Updated Jan 29, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,023 144 Updated Oct 31, 2024

Few Shot Text Classification with Large Language Models

Jupyter Notebook 9 3 Updated Oct 16, 2023

Curate better data for LLMs

Python 953 89 Updated Mar 19, 2024

NLG and NLU for dialogue processing

Python 43 10 Updated Jun 17, 2023

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 188 20 Updated Sep 9, 2024

This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the informati…

249 18 Updated Jan 10, 2022

Support Continual pre-training & Instruction Tuning forked from llama-recipes

Python 32 4 Updated Feb 17, 2024

DagStream is the Python package in order to manage relationship between functions, especially for data-preprocessing functions for machine learning applications.

Python 8 Updated May 27, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 55,735 5,882 Updated Nov 5, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,609 246 Updated Dec 12, 2023

Libraries for efficient and scalable group-structured dataset pipelines.

Python 23 3 Updated Oct 2, 2024

Real-time lossless audio compression in Python

C 131 7 Updated Apr 16, 2024

A collection of useful audio datasets and transforms for PyTorch.

Python 132 22 Updated Feb 11, 2023

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,032 2,506 Updated Nov 6, 2024

Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]

Python 153 6 Updated Jun 24, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,856 174 Updated Nov 6, 2024

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Python 12,277 2,522 Updated Aug 15, 2024

Easily create large video dataset from video urls

Python 545 65 Updated Jul 30, 2024
Next