shibuiwilliam

shibuiwilliam shibuiwilliam

Software engineer for backend, cloud, container, machine learning, LLM and AR. MENSA. https://amzn.to/43hVwtC https://amzn.to/3Uz5AdP

184 followers · 24 following

http:https://qiita.com/cvusk

Achievements

x3 x4 x3

Achievements

x3 x4 x3

Block or Report

Block or report shibuiwilliam

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

u-hyszk / japanese-speculative-decoding

Verification of the effect of speculative decoding in Japanese.

Python 2 Updated Mar 4, 2024

XiongjieDai / GPU-Benchmarks-on-LLM-Inference

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 656 25 Updated May 13, 2024

jd / tenacity

Retrying library for Python

Python 6,336 276 Updated Jul 8, 2024

graphql-python / graphene-pydantic

Integrate GraphQL with your Pydantic models

Python 223 45 Updated Jun 11, 2024

ali-vilab / Ranni

Python 194 14 Updated Apr 10, 2024

ray-project / ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32,210 5,487 Updated Jul 24, 2024

ktr0731 / evans

Evans: more expressive universal gRPC client

Go 4,181 187 Updated Dec 26, 2023

CarperAI / squeakily

A library for squeakily cleaning and filtering language datasets.

Jupyter Notebook 45 9 Updated Jul 10, 2023

iwiwi / epochraft

Checkpointable dataset utilities for foundation model training

Python 31 5 Updated Jan 29, 2024

huggingface / datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 1,798 112 Updated Jul 22, 2024

botelhoa / llm-classifier

Few Shot Text Classification with Large Language Models

Jupyter Notebook 6 2 Updated Oct 16, 2023

lamini-ai / llm-classifier

Classify data instantly using an LLM

Python 218 20 Updated Jun 18, 2024

lilacai / lilac

Curate better data for LLMs

Python 909 81 Updated Mar 19, 2024

ZHAOTING / dialog-processing

NLG and NLU for dialogue processing

Python 43 10 Updated Jun 17, 2023

allenai / wimbd

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 157 16 Updated Jun 10, 2024

drmuskangarg / Multimodal-datasets

This repository is build in association with our position paper on "Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers". As a part of this release we share the informati…

208 18 Updated Jan 10, 2022

kotoba-tech / kotoba-recipes

Support Continual pre-training & Instruction Tuning forked from llama-recipes

Python 31 5 Updated Feb 17, 2024

ricosjp / dagstream

DagStream is the Python package in order to manage relationship between functions, especially for data-preprocessing functions for machine learning applications.

Python 7 Updated May 27, 2024

facebookresearch / textlesslib

Library for Textless Spoken Language Processing

Python 514 50 Updated Aug 29, 2023

comfyanonymous / ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Python 43,105 4,546 Updated Jul 24, 2024

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,530 240 Updated Dec 12, 2023

google-parfait / dataset_grouper

Libraries for efficient and scalable group-structured dataset pipelines.

Python 22 2 Updated Apr 11, 2024

sonos / pyFLAC

Real-time lossless audio compression in Python

C 126 7 Updated Apr 16, 2024

archinetai / audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

Python 128 22 Updated Feb 11, 2023

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,044 2,307 Updated Jul 24, 2024

thu-coai / SafetyBench

Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety.

Python 123 6 Updated Jun 24, 2024

modelscope / data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据！

Python 1,799 124 Updated Jul 24, 2024

pytube / pytube

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Python 11,676 2,443 Updated Jul 23, 2024

iejMac / video2dataset

Easily create large video dataset from video urls

Python 513 59 Updated Jul 18, 2024

facebookresearch / pytorchvideo

A deep learning library for video understanding research.

Python 3,237 397 Updated Mar 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shibuiwilliam shibuiwilliam

Achievements

Achievements

Block or report shibuiwilliam

Stars

u-hyszk / japanese-speculative-decoding

XiongjieDai / GPU-Benchmarks-on-LLM-Inference

jd / tenacity

graphql-python / graphene-pydantic

ali-vilab / Ranni

ray-project / ray

ktr0731 / evans

CarperAI / squeakily

iwiwi / epochraft

huggingface / datatrove

botelhoa / llm-classifier

lamini-ai / llm-classifier

lilacai / lilac

ZHAOTING / dialog-processing

allenai / wimbd

drmuskangarg / Multimodal-datasets

kotoba-tech / kotoba-recipes

ricosjp / dagstream

facebookresearch / textlesslib

comfyanonymous / ComfyUI

PhoebusSi / Alpaca-CoT

google-parfait / dataset_grouper

sonos / pyFLAC

archinetai / audio-data-pytorch

NVIDIA / NeMo

thu-coai / SafetyBench

modelscope / data-juicer

pytube / pytube

iejMac / video2dataset

facebookresearch / pytorchvideo