Skip to content
View huiruru's full-sized avatar

Block or report huiruru

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

a self-hosted webui for 30+ generative ai

Python 417 39 Updated Sep 1, 2024

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,715 629 Updated Aug 26, 2024

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Svelte 37,503 4,320 Updated Sep 1, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,255 268 Updated Aug 7, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 16,269 1,258 Updated Sep 1, 2024

Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)

2,121 311 Updated Jun 30, 2024

OpenAI Whisper Container (GPU and CPU) and Lambda (CPU) - speech recognition model

Shell 41 5 Updated May 9, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 66,910 7,886 Updated Aug 19, 2024

SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open sour…

TypeScript 18,221 1,157 Updated Sep 2, 2024

A curated list of awesome blogs, videos, tools and resources about Data Contracts

157 20 Updated Aug 12, 2024

Automatically upgrade your Django projects.

Python 963 62 Updated Aug 26, 2024

Scratch is a swiss army knife for big data.

Go 1,075 53 Updated Jul 19, 2024

A tool for running on-premises large language models with non-public data

Jupyter Notebook 683 32 Updated Aug 23, 2024

the AI-native open-source embedding database

Rust 14,340 1,197 Updated Sep 2, 2024

Integrate cutting-edge LLM technology quickly and easily into your apps

C# 21,195 3,105 Updated Sep 2, 2024

Fancy stream processing made operationally mundane

Go 8,077 811 Updated Aug 30, 2024

For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR

Python 65 29 Updated Jan 2, 2022

CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code

Go 12,344 1,628 Updated Aug 26, 2024

Template for a data contract used in a data mesh.

460 85 Updated Mar 13, 2024

StableLM: Stability AI Language Models

Jupyter Notebook 15,849 1,034 Updated Apr 8, 2024

A collection of postmortems. Sorry for the delay in merging PRs!

11,210 435 Updated Jul 24, 2024

A publicly compiled list of tech conferences that offer childcare. Special thanks to all the people who contributed to this by responding to me on twitter. Contribute or make improvements if you fe…

13 6 Updated Jun 9, 2022

The Internals of Spark on Kubernetes

71 18 Updated May 9, 2022

Serverless Python

Python 11,884 1,204 Updated Mar 23, 2023

Boundary enables identity-based access management for dynamic infrastructure.

Go 3,828 281 Updated Sep 1, 2024

Prefect is a workflow orchestration framework for building resilient data pipelines in Python.

Python 15,748 1,535 Updated Sep 2, 2024

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…

Python 3,875 683 Updated Sep 2, 2024

Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment

Python 2,743 1,244 Updated Aug 7, 2024

NLP, before and after spaCy

Python 2,200 247 Updated Sep 22, 2023

Distributed PostgreSQL as an extension

C 10,323 655 Updated Aug 23, 2024
Next