Starred repositories
A streaming SQL engine, a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse.
The Network UPS Tools repository. UPS management protocol Informational RFC 9271 published by IETF at https://www.rfc-editor.org/info/rfc9271 Please star NUT on GitHub, this helps with sponsorships!
The Clickhouse plugin for dbt (data build tool)
🆔 Command line tool for deduplicating CSV files
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
OpenAPI Generator allows generation of API client libraries (SDK generation), server stubs, documentation and configuration automatically given an OpenAPI Spec (v2, v3)
swagger-codegen contains a template-driven engine to generate documentation, API clients and server stubs in different languages by parsing your OpenAPI / Swagger definition.
🔥Highlighting the top ML papers every week.
📺 Discover the latest machine learning / AI courses on YouTube.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
"Probabilistic Machine Learning" - a book series by Kevin Murphy
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
All Algorithms implemented in Python
Modelio is a modeling solution offering a wide range of functionalities based on the main standards of enterprise architecture, software development and systems engineering.
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
A high-performance, zero-overhead, extensible Python compiler using LLVM
Companion webpage to the book "Mathematics For Machine Learning"
A curated list to learn about distributed systems
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Apache Superset is a Data Visualization and Data Exploration Platform
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
A curated list of awesome ETL frameworks, libraries, and software.
A list of cool features of Git and GitHub.
A semantic diff utility and library for tree-like files such as JSON, JSON5, XML, HTML, YAML, and CSV.
Semantic Highlighting for Vim
Terminal stock ticker with live updates and position tracking