大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块
-
Updated
Jul 7, 2024 - Java
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块
Roadmap for Data Engineering
High Performance Tensorflow Data Pipeline with State of Art Augmentations and low level optimizations.
Simple stream processing pipeline
Tensorflow 2 Tutorials (use tensorflow and keras in a better way!)
Terraform module designed to easily backup EFS filesystems to S3 using DataPipeline
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Building Json data pipeline within Snowflake using Streams and Tasks
kedro cli plugin for generating a static kedro viz site (html, css, js) that can be deployed on many serverless tools.
Ethereum client written in Go, modified for full-hierarchy data exports and block specimen production
Domain-specific language to help build and maintain AWS Data Pipelines
Awesome list for datapipeline
A GitHub Action to lint, test, build-docs, package, and run your kedro pipelines. Supports any Python version you'll give it (that is also supported by pyenv).
Go library that provides easy-to-use interfaces and tools for TensorFlow users, in particular allowing to train existing TF models on .tar and .tgz datasets
Материалы для курса Введение в Data Engineering: дата пайплайны
High speed message passing between various queues and services
Modeling tool like DBT to use SQL Alchemy core with a DataFrame interface like
Global Tree Cover Loss Analysis using Geotrellis and SPARK
Simple Airflow on Kubernetes (GKE)
Add a description, image, and links to the datapipeline topic page so that developers can more easily learn about it.
To associate your repository with the datapipeline topic, visit your repo's landing page and select "manage topics."