🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
-
Updated
Apr 7, 2021 - Shell
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
A GitHub action to build data science environment images with repo2docker and push them to registries.
A handbook for those who want to start coordinating Hacky Hour events in their University/Institute
Snorkel - Bootstrap your Data Science
Dockerize Data Science
Presentations from the GDG Cloud Chennai meetup event.
K3ai plugins Repo is the place where we host all the optional capabilites of k3ai. The main goal of the repo is to mantainer k3ai simple and lightweight while adding capabilites in the form of manifests or helm charts.
Environment for Data Workshop
Easily Bootstrap an AI/Data Science/Deep Learning DevBox for Ubuntu Desktop 18.04 LTS
This is a quick-and-dirty data analytics platform based on Spark, Hadoop and Jupyterhub. All this tools are deployed automatically with docker and docker-compose.
Dockerfile setup for Spark set-up, imbued with varying degree of Python data science packages
Explore and demo label-studio on OpenShift
Infrastructure as a code for reproducible data science pipelines using CWL and Airflow
Template for running Python 3.x shell scripts and notebooks in a Docker container for isolation, security, and portability
Package for conducting PCA incl. scree-plot and bi-plot
This is a project where I created classes in C++ for doubly linked lists and nodes in order to learn data structures. Once main.cc is ran, the program prompts the user to create different types of data structures, each having their own unique functions.
Service for automatic matching two data sets without mapping
Add a description, image, and links to the datascience topic page so that developers can more easily learn about it.
To associate your repository with the datascience topic, visit your repo's landing page and select "manage topics."