Skip to content

Docker image with zeppelin; spark and the python libs needed for Data Science

License

BSD-3-Clause, BSD-3-Clause licenses found

Licenses found

BSD-3-Clause
LICENSE
BSD-3-Clause
LICENSE.txt
Notifications You must be signed in to change notification settings

supergarotinho/docker-zeppelin

Docker image of zeppelin notebook   Tweet

Fun data science with zeppelin and docker

Build Status Docker Stars Docker Pulls Price License: BSD-3 Contributions welcome

Built with ❤︎ by Anderson Santos and contributors

DockerHub repository: https://hub.docker.com/r/supergarotinho/zeppelin/

Table of contents

Features

  • Spark - 2.1.1
  • zeppelin - 0.7.1
    • spark
    • shell
    • angular
    • markdown
    • postgresql
    • jdbc
    • python
    • hbase
    • elasticsearch
  • Python libs:
    • Python 3.5
    • Data
      • NumPy
      • pandas
      • PandaSQL
    • ML and Math
      • sklearn
      • SciPy
    • Visualization
      • matplotlib
      • seaborn
      • folium (GeoVisualization)
      • wordcloud
    • Util
      • ijson
      • datetime
      • tweepy
    • NLP
      • nltk
        • punkt - sentence segmentation
        • stopwords
        • rslp - lemmatizer da Viviane Orengo
        • floresta - Corpus Floresta Sint?tica for PT_BR
      • gensim (Topic and language modelling)
    • Graphs
      • networkx
      • igraph

Getting Started

You can enter at the directory that you want to save your notebooks and run:

docker run --rm -d -p 8080:8080 -v $PWD:/notebook -e ZEPPELIN_NOTEBOOK_DIR='/notebook' supergarotinho/zeppelin

Built With

  • Zeppelin - The notebook
  • Spark - The distributed in-memory framework

Contributing

Build and push:

docker build -t supergarotinho/zeppelin .
docker push supergarotinho/zeppelin

Authors

See also the list of contributors who participated in this project.

License

This project is licensed under the BSD-3 License - see the LICENSE.md file for details

Acknowledgments

  • From zeppelin official image: apache/zeppelin:0.7.2

About

Docker image with zeppelin; spark and the python libs needed for Data Science

Resources

License

BSD-3-Clause, BSD-3-Clause licenses found

Licenses found

BSD-3-Clause
LICENSE
BSD-3-Clause
LICENSE.txt

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published