Skip to content

shihabuddinbuet/machine-learning-spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

machine-learning-spark

Apache spark is an open source popular big data processing engine. It has provided machine learning library to make machine learning scalable and easy. The aim of the project is to show how to use apache spark machine learning library.

Getting Started

If you are new with apache spark, please learn basic about the tool and see some example codes before getting started.

Prerequesite

The following tools need to installed before using the project

  • Java sdk >=7.0
  • Maven >=3.0.0
  • Scala sdk >=2.11
  • Apache spark >=2.0.0 (if you want to submit in spark local machine)

Installing

  1. Install java from here if java is not installed
  2. Install Maven by following the instructions
  3. Download and install apache spark
  4. Open any terminal
  5. Clone project git clone [email protected]:shihabuddinbuet/machine-learning-spark.git
  6. Run cd machine-learning-spark
  7. Run mvn clean -DskipTests package to build the project
  8. Submit the jar in spark for any of the main apps

Authors

See also the list of contributors who participated in this project.

About

use of spark machine learning library

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages