Spark.jl

Not under current development

This repository was part of a project for MIT's Distributed Systems class. It is not under current development. There is another implementation of Spark in Julia which seems to have active attention:

https://github.com/dfdx/Spark.jl

Usage

A basic implementation of Apache Spark for Julia. Examples of use can be found in the test directory. A user will specify workers in a JSON file like default_workers.json and call julia start_worker.jl on them, specifying the correct port.

Locally, the user can then start a Spark.Master instance and use it to create and manipulate resilient distributed datasets:

using Spark
master = Spark.Master("127.0.0.1", 3333)
# fill in master.workers
Spark.load(master, "default_workers.json")
# start master listener
Spark.initserver(master)

# Create and manipulate RDDs
rdd = Spark.input(master, "RDDA.txt", "int_reader")
filtered_rdd = Spark.filter(master, rdd, "number_filter")
results = Spark.collect(master, filtered_rdd)

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
start_worker.jl		start_worker.jl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark.jl

Not under current development

Usage

About

Releases

Packages

Languages

License

d9w/Spark.jl

Folders and files

Latest commit

History

Repository files navigation

Spark.jl

Not under current development

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages