map-reduce

Star

Here are 347 public repositories matching this topic...

Qihoo360 / poseidon

Star

A search engine which can hold 100 trillion lines of log data.

golang search-engine big-data map-reduce poseidon

Updated May 22, 2017
Go

chrislusf / gleam

Sponsor

Star

Fast, efficient, and scalable distributed map/reduce system, DAG execution, in memory or on disk, written in pure Go, runs standalone or distributedly.

golang distributed-systems distributed-computing map-reduce

Updated Jun 11, 2024
Go

tirthajyoti / Spark-with-Python

Star

Fundamentals of Spark with Python (using PySpark), code examples

python machine-learning sql database big-data spark apache-spark hadoop analytics parallel-computing distributed-computing apache map-reduce pyspark hdfs dataframe mlib

Updated Oct 29, 2022
Jupyter Notebook

phelps-sg / python-bigdata

Sponsor

Star

Data science and Big Data with Python

python data-science spark numpy hbase map-reduce numerical-methods notebook-jupyter

Updated Aug 27, 2023
Jupyter Notebook

numaproj / numaflow

Star

Kubernetes-native platform to run massively parallel data/streaming jobs

kubernetes pipeline stream-processing map-reduce k8s data-processing hacktoberfest

Updated Sep 27, 2024
Go

commoncrawl / cc-mrjob

Star

Demonstration of using Python to process the Common Crawl dataset with the mrjob framework

python hadoop map-reduce commoncrawl

Updated Apr 1, 2022
Python

JuliaFolds / Transducers.jl

Star

Efficient transducers for Julia

high-performance julia parallel distributed-computing map-reduce iterators transducers

Updated Sep 26, 2024
Julia

nglthu / infoRetrieval

Star

Inverted Indexer, web crawler, sort, search and poster steamer written using Python for information retrieval.

information-retrieval python3 map-reduce tokens inverted-index terms webcrawler heaps stemming-algorithm

Updated Apr 1, 2019
HTML

xarray-contrib / flox

Star

Fast & furious GroupBy operations for dask.array

xarray map-reduce dask

Updated Sep 21, 2024
Python

daleroberts / pypar

Star

Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.

python big-data mpi map-reduce

Updated Nov 11, 2016
Python

imehrdadmahdavi / map-reduce-inverted-index

Star

Creating an Inverted Index of words occurring in a large set of documents extracted from web pages using Hadoop MapReduce and Google Dataproc

search-engine information-retrieval big-data hadoop clustering bigdata gcp map-reduce inverted-index mapreduce googlecloud dataprocessing dataproc

Updated Oct 28, 2019
Java

RedisGears / redisgears-py

Star

RedisGears python client

redis python-client stream-processing map-reduce redisgears

Updated Jun 19, 2023
Python

tkf / ThreadsX.jl

Star

Parallelized Base functions

high-performance julia parallel map-reduce sorting-algorithms transducers

Updated Sep 26, 2024
Julia

rvantonder / hack_parallel

Star

The core parallel and shared memory library used by Hack, Flow, and Pyre

ocaml parallel map-reduce shared-memory

Updated Feb 27, 2021
OCaml

Cheng-Lin-Li / Spark

Star

There are Python 2.7 codes and learning notes for Spark 2.1.1

spark map-reduce minhash tf-idf kmeans als cosine-similarity python27 kmeans-clustering minhash-lsh-algorithm apriori-algorithm alternating-least-squares uv-decomposition savasere-omiecinski-and-navathe apriori-son

Updated Aug 21, 2018
Python

kalmyk / fox-wamp

Star

Web Application Message Async Server and WAMP/MQTT bridge

mqtt iot websocket stream-processing map-reduce wamp-router async-storage

Updated Sep 19, 2024
JavaScript

fangvv / EdgeLD

Star

Code for paper "Locally Distributed Deep Learning Inference on Edge Device Clusters"

deep-learning cluster parallel-computing distributed-computing inference dnn map-reduce vggnet parallel-algorithm speedup workload edge-computing

Updated May 20, 2023
Python

manuparra / TallerH2S

Star

Taller HDFS, Hadoop y Spark para el Master Profesional de Ingeniería Informática - Universidad de Granada

python java spark hadoop map-reduce hdfs wordcount

Updated May 20, 2019
R

JuliaFolds / data-parallelism

Star

high-performance julia parallel distributed-computing map-reduce iterators transducers franklin

Updated Jan 20, 2022
Julia

gwr3n / jsdp

Star

A Java Stochastic Dynamic Programming Library

java control programming stream dynamic lambda-calculus inventory parallel uncertainty map-reduce object-oriented stochastic optimal maintenance

Updated May 2, 2024
Java

Improve this page

Add a description, image, and links to the map-reduce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the map-reduce topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

map-reduce

Here are 347 public repositories matching this topic...

Qihoo360 / poseidon

chrislusf / gleam

tirthajyoti / Spark-with-Python

phelps-sg / python-bigdata

numaproj / numaflow

commoncrawl / cc-mrjob

JuliaFolds / Transducers.jl

nglthu / infoRetrieval

xarray-contrib / flox

daleroberts / pypar

imehrdadmahdavi / map-reduce-inverted-index

RedisGears / redisgears-py

tkf / ThreadsX.jl

rvantonder / hack_parallel

Cheng-Lin-Li / Spark

kalmyk / fox-wamp

fangvv / EdgeLD

manuparra / TallerH2S

JuliaFolds / data-parallelism

gwr3n / jsdp

Improve this page

Add this topic to your repo