Skip to content
#

map-reduce

Here are 337 public repositories matching this topic...

This project implements a distributed K-means clustering algorithm using a custom-built MapReduce framework. It is designed to handle potentially large datasets by distributing the clustering workload across multiple processes or machines. Uses gRPC for the communication between mapper, reducer, master

  • Updated Apr 24, 2024
  • Python

This repository presents a Python-powered analysis tool using MRJob and MongoDB to identify top-selling music artists by decade. With a focus on big data processing, it serves as a valuable resource for understanding historical and current musical trends, providing a streamlined solution for music industry analytics.

  • Updated Apr 13, 2024
  • Python

Improve this page

Add a description, image, and links to the map-reduce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the map-reduce topic, visit your repo's landing page and select "manage topics."

Learn more