GitHub - xxling/mammoth: A memory-centric MapReduce based on hadoop-1.0.1.

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
src/org/apache/hadoop/mapred		src/org/apache/hadoop/mapred
LICENSE.txt		LICENSE.txt
README.txt		README.txt
hadoop-core-1.0.1-mammoth-0.9.0.jar		hadoop-core-1.0.1-mammoth-0.9.0.jar

Repository files navigation

How to use: 
Mammoth is a memory-centric MapReduce based on hadoop-1.0.1 aiming to solve the problem of I/O bottleneck in 
data-intensive applications. About how to use hadoop, you can refer to its homepage: https://hadoop.apache.org/. 
In the following part of this document, we assume you are familiar with hadoop.
You can just replace the "hadoop-core-1.0.1.jar" under $HADOOP_HOME with the compiled "hadoop-core-1.0.1-mammoth-0.9.0.jar".
After that you can use mammoth just in the same way with original hadoop.
Mammoth is developed with 64-bit jdk7, and you are suggested to use the same.
You must specify the child jvm options before running your job, eg:
<property>
   <name>mapred.job.child.java.opts</name>
   <value>-d64 -Xmx8000M -Xms8000M</value>
</property>
This parameter is the only one required to be manually specified because Mammoth can maximize the usage of memory 
in runtime using a rule-based heuristic. You can learn more about the Mammoth on the following page: 
https://grid.hust.edu.cn/xhshi/projects/mammoth.htm.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

License

xxling/mammoth

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages