Skip to content
This repository has been archived by the owner on Nov 21, 2018. It is now read-only.

OutOfMemoryError #7

Open
carschno opened this issue Jul 24, 2014 · 0 comments
Open

OutOfMemoryError #7

carschno opened this issue Jul 24, 2014 · 0 comments

Comments

@carschno
Copy link

Hi,
I run Mr.LDA VariationalInference on a data set with the following basic parameters:
14/07/24 11:43:58 INFO mrlda.VariationalInference: Total number of documents is: 605830 14/07/24 11:43:58 INFO mrlda.VariationalInference: Total number of terms is: 2573070

In Iteration 2, all map jobs fail due to OutOfMemoryError exceptions:

14/07/24 11:59:01 INFO mapreduce.Job: Task Id : attempt_1403876159389_0503_m_00 0041_2, Status : FAILED Error: java.lang.RuntimeException: Error in configuring object at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109) at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java: 75) at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133) at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:426) at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342) at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:168) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1548) at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:163) Caused by: java.lang.reflect.InvocationTargetException at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAcce ssorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106) ... 9 more Caused by: java.lang.RuntimeException: Error in configuring object at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:109) at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:75) at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:133) at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:38) ... 14 more Caused by: java.lang.reflect.InvocationTargetException at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:606) at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:106) ... 17 more Caused by: java.lang.OutOfMemoryError: Java heap space at cc.mrlda.DocumentMapper.importBeta(DocumentMapper.java:514) at cc.mrlda.DocumentMapper.configure(DocumentMapper.java:116) ... 22 more

This is re-producibly with varying parameters regarding corpus size, number of mapper, heap size (tried up to 32GB).
Our Hadoop cluster comprises 24 nodes with a total of 504 CPU cores and 1.1TB RAM.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant