Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

F14Table rehashimpl assert failure #2215

Open
xingnailu opened this issue May 27, 2024 · 1 comment
Open

F14Table rehashimpl assert failure #2215

xingnailu opened this issue May 27, 2024 · 1 comment

Comments

@xingnailu
Copy link

I am using Gluten + Velox + folly, while container running with reading s3 data, throw this exception:

[2024-05-27 01:45:46.687]Container exited with a non-zero exit code 134. Error file: prelaunch.err.
Last 4096 bytes of prelaunch.err :
/bin/bash: line 1: 2322107 Aborted (core dumped) /usr/lib/jvm/java-1.8.0/bin/java -server -Xmx4096m '-Djava.net.preferIPv6Addresses=false' '-XX:+IgnoreUnrecognizedVMOptions' '--add-opens=java.base/java.lang=ALL-UNNAMED' '--add-opens=java.base/java.lang.invoke=ALL-UNNAMED' '--add-opens=java.base/java.lang.reflect=ALL-UNNAMED' '--add-opens=java.base/java.io=ALL-UNNAMED' '--add-opens=java.base/java.net=ALL-UNNAMED' '--add-opens=java.base/java.nio=ALL-UNNAMED' '--add-opens=java.base/java.util=ALL-UNNAMED' '--add-opens=java.base/java.util.concurrent=ALL-UNNAMED' '--add-opens=java.base/java.util.concurrent.atomic=ALL-UNNAMED' '--add-opens=java.base/jdk.internal.ref=ALL-UNNAMED' '--add-opens=java.base/sun.nio.ch=ALL-UNNAMED' '--add-opens=java.base/sun.nio.cs=ALL-UNNAMED' '--add-opens=java.base/sun.security.action=ALL-UNNAMED' '--add-opens=java.base/sun.util.calendar=ALL-UNNAMED' '--add-opens=java.security.jgss/sun.security.krb5=ALL-UNNAMED' '-Djdk.reflect.useDirectMethodHandle=false' '-Ddubbo.application.qos.enable=false' '-Duser.timeZone=GMT+08' '-Dcom.amazonaws.services.s3.enableV4=true' '-Djava.net.preferIPv4Stack=true' '-XX:MetaspaceSize=512m' '-XX:MaxMetaspaceSize=512m' '-XX:MaxDirectMemorySize=1g' '-XX:+UseCompressedOops' '-XX:ParallelGCThreads=8' '-XX:ConcGCThreads=4' '-XX:+UseG1GC' '-XX:SoftRefLRUPolicyMSPerMB=0' '-XX:OnOutOfMemoryError=kill -9 %p' '-verbose:gc' '-XX:+PrintGCDetails' '-XX:+PrintGCTimeStamps' '-XX:+PrintGCDateStamps' '-XX:+PrintHeapAtGC' '-Xloggc:/data/data1/yarn/container-logs/application_1693383838041_3359264/container_e32_1693383838041_3359264_01_000003/gc.log' '-XX:MaxDirectMemorySize=3686m' -Djava.io.tmpdir=/data/data1/yarn/nm/usercache/hive/appcache/application_1693383838041_3359264/container_e32_1693383838041_3359264_01_000003/tmp '-Dspark.network.timeout=120s' '-Dspark.driver.port=44823' '-Dspark.ui.port=0' -Dspark.yarn.app.container.log.dir=/data/data1/yarn/container-logs/application_1693383838041_3359264/container_e32_1693383838041_3359264_01_000003 org.apache.spark.executor.YarnCoarseGrainedExecutorBackend --driver-url spark:https://[email protected]:44823 --executor-id 2 --hostname xx.bigdata.pod.cluster.local --cores 4 --app-id application_1693383838041_3359264 --resourceProfileId 0 > /data/data1/yarn/container-logs/application_1693383838041_3359264/container_e32_1693383838041_3359264_01_000003/stdout 2> /data/data1/yarn/container-logs/application_1693383838041_3359264/container_e32_1693383838041_3359264_01_000003/stderr
Last 4096 bytes of stderr :
bytes in memory (estimated size 203.7 KiB, free 5.2 GiB)
24/05/27 01:45:35 INFO TorrentBroadcast: Reading broadcast variable 4 took 81 ms
24/05/27 01:45:35 INFO MemoryStore: Block broadcast_4 stored as values in memory (estimated size 612.5 KiB, free 5.2 GiB)
24/05/27 01:45:36 INFO deprecation: mapred.max.split.size is deprecated. Instead, use mapreduce.input.fileinputformat.split.maxsize

24/05/27 01:45:39 INFO BaseAllocator: Debug mode disabled. Enable with the VM option -Darrow.memory.debug.allocator=true.
24/05/27 01:45:39 INFO DefaultAllocationManagerOption: allocation manager type not specified, using netty as the default type
24/05/27 01:45:39 INFO CheckAllocator: Using DefaultAllocationManager at memory/DefaultAllocationManagerFactory.class
24/05/27 01:45:40 INFO TorrentBroadcast: Started reading broadcast variable 1 with 1 pieces (estimated total size 4.0 MiB)
24/05/27 01:45:40 INFO MemoryStore: Block broadcast_1_piece0 stored as bytes in memory (estimated size 33.3 KiB, free 5.2 GiB)
24/05/27 01:45:40 INFO TorrentBroadcast: Reading broadcast variable 1 took 9 ms
24/05/27 01:45:40 INFO MemoryStore: Block broadcast_1 stored as values in memory (estimated size 48.1 KiB, free 5.2 GiB)

Assertion failure: hp.second == srcChunk->tag(srcI)
Message:
File: /usr/local/include/folly/container/detail/F14Table.h
Line: 2064
Function: rehashImpl

@xingnailu
Copy link
Author

@Orvid Please take a look

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant