Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[VL] libgluten.so crash while building gluten velox #6088

Open
Au-Miner opened this issue Jun 14, 2024 · 3 comments
Open

[VL] libgluten.so crash while building gluten velox #6088

Au-Miner opened this issue Jun 14, 2024 · 3 comments
Labels

Comments

@Au-Miner
Copy link

Problem description

Backend
Velox

Bug description
I built velox gluten according to the requirements of the official website, but encountered an error of 'C [libgluten. so+0x317353]' during startup

Reproduction
Create the docker container

docker pull ubuntu:22.04
docker run -itd --name ubuntu2204 ubuntu:22.04 /bin/bash
docker attach ubuntu2204

apt-get update
apt install software-properties-common
apt install maven build-essential cmake libssl-dev libre2-dev libcurl4-openssl-dev clang lldb lld libz-dev git ninja-build uuid-dev autoconf-archive curl zip unzip tar pkg-config bison libtool flex vim
apt install sudo
apt purge libjemalloc-dev libjemalloc2 librust-jemalloc-sys-dev
apt install -y openjdk-8-jdk
export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
export PATH=$JAVA_HOME/bin:$PATH

<download spark/spark-3.2.0-bin-hadoop3.2.tgz>
<download gluten-velox-bundle-spark3.2_2.12-1.1.1.jar>

/spark/spark/bin/spark-shell --name run_gluten \
 --master local --deploy-mode client \
 --conf spark.plugins=io.glutenproject.GlutenPlugin \
 --conf spark.memory.offHeap.enabled=true \
 --conf spark.memory.offHeap.size=20g \
 --jars /spark/gluten-velox-bundle-spark3.2_2.12-1.1.1.jar \
 --conf spark.shuffle.manager=org.apache.spark.shuffle.sort.ColumnarShuffleManager

error message

root@8c55a15ec775:/spark/spark# /spark/spark/bin/spark-shell --name run_gluten \
 --master local --deploy-mode client \
 --conf spark.plugins=io.glutenproject.GlutenPlugin \
 --conf spark.memory.offHeap.enabled=true \
 --conf spark.memory.offHeap.size=20g \
 --jars /spark/gluten-velox-bundle-spark3.2_2.12-1.1.1.jar \
 --conf spark.shuffle.manager=org.apache.spark.shuffle.sort.ColumnarShuffleManager
24/06/14 01:39:04 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
Setting default log level to "WARN".
To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel).
OpenJDK 64-Bit Server VM warning: You have loaded library /tmp/gluten-7cca4e91-463a-4240-b9e4-36f67b147160/jni/11f04b97-1abd-4656-aeeb-0bedfe813394/gluten-984801035636929938/libvelox.so which might have disabled stack guard. The VM will try to fix the stack guard now.
It's highly recommended that you fix the library with 'execstack -c <libfile>', or link it with '-z noexecstack'.
#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGILL (0x4) at pc=0x00007f2841964353, pid=42017, tid=0x00007f28da678640
#
# JRE version: OpenJDK Runtime Environment (8.0_412-b08) (build 1.8.0_412-8u412-ga-1~22.04.1-b08)
# Java VM: OpenJDK 64-Bit Server VM (25.412-b08 mixed mode linux-amd64 compressed oops)
# Problematic frame:
# C  [libgluten.so+0x317353]  gluten::Runtime::registerFactory(std::string const&, std::function<gluten::Runtime* (std::unordered_map<std::string, std::string, std::hash<std::string>, std::equal_to<std::string>, std::allocator<std::pair<std::string const, std::string> > > const&)>)+0x23
#
# Core dump written. Default location: /spark/spark/core or core.42017
#
# An error report file with more information is saved as:
# /spark/spark/hs_err_pid42017.log
#
# If you would like to submit a bug report, please visit:
#   http:https://bugreport.java.com/bugreport/crash.jsp
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#
/spark/spark/bin/spark-shell: line 47: 42017 Aborted                 (core dumped) "${SPARK_HOME}"/bin/spark-submit --class org.apache.spark.repl.Main --name "Spark shell" "$@"

What's the problem?

System information

server: centos7
container: ubuntu22.04

CMake log

No response

@PHILO-HE PHILO-HE changed the title Libgluten.so crash while building gluten velox [VL] libgluten.so crash while building gluten velox Jun 14, 2024
@weiting-chen
Copy link
Contributor

Since the source has been transferred to Apache, if you are using the latest source code to compile your jar, please use "--conf spark.plugins=org.apache.gluten.GlutenPlugin" instead of "spark.plugins=io.glutenproject.GlutenPlugin".

@ArnavBalyan
Copy link

Hi @weiting-chen @PHILO-HE, this issue is reported at multiple places: #5327, #6088. Being encountered when using released jar with: spark.plugins=io.glutenproject.GlutenPlugin, can you please take a look

@Au-Miner
Copy link
Author

Au-Miner commented Jul 1, 2024

thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants