MDEV-33543 Server hang caused by InnoDB change buffer #3234

mariadb-DebarunBanerjee · 2024-04-30T11:39:11Z

The Jira issue number for this PR is: MDEV-33543

Description

Issue: When getting a page (buf_page_get_gen) with no latch option (RW_NO_LATCH), the caller is not expected to follow the B-tree latching order. However in buf_page_get_low we try to acquire shared page latch unconditionally to wait for a page that is being loaded by another thread concurrently. In general it could lead to latch order violation and deadlock.

Currently it affects the change buffer insert path btr_latch_prev() which tries to load the previous page out of order with RW_NO_LATCH and two concurrent inserts into IBUF tree cause deadlock. This problem is introduced in 10.6 by MDEV-27058.

Fix: While trying to latch a page with RW_NO_LATCH, always use the "*lock_try" interface and retry operation on failure after unfixing the page.

Release Notes

MDEV-27058 had introduced this regression in 10.6.6. which could hang the server under load If innodb_change_buffering is enabled and user has tables with non unique secondary index.

How can this PR be tested?

It is difficult to maintain an automated mtr test in our regular test suite. The RQG test and the attached code/test patch in MDEV can repeat the issue.

The MDEV has a repeatable test and steps for manual reproduction of the issue.

Basing the PR against the correct MariaDB version

This is a new feature and the PR is based against the latest MariaDB development branch.
This is a bug fix and the PR is based against the earliest maintained branch in which the bug can be reproduced.

PR quality check

I checked the CODING_STANDARDS.md file and my PR conforms to this where appropriate.
For any trivial modifications to the PR, I am ok with the reviewer making the changes themselves.

CLAassistant · 2024-04-30T11:39:17Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

dr-m

Thank you. This looks correct to me. In the commit message, please refer to the exact commit that you think introduced this bug, using a wording such as the following:

commit aaef2e1d8c843d1e40b1ce0c5199c3abb3c5da28 (MDEV-27058)

I was thinking if there could be a simpler hang scenario:

A page is being read into the buffer pool.
The page is being requested by another thread buf_page_get_gen() with RW_NO_LATCH.
On read completion, the page turns out to be corrupted, and buf_pool_t::corrupted_evict() will be invoked on it.

This patch does not affect that scenario, because in that case, we would notice that the block state was changed to FREED and release our S-latch and buffer-fix. However, while thinking about this scenario, I noticed that there is a small race condition with buf_pool_t::corrupted_evict(), which would be fixed by releasing the buffer-fix a little later in the error handling code path:

diff --git a/storage/innobase/buf/buf0buf.cc b/storage/innobase/buf/buf0buf.cc
index dc90160a668..12a11aefc12 100644
--- a/storage/innobase/buf/buf0buf.cc
+++ b/storage/innobase/buf/buf0buf.cc
@@ -2587,15 +2587,15 @@ buf_page_get_low(
 		block->page.lock.s_unlock();
 
 		if (UNIV_UNLIKELY(state < buf_page_t::UNFIXED)) {
-			block->page.unfix();
 			if (UNIV_UNLIKELY(id == page_id)) {
 				/* The page read was completed, and
 				another thread marked the page as free
 				while we were waiting. */
-				goto ignore_unfixed;
+				goto ignore;
 			}
 
 			ut_ad(id == page_id_t{~0ULL});
+			block->page.unfix();
 
 			if (++retries < BUF_PAGE_READ_MAX_RETRIES) {
 				goto loop;

If you agree with this, please include this fix as well.

storage/innobase/buf/buf0buf.cc

Issue: When getting a page (buf_page_get_gen) with no latch option (RW_NO_LATCH), the caller is not expected to follow the B-tree latching order. However in buf_page_get_low we try to acquire shared page latch unconditionally to wait for a page that is being loaded by another thread concurrently. In general it could lead to latch order violation and deadlock. Currently it affects the change buffer insert path btr_latch_prev() which tries to load the previous page out of order with RW_NO_LATCH and two concurrent inserts into IBUF tree cause deadlock. This problem is introduced in 10.6 by following commit. commit 9436c77 (MDEV-27058) Fix: While trying to latch a page with RW_NO_LATCH, always use the "*lock_try" interface and retry operation on failure after unfixing the page.

mariadb-DebarunBanerjee requested a review from dr-m April 30, 2024 11:39

mariadb-DebarunBanerjee self-assigned this Apr 30, 2024

dr-m approved these changes May 2, 2024

View reviewed changes

storage/innobase/buf/buf0buf.cc Outdated Show resolved Hide resolved

mariadb-DebarunBanerjee force-pushed the 10.6-MDEV-33543 branch from 679e6db to 90b95c6 Compare May 2, 2024 11:40

mariadb-DebarunBanerjee merged commit 90b95c6 into 10.6 May 2, 2024
14 of 17 checks passed

mariadb-DebarunBanerjee deleted the 10.6-MDEV-33543 branch May 2, 2024 13:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MDEV-33543 Server hang caused by InnoDB change buffer #3234

MDEV-33543 Server hang caused by InnoDB change buffer #3234

mariadb-DebarunBanerjee commented Apr 30, 2024

CLAassistant commented Apr 30, 2024

dr-m left a comment

MDEV-33543 Server hang caused by InnoDB change buffer #3234

MDEV-33543 Server hang caused by InnoDB change buffer #3234

Conversation

mariadb-DebarunBanerjee commented Apr 30, 2024

Description

Release Notes

How can this PR be tested?

Basing the PR against the correct MariaDB version

PR quality check

CLAassistant commented Apr 30, 2024

dr-m left a comment

Choose a reason for hiding this comment