Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove monitor op from stonith-sbd #83080

Closed
wants to merge 1 commit into from
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Remove monitor op from stonith-sbd
I removed    op monitor interval="15" timeout="15" from stonith-sbd. 
With multiple SBD disks, in case one disk failed, the monitor operation will detect a failure and pacemaker will try to restart stonith-sbd resource on all node and the start will fail and stonith-sbd become inactive. 
stonith-sbd    (stonith:external/sbd): Stopped

Removing above monitor operation, the system will log failure messages in logs but resource will not restart. 
Oct 25 11:42:38 ora1 sbd[3034]:  warning: inquisitor_child: Servant /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1 is outdated (age: 4)
Oct 25 11:42:38 ora1 sbd[3035]: /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1:    error: header_get: Unable to read header from device 5
Oct 25 11:42:38 ora1 sbd[3035]: /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1:    error: servant_md: No longer found a valid header on /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1

Also, in our HAE documentation we don't advice customer to us op monitor for stonith-sbd resource:
https://documentation.suse.com/sle-ha/15-SP3/single-html/SLE-HA-administration/#book-administration 

Thanks
  • Loading branch information
wtmpx committed Oct 28, 2021
commit 57f794c40f5a6f2698ae88b1fa900d59642bf9bc
Original file line number Diff line number Diff line change
Expand Up @@ -561,8 +561,7 @@ sudo crm resource list
sudo crm resource stop stonith-sbd
sudo crm configure delete <b>stonith-sbd</b>
sudo crm configure primitive <b>stonith-sbd</b> stonith:external/sbd \
params pcmk_delay_max="15" \
op monitor interval="15" timeout="15"
params pcmk_delay_max="15"
</code></pre>

## Create Azure Fence agent STONITH device
Expand Down