Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove monitor op from stonith-sbd #83080

Closed
wants to merge 1 commit into from
Closed

Conversation

wtmpx
Copy link
Contributor

@wtmpx wtmpx commented Oct 28, 2021

I removed op monitor interval="15" timeout="15" from stonith-sbd.
With multiple SBD disks, in case one disk failed, the monitor operation will detect a failure and pacemaker will try to restart stonith-sbd resource on all node and the start will fail and stonith-sbd become inactive.
stonith-sbd (stonith:external/sbd): Stopped

Removing above monitor operation, the system will log failure messages in logs but resource will not restart.
Oct 25 11:42:38 ora1 sbd[3034]: warning: inquisitor_child: Servant /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1 is outdated (age: 4)
Oct 25 11:42:38 ora1 sbd[3035]: /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1: error: header_get: Unable to read header from device 5
Oct 25 11:42:38 ora1 sbd[3035]: /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1: error: servant_md: No longer found a valid header on /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1

Also, in our HAE documentation we don't advice customer to us op monitor for stonith-sbd resource:
https://documentation.suse.com/sle-ha/15-SP3/single-html/SLE-HA-administration/#book-administration

Thanks

I removed    op monitor interval="15" timeout="15" from stonith-sbd. 
With multiple SBD disks, in case one disk failed, the monitor operation will detect a failure and pacemaker will try to restart stonith-sbd resource on all node and the start will fail and stonith-sbd become inactive. 
stonith-sbd    (stonith:external/sbd): Stopped

Removing above monitor operation, the system will log failure messages in logs but resource will not restart. 
Oct 25 11:42:38 ora1 sbd[3034]:  warning: inquisitor_child: Servant /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1 is outdated (age: 4)
Oct 25 11:42:38 ora1 sbd[3035]: /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1:    error: header_get: Unable to read header from device 5
Oct 25 11:42:38 ora1 sbd[3035]: /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1:    error: servant_md: No longer found a valid header on /dev/disk/by-path/ip-192.168.0.10:3260-iscsi-iqn.2019-11.sbd.local:ora1-sbd-2-lun-1

Also, in our HAE documentation we don't advice customer to us op monitor for stonith-sbd resource:
https://documentation.suse.com/sle-ha/15-SP3/single-html/SLE-HA-administration/#book-administration 

Thanks
@PRMerger18
Copy link
Contributor

@wtmpx : Thanks for your contribution! The author(s) have been notified to review your proposed change.

@lauradolan
Copy link
Member

Thanks for your dedication to our documentation. Unfortunately, at this time we have been unable to review your issue in a timely manner and we sincerely apologize for the delayed response. The requested updates have not been made since the creation of this issue, and the timeline for resolution may vary based on resourcing, so we've created an internal work item to review your suggestions. We are closing this issue for now, but feel free to comment here as necessary. #please-close

@PRMerger18 PRMerger18 closed this Jul 13, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

6 participants