Add manage_service settings to get puppet out of the way. #349

jguiditta · 2014-08-12T16:39:15Z

This should make starting services more consistent, as pacemaker will have full control of starting and stopping the services without the chance of puppet having already done so, which potentially
could cause confusion for pacemaker.

jguiditta · 2014-08-12T16:39:46Z

This still has a problem or two to be worked out, specifically errors against rabbitmq

cwolferh · 2014-09-05T02:07:38Z

puppet/modules/quickstack/manifests/cinder.pp

 bind_host => $bind_host,
 }
+ contain cinder::api


This contain, along with contain cinder::scheduler, caused a dependency cycle error like

Error: Could not apply complete catalog: Found 1 dependency cycle: (Cinder_config[DEFAULT/glance_api_version] => Service[cinder-api] => Class[Cinder::Api] => Class[Quickstack::Cinder] => Class[Quickstack::Cinder_volume] => Cinder_config[DEFAULT/glance_api_version])

when the paramter $backend_rbd in quickstack::pacemaker::cinder was set to true

Removing the 2 contain's caused the dep cycle error to go away. However, I had to add 2 extra dependencies to make sure that the Service's were started before we executed the one-time stop and disable. See
https://github.com/cwolferh/astapor/compare/jguiditta:add_manage_service_ha...cwolferh:service_unmanage_tinkering?expand=1

jguiditta · 2014-09-05T19:39:09Z

Results of successful run:

pcs status - https://gist.github.com/jguiditta/6ac02293087eeec9e18f
openstack-status from node 1 (horizon always shows as uncontactable with this tool in HA, since it is not bound to localhost) - https://gist.github.com/jguiditta/23a7e3a09ae8171a63f1
node 2 - https://gist.github.com/jguiditta/97447ffe1415cae67e06
node 3 - https://gist.github.com/jguiditta/6542d9af07f25ea6fa01

Note that services that should be A/P like neutron (anything but server) and heat engine, are shown as active on only one node, and inactive on the others.

cwolferh · 2014-09-05T21:31:07Z

Looks good, services have been disabled: http:https://ur1.ca/i4jhy

cwolferh · 2014-09-06T00:15:26Z

puppet/modules/quickstack/manifests/pacemaker/rabbitmq.pp

@@ -71,6 +78,9 @@
 try_sleep => 10,
 command => "/tmp/ha-all-in-one-util.bash all_members_include rabbitmq",
 } ->
+ quickstack::pacemaker::manual_service { "rabbitmq-server":
+ stop => $_enabled,
+ } ->


pacemaker did not bring up rabbitmq across all nodes after a couple of fresh attempts. E.g.:

# pcs status Clone Set: rabbitmq-server-clone [rabbitmq-server] Started: [ c1a1.example.com c1a2.example.com ] Stopped: [ c1a3.example.com ]

From the puppet output on the control node:

Debug: Execone-time-rabbitmq-server-disable: Executing '/sbin/chkconfig rabbitmq-server off' Debug: Executing '/sbin/chkconfig rabbitmq-server off' Notice: /Stage[main]/Quickstack::Pacemaker::Rabbitmq/Quickstack::Pacemaker::Manual_service[rabbitmq-server] /Exec[one-time-rabbitmq-server-disable]/returns: executed successfully Debug: /Stage[main]/Quickstack::Pacemaker::Rabbitmq/Quickstack::Pacemaker::Manual_service[rabbitmq-server] /Exec[one-time-rabbitmq-server-disable]: The container Quickstack::Pacemaker::Manual_service[rabbitmq-server] will propagate my refresh event Debug: Quickstack::Pacemaker::Manual_service[rabbitmq-server]: The container Class[Quickstack::Pacemaker::Rabbitmq] will propagate my refresh event Debug: /usr/sbin/pcs resource show rabbitmq-server > /dev/null 2>&1 Debug: /usr/sbin/pcs resource create rabbitmq-server systemd:rabbitmq-server op monitor interval=30s start-delay=35s interval=30s --clone Error: Unable to create resource/fence device Call cib_create failed (-206): Application of an update diff failed

If I comment out the above 3 lines quickstack::pacemaker::manual_service { "rabbitmq-server":..., rabbitmq comes up fine for me.

Clone Set: rabbitmq-server-clone [rabbitmq-server] Started: [ c1a1.example.com c1a2.example.com c1a3.example.com ]

The cause isn't clear to me, but I've got the logs handy.

I had success throwing in a "sleep 60" after manual_service { "rabbitmq-server": . Not that I'm suggesting that as the final solution, but perhaps we need to let systemd to finish reloading (if applicable) across nodes before a pacemaker resource is added.

Hmm, that is odd, since all we are doing is a chkconfig off, not even stopping the service. I wonder if it would be better to perhaps move the disable to happen at the very end of the first run?

https://bugzilla.redhat.com/show_bug.cgi?id=1123303 This does not include rabbit, due to unique issues with that service. This patch should make starting services more consistent for the included services as pacemaker will have full control of starting and stopping the services without the chance of puppet having already done so, which potentially could cause confusion for pacemaker.

cwolferh · 2014-09-08T20:50:45Z

Looks good. Services disabled after 2nd puppet run.

Add manage_service settings to get puppet out of the way.

jguiditta force-pushed the add_manage_service_ha branch 7 times, most recently from 64a9e48 to 8b0892b Compare August 27, 2014 21:02

jguiditta changed the title ~~A manage_service settings to get puppet out of the way.~~ Add manage_service settings to get puppet out of the way. Aug 27, 2014

jguiditta force-pushed the add_manage_service_ha branch 13 times, most recently from 1242336 to 622d350 Compare September 3, 2014 20:50

jguiditta force-pushed the add_manage_service_ha branch 2 times, most recently from efc5352 to db61df8 Compare September 4, 2014 21:39

cwolferh reviewed Sep 5, 2014
View reviewed changes

jguiditta force-pushed the add_manage_service_ha branch from db61df8 to c0e6da6 Compare September 5, 2014 17:20

cwolferh reviewed Sep 6, 2014
View reviewed changes

jguiditta force-pushed the add_manage_service_ha branch from c0e6da6 to 8277e20 Compare September 8, 2014 16:58

jguiditta force-pushed the add_manage_service_ha branch 3 times, most recently from b81527d to 748a61b Compare September 8, 2014 19:32

jguiditta force-pushed the add_manage_service_ha branch from 748a61b to 42c1fc6 Compare September 8, 2014 20:28

cwolferh added a commit that referenced this pull request Sep 8, 2014

Merge pull request #349 from jguiditta/add_manage_service_ha

ec69a28

Add manage_service settings to get puppet out of the way.

cwolferh merged commit ec69a28 into redhat-openstack:master Sep 8, 2014

jguiditta deleted the add_manage_service_ha branch October 13, 2014 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add manage_service settings to get puppet out of the way. #349

Add manage_service settings to get puppet out of the way. #349

jguiditta commented Aug 12, 2014

jguiditta commented Aug 12, 2014

cwolferh Sep 5, 2014

jguiditta commented Sep 5, 2014

cwolferh commented Sep 5, 2014

cwolferh Sep 6, 2014

cwolferh Sep 6, 2014

jguiditta Sep 8, 2014

cwolferh commented Sep 8, 2014

Add manage_service settings to get puppet out of the way. #349

Add manage_service settings to get puppet out of the way. #349

Conversation

jguiditta commented Aug 12, 2014

jguiditta commented Aug 12, 2014

cwolferh Sep 5, 2014

Choose a reason for hiding this comment

jguiditta commented Sep 5, 2014

cwolferh commented Sep 5, 2014

cwolferh Sep 6, 2014

Choose a reason for hiding this comment

cwolferh Sep 6, 2014

Choose a reason for hiding this comment

jguiditta Sep 8, 2014

Choose a reason for hiding this comment

cwolferh commented Sep 8, 2014