Skip to content

neutron: use crm_resource restart for restarting neutron-l3-ha-service#2116

Merged
rsalevsky merged 1 commit into
crowbar:masterfrom
dirkmueller:neutron_hatool
May 9, 2019
Merged

neutron: use crm_resource restart for restarting neutron-l3-ha-service#2116
rsalevsky merged 1 commit into
crowbar:masterfrom
dirkmueller:neutron_hatool

Conversation

@dirkmueller

Copy link
Copy Markdown
Contributor

When we restart via systemd, it can happen that the service already
crashed and then 'service neutron-l3-ha-service status" will (correctly)
report that the service isn't running. And then the restart is not being
executed due to:

INFO: Ignoring restart action for neutron-l3-ha-service service since not running on this node (d52-54-77-77-01-01)

which will then later cause a pacemaker failcount. the pacemaker
resource restart hopefully does not have this problem.

@skazi0

skazi0 commented May 7, 2019

Copy link
Copy Markdown
Member

@dirkmueller the gating job still fails with the same symptomps :o(

@sjamgade

sjamgade commented May 8, 2019

Copy link
Copy Markdown
Contributor
neutron-l3-ha-service.log
 [2019-05-07T16:35:25.390173 #23874]  INFO -- : error counter: bumped to 11   
chef logs
[2019-05-07T16:38:49+00:00] INFO: Ignoring restart action for neutron-l3-ha-service service since not running on this node (d52-54-77-77-01-01)  
service was running on d52-54-77-77-01-01

By default neutron-ha-tool restart is subscribed as delayed so all restarts are queued, So the actual restart happens after chef is done.

So may be changing it to not-delayed should help

When we restart via systemd, it can happen that the service already
crashed and then 'service neutron-l3-ha-service status" will (correctly)
report that the service isn't running. And then the restart is not being
executed due to:

  INFO: Ignoring restart action for neutron-l3-ha-service service since not running on this node (d52-54-77-77-01-01)

which will then later cause a pacemaker failcount. the pacemaker
resource restart hopefully does not have this problem.
@rsalevsky rsalevsky merged commit 797307d into crowbar:master May 9, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

5 participants