Skip to content

neutron: restart neutron-ha-tool when the config file changes#2112

Merged
dirkmueller merged 1 commit into
crowbar:masterfrom
dirkmueller:neutron_hatool
Apr 26, 2019
Merged

neutron: restart neutron-ha-tool when the config file changes#2112
dirkmueller merged 1 commit into
crowbar:masterfrom
dirkmueller:neutron_hatool

Conversation

@dirkmueller

Copy link
Copy Markdown
Contributor

When we toggle ssl/non-ssl in keystone, neutron-ha-tool
gets a new config file but doesn't get restarted, so it just
repeatedly crashes all the way until it causes a pacemaker
failcount exceeded and then its dead. We should try better.

@dirkmueller

Copy link
Copy Markdown
Contributor Author

(needs testing, and probably does not work as is)

aspiers
aspiers previously approved these changes Apr 17, 2019

@aspiers aspiers left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. CI failure is unrelated, something to do with missing iso?

@dirkmueller

Copy link
Copy Markdown
Contributor Author

yeah, not sure about that problem, it happens quite frequently recently.

@cmurphy

cmurphy commented Apr 17, 2019

Copy link
Copy Markdown
Contributor

I would like to see the HA job passing as this is supposed to be fixing that job.

Comment thread chef/cookbooks/neutron/recipes/network_agents_ha.rb Outdated

@cmurphy cmurphy left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think this will work, see inline comment

Comment thread chef/cookbooks/neutron/recipes/network_agents_ha.rb Outdated
@sjamgade

Copy link
Copy Markdown
Contributor

Error executing action `restart` on resource 'pacemaker_primitive[neutron-l3-ha-service]' from CI logs

@dirkmueller

Copy link
Copy Markdown
Contributor Author

yes, thats why there is "do not merge" and "wip" and "untested, probably does not work" on the PR

@vuntz

vuntz commented Apr 18, 2019

Copy link
Copy Markdown
Member

Ah, I hadn't realized. In case it's not already solved on your hard disk: the restart action needs to be on a service resource, not a pacemaker resource. So we need to define a service for neutron-l3-ha-service, make sure the service uses the pacemaker provider, and that should do it.

When we toggle ssl/non-ssl in keystone, neutron-ha-tool
gets a new config file but doesn't get restarted, so it just
repeatedly crashes all the way until it causes a pacemaker
failcount exceeded and then its dead. We should try better.
@dirkmueller

Copy link
Copy Markdown
Contributor Author

@vuntz thanks, that did it

@dirkmueller dirkmueller merged commit b865297 into crowbar:master Apr 26, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Development

Successfully merging this pull request may close these issues.

7 participants