Skip to content

Troubleshooting Elasticsearch Upgrades#6396

Merged
stefnestor merged 23 commits into
elastic:mainfrom
stefnestor:stef_esUpgradeIssues
Jun 6, 2026
Merged

Troubleshooting Elasticsearch Upgrades#6396
stefnestor merged 23 commits into
elastic:mainfrom
stefnestor:stef_esUpgradeIssues

Conversation

@stefnestor

@stefnestor stefnestor commented May 9, 2026

Copy link
Copy Markdown
Member

Summary

👋 Follow-up of #5848 to create separate upgrade troubleshooting page.

This assumes but does not breaking rely on errors WIP documenting in

It will hopefully be less necessary after elastic/kibana#266733

There's various page cross-links/cross-reference between troubleshooting "rolling upgrading Elasticsearch" and the "upgrade assistant" before it, so starting together

Non-exhaustive related links

Generative AI disclosure

  1. Did you use a generative AI (GenAI) tool to assist in creating this contribution?
  • Yes
  • No

@stefnestor stefnestor requested review from a team as code owners May 9, 2026 15:03
@stefnestor stefnestor marked this pull request as draft May 9, 2026 15:03
@github-actions

github-actions Bot commented May 9, 2026

Copy link
Copy Markdown
Contributor

Elastic Docs AI PR menu

Check the box to run an AI review for this pull request.

Powered by GitHub Agentic Workflows and docs-actions. For more information, reach out to the docs team.

github-actions[bot]

This comment was marked as outdated.

@github-actions

github-actions Bot commented May 16, 2026

Copy link
Copy Markdown
Contributor

@stefnestor stefnestor added the enhancement New feature or request label May 22, 2026
Comment thread troubleshoot/elasticsearch/troubleshooting-upgrade-assistant.md Outdated
Comment thread troubleshoot/elasticsearch/troubleshooting-upgrades.md Outdated
Co-authored-by: Stef Nestor <26751266+stefnestor@users.noreply.github.com>
@stefnestor stefnestor marked this pull request as ready for review May 22, 2026 23:32
@stefnestor stefnestor enabled auto-merge (squash) May 22, 2026 23:33
@stefnestor

Copy link
Copy Markdown
Member Author

👋 howdy, team!

In famous "many errors reduce down to a handful of core issues", this PR (with it's blossoming sub-PRs) documents the majority of the issues users surface to Support during Elasticsearch upgrade, both during the Upgrade Assistant as well as during actual upgrade.

I did both pages simultaneously mostly as part of process trying to unwind that hairball since majority of errors are cascades of templates or index compatibility due to failing to do the upgrade assistant fully.

🙏 TIA!

@github-actions github-actions Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Docs review summary

Focus areas

  • Style and clarity: Found several issues with phrasing, including redundant wording ("due to due diligence"), awkward constructions ("ensure to"), and overuse of first-person plural ("we"). Present tense should be used more consistently per style guide.
  • Jargon: No unexplained jargon detected. Good use of product names and technical terms with appropriate context.
  • Frontmatter and applies_to:
    • Critical typo: troubleshooting-upgrade-assistant.md has "Assitant" instead of "Assistant" in navigation_title
    • Duplicate descriptions: Both new troubleshooting files use identical description text ("Common upgrade issues and resolutions."), which violates the requirement for unique descriptions
    • applies_to usage appears correct
  • Content type fit: Both new pages are labeled as type: troubleshooting but function more as wayfinding/collection pages rather than dedicated troubleshooting pages. They lack the standard Symptoms and Resolution structure required for dedicated troubleshooting pages. This is acceptable per the content-type guidelines for "generic wayfinding pages" that "organize multiple related troubleshooting topics," but the generic titles and structure should be intentional.
  • Parent issue satisfaction: The PR references #5848 as a follow-up. The changes successfully create separate troubleshooting pages for upgrade issues as intended, moving content from the main upgrade guide into dedicated troubleshooting documentation.

Nits

  • Line 15 of troubleshooting-upgrades.md: "beginning upgrading" could be "beginning the upgrade"
  • Multiple instances could be tightened: "you would commonly filter" → "commonly filter", "you should end up with" → "you should have"
  • The phrase "for your reference" appears multiple times and adds little value

Notes

  • The changes appropriately refactor the "Rolling upgrades considerations" section from elasticsearch.md into a dedicated troubleshooting page, improving discoverability
  • Cross-linking between the upgrade documentation and new troubleshooting pages is well implemented
  • The new tip callouts in elasticsearch.md and upgrade-assistant.md appropriately direct users to the troubleshooting resources

Generated by Docs review agent for issue #6396 · ● 1.3M

Comment thread troubleshoot/elasticsearch/troubleshooting-upgrades.md Outdated
Comment thread troubleshoot/elasticsearch/troubleshooting-upgrades.md Outdated
Comment thread troubleshoot/elasticsearch/troubleshooting-upgrades.md Outdated
Comment thread troubleshoot/elasticsearch/troubleshooting-upgrade-assistant.md Outdated
Comment thread troubleshoot/elasticsearch/troubleshooting-upgrade-assistant.md Outdated

Production environments should have [at least three master-eligible nodes for high availability](/deploy-manage/deploy/elastic-cloud/elastic-cloud-hosted-planning.md). In a testing or development environment with only one or two master-eligible nodes, you cannot avoid stopping half or more of the master-eligible nodes, so the cluster will always [become unavailable](/troubleshoot/elasticsearch/discovery-troubleshooting.md#discovery-no-master) at some point during the upgrade.

You must restart all the stopped master-eligible nodes to allow the cluster to re-form. This might cause a [premature cluster version update](#troubleshooting-upgrades-theory-version).

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unclear connection: The statement "You must restart all the stopped master-eligible nodes to allow the cluster to re-form. This might cause a premature cluster version update." could be clearer about when this causes the problem.

Consider: "You must restart all the stopped master-eligible nodes to allow the cluster to re-form. If those restarted nodes are all running the upgraded version, this will cause a premature cluster version update."

Comment thread troubleshoot/elasticsearch/troubleshooting-upgrade-assistant.md Outdated

This error indicates the [Upgrade Assistant](/deploy-manage/upgrade/prepare-to-upgrade/upgrade-assistant.md) was not fully completed during [upgrade preparation work](/deploy-manage/upgrade/prepare-to-upgrade.md).

You should reset this node's version upgrade, rejoin it to cluster at the earlier version, and complete the [Upgrade Assistant](/deploy-manage/upgrade/prepare-to-upgrade/upgrade-assistant.md) critical items before beginning upgrading. Refer also to [Troubleshoot Upgrade Assistant](/troubleshoot/elasticsearch/troubleshooting-upgrade-assistant.md).

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Clarity: "You should reset this node's version upgrade" is vague. Consider being more specific about what action to take:

"Downgrade this node back to the earlier version, allow it to rejoin the cluster, and complete the Upgrade Assistant critical items before attempting to upgrade again. Refer also to Troubleshoot Upgrade Assistant."

Comment thread troubleshoot/elasticsearch/troubleshooting-upgrades.md Outdated
Comment thread troubleshoot/elasticsearch/troubleshooting-upgrades.md Outdated
stefnestor and others added 2 commits May 23, 2026 10:54
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
@stefnestor

Copy link
Copy Markdown
Member Author

Bumping @elastic/admin-docs 🙏

@stefnestor stefnestor disabled auto-merge June 4, 2026 19:10
@marciw

marciw commented Jun 6, 2026

Copy link
Copy Markdown
Member

@stefnestor I pushed quite a few changes. 🐌 Please take a quick look before merging.

By the way, for all docs-content PRs, please make sure to check the preview links that are automatically generated. Some issues are more apparent that way, like in this screenshot:
Screenshot 2026-06-06 at 11 30 41 AM

@stefnestor stefnestor merged commit 986ac0a into elastic:main Jun 6, 2026
8 checks passed
@stefnestor

Copy link
Copy Markdown
Member Author

Fabulous, cheers! 🙇‍♀️

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants