Skip to content

BB-856 fix#1267

Open
samvit-srivastava wants to merge 1 commit intometabrainz:masterfrom
samvit-srivastava:bb-856-accent-search
Open

BB-856 fix#1267
samvit-srivastava wants to merge 1 commit intometabrainz:masterfrom
samvit-srivastava:bb-856-accent-search

Conversation

@samvit-srivastava
Copy link

Problem

Search queries containing diacritics may not match equivalent entries without diacritics.
For example, Greek words with accents may fail to match their base-letter equivalents.

Solution

Normalize the search input before building the Elasticsearch query.
The input string is normalized using Unicode NFD and combining marks are removed so that accented and non-accented forms are treated the same.

This allows queries like:

Γκιλγκαμές
Γκιλγκαμες

to match consistently.

  • I have run the code and manually tested the changes

AI usage

  • I did not use any AI
  • I have used AI in this PR (add more details below)

If you did use AI:

  • I used AI tools for communication
  • I used AI tools for coding
  • I understand all the changes made in this PR

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant