Description
wildcard pre/postfixing combined with stemming is not always possible to work correctly in Lucene. Currently the problem that when for example indexing the word 'Plattenbandförderer' (mind the ö accent) and then quering in the CMS you get:
'Plattenbandförd' DOES have a hit (cms does auto postfix wildcards)
'Plattenbandfö' DOES NOT have a hit wildcard postfix fails with accents)
'Plattenbandförde' DOES NOT have a hit wildcard postfix fails with accents)
Plattenbandförderer DOES have a hit
this is because we use an analyzer by default that removes diacritics. Hence, it is better to remove diacritics from the input query before doing a search with it
Attachments
Issue Links
- relates to
-
HSTTWO-2456 remove diacritics in SearchInputParsingUtils as well
- Closed