Description
See attached document. When uploaded in the cms, the document is not found in the site when searching for "banaan" or "tomaat". When searching for "banaan*" it does. Also when searching for "banaankerstomaat4711huiswerkziekenhuis". The separators in the pdf are not recognized and all words are stored as one term.
Attachments
Issue Links
- relates to
-
CMS-6735 bump tika from 0.8 to 0.9 due to linebreak extraction bug
- Closed