Skip to content

Conversation

@magibney
Copy link
Contributor

@magibney magibney commented Jul 1, 2021

See: LUCENE-9177

Normalization-inert characters need not be required as boundaries
for incremental processing. It is sufficient to check hasBoundaryAfter
and hasBoundaryBefore, substantially improving worst-case performance.

…presence of normalization-inert characters

Normalization-inert characters need not be required as boundaries
for incremental processing. It is sufficient to check `hasBoundaryAfter`
and `hasBoundaryBefore`, substantially improving worst-case performance.
@rmuir rmuir merged commit c3482c9 into apache:main Jul 14, 2021
mikemccand pushed a commit to mikemccand/lucene that referenced this pull request Sep 3, 2021
…presence of normalization-inert characters (apache#199)

Normalization-inert characters need not be required as boundaries
for incremental processing. It is sufficient to check `hasBoundaryAfter`
and `hasBoundaryBefore`, substantially improving worst-case performance.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants