Skip to content

Add N-Gram analyzer components#228

Merged
alanw merged 1 commit into
luceneplusplus:masterfrom
Johnson-zs:ngram
May 25, 2026
Merged

Add N-Gram analyzer components#228
alanw merged 1 commit into
luceneplusplus:masterfrom
Johnson-zs:ngram

Conversation

@Johnson-zs
Copy link
Copy Markdown
Contributor

  1. Added NGramAnalyzer, NGramTokenFilter, and NGramTokenizer classes for n-gram text analysis
  2. Implemented configurable min/max gram sizes with validation
  3. Added preserve original token option to NGramTokenFilter
  4. Included comprehensive test cases for all new components

1. Added NGramAnalyzer, NGramTokenFilter, and NGramTokenizer classes for
n-gram text analysis
2. Implemented configurable min/max gram sizes with validation
3. Added preserve original token option to NGramTokenFilter
4. Included comprehensive test cases for all new components
Copy link
Copy Markdown
Collaborator

@alanw alanw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fantastic contribution, thanks @Johnson-zs!

@alanw alanw merged commit 48040f7 into luceneplusplus:master May 25, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants