You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Background
Default ngram analyzer removes punctuation for the default analyzer. Furthermore, there should be code written out to do the same for @mentions and hashtags, but this will be implemented later on as an n-gram customization
Problems
Punctuation, @mentions and hashtags are quite dominant in certain ngram analysis output, we need to allow this to be customizable.
Desired Outcome
Not splitting tokens based on punctuation.
Tasks
Punctuation functionality
Punctuation implemented in default ngram analyzer
@mention filtering functionality
hashtag filtering functionality
The text was updated successfully, but these errors were encountered:
Background
Default ngram analyzer removes punctuation for the default analyzer. Furthermore, there should be code written out to do the same for @mentions and hashtags, but this will be implemented later on as an n-gram customization
Problems
Punctuation, @mentions and hashtags are quite dominant in certain ngram analysis output, we need to allow this to be customizable.
Desired Outcome
Not splitting tokens based on punctuation.
Tasks
The text was updated successfully, but these errors were encountered: