To configure the segmentation rules from AnalyzeAssist, use the Tools >> Preferences menu (recommended).
You can also directly edit the "segrules.txt" file in the application directory. This file contains the rules for segmenting text. The defaults are as follwos:
StopChars = .!?。.!? SegControlChars = true IgnoreNumberSegs = true
The meanings of each setting are as follows.
- StopChars
- These are the characters that mark the end of a segment (typically a sentence).
- SegControlChars
- Whether control characters (such as tabs) mark the end of a segment. Set to false if segments contain control characters. true/false
- IgnoreNumberSegs
- Ignore segments consisting entirely of numbers. true/false