The file "extensions.txt" in the application directory contains the rules for associating file extensions to segmentation engines. The default settings are as follows:
WordFiles = *.doc;*.rtf TextFiles = *.txt PptFiles = *.ppt HtmlFiles = *.html;*.htm;*.shtml;*.mht ExcelFiles = *.xls;*.csv XmlFiles = *.xml
You can set which segmentation engines are used for which file extensions by adding, changing, or removing extensions. Extension definitions should be one to a line, and extensions should be in the form "*.extension" (without the quotes). Separate extensions with semicolons (;).