The indexing engine expects filtered text to be in the database character set. When you use the AUTO_FILTER
filter type, formatted documents are converted to text in the database character set.
If your source is text and your document character set is not the database character set, then you can use the AUTO_FILTER
filter type to convert your text for indexing.
When the CHARSET
column is set to AUTO
, the AUTO_FILTER
filter detects the character set of the document and converts it from the detected character set to the database character set, if there is a difference.
If your document set contains documents with different character sets, such as JA16EUC and JA16SJIS, you can index the documents provided you create a charset column. You populate this column with the name of the document character set for each row. You name the column in the parameter clause of the CREATE INDEX
statement.