Section searching is enabled by defining section groups. You use one of the system-defined section groups to create an instance of a section group. Choose a section group appropriate for your document collection.
You use section groups to specify the type of document set you have and implicitly indicate the tag structure. For instance, to index HTML tagged documents, you use the HTML_SECTION_GROUP
. Likewise, to index XML tagged documents, you can use the XML_SECTION_GROUP
.
Table 8-1 lists the different types of section groups you can use:
Table 8-1 Types of Section Groups
Section Group Preference | Description |
---|---|
|
This is the default. Use this group type when you define no sections or when you define only |
|
Use this group type for defining sections where the start and end tags are of the form Note: This group type dopes not support input such as unbalanced parentheses, comments tags, and attributes. Use |
|
Use this group type for indexing HTML documents and for defining sections in HTML documents. |
|
Use this group type for indexing XML documents and for defining sections in XML documents. |
|
Use this group type to automatically create a zone section for each start-tag/end-tag pair in an XML document. The section names derived from XML tags are case-sensitive as in XML. Attribute sections are created automatically for XML tags that have attributes. Attribute sections are named in the form tag@attribute. Stop sections, empty tags, processing instructions, and comments are not indexed. The following limitations apply to automatic section groups:
|
|
Use this group type to index XML documents. Behaves like the The difference is that with this section group you can do path searching with the |
|
Use this group for defining sections in newsgroup formatted documents according to RFC 1036. |
Documents sent to the HTML
, XML
, AUTO
and PATH
sectioners must begin with \s*<
, where \s*
represents zero or more whitespace characters. Otherwise, the document is treated as a plaintext document, and no sections are recognized.
You use the CTX_DDL
package to create section groups and define sections as part of section groups. For example, to index HTML documents, create a section group with HTML_SECTION_GROUP
:
begin ctx_ddl.create_section_group('htmgroup', 'HTML_SECTION_GROUP'); end;