To index an HTML document set located by URLs, you can specify the system-defined preference for the NULL_FILTER
in the CREATE
INDEX
statement.
You can also specify your section group htmgroup
that uses HTML_SECTION_GROUP
and datastore my_url
that uses URL_DATASTORE
as follows:
begin ctx_ddl.create_preference('my_url','URL_DATASTORE'); ctx_ddl.set_attribute('my_url','HTTP_PROXY','www-proxy.us.example.com'); ctx_ddl.set_attribute('my_url','NO_PROXY','us.example.com'); ctx_ddl.set_attribute('my_url','Timeout','300'); end; begin ctx_ddl.create_section_group('htmgroup', 'HTML_SECTION_GROUP'); ctx_ddl.add_zone_section('htmgroup', 'heading', 'H1'); end;
You can then index your documents as follows:
CREATE INDEX myindex on docs(htmlfile) indextype is ctxsys.context parameters( 'datastore my_url filter ctxsys.null_filter section group htmgroup' );
"Creating Preferences " for more examples on creating a custom context
index