diff --git a/htroot/CrawlStart_p.html b/htroot/CrawlStart_p.html index 98257c0e2..63c84919a 100644 --- a/htroot/CrawlStart_p.html +++ b/htroot/CrawlStart_p.html @@ -114,8 +114,9 @@ Restrict to sub-path - The filter is an emacs-like regular expression that must match with the URLs which are used to be crawled; - default is 'catch all'. + The filter is a regular expression + that must match with the URLs which are used to be crawled; default is 'catch all'. + Example: to allow only urls that contain the word 'science', set the filter to '.*science.*'. You can also use an automatic domain-restriction to fully crawl a single domain.