diff --git a/htroot/CrawlStartExpert.html b/htroot/CrawlStartExpert.html index 227e978ad..6ae89a7a1 100644 --- a/htroot/CrawlStartExpert.html +++ b/htroot/CrawlStartExpert.html @@ -308,6 +308,7 @@ The filter is a regular expression. Example: to allow only urls that contain the word 'science', set the must-match filter to '.*science.*'. You can also use an automatic domain-restriction to fully crawl a single domain. + Attention: you can test the functionality of your regular expressions using the Regular Expression Tester within YaCy. @@ -346,6 +347,7 @@ info The filter is a regular expression that must not match with the URLs to allow that the content of the url is indexed. + Attention: you can test the functionality of your regular expressions using the Regular Expression Tester within YaCy.
must-match
must-match(must not be empty)