From f642cfbe3035ec09bbb4257aa62eafedb1ff20ca Mon Sep 17 00:00:00 2001 From: orbiter Date: Wed, 27 Aug 2014 18:40:20 +0200 Subject: [PATCH] added hint to the regular expression tester --- htroot/CrawlStartExpert.html | 2 ++ 1 file changed, 2 insertions(+) diff --git a/htroot/CrawlStartExpert.html b/htroot/CrawlStartExpert.html index 227e978ad..6ae89a7a1 100644 --- a/htroot/CrawlStartExpert.html +++ b/htroot/CrawlStartExpert.html @@ -308,6 +308,7 @@ The filter is a regular expression. Example: to allow only urls that contain the word 'science', set the must-match filter to '.*science.*'. You can also use an automatic domain-restriction to fully crawl a single domain. + Attention: you can test the functionality of your regular expressions using the Regular Expression Tester within YaCy. @@ -346,6 +347,7 @@ info The filter is a regular expression that must not match with the URLs to allow that the content of the url is indexed. + Attention: you can test the functionality of your regular expressions using the Regular Expression Tester within YaCy.
must-match
must-match(must not be empty)