added a hint about regular expressions in crawl start

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6021 6c8d7289-2bf4-0310-a012-ef5d649a1542
pull/1/head
orbiter 16 years ago
parent 9bfd22f65d
commit 94f3d90af2

@ -114,8 +114,9 @@
<input type="radio" name="range" value="subpath" />Restrict to sub-path <input type="radio" name="range" value="subpath" />Restrict to sub-path
</td> </td>
<td> <td>
The filter is an emacs-like regular expression that must match with the URLs which are used to be crawled; The filter is a <a href="http://java.sun.com/j2se/1.5.0/docs/api/java/util/regex/Pattern.html">regular expression</a>
default is 'catch all'. that must match with the URLs which are used to be crawled; default is 'catch all'.
Example: to allow only urls that contain the word 'science', set the filter to '.*science.*'.
You can also use an automatic domain-restriction to fully crawl a single domain. You can also use an automatic domain-restriction to fully crawl a single domain.
</td> </td>
</tr> </tr>

Loading…
Cancel
Save