*) changed regex that is created for crawling filter (see http://forum.yacy-websuche.de/viewtopic.php?t=83)

git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3945 6c8d7289-2bf4-0310-a012-ef5d649a1542
pull/1/head
low012 18 years ago
parent 28baecd41b
commit 51800539b2

@ -71,10 +71,10 @@
</tr> </tr>
<tr valign="top" class="TableCellDark"> <tr valign="top" class="TableCellDark">
<td><label for="crawlingFilter">Crawling Filter</label>:</td> <td><label for="crawlingFilter">Crawling Filter</label>:</td>
<td> <td>
<input name="crawlingFilter" id="crawlingFilter" type="text" size="20" maxlength="100" value="#[crawlingFilter]#" /> <input name="crawlingFilter" id="crawlingFilter" type="text" size="20" maxlength="100" value="#[crawlingFilter]#" />
<input type="button" value="only one domain" onclick='var domain=prompt("please enter domain without \"www.\" (i.e. \"yacy.net\").");if(domain!=null){document.getElementById("crawlingFilter").value=".*\\."+domain.replace("\.", "\\.")+"/.*"}' /> <input type="button" value="only one domain" onclick='var domain=prompt("please enter domain without \"www.\" (i.e. \"yacy.net\").");if(domain!=null){document.getElementById("crawlingFilter").value="(^|.*\\.)"+domain.replace("\.", "\\.")+"/.*"}' />
</td> </td>
<td> <td>
This is an emacs-like regular expression that must match with the URLs which are used to be crawled. This is an emacs-like regular expression that must match with the URLs which are used to be crawled.
Use this i.e. to crawl a single domain. If you set this filter it makes sense to increase Use this i.e. to crawl a single domain. If you set this filter it makes sense to increase
@ -216,7 +216,7 @@
<td colspan="5"><input type="submit" name="crawlingstart" value="Start New Crawl" /></td> <td colspan="5"><input type="submit" name="crawlingstart" value="Start New Crawl" /></td>
</tr> </tr>
</table> </table>
</form> </form>
#%env/templates/footer.template%# #%env/templates/footer.template%#
</body> </body>

Loading…
Cancel
Save