You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/crawler
reger 379e9b330d
use supplied url port to get robots.txt in crawlers hostqueue
9 years ago
..
data use supplied url port to get robots.txt in crawlers hostqueue 9 years ago
retrieval result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
robots more ipv6 bugfixes 10 years ago
Balancer.java removed warnings 10 years ago
CrawlStacker.java enhanced timezone managament for indexed data: 10 years ago
CrawlSwitchboard.java Add the Autocrawl thread 9 years ago
HarvestProcess.java fix for wrong display of error urls in HostBrowser 12 years ago
HostBalancer.java use supplied url port to get robots.txt in crawlers hostqueue 9 years ago
HostQueue.java use supplied url port to get robots.txt in crawlers hostqueue 9 years ago
LegacyBalancer.java use supplied url port to get robots.txt in crawlers hostqueue 9 years ago
RecrawlBusyThread.java init Recrawl job chunk size to max crawl loader during job start, to use some system preferences 9 years ago