You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/crawler
luccioman fb3032c530
Added a crawl filtering possibility on documents Media Type (MIME)
7 years ago
..
data Added a crawl filtering possibility on documents Media Type (MIME) 7 years ago
retrieval Added support for enclosures (media links) to the RSS loader 7 years ago
robots Do locale independant case conversion on hosts, schemes, and file exts. 7 years ago
Balancer.java Fixed display of crawler pending URLs counts in HostBrowser.html page. 8 years ago
CrawlStacker.java Removed unncessary reflection usage for workflow tasks. 7 years ago
CrawlStarterFromScraper.java Updated a license header typo. 7 years ago
CrawlSwitchboard.java Added new recrawl job profile to the list of default crawl profiles 7 years ago
FileCrawlStarterTask.java fix typo 7 years ago
HarvestProcess.java fix for wrong display of error urls in HostBrowser 12 years ago
HostBalancer.java Removed time condition on HostBalancer initialization in JUnit test. 7 years ago
HostQueue.java to prevent crawler to concurrently access and alter same crawl queue 9 years ago
IllegalCrawlProfileException.java Crawl from local file : faster task end when manually terminating crawl. 8 years ago
LegacyBalancer.java use supplied url port to get robots.txt in crawlers hostqueue 9 years ago
RecrawlBusyThread.java Create recrawl requests with the relevant crawl profile. 7 years ago