Commit Graph

17 Commits (b0cab1e731f7d7f141cd9b2a18cd0680c7b63ddc)

Author SHA1 Message Date
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
19 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error
19 years ago
theli fded1f4a5d *) better handling of maximum file size limit in crawler
19 years ago
theli 63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use.
19 years ago
theli b44514242a *) crawler/ftp/CrawlWorker.java: better errorhandling
19 years ago
theli 7d7f30139c *) crawler/ftp/CrawlWorker.java: delete old cache file
19 years ago
theli 043edfa4d8 *) ftp/ResourceInfo.java ResourceInfo object for ftp resources added
19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli 4825bfaaf3 *) Bugfix for PrintWriter Problem
19 years ago
theli 7930839594 *) URL.java: userinfo was not taken over when generating a new url from a base url and a rel. path
19 years ago
theli 393a7d10be *) setting htCache.Entry fields to private
19 years ago
theli ab5a9bee66 *) adding some copyright headers
19 years ago
theli fce9e7741b *) next step of restructuring for new crawlers
19 years ago
theli 4e2a950ac9 *) next step of restructuring for new crawlers
19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers
19 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
19 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
19 years ago