Commit Graph

46 Commits (69521d92e5f75355ad668712c56b0374f39253ec)

Author SHA1 Message Date
orbiter af10f729df fixed image search and favicon loading
17 years ago
orbiter 6eaa5a0e64 enhanced local search speed. The ranking process is now 6 times faster that before.
17 years ago
orbiter 55c87b3b12 changed behavior of crawl stacker
18 years ago
orbiter a31b9097a4 preparations for mass remote crawls:
18 years ago
orbiter c1440d2241 fixed problem with redirection: redirected URLs had not been tested with the double-check
18 years ago
orbiter 01e0669264 re-designed some parts of DHT position calculation (effect is the same as before)
18 years ago
orbiter 842308ea97 - redesigned crawl start menu, integrated monitoring pages
18 years ago
orbiter 2f1ff048ba some fixes to socket connection time-out
18 years ago
orbiter 11b4f80bde - fixed non-closing client connections
18 years ago
orbiter 1488769e1f cleanup of unmaintained and outdated performance methods:
18 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation:
18 years ago
orbiter bb426565f0 added new yacy protocol for mass url-pull for better remote crawling distribution
18 years ago
orbiter f890cc86aa inserted forwarding patch from fuchs
18 years ago
orbiter b5346141b3 made the plasmaHTCache static (there is only one internet, so we need only one cache)
18 years ago
orbiter 57a5b6fa71 some generalization of remote proxy configuration and setting handling in httpc
18 years ago
orbiter 40b0547611 - documentaton changes (removed old forum links)
18 years ago
rramthun 18a5380ee3 *) situation-dependent lock-buttons for search-page
18 years ago
orbiter 861f41e67e redesigned NURL-handling:
18 years ago
theli d157201e08 *) IfesL for "Unexpected end of ZLIB" error message
18 years ago
orbiter 109ed0a0bb - cleaned up code; removed methods to write the old data structures
18 years ago
orbiter 30888e7a2f implementation of search constraints
18 years ago
orbiter 497428c8ec refactoring
18 years ago
orbiter 76fceb9997 refactoring
18 years ago
orbiter bb7d4b5d5e refactoring to prepare new RWI entry object
18 years ago
theli a5b9b514c1 *) retry crawling without content-encoding if the content-encoding header was not correct
19 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
19 years ago
theli f17ce28b6d *) plasmaHTCache:
19 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
19 years ago
orbiter df1629b05a - code cleanup
19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
19 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error
19 years ago
theli fded1f4a5d *) better handling of maximum file size limit in crawler
19 years ago
theli 63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use.
19 years ago
theli b44514242a *) crawler/ftp/CrawlWorker.java: better errorhandling
19 years ago
theli 7d7f30139c *) crawler/ftp/CrawlWorker.java: delete old cache file
19 years ago
theli 043edfa4d8 *) ftp/ResourceInfo.java ResourceInfo object for ftp resources added
19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli 4825bfaaf3 *) Bugfix for PrintWriter Problem
19 years ago
theli 7930839594 *) URL.java: userinfo was not taken over when generating a new url from a base url and a rel. path
19 years ago
theli 393a7d10be *) setting htCache.Entry fields to private
19 years ago
theli ab5a9bee66 *) adding some copyright headers
19 years ago
theli fce9e7741b *) next step of restructuring for new crawlers
19 years ago
theli 4e2a950ac9 *) next step of restructuring for new crawlers
19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers
19 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
19 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
19 years ago