Commit Graph

613 Commits (449e69743623cc73a694449890ed55d322bb9c70)

Author SHA1 Message Date
orbiter d376d81fc4 replaced busy thread control of crawl stacker by blocking threads
16 years ago
orbiter 7535fd7447 - refactoring of CrawlEntry and CrawlStacker
16 years ago
orbiter 2802138787 - refactoring of CrawlStacker (to prepare it for new multi-Threading to remove DNS lookup bottleneck)
16 years ago
orbiter 4a2dac659e more speed hacks:
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter 867d0f2f56 removed some unnecessary pause delays
16 years ago
orbiter 8c96bc2ac1 do not use proxy caching rules for crawling
16 years ago
orbiter dba7ef5144 extended crawling constraints:
16 years ago
orbiter 96174b2b56 more debugging / better result status logging for parser/caching errors
16 years ago
orbiter ef66438662 - more space in error db to store larger error messages
16 years ago
orbiter 674ad2d55b different handling of error cases that occur during loading files with http or ftp:
16 years ago
orbiter 3f746be5d4 - consolidation and refactoring of many DHT target - computing methods
16 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation:
16 years ago
orbiter 22989d0d8a added property index.storeCommons to switch commons storage on or off
16 years ago
orbiter 1778fb420d - added some performance tweaks to the new BLOB buffer
17 years ago
orbiter 9663e61449 added another class to handle BLOB writings to the new HTCACHE data storage:
17 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache:
17 years ago
orbiter ce4715e305 removed indexing of anchor links and tagging such words as part of urls (that was wrong)
17 years ago
orbiter ce57de6cb3 - fixed re-setting of DHT Send/Receive settings
17 years ago
f1ori 7afa084207 * add nativ java trayicon, using reflections
17 years ago
orbiter 6e7d113eac fix for wrong index initialization after network switch
17 years ago
orbiter 7b35d54c6c fixed some problems with network switching (was not completely 'clean')
17 years ago
orbiter 2b13705839 fixed a mistake in indexing queue processing: documents had been parsed before it was checked if they should be indexed or not. parsing was not necessary for this check, so the check was moved in the queue in front of the document parsing
17 years ago
orbiter 1bbf362cef update to the crawl balancer: better organization and better crawl delay prediction
17 years ago
orbiter 670244849d fix for http://forum.yacy-websuche.de/viewtopic.php?p=9835#p9835
17 years ago
orbiter 1fb1665e71 increased dht interval to avoid peer selection failure
17 years ago
lotus b68d06a6e8 performance settings based on network's remote crawl speed
17 years ago
orbiter 3c6e8d2015 set default ppm when network is switched
17 years ago
orbiter 3288c19c1a reduce remote crawl PPM for fresh peers in freeworld to 6 PPM
17 years ago
orbiter 05dbba4bab added logging conditions to all fine and finest log line calls
17 years ago
danielr 9ff4fc11da partial fix (images,audio,video) for proxy and content-type problem http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374
17 years ago
lotus d9d9c522a1 addendum to last commit
17 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
17 years ago
danielr 753a1ae430 - changed default browser from netscape to firefox
17 years ago
orbiter 7989335ed6 Preparations to replace the HTCache with a new storage data structure:
17 years ago
orbiter bdae051d9a - extended new performance graph (better timing)
17 years ago
danielr a087090bbb fixed starting crawl results in "No parser available to parse mimetype 'application/octet-stream'"
17 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net)
17 years ago
orbiter ebb40d324b enhanced memory chart: shows now also the size of the word cache as third vector.
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
orbiter 50ef5c406f - refactoring of robots parser (removed opaque Objects[] result vector)
17 years ago
lotus 62afea0c9f some improvements for yacyTray
17 years ago
lotus fa695c2d9f tray is now only shown on Windows and doesn't block on linux
17 years ago
lotus d77ed28e2f temporary disabled tray because of flaws on only-shell-linux
17 years ago
lotus f8a1e3175e new yacyTray
17 years ago
orbiter 1e6d12f146 Major update to BLOB data structures:
17 years ago
orbiter 1400cdc91e - refactoring of resourceObserver (moved it to crawler)
17 years ago
orbiter a6719dfd2b - refactoring of robots parser
17 years ago
orbiter e81be7d4f2 added many missing user-agent declarations for yacy http client connections.
17 years ago