Commit Graph

3479 Commits (9559bc23fd96db7313dac65b2092fdf572fa3cf9)

Author SHA1 Message Date
orbiter 513179f404 changed interface to colletctionIndex and adopted all implementing classes: 17 years ago
orbiter 9d64693cfb reverting again the changes to new concurrent chunkIterator 17 years ago
orbiter 45ad1c3dd5 - re-activated concurrent iterator for EcoFiles 17 years ago
orbiter 2e2120046f speed enhancement for BLOBHeap opening process 17 years ago
orbiter fa26a8f25a fix for deadlock-like behavior in balancer 17 years ago
orbiter 1918a0173e added more exception handling during crawling 17 years ago
orbiter 10f5ec1040 reverted last commit (more testing needed) 17 years ago
f1ori 5af8923f37 * distribute forgotten jar-file in parser 17 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy: 17 years ago
orbiter 0ca4bc7b79 - added reader and visualization for mediawiki-export files: 17 years ago
danielr 2e63f03ca5 copy&paste vergessen :/ 17 years ago
danielr cd8082b4e3 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1111#p11166 17 years ago
lotus 4f996a7651 fix for logparser pattern 17 years ago
f1ori d18c18971e * dirlisting in UTF-8 encoding 17 years ago
orbiter 867d0f2f56 removed some unnecessary pause delays 17 years ago
f1ori d49ffcd818 * files distributed by yacy are utf-8, files from repository use the system default charset 17 years ago
orbiter 8c96bc2ac1 do not use proxy caching rules for crawling 17 years ago
orbiter dba7ef5144 extended crawling constraints: 17 years ago
orbiter 96174b2b56 more debugging / better result status logging for parser/caching errors 17 years ago
f1ori 90e78b2cf6 * improve encoding detection of http service 17 years ago
orbiter ef66438662 - more space in error db to store larger error messages 17 years ago
orbiter 674ad2d55b different handling of error cases that occur during loading files with http or ftp: 17 years ago
danielr 538359a0ff simple fix to get DHT working again (maybe something more has to be done ;) 17 years ago
f1ori 7e1fe05e3c * added utf8-encoding to many getBytes-calls 17 years ago
lotus fad044fb54 update to snippet marker: 17 years ago
lotus 16723d0fa6 ask another peer if crawljob loading fails 17 years ago
orbiter 1b18d4bcf3 enhancement to crawling and remote crawling: 17 years ago
orbiter 3f746be5d4 - consolidation and refactoring of many DHT target - computing methods 17 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation: 17 years ago
orbiter dd27ce7216 added control logic to ECO tables that deletes ram copies of the tables if they get too large 17 years ago
orbiter 38e6ba5d00 forgot to re-rename commonsPath 17 years ago
orbiter 22989d0d8a added property index.storeCommons to switch commons storage on or off 17 years ago
f1ori 4b4ce75396 * http-server: submit charset from html metatags 17 years ago
f1ori 69e695bd4b * detect charset for directory index 17 years ago
f1ori 340ecd919d * include non ascii characters in visible characters 17 years ago
lotus 5cf0cbb47e javadoc 17 years ago
lotus 8d07607d1d update to resource observer: 17 years ago
f1ori d0543a7c39 * fix the debug ant-target 17 years ago
low012 baae3d91b1 *) fixed warning when compiling listManager 17 years ago
danielr a4fb76e93c undo r5300 (not fixed as seen after longer run) 17 years ago
low012 a99a629ed4 *) quick fix to prevent comments for blog entries which don't exist (http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1554) 17 years ago
low012 00e27e5050 *) fixed bug which made it possible to write files outside of the DATA/LIST directory when creating a new blacklist 17 years ago
danielr 0f9c0bd0d5 fix for ConcurrentModificationException at de.anomic.index.indexContainerHeap$heapCacheIterator.next(indexContainerHeap.java:324) 17 years ago
danielr 103ad2a437 some javadoc 17 years ago
orbiter b098522977 some very small advances to index utf-8 (not working yet), inserted also debugging code 17 years ago
orbiter 2f49666908 integrated the character decoding into the parser, removed old code 17 years ago
orbiter 49293c1358 fix for deadlock in new encoder :-( 17 years ago
orbiter 0edec2b760 FULL redesign of algorithms in htmlTools to encode/decode strings from/to unicode and html. 17 years ago
orbiter 958ec20cd0 removed specialized umlaute-handling in html parser. This has to be replaced by something that is able to transfer all possible html encodings into utf-8. Please see SVN 5293 for test cases. 17 years ago
f1ori 2e53cbc66a should compile now 17 years ago
f1ori f3bf2e379e should compile again 17 years ago
f1ori dd8441f102 fix bug: data from plasmaParser is allready converted to UTF-8 17 years ago
orbiter 6941bf42b1 performance hacks 17 years ago
orbiter 9b0c4b1063 redesign of parts of the new BLOB buffer 17 years ago
orbiter 1778fb420d - added some performance tweaks to the new BLOB buffer 17 years ago
orbiter 9663e61449 added another class to handle BLOB writings to the new HTCACHE data storage: 17 years ago
orbiter 382226da94 fix for bug introduced in SVN 5281: parameters were switched 17 years ago
danielr f2fd043797 refactoring (moved duplicate code into methods) 17 years ago
danielr c612046e5e r5278 java 1.5 compatible 17 years ago
f1ori af71ec93bf ops, forgot to import something 17 years ago
f1ori 9e65e9141c * always use UTF-8 for encoding hashes 17 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache: 17 years ago
danielr f095137238 - respecting httpdMaxBusySessions (refusing new connections if limit is hit) 17 years ago
orbiter 8ba33f104e fix for npe 17 years ago
orbiter 998861acfd - some refactoring in BLOBHeap to enable more gap processing functions 17 years ago
lotus 9d50bfd0b3 fix for npe: http://forum.yacy-websuche.de/viewtopic.php?p=10562 17 years ago
orbiter 766cad6e93 enhancement in memory management of BLOB Heap files / merging of deleted entries 17 years ago
orbiter 7860d5d632 fix for bug in seed list management (cause was bad class overloading, only visual effects!) 17 years ago
orbiter ffed5fc415 fixed problem with lost peers in database 17 years ago
orbiter 6fb865fbdc - fix of bug in iterator in kelondroBLOBHeap which caused bug in crawl profile listing 17 years ago
orbiter 2d65887723 - fix for bug in new profile handling 17 years ago
orbiter ff68f394dd fix for problem with balancer and lost crawl profiles: 17 years ago
lotus fb8d9850ea fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1462 17 years ago
lotus 0d1a2f6183 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1461 17 years ago
orbiter 9ac16f565b - fixed several bugs in database management functions 17 years ago
orbiter 820a03f9d6 - removed some warnings 17 years ago
lotus fe2792e9ce use accept-language header instead of user agent for language detection 17 years ago
orbiter c8bdd965ec - larger update time for status page 17 years ago
lotus dda771db9d - search result layout 17 years ago
orbiter ce4715e305 removed indexing of anchor links and tagging such words as part of urls (that was wrong) 17 years ago
orbiter ce57de6cb3 - fixed re-setting of DHT Send/Receive settings 17 years ago
lotus 31c31e54e4 new tray icon image for different icon sizes (e.g. linux) 17 years ago
f1ori 9589dfe080 * removed trayicon popupmenu title 17 years ago
lotus 5a637f004d localized tray 17 years ago
lotus 9d4f0325e1 - removed shutdown from search page (we have it in tray now!) 17 years ago
lotus 214277dad6 - revert r5202 17 years ago
f1ori 7afa084207 * add nativ java trayicon, using reflections 17 years ago
apfelmaennchen b97ff24b43 bookmarksDB / xbel.xml: 17 years ago
orbiter 6e7d113eac fix for wrong index initialization after network switch 17 years ago
lotus 0a0cc3bf67 added missing classes to build target "run" 17 years ago
orbiter 7b35d54c6c fixed some problems with network switching (was not completely 'clean') 17 years ago
orbiter f0b42e5a98 fixed NPE 17 years ago
orbiter 8e0de7f180 update to language statistic evaluation: 17 years ago
orbiter 1198eeecc7 added language selection to search query: 17 years ago
orbiter 00c1535f84 added ranking and evaluation of language type in a search 17 years ago
lotus a81cb78211 finally some putHTML on htroot/xml/ 17 years ago
orbiter bfcf9b7aa3 - added language detection using metadata from documents: html and odt documents provide this information 17 years ago
apfelmaennchen 5e8bd0f29c small fixes to getpageinfo_p.xml and htmlFilterContentScraper.java with respect to keyword extraction 17 years ago
apfelmaennchen 5b2a57bfd0 - /xml/util/getpageinfo_p.xml added <desc> and <lang> tags 17 years ago
orbiter e1f67262f7 - added and removed some debugging output 17 years ago
orbiter ce2a7ed116 integrated language detection classes into condenser environment 17 years ago
orbiter 2b13705839 fixed a mistake in indexing queue processing: documents had been parsed before it was checked if they should be indexed or not. parsing was not necessary for this check, so the check was moved in the queue in front of the document parsing 17 years ago
orbiter 21dbb39afa switched two balancer cases 17 years ago
orbiter 1bbf362cef update to the crawl balancer: better organization and better crawl delay prediction 17 years ago
orbiter ddcf285499 - fixed a bug in performance setting (did not work with german translation) 17 years ago
orbiter 0cd0fee546 fixed bug with wrong proxy result enqueueing. See: 17 years ago
orbiter 670244849d fix for http://forum.yacy-websuche.de/viewtopic.php?p=9835#p9835 17 years ago
lotus fd9233244e configurable free disk space via disk.free 17 years ago
orbiter 25a62cdc3f small fixes 17 years ago
lotus 73f233bb11 * set resource observer to 1000MB 17 years ago
orbiter 5fbccfd75e fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1366&p=9348#p9348 17 years ago
orbiter a28faabfd2 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1351&p=9242#p9242 17 years ago
apfelmaennchen 7b63c66a08 - bugfix in bookmarksDB.Tag.hasPublicItems() 17 years ago
orbiter 1fb1665e71 increased dht interval to avoid peer selection failure 17 years ago
orbiter 1eb813bd43 shifted index deletion-on-exit rule to the class where the errors are produced 17 years ago
f1ori ba76995d2c * fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1415 17 years ago
f1ori bea6c13139 * with r5137 robotParser didn't work at all -> fix 17 years ago
lotus 3ded1efe84 kelondroExceptionCounter didn't work 17 years ago
f1ori ae677e1738 * fix problem in robotparser, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1421&p=9742 17 years ago
lotus 383d89481e count errors before deleting collection.index 17 years ago
lotus 0bb4fbc403 delete corrupted collecion.index on exit for rebuild on next start 17 years ago
lotus b68d06a6e8 performance settings based on network's remote crawl speed 17 years ago
danielr d60b2b198d proxy fixed 'not modified' http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1419 17 years ago
f1ori bd0318ba81 * YaCy only supports gzip-encoding, so remove any other encoding from request 17 years ago
orbiter bb5c898441 enhancements to localsearch behavior 17 years ago
orbiter 42e2d195ac added hint from http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1294 17 years ago
orbiter 39964e88fa fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1329#p9121 17 years ago
orbiter 3f3673b6e5 extended balancer: 17 years ago
orbiter 3c6e8d2015 set default ppm when network is switched 17 years ago
orbiter 3288c19c1a reduce remote crawl PPM for fresh peers in freeworld to 6 PPM 17 years ago
lotus 5ce9a100bb fix(2) for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1416 17 years ago
danielr cf29ca19d4 possible fix for POST character encoding http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374 17 years ago
danielr a2eeb6138c fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1416 17 years ago
orbiter d09ddabd09 corrected a design mistake (5-byte hashes not necessary) 17 years ago
orbiter c97d0fcee7 modified the domain list export function: 17 years ago
orbiter 77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet 17 years ago
orbiter 80a7bc93d6 - added statistical evaluation about domains that appear during crawling 17 years ago
orbiter 4fbee21cea - added fetch-ahead again (had been removed in last commit) 17 years ago
lotus 423a89ebe8 * fix if yacy was installed to a path with whitespace 17 years ago
orbiter fc03b0437a fixed a error case where a second search after a first search with a different search word failed 17 years ago
orbiter eca171ba2e fix for case where javascript was not filtered by the html parser 17 years ago
lotus e645bae29f display table in log 17 years ago
orbiter ead39064c5 fixed problem with wrong result number calculation 17 years ago
hermens 2437beb96c fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1360&p=9321#p9321 17 years ago
orbiter 7b12e77a63 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1393&hilit=&p=9655#p9655 17 years ago
orbiter 05dbba4bab added logging conditions to all fine and finest log line calls 17 years ago
orbiter d3d41e2ee4 - fixed problem with searching with quotes (still not complete, but not as bad as before) 17 years ago
lotus 3fbfd5a78b * fix for non-changing offset on new search term 17 years ago
danielr 219b93df6a - fixed internal error after receiving chunked POST 17 years ago
lotus c245c7a45e delete index.dhtin/out.heap if restore fails 17 years ago
danielr cd19d0aee6 - added warnings for failed transferRWI (dht-in) 17 years ago
orbiter df4ff423c4 added additional properties to query id's to distinguish search events better 17 years ago
danielr d6d9b0f14a fixed transferRWI.html 'Read timed out' 17 years ago
danielr e503158527 Proxy: fix for never ending loading after POST 17 years ago
danielr 1a1d57e449 Proxy: added binary passthrough for POST 17 years ago
apfelmaennchen aa6ae77e5e - autoReCrawl: fix for filter settings 17 years ago
apfelmaennchen 8ae29bad57 - fix to previous change of Crawl Profile Names 17 years ago
apfelmaennchen 434104e4a0 - change Crawl profile name for autoreCrawl 17 years ago
danielr 9ff4fc11da partial fix (images,audio,video) for proxy and content-type problem http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374 17 years ago
lotus 0df2e47012 changed auto recrawl to comply with new date format 17 years ago
lotus d9d9c522a1 addendum to last commit 17 years ago
lotus 480497f7c9 changed recrawl 17 years ago
orbiter da1b0b2fc6 added two new classes that will be used for the new htcache 17 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once: 17 years ago
borg-0300 08cdf6db8a fix for wrong "VegaYacyB" peers 17 years ago
danielr 4d937f6b21 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1396 17 years ago
apfelmaennchen bd931a82f7 - added dynamic filters to autoReCrawl.conf 17 years ago
apfelmaennchen b3fc5e96a3 - removed unused import from bookmarksDB 17 years ago
apfelmaennchen bc048db7b6 - bugfix for bookmarksDB's rebuildDates() 17 years ago
danielr 3c68905540 remove redundant null checks 17 years ago
danielr 753a1ae430 - changed default browser from netscape to firefox 17 years ago
orbiter 7989335ed6 Preparations to replace the HTCache with a new storage data structure: 17 years ago
danielr be28af50f5 - fixed "yacy2yacy no proxy"-problem 17 years ago
f1ori f99c307eff * correct debian build dependencies 17 years ago
orbiter bdae051d9a - extended new performance graph (better timing) 17 years ago
danielr d9cea5ff23 removed annotations which broke the build with java 1.5 17 years ago
danielr a087090bbb fixed starting crawl results in "No parser available to parse mimetype 'application/octet-stream'" 17 years ago
danielr 7e7e6a099a undo 5044 17 years ago
danielr f2d0bd7790 fix for NPE in JakartaHttpClient.setProxy 17 years ago
danielr bb6a6fc233 fixed 'FileUploadException Stream ended unexpectedly' 17 years ago
danielr 8422ee5ec4 - fixed UnsupportedEncoding (in proxy) using defaultCharset if no characterEncoding can be determined 17 years ago
hermens 3ac1988059 Add some sanity checks for invalid seeds 17 years ago
hermens cff4393f0c Fix HTCache so oldest Files get deleted first 17 years ago
danielr 31d97f2b9f replaced httpd.parseMultipart() by a 'right' implementation 17 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net) 17 years ago
apfelmaennchen 0500b1179e added a 2 min start up delay to serverBusyThread autoReCrawl to avoid a Null Pointer Exception... 17 years ago
apfelmaennchen e1574fe02e - added autoReCrawl folders to bookmarks (DATA/SETTINGS/autoReCrawl.conf) 17 years ago
orbiter ebb40d324b enhanced memory chart: shows now also the size of the word cache as third vector. 17 years ago
danielr 17b7845eb5 * refactoring 17 years ago
danielr 3bb870bfcd added final where possible 17 years ago
lotus 7e92484400 fix for open browser on windows 2000 17 years ago
f1ori b0724e5ec0 * add config option to disable cookie monitoring (disabled by default) 17 years ago
lotus 0b2f67577e Index Transfer: 17 years ago
lotus 694084c570 fix for NPE on shutdown 17 years ago
lotus 5f77f55ed7 possible fix for negative speed values 17 years ago
orbiter 50ef5c406f - refactoring of robots parser (removed opaque Objects[] result vector) 17 years ago
danielr 7913bdb75b Flextable: filename in errormessage if inconsistent 17 years ago
lotus d42eae25f8 yacyTray: 17 years ago
orbiter c3d461d191 - removed superfluous copyright statement 17 years ago
orbiter 3ca98fee42 removed superfluous copyright statement 17 years ago