Commit Graph

327 Commits (0f10bdde22dc256b09fe1eb10ae967a3a021a3c6)

Author SHA1 Message Date
orbiter 0f10bdde22 more generic cache methods
19 years ago
hermens 440c6ee657 Implement alternative htcache layout
19 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
19 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
19 years ago
theli f17ce28b6d *) plasmaHTCache:
19 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
19 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
19 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
19 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
19 years ago
theli cd5f349666 *) Better handling of large files during parsing
19 years ago
orbiter df1629b05a - code cleanup
19 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes
19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
19 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
19 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
19 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
19 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
19 years ago
theli d0a5a53789 *) changes needed for multi-language support
19 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem
19 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
19 years ago
orbiter 75b198bc02 - updated references to indexContainer
19 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error
19 years ago
orbiter 64bed59ee8 enhancements to ranking
19 years ago
orbiter a8bc768206 enhancements to ranking evaluation
19 years ago
orbiter 96c6e4e322 - enhancements to detailed search page
19 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs
19 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
19 years ago
orbiter 4866868c0e added write cache for LURLs
19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli 7a35b8e237 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
19 years ago
theli ffbf416e76 *) direct access to requestheader of htCache.Entry removed to make it more http independent
19 years ago
theli 3870d615e3 *) setting htCache.Entry fields to private
19 years ago
theli 393a7d10be *) setting htCache.Entry fields to private
19 years ago
theli ab5a9bee66 *) adding some copyright headers
19 years ago
theli 9ded4e8d5a *) Bugfix for name resolution in proxy mode
19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers
19 years ago
theli b4acbdaa97 *) better handling of server shutdown
19 years ago
theli f3ac4dbbb9 *) better handling of server shutdown
19 years ago
orbiter 18b6876860 new cache flush configuration settings
19 years ago
orbiter 985dcbde7f changed some parameters that may cause better memory usage and more indexing speed
19 years ago
orbiter b7f4a1521b added options to switch on or off the kelondroFlexTable for NURL, EURL and PreNURL
19 years ago
orbiter c26da4893b turned back NURL usage of kelondroTree, kelondroFlexTable has still problems with deleted entries
19 years ago
theli f80f776b89 *) Trying to solve NullpointerException problem in function addURLtoErrorDB
19 years ago
orbiter 1ce3c22761 better memory control:
19 years ago
orbiter 39b4c26bdc more memory control:
19 years ago
orbiter eb633c0a4f server threads must now supply a method that can be called in case
19 years ago
orbiter 8418af141a added several consistency checks and small changes
19 years ago
theli eee44be602 *) adding an interface for customized blacklist classes
19 years ago
theli d2e8e76218 *) now it's possible to configure the yacy blacklist separately for dht, search, proxy, crawler
19 years ago
orbiter abf22f6e60 removed url normalform computation from htmlFilterContentScraper.
19 years ago