Commit Graph

387 Commits (304412a049435ccfcdabf3623aa4a0b244da4c4f)

Author SHA1 Message Date
hydrox 7e8669b15c *) added possibility to "recycle" a DHTChunk that failed to transfer.
19 years ago
auron_x 194d42b6a7 *) changed PPM-calculation to be more accurate
19 years ago
orbiter 2a9d868f6d - removed object cache from kelondroTree
19 years ago
orbiter 06854988da - full integration of new LURL database in INDEX
19 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
19 years ago
theli 3d152bfe43 *) Logging message added
19 years ago
orbiter 77a59a115d refactoring of indexing methods
19 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
19 years ago
orbiter 6396f5971e bugfixes and migration attempt toward new kelondroFlex db
19 years ago
orbiter c8f3a7d363 added snippet-url re-indexing
19 years ago
orbiter 0f10bdde22 more generic cache methods
19 years ago
hermens 440c6ee657 Implement alternative htcache layout
19 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
19 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
19 years ago
theli f17ce28b6d *) plasmaHTCache:
19 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
19 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
19 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
19 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
19 years ago
theli cd5f349666 *) Better handling of large files during parsing
19 years ago
orbiter df1629b05a - code cleanup
19 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes
19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
19 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
19 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
19 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
19 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
19 years ago
theli d0a5a53789 *) changes needed for multi-language support
19 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem
19 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
19 years ago
orbiter 75b198bc02 - updated references to indexContainer
19 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error
19 years ago
orbiter 64bed59ee8 enhancements to ranking
19 years ago
orbiter a8bc768206 enhancements to ranking evaluation
19 years ago
orbiter 96c6e4e322 - enhancements to detailed search page
19 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs
19 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
19 years ago
orbiter 4866868c0e added write cache for LURLs
19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli 7a35b8e237 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
19 years ago
theli ffbf416e76 *) direct access to requestheader of htCache.Entry removed to make it more http independent
19 years ago
theli 3870d615e3 *) setting htCache.Entry fields to private
19 years ago
theli 393a7d10be *) setting htCache.Entry fields to private
19 years ago
theli ab5a9bee66 *) adding some copyright headers
19 years ago
theli 9ded4e8d5a *) Bugfix for name resolution in proxy mode
19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers
19 years ago
theli b4acbdaa97 *) better handling of server shutdown
19 years ago
theli f3ac4dbbb9 *) better handling of server shutdown
19 years ago
orbiter 18b6876860 new cache flush configuration settings
19 years ago
orbiter 985dcbde7f changed some parameters that may cause better memory usage and more indexing speed
19 years ago
orbiter b7f4a1521b added options to switch on or off the kelondroFlexTable for NURL, EURL and PreNURL
19 years ago
orbiter c26da4893b turned back NURL usage of kelondroTree, kelondroFlexTable has still problems with deleted entries
19 years ago
theli f80f776b89 *) Trying to solve NullpointerException problem in function addURLtoErrorDB
19 years ago
orbiter 1ce3c22761 better memory control:
19 years ago
orbiter 39b4c26bdc more memory control:
19 years ago
orbiter eb633c0a4f server threads must now supply a method that can be called in case
19 years ago
orbiter 8418af141a added several consistency checks and small changes
19 years ago
theli eee44be602 *) adding an interface for customized blacklist classes
19 years ago
theli d2e8e76218 *) now it's possible to configure the yacy blacklist separately for dht, search, proxy, crawler
19 years ago
orbiter abf22f6e60 removed url normalform computation from htmlFilterContentScraper.
19 years ago
orbiter 314021453f * more logging
19 years ago
orbiter 80b6c90d54 enhancements to prevent blocking during dht transfer receive
19 years ago
theli 9f298083cd *) adding more urls to the error url
19 years ago
orbiter 279b1d969d Integrated new indexing data structure 'collections' into the main class
19 years ago
orbiter ebc2233092 * implemented (finished) class indexRowSetContainer
19 years ago
orbiter 9183d21f25 renamed new index class to old name
19 years ago
orbiter c4e922885a replaced indexURLEntry by new class that uses a kelondroRow.Entry object
19 years ago
orbiter e357599f92 * fixed problem with indexContainer iteration from RAM:
19 years ago
orbiter 5f72be2a95 some redesign of EURL storage
19 years ago
orbiter e4f1820b58 protection against too long authentication strings in switchboard
19 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL
19 years ago
orbiter 671fd9a5c9 work towards new indexing database structure
19 years ago
orbiter 92f4cb4d73 added option to configure the start-up delay time for kelondro database files.
19 years ago
orbiter 66964dc015 removed high/med/low from kelondroRecords cache control.
19 years ago
allo 67a8c74be3 Fix for dynamic login with static password.
19 years ago
allo ef9eb50c3c fix for adminlogin
19 years ago
allo 6fe2fed87e cookieauth works with static Admin.
19 years ago
theli 4ca0857c0c *) Index transfer now considers the pause time send by busy peers during
19 years ago
orbiter c75cacda95 added a flex-width-array: this is a table where it is
19 years ago
orbiter 5041d330ce refactoring
19 years ago
orbiter bd057b44dd - automatic setting of peer-does-not-accept-remote-crawl
19 years ago
orbiter cda087f43b - integrated cache miss storage into object cache
19 years ago
theli 61078b3885 *) adding support for delayed shutdown
19 years ago
orbiter 90d569d70f refactoring of index management:
19 years ago
orbiter a930be4ba3 refactoring of index management:
19 years ago
hermens df7e1d9df3 Changes to plasmaURL and subclasses:
19 years ago
orbiter a474669338 start with refactoring of index management
19 years ago
theli f331def5d8 *) Bugfix for distribution. Incorrect behavior if peerCount == selectedCount
19 years ago
theli bcc950c533 *) Bugfix for Index Transfer
19 years ago
orbiter 461548698c configuration of index transfer chunk size
19 years ago
hermens 51e3bb576f Don't increase dhtTransferIndexCount when the last transferred index was smaller
19 years ago
hermens a0ca4c5fb8 Remove a possible race condition between DHT transfer and deQueue
19 years ago
orbiter 60e5aff9fc some enhancements to the remote crawl trigger
19 years ago
orbiter 14d6e476c9 tried to solve some problems with new picture viewer
19 years ago
orbiter f0833b0328 introduced simple search interface
19 years ago
orbiter 83e0e765ec redesigned some parts of the html scanner & parser
19 years ago
orbiter e2e8d0c188 some kind of refactoring of yacysearch:
19 years ago
rramthun 250864406f ...
19 years ago
orbiter 63f39ac7b5 added 3 new crawling steering options:
19 years ago
orbiter 1fc3b34be6 some pre-work (without function yet) to implement:
19 years ago