Commit Graph

1818 Commits (db3a06dd81322297d84a38eb63a6e55ca99840f0)

Author SHA1 Message Date
orbiter 029495e64d fixed bug introduced in SVN 5756 in EcoTable.put()
16 years ago
orbiter 96eaecda3e - added migration class to go from index collections to the index cell data structure.
16 years ago
orbiter 0f0b4aec75 better index cell merge logic
16 years ago
orbiter 832fef670f migration of urls-files into subdirectory METADATA
16 years ago
orbiter 37f892b988 added new concurrent merger class for IndexCell RWI data
16 years ago
orbiter b3f75e48fa - enhanced balancer: auto-solving of waiting-deadlocks
16 years ago
orbiter 9a90ea05e0 added a merge operation for IndexCell data structures
16 years ago
orbiter a9cea419ef Integration of the new index data structure IndexCell
16 years ago
orbiter 83792d9233 more refactoring
16 years ago
orbiter 474aac65af more refactoring
16 years ago
orbiter 209f25f5f5 refactoring to integrate indexCell data structures
16 years ago
orbiter 7dff1cba62 removed option to use different primary keys in kelondro tables
16 years ago
orbiter 7f67238f8b refactoring of plasmaWordIndex: less methods in the class, separated the index to CachedIndexCollection
16 years ago
orbiter 14a1c33823 refactoring of wordIndex class
16 years ago
orbiter e2e7949feb replaced old PPM computation with a better one that simply sums up events that had been stored in the profiling table.
16 years ago
lotus 9f7e62e900 refactoring
16 years ago
orbiter 100247bdda added also an export and delete-feature to the URLAnalysis. This completes the clean-up feature for URLs. To do a complete clean-up of the url database, start the following:
16 years ago
orbiter 60078cf322 added next tool for url analysis: check for references, that occur in the URL-DB but not in the RICOLLECTIONS
16 years ago
orbiter b1ddc4a83f do not merge collections if ram == false
16 years ago
orbiter b80db04667 - refactoring of IntegerHandleIndex and LongHandleIndex (better method names)
16 years ago
orbiter efcd95dc37 simplification of (internal) query process / refactoring
16 years ago
orbiter aa44d9bad9 more refactoring of kelondro.text / deleted de.anomic.index
16 years ago
orbiter 6ffc6e3389 more refactoring of indexer and kelondro classes;
16 years ago
orbiter 404bc21da9 simplification of (internal) query process / refactoring
16 years ago
orbiter 76ef5f0f14 refactoring of index package: better names for the classes (to be continued)
16 years ago
orbiter 2df57b1fd1 refactoring of index collection class
16 years ago
lotus 39a177649b * added upnp listener for devices that do not respond to discovery but advertise themselves
16 years ago
orbiter c12bb8a6d0 - refactoring of the http client
16 years ago
orbiter 62505bb3cb more bugfixes as recommendet by findbugs
16 years ago
orbiter 4db80065ac select more
16 years ago
orbiter 94c42691d8 - reject less transmissions as transmission receiver
16 years ago
orbiter 59427064fb first part of 'doubles' fix (not fully ready yet)
16 years ago
orbiter 26978b2a25 - better memory protection in kelondro caches: computation of needed memory for cache grow
16 years ago
lotus e9e2fff47a better scaling on performance graph
16 years ago
lotus 4aad461100 added UPnP support
16 years ago
orbiter be0c492ae5 fix for memory leak bug in new dht transmissions
16 years ago
orbiter 40d9849aa4 - better control of chunk size in dht selection
16 years ago
orbiter 411f2212f2 more memory leak fixing hacks
16 years ago
orbiter 985d421f91 found and fixed some memory leaks
16 years ago
orbiter 333489420b - fix for NPE when loading the cytag image
16 years ago
orbiter 6a32193916 - refactoring of cache naming in web index cache (no more dht semantics there)
16 years ago
orbiter 6c627dbdff update to the server core
16 years ago
orbiter 5393f356aa fix for termination problem
16 years ago
orbiter 6a876ecb88 first fixes to the DHT transmission process
16 years ago
orbiter c25c334b75 replaced old DHT transmission method with new method. Many things have changed! some of them:
16 years ago
orbiter 01b97ef3f8 added new cybertag-tracking feature that was inspired by itgrl
16 years ago
orbiter b57c9da1f8 - fixes to doc, ppt, xls parser: better title
16 years ago
orbiter 9d282d2c16 - renamed interactivesearch to yacyinteractive
16 years ago
orbiter d3e33fd6c1 removed strange retry logic from DHT transfer
16 years ago
orbiter ef82cced01 removed default line 'P2P WEB SEARCH' if no line is given
16 years ago
orbiter 94110df85a moved logging partially to kelondro
16 years ago
orbiter 024da2916b refactoring of logging
16 years ago
orbiter 83ce65707a (almost) completed partition of classes in kelondro
16 years ago
orbiter 7ee494fde5 more refactoring of kelondro:
16 years ago
lotus d4281b78da dynamic memory scale
16 years ago
orbiter bf93767ec6 refactoring of kelondro database classes
16 years ago
orbiter fc27bf8c4c refactoring of kelondro classes:
16 years ago
orbiter 419469ac27 added more methods to control the vertical DHT (not yet active .. )
16 years ago
orbiter dedfc7df7f removed distinction between DHT-in and DHT-out. This is necessary to make room for the new cell data structure, which cannot use this this distinction in the first place, but will enable the same meaning with different mechanisms (segments, later)
16 years ago
orbiter b74159feb8 preparations to integrate the new 'cell' index data structure
16 years ago
orbiter d1bace5e4d enhanced cleanup function
16 years ago
orbiter ff41da613e removed exception printout during load of snippets
16 years ago
orbiter bed38a5f8c fix for uncaught exception in RSSReader
16 years ago
orbiter a6b29cf72c reverted change of search event processing in SVN 5460. The new code did not work properly,
16 years ago
orbiter 9ef77d57f5 added an access control to the search interface using white/blacklists:
16 years ago
orbiter efe801173c better dht-in cache flush. see also:
16 years ago
orbiter e948df68ac longer timeout for queues during shutdown
16 years ago
orbiter b2a8c653ee small fixes
16 years ago
orbiter 4f45605f04 small update for timing in search result processing
16 years ago
orbiter b2b7edae18 fixed interactive search
16 years ago
lotus 2be119f0df adjusted big peer to 28M links
16 years ago
orbiter c6880ce28b removed the permanent cache flush and replaced it with a periodic cache flush
16 years ago
orbiter 6c7e83909b - refactoring of data access methods to be prepared for new cell data structure
16 years ago
orbiter c4c4c223b9 fixed a problem with attribute flags on RWI entries that prevented proper selection of index-of constraint
16 years ago
orbiter 6072831235 no cr transmission for robinson peers
16 years ago
orbiter be4c458951 refactoring (implemented Iterable in kelondroRowCollection)
16 years ago
orbiter b6bba18c37 replaced the storing procedure for the index ram cache with a method that generates BLOBHeap-compatible dumps
16 years ago
f1ori 025094675f * remove empty directory
16 years ago
orbiter e004da48d3 - added fast fingerprint computation for files (any). Will be used in new index dump method
16 years ago
f1ori 963da8c3f9 * updated tm-extractors to new version 1.0
16 years ago
orbiter e34ac22fbd - added new monitoring servlet at
16 years ago
orbiter d376d81fc4 replaced busy thread control of crawl stacker by blocking threads
16 years ago
orbiter 8cb7170b75 - set status of kelondroTree, kelondroBLOBTree and kelondroFlexTable to deprecated
16 years ago
orbiter 7535fd7447 - refactoring of CrawlEntry and CrawlStacker
16 years ago
orbiter 2802138787 - refactoring of CrawlStacker (to prepare it for new multi-Threading to remove DNS lookup bottleneck)
16 years ago
orbiter 1779c3c507 - added a read cache to the RAFile interface to RandomAccessFile
16 years ago
orbiter 4a2dac659e more speed hacks:
16 years ago
orbiter 47292e696a more performance hacks
16 years ago
lotus 1951d30a62 addendum to last commit
16 years ago
lotus 325ba7bfb8 only query words with length > 2
16 years ago
f1ori 5af8923f37 * distribute forgotten jar-file in parser
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter 867d0f2f56 removed some unnecessary pause delays
16 years ago
orbiter 8c96bc2ac1 do not use proxy caching rules for crawling
16 years ago
orbiter dba7ef5144 extended crawling constraints:
16 years ago
orbiter 96174b2b56 more debugging / better result status logging for parser/caching errors
16 years ago
f1ori 90e78b2cf6 * improve encoding detection of http service
16 years ago
orbiter ef66438662 - more space in error db to store larger error messages
16 years ago
orbiter 674ad2d55b different handling of error cases that occur during loading files with http or ftp:
16 years ago
f1ori 7e1fe05e3c * added utf8-encoding to many getBytes-calls
16 years ago
lotus fad044fb54 update to snippet marker:
16 years ago
orbiter 3f746be5d4 - consolidation and refactoring of many DHT target - computing methods
16 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation:
16 years ago
orbiter 22989d0d8a added property index.storeCommons to switch commons storage on or off
16 years ago
f1ori 340ecd919d * include non ascii characters in visible characters
16 years ago
low012 00e27e5050 *) fixed bug which made it possible to write files outside of the DATA/LIST directory when creating a new blacklist
17 years ago
orbiter b098522977 some very small advances to index utf-8 (not working yet), inserted also debugging code
17 years ago
orbiter 2f49666908 integrated the character decoding into the parser, removed old code
17 years ago
orbiter 0edec2b760 FULL redesign of algorithms in htmlTools to encode/decode strings from/to unicode and html.
17 years ago
f1ori 2e53cbc66a should compile now
17 years ago
f1ori f3bf2e379e should compile again
17 years ago
f1ori dd8441f102 fix bug: data from plasmaParser is allready converted to UTF-8
17 years ago
orbiter 6941bf42b1 performance hacks
17 years ago
orbiter 9b0c4b1063 redesign of parts of the new BLOB buffer
17 years ago
orbiter 1778fb420d - added some performance tweaks to the new BLOB buffer
17 years ago
orbiter 9663e61449 added another class to handle BLOB writings to the new HTCACHE data storage:
17 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache:
17 years ago
orbiter ffed5fc415 fixed problem with lost peers in database
17 years ago
orbiter 2d65887723 - fix for bug in new profile handling
17 years ago
orbiter ff68f394dd fix for problem with balancer and lost crawl profiles:
17 years ago
lotus fb8d9850ea fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1462
17 years ago
orbiter 9ac16f565b - fixed several bugs in database management functions
17 years ago
orbiter 820a03f9d6 - removed some warnings
17 years ago
orbiter c8bdd965ec - larger update time for status page
17 years ago
orbiter ce4715e305 removed indexing of anchor links and tagging such words as part of urls (that was wrong)
17 years ago
orbiter ce57de6cb3 - fixed re-setting of DHT Send/Receive settings
17 years ago
f1ori 7afa084207 * add nativ java trayicon, using reflections
17 years ago
orbiter 6e7d113eac fix for wrong index initialization after network switch
17 years ago
orbiter 7b35d54c6c fixed some problems with network switching (was not completely 'clean')
17 years ago
orbiter f0b42e5a98 fixed NPE
17 years ago
orbiter 8e0de7f180 update to language statistic evaluation:
17 years ago
orbiter 1198eeecc7 added language selection to search query:
17 years ago
orbiter 00c1535f84 added ranking and evaluation of language type in a search
17 years ago
orbiter bfcf9b7aa3 - added language detection using metadata from documents: html and odt documents provide this information
17 years ago
orbiter e1f67262f7 - added and removed some debugging output
17 years ago
orbiter ce2a7ed116 integrated language detection classes into condenser environment
17 years ago
orbiter 2b13705839 fixed a mistake in indexing queue processing: documents had been parsed before it was checked if they should be indexed or not. parsing was not necessary for this check, so the check was moved in the queue in front of the document parsing
17 years ago
orbiter 1bbf362cef update to the crawl balancer: better organization and better crawl delay prediction
17 years ago
orbiter 0cd0fee546 fixed bug with wrong proxy result enqueueing. See:
17 years ago
orbiter 670244849d fix for http://forum.yacy-websuche.de/viewtopic.php?p=9835#p9835
17 years ago
orbiter 5fbccfd75e fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1366&p=9348#p9348
17 years ago
orbiter 1fb1665e71 increased dht interval to avoid peer selection failure
17 years ago
orbiter 1eb813bd43 shifted index deletion-on-exit rule to the class where the errors are produced
17 years ago
lotus 3ded1efe84 kelondroExceptionCounter didn't work
17 years ago
lotus 383d89481e count errors before deleting collection.index
17 years ago
lotus 0bb4fbc403 delete corrupted collecion.index on exit for rebuild on next start
17 years ago
lotus b68d06a6e8 performance settings based on network's remote crawl speed
17 years ago
orbiter bb5c898441 enhancements to localsearch behavior
17 years ago
orbiter 3c6e8d2015 set default ppm when network is switched
17 years ago
orbiter 3288c19c1a reduce remote crawl PPM for fresh peers in freeworld to 6 PPM
17 years ago