Commit Graph

868 Commits (a564df3984087c18cc3cdb49da92b93d0b874237)

Author SHA1 Message Date
orbiter 66f78d67e0 bad idea. Concurrency in index management will be done differently
16 years ago
orbiter 7dff1cba62 removed option to use different primary keys in kelondro tables
16 years ago
orbiter 7f67238f8b refactoring of plasmaWordIndex: less methods in the class, separated the index to CachedIndexCollection
16 years ago
orbiter d49238a637 more performance hacks: better default values for scaling, less memory usage
16 years ago
orbiter 39644dc14e performance hacks to compare methods in database core
16 years ago
orbiter e2e7949feb replaced old PPM computation with a better one that simply sums up events that had been stored in the profiling table.
16 years ago
orbiter f6d989aa04 added new class RowSetArray which arranges RowSet objects like Elements in a hashtable, but still provides the functionality of sorted enumeration. The new class is now integrated into the ObjectIndexCache, which is the core class to provide index functions to all database files. The new index access is about twice as fast as before. This has strong speed enhancement effects on all parts of YaCy.
16 years ago
orbiter 6958eff196 removed unnecessary exceptions, extended testing in IntegerHandleIndex
16 years ago
orbiter 13c666adef performance hack to ObjectIndex put() method:
16 years ago
orbiter 1f1be1518c added stub for another performance hack: concurrent indexes
16 years ago
orbiter 3e4c28e188 enhanced count feature for kelondroRowSet. This is about twice as fast as before. Should speed up the collection analysis (half time!)
16 years ago
orbiter 84e37387a2 fix for last commit and more testing stub
16 years ago
orbiter ca006c506d stub for performance enhancements for RowSet (no functional change yet)
16 years ago
orbiter 100247bdda added also an export and delete-feature to the URLAnalysis. This completes the clean-up feature for URLs. To do a complete clean-up of the url database, start the following:
16 years ago
orbiter 60078cf322 added next tool for url analysis: check for references, that occur in the URL-DB but not in the RICOLLECTIONS
16 years ago
orbiter dbdd10da84 better logging and startup behaviour for referenceHash computation
16 years ago
orbiter d64836c34f added statistical analysis of URL reference
16 years ago
orbiter 3b28daab40 code-beautification (to be consistent with external documentation paper)
16 years ago
orbiter 485c9406e5 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1915&hilit=&p=13249#p13249
16 years ago
orbiter b80db04667 - refactoring of IntegerHandleIndex and LongHandleIndex (better method names)
16 years ago
orbiter 16f5c6a85e fixed merge method initialization in ReferenceContainer
16 years ago
orbiter d7a493b4f5 added experimental timeline api
16 years ago
orbiter efcd95dc37 simplification of (internal) query process / refactoring
16 years ago
orbiter d4b56d5819 added more asserts to BLOBHeap.flushBuffer() to fix the problem described in
16 years ago
orbiter aa44d9bad9 more refactoring of kelondro.text / deleted de.anomic.index
16 years ago
orbiter 6ffc6e3389 more refactoring of indexer and kelondro classes;
16 years ago
orbiter 2df57b1fd1 refactoring of index collection class
16 years ago
orbiter 8444357291 added new row interator in kelondro tables files that enumerates rows
16 years ago
orbiter 62505bb3cb more bugfixes as recommendet by findbugs
16 years ago
orbiter 6b450d09ca some fixes recommended by findbugs
16 years ago
orbiter e04a0e05c3 fix for last commit
16 years ago
orbiter a9ad863686 second part of 'doubles' fix - better handling of doubles in RAMIndex. More logging.
16 years ago
orbiter 59427064fb first part of 'doubles' fix (not fully ready yet)
16 years ago
orbiter 26978b2a25 - better memory protection in kelondro caches: computation of needed memory for cache grow
16 years ago
hermens 2173865f92 Prevent race condition when switching timezones.
16 years ago
orbiter 30a1de41b3 disabled the BufferedIOChunks, because I consider it as broken.
16 years ago
orbiter 411f2212f2 more memory leak fixing hacks
16 years ago
orbiter 333489420b - fix for NPE when loading the cytag image
16 years ago
orbiter c25c334b75 replaced old DHT transmission method with new method. Many things have changed! some of them:
16 years ago
orbiter 01b97ef3f8 added new cybertag-tracking feature that was inspired by itgrl
16 years ago
borg-0300 b19bc611b0 gc: better logging
16 years ago
orbiter b1f9c00118 fix for bug in merge operator initialization
16 years ago
orbiter b57c9da1f8 - fixes to doc, ppt, xls parser: better title
16 years ago
f1ori 7936e58fe7 * sorry,previous version didn't compile
16 years ago
f1ori 76cdc59789 * added some convertions to and from UTF-8
16 years ago
orbiter 94110df85a moved logging partially to kelondro
16 years ago
orbiter 024da2916b refactoring of logging
16 years ago
orbiter 83ce65707a (almost) completed partition of classes in kelondro
16 years ago
orbiter 7ee494fde5 more refactoring of kelondro:
16 years ago
orbiter bf93767ec6 refactoring of kelondro database classes
16 years ago
orbiter fc27bf8c4c refactoring of kelondro classes:
16 years ago
orbiter 6cbca1e508 extended last fix, preventing more sorts
16 years ago
orbiter f9672d3f97 applied fix for inefficient put method as recommended by celle, see
16 years ago
orbiter 3154926311 some better memory protection and OOM prevention in EcoFS
16 years ago
orbiter dedfc7df7f removed distinction between DHT-in and DHT-out. This is necessary to make room for the new cell data structure, which cannot use this this distinction in the first place, but will enable the same meaning with different mechanisms (segments, later)
16 years ago
orbiter b74159feb8 preparations to integrate the new 'cell' index data structure
16 years ago
orbiter cb76d9e0e4 more synchronized in BLOBHeap (will not fix problem with Runtime-Error as reported in forum)
16 years ago
orbiter f675d47f86 better protection against database failures
16 years ago
orbiter 4d5b401f00 try to fix some performance problems with the internal index management:
16 years ago
orbiter c6880ce28b removed the permanent cache flush and replaced it with a periodic cache flush
16 years ago
orbiter ef7fe537c5 fixed a cache-bug in cachedFileRA
16 years ago
orbiter 6c7e83909b - refactoring of data access methods to be prepared for new cell data structure
16 years ago
orbiter 07fc115e90 removed active profiling in kelondroRowSet
16 years ago
orbiter be4c458951 refactoring (implemented Iterable in kelondroRowCollection)
16 years ago
orbiter b6bba18c37 replaced the storing procedure for the index ram cache with a method that generates BLOBHeap-compatible dumps
16 years ago
orbiter 3567c58b18 added another filed information for BLOBHeap dumps: the gaps
16 years ago
orbiter abdd4aa414 added a index dump for blob heaps:
16 years ago
orbiter 8c3205b62e fix for OOB Exception
16 years ago
orbiter e004da48d3 - added fast fingerprint computation for files (any). Will be used in new index dump method
16 years ago
orbiter fc8189f3fb better self-healing of corrupted databases
16 years ago
orbiter f29b48d9ff patch for IndexOutOfBoundsException
16 years ago
orbiter 8cb7170b75 - set status of kelondroTree, kelondroBLOBTree and kelondroFlexTable to deprecated
16 years ago
orbiter 7cd08bd5fb fix for NPE in BLOBCompressor
16 years ago
orbiter 5b94498643 fine-tuning of cache usage from SVN 5386 and a bug fix for overflow in available() method
16 years ago
orbiter 1779c3c507 - added a read cache to the RAFile interface to RandomAccessFile
16 years ago
orbiter e1acdb952c fix for problem with userDB and bookmarksDB which was caused by changes in kelondroRA in SVN 5376
16 years ago
orbiter 4a2dac659e more speed hacks:
16 years ago
orbiter 47292e696a more performance hacks
16 years ago
orbiter 759cef23dd fix for bug in kelondroAbstractRA.readFully
16 years ago
orbiter d39d420b39 performance hacks
16 years ago
orbiter 513179f404 changed interface to colletctionIndex and adopted all implementing classes:
16 years ago
orbiter 9d64693cfb reverting again the changes to new concurrent chunkIterator
16 years ago
orbiter 45ad1c3dd5 - re-activated concurrent iterator for EcoFiles
16 years ago
orbiter 2e2120046f speed enhancement for BLOBHeap opening process
16 years ago
orbiter 10f5ec1040 reverted last commit (more testing needed)
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter ef66438662 - more space in error db to store larger error messages
16 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation:
16 years ago
orbiter dd27ce7216 added control logic to ECO tables that deletes ram copies of the tables if they get too large
16 years ago
orbiter 38e6ba5d00 forgot to re-rename commonsPath
16 years ago
orbiter 22989d0d8a added property index.storeCommons to switch commons storage on or off
16 years ago
danielr 103ad2a437 some javadoc
16 years ago
orbiter 6941bf42b1 performance hacks
17 years ago
orbiter 9b0c4b1063 redesign of parts of the new BLOB buffer
17 years ago
orbiter 1778fb420d - added some performance tweaks to the new BLOB buffer
17 years ago
orbiter 9663e61449 added another class to handle BLOB writings to the new HTCACHE data storage:
17 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache:
17 years ago
orbiter 998861acfd - some refactoring in BLOBHeap to enable more gap processing functions
17 years ago
orbiter 766cad6e93 enhancement in memory management of BLOB Heap files / merging of deleted entries
17 years ago
orbiter 7860d5d632 fix for bug in seed list management (cause was bad class overloading, only visual effects!)
17 years ago
orbiter ffed5fc415 fixed problem with lost peers in database
17 years ago
orbiter 6fb865fbdc - fix of bug in iterator in kelondroBLOBHeap which caused bug in crawl profile listing
17 years ago
orbiter 9ac16f565b - fixed several bugs in database management functions
17 years ago
orbiter e1f67262f7 - added and removed some debugging output
17 years ago
orbiter 25a62cdc3f small fixes
17 years ago
orbiter 1eb813bd43 shifted index deletion-on-exit rule to the class where the errors are produced
17 years ago
lotus 0bb4fbc403 delete corrupted collecion.index on exit for rebuild on next start
17 years ago
orbiter 77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet
17 years ago
lotus e645bae29f display table in log
17 years ago
orbiter ead39064c5 fixed problem with wrong result number calculation
17 years ago
hermens 2437beb96c fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1360&p=9321#p9321
17 years ago
orbiter 7b12e77a63 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1393&hilit=&p=9655#p9655
17 years ago
orbiter 05dbba4bab added logging conditions to all fine and finest log line calls
17 years ago
orbiter d3d41e2ee4 - fixed problem with searching with quotes (still not complete, but not as bad as before)
17 years ago
danielr 3c68905540 remove redundant null checks
17 years ago
danielr 753a1ae430 - changed default browser from netscape to firefox
17 years ago
orbiter 7989335ed6 Preparations to replace the HTCache with a new storage data structure:
17 years ago
orbiter bdae051d9a - extended new performance graph (better timing)
17 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net)
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
danielr 7913bdb75b Flextable: filename in errormessage if inconsistent
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
orbiter 606b323a2d fixed bug that appeared when a new crawl ist started
17 years ago
orbiter 38eb5bd1ee fixed a bug in kelondroBLOBHeap. The following files are probably inconsistent and should be deleted:
17 years ago
orbiter 28d5703f8a - fixed a bug in Robots.txt loader which could have caused that robots.txt files had been loaded from the same domain more than once
17 years ago
orbiter 7b1c9e6aee discovered and removed a (possibly large) memory leak:
17 years ago
orbiter 0f5fe8cc53 refactoring of method calling for objects from kelondroMapDataMining
17 years ago
orbiter 4acf0a61cd refactoring of kelondroObjects (mainly renaming to kelondroMap)
17 years ago
orbiter 1e6d12f146 Major update to BLOB data structures:
17 years ago
orbiter 81f75f5056 - removed unnecessary classes (these objects are much easier to handle using generics)
17 years ago
orbiter b38f467e3c better SRU compliance
17 years ago
orbiter a6719dfd2b - refactoring of robots parser
17 years ago
orbiter 474e29ce4a added options to configure the 'corporate identity'-icons, the home page link and the greeting line from
17 years ago
orbiter 474659a71f - modified and enhanced the crawl balancer: better list export, fixing of damaged crawl queue at start-up, re-sorting at start-up to enhance domain order
17 years ago
danielr dba7ba079e fixed NPE seen with queues_p.xml (serverClassLoader finds already loaded classes)
17 years ago
orbiter 6bdd99e065 - more asserts to solve the ooB-problem
17 years ago
orbiter b928ae492a some code-cleanup and possible speed enhancements in different core methods
17 years ago
orbiter c998dc6556 - added security functions to flush url and search caches in case that memory is full
17 years ago
orbiter f4ae8082c3 - better error analysis for ooRange Exception in kelondroBase64Ordering
17 years ago
orbiter 84cbe75005 more asserts
17 years ago
orbiter e269c12710 small changes in partition routine
17 years ago
orbiter 31efb8fbee - fix for LOG path generation when the DATA/LOG does not exists (fix for bug introduced in SVN 4923)
17 years ago
orbiter 21c87c36e3 added a log line
17 years ago
danielr 68c38c2d34 - WatchCrawler shows status without JavaScript
17 years ago
orbiter 3330181aa0 refactoring:
17 years ago
orbiter 9a9737a54e fix for "no more elements available" exception
17 years ago
danielr 7feae906aa - organize imports
17 years ago
orbiter e91bf4c8cc - fix for bad reset of index / bad index location after deletion
17 years ago
orbiter 25192e0d36 added a deletion button to indexControlRWIs that deletes the complete web index
17 years ago
orbiter cfe6790498 - added option to switch between yacy networks, especially between the two default networks (freeworld and intranet),
17 years ago
orbiter 5fde679acb - fixed problem in performance configuration
17 years ago
orbiter db032fb6de - added RWI transmissions to the event terminal
17 years ago
danielr 0d3808bd9e minor refactoring
17 years ago
danielr d4bce6affd refactoring (initialized static fields, removed empty if/else, serialized some fields in serializable classes)
17 years ago
orbiter d0678f7ab9 refactoring as result of
17 years ago
orbiter 32b5b057b9 - modified, simplified old kelondroHTCache object; I believe it should be replaced by something completely new
17 years ago
orbiter b9a2a2d287 more search performance hacks
17 years ago
orbiter ff755fb858 small corrections and enhancements after search timing profiling
17 years ago
danielr 48ffd61e6a changed "patched wrong" to warning, so it goes to the logfile
17 years ago
orbiter 2f629d20a7 - tried to fix the '4217666-problem'
17 years ago
orbiter 45ae3da7e7 another patch to prevent NPE in EcoTable
17 years ago
orbiter 93376acdca fixed a bad chunkcache limit check which could have caused ArrayIndexOutOfBoundsExceptions
17 years ago
orbiter 1cab240198 patch for possible NPE in EcoTable iterator
17 years ago
orbiter 8fe39ebd74 -fixed file transmission with POST. The only usage was in ranking transmission, therefore:
17 years ago
orbiter 444dce7e81 more performance hacks
17 years ago
orbiter 2c2dcd12a2 - enhanced performance of Eco-Tables: less time-consuming size() - operations
17 years ago
orbiter 14404d31a8 - enhanced performance graph (more info)
17 years ago
orbiter 696b8ee3f5 fix for http://forum.yacy-websuche.de/viewtopic.php?p=6806#p6806
17 years ago
orbiter 225f9fd429 various fixes
17 years ago
orbiter 6e36c156e8 added more logging to EcoFS
17 years ago
orbiter 319144f4b2 fix for outofbounds-excception in EcoFS chunk iterator
17 years ago
orbiter a9cf6cf2f4 generalization of index container-heap class.
17 years ago
orbiter 5e4fddc1e6 more logging for new EcoFS.ChunkIterator to find bug for
17 years ago
orbiter 117ae78001 speed enhancement for reading of eco-table indexes
17 years ago
danielr 5c3c1fdf41 replaced httpc with Apache Jakarta Commons HttpClient (includes some refactoring ;)
17 years ago
orbiter 783a4c9edb strong speed enhancements for the index cache dump and restore:
17 years ago
orbiter d2f4926951 - more logging for balancer to get a hint where the problem is
17 years ago
orbiter 20dadba426 - added a deadlock prevention function in cache flushing
17 years ago
orbiter 764a40e37d speed enhancements for crawler and url retrieval (affects also search speed)
17 years ago
orbiter 3ce3a4a3a1 added stub for new index container heap data structure (purpose: index folding)
17 years ago
orbiter 968c775025 - preparation of parsing/indexing queue for concurrent execution
17 years ago
orbiter d6050b9ffb - separated the LURL data storage and Crawl result stack for process supervision.
17 years ago
orbiter fba46c51d7 fixed non-termination bug in qsort
17 years ago
orbiter 541b817502 refactoring of switchboard queueing
17 years ago
orbiter fc94fbe224 another improvement to the collection sorting
17 years ago
orbiter 11270d450e better quicksort-pivot computation: 30% faster (measured with test program)
17 years ago
orbiter 3e44293f07 - fixed a problem with thread pools in row collection
17 years ago
danielr e43051b125 - fixed Threaddump output (html-escaped ie. <init>)
17 years ago
orbiter 433ff855f7 - fixed another concurrency problem in collection sorting
17 years ago
orbiter 19286fa2d1 tried to fix seed2.old.db-problem
17 years ago
orbiter f3996e63b8 tried to fix more deadlocks:
17 years ago
orbiter fa1090113d - next try to fix the networking problem:
17 years ago
orbiter 65785da8f2 new method for best hash computation
17 years ago
orbiter 9eddc1506b - one try to fix the httpd problem
17 years ago
orbiter 7cc4ff05c9 some code enhancements and bugfixes
17 years ago
orbiter 275a226cc5 refactoring
17 years ago
danielr fbe335db73 consistent use of de.anomic.server.serverMemory to get information about memory statistics
17 years ago
orbiter 4fdf695064 - fixed a bug in remote search that prevented that any results had been generated (!)
17 years ago
orbiter 1dce2f1079 more multithreading support:
17 years ago