Commit Graph

922 Commits (f1bde59c508d4b952baca3cfe178726b0276d77c)

Author SHA1 Message Date
orbiter 1d8d51075c refactoring:
16 years ago
orbiter 5bb8074150 removed the indexing queue. This queue was superfluous since the introduction of the blocking queues last year, where documents are parsed, analysed and stored in the index with concurrency.
16 years ago
orbiter b332dfad67 - inserted request object into response object which carries this now instead generating new objects
16 years ago
orbiter ca72ed7526 -removed superfluous crawl cache
16 years ago
orbiter 3f113f38a8 removed unused imports
16 years ago
f1ori f814e0fa81 enable warnings and fix most of it
16 years ago
orbiter 21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes
16 years ago
orbiter 0e8647d62f refactoring of search classes
16 years ago
orbiter dafffd0153 refactoring of parsers and document processing
16 years ago
orbiter 77d2a3782c removed strange debugging strings
16 years ago
orbiter 16efcd0366 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2252&hilit=&p=16389#p16389
16 years ago
orbiter 24cb6d68bc - renamed Stack to RecordStack to avoid name confusion with new classes
16 years ago
orbiter 409538e17a code cleanup and code simplifcation
16 years ago
orbiter 1f1399e5c5 extending visibility of objects and methods to avoid synthetic accessor methods and increase performance
16 years ago
orbiter 222850414e simplification of the code: removed unused classes, methods and variables
16 years ago
orbiter 93dfb51fd4 problems with code style
16 years ago
orbiter adf01c676e reduce lookup time when merging a large number of BLOBs
16 years ago
orbiter 9a674d8047 - After the removal of the Tree class some code simplifications are possible. This affects mostly the Records class, which can be refactored and the result of the refactoring results in a reduced number of classes.
16 years ago
orbiter c5122d6836 completed migration of BLOBTree to BLOBHeaps:
16 years ago
orbiter 6b307d6d59 more tolerance for corrupted index entries in exported row sets
16 years ago
orbiter 33aafa9b4b better logging when writing merged dumps
16 years ago
orbiter 4d29e90708 uaeh
16 years ago
orbiter 3c3e6499ae added more logging for merge operation
16 years ago
orbiter 15180fc95e - patch for future computation in SplitTable
16 years ago
orbiter 9a5ec20b3c avoid merge during startup
16 years ago
orbiter ae015e8e98 refactoring of blob package classes
16 years ago
orbiter be1c7ddc64 refactoring of search classes -- moved Ranking Profile to search package
16 years ago
orbiter 5a7fd6b4c8 just some comment lines
16 years ago
orbiter ce1adf9955 serialized all logging using concurrency:
16 years ago
orbiter bc6dd8194b refactoring: moved search query class to new search package
16 years ago
orbiter b8e738a7be a collection of
16 years ago
orbiter 945777aa80 replaced rwi term counting method by one that computes the maximum of the blobs that contibute to the RWI. An addition of the blob sizes is wrong/incorrect and does not reflect the real size. Truncation the size operation to the maximum of all blobs is also incorrect, but not as wrong as the sum of all blob sizes wich double-counts many rwi entries.
16 years ago
orbiter 733385cdd7 enahnced database access times by removal of unnecessary synchronization.
16 years ago
orbiter 1c54ae4a63 some small changes in HandleMap Testing
16 years ago
orbiter 2c5554c912 small enhancements in search result computation speed
16 years ago
orbiter 27fa6a66ad - completed the author navigation
16 years ago
orbiter c079b18ee7 - refactoring of IntegerHandleIndex and LongHandleIndex: both classes had been merged into the new HandleMap class, which handles (key<byte[]>,n-byte-long) pairs with arbitraty key and value length. This will be useful to get a memory-enhanced/minimized database table indexing.
16 years ago
orbiter bead0006da replaced tmp file extensions by prt
16 years ago
orbiter a704d82280 patch for problem with digest
16 years ago
orbiter 3029ef6eb3 fixed a bug that was recently inserted which caused that no idx and gap files were written.
16 years ago
orbiter 95e8cbd1c3 new fully redesigned balancer and bugfixes regarding lost profile handles and killed crawls
16 years ago
orbiter 42ae40b9f6 some bugfixes to database close() methods
16 years ago
orbiter 9bfd22f65d fix for http://forum.yacy-websuche.de/viewtopic.php?p=15523#p15523
16 years ago
orbiter cc49aedf12 - fixed problem with remote search NPE
16 years ago
orbiter c38c852090 modified access method to get index entries out of a array of BLOBs:
16 years ago
orbiter a5d481eab1 enhanced navigation
16 years ago
orbiter 1c69d9b8b6 more refactoring of the index classes
16 years ago
orbiter 88426912ad more refactoring to make the segment object easier to use and to be prepared to integrate author navigation
16 years ago
lotus d813fd26ed reset sent/received counters on index delete
16 years ago
orbiter 99bf0b8e41 refactoring of plasmaWordIndex:
16 years ago
orbiter 876746602d catch problems of file hash computation, see also:
16 years ago
orbiter fec6f9054f some refactoring of search methods
16 years ago
orbiter 3d4b826ca5 migration of all databases that use the deprecated BLOBTree format into the BLOBHeap format. Old databases are migrated automatically.
16 years ago
orbiter 26a46b5521 increased default maximum file size for database files to 2GB
16 years ago
orbiter addecdb18c simplified code, removed one unused method in all implementing classes
16 years ago
borg-0300 47fce9020c small change (Orbiter's wish)
16 years ago
borg-0300 e07b14e5d7 finally a working fix for 5960
16 years ago
borg-0300 3ebb904d2c fix for 5960, http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2119
16 years ago
orbiter e005cfea37 fix for bug in -incell option of URLAnalysis
16 years ago
orbiter a7e392f31b The collection index will not be supported any more.
16 years ago
orbiter a2f48863fc - added prototype for navigation index
16 years ago
orbiter b7457d3807 patch for http://forum.yacy-websuche.de/viewtopic.php?p=14720#p14720
16 years ago
orbiter f133d6065c fix for http://forum.yacy-websuche.de/viewtopic.php?p=14955#p14955
16 years ago
orbiter ad9762746d no exception in case of uniq() time-out, see also
16 years ago
orbiter f2e4d156e8 removed debug messages
16 years ago
orbiter c1e5fad9a7 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14767#p14767
16 years ago
orbiter 8ee3a94e82 fix for non-caching of sitehash, see http://forum.yacy-websuche.de/viewtopic.php?p=14440#p14440
16 years ago
borg-0300 21930d05ed fix for [B@...
16 years ago
orbiter b6ba387e01 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14751#p14751
16 years ago
low012 d164b42604 *) cosmetics
16 years ago
orbiter 5fb77116c6 added a submenu to index administration to import a wikimedia dump (i.e. a dump from wikipedia) into the YaCy index: see
16 years ago
hermens df733af4fa Try not to loose content from ram during IndexCell.delete by moving ram.delete after the dangerous operations on the array (array.get and array.delete)
16 years ago
hermens ac72005f2f Let IndexCell.remove remove entries from the ram portion of the DB as well.
16 years ago
orbiter 8ba7ff5353 a fix and another speed enhancement for the RWI cache
16 years ago
orbiter 05f077e85f added stack trace output to solve problem in
16 years ago
orbiter 71a4cadf31 better and more performant synchronization in SimpleARC, the caching object for word hashes. Speeds up indexing.
16 years ago
orbiter e6773cbb33 better handling of RWI cache for concurrency and less overhead when writing new entries -> even more indexing speed
16 years ago
orbiter c097531e3d added a catch Exception to all thread to check if any of them silently dies without any other notification
16 years ago
orbiter 083533e5ec fix for bugs in IODispatcher
16 years ago
orbiter 21fbca0410 better scaling of HEAP dump writer for small memory configurations;
16 years ago
orbiter 6e0b57284d better care for states of the IODispatcher
16 years ago
orbiter 1db9cdd4e4 fixed bug in writing of robots.txt entries in case that host names exceeded 64 characters and some other problems
16 years ago
orbiter 057ce14c8e more fixes (character encoding, parser exceptions, http client failure, blob writing)
16 years ago
orbiter d2ac0aa682 - fixed possible bugs in Stack (may affect Crawler reset) and RandomAccess handling
16 years ago
orbiter 8d6212233b fix for IODispatcher
16 years ago
orbiter 07f09742bb set of small fixes and comments
16 years ago
orbiter 9e4db75aac reduced internal logging and reduced memory that internal logging can use
16 years ago
orbiter c10c257255 attempt to fix a deadlock situation where the IODispatcher did not work.
16 years ago
orbiter fe51f4d668 less synchronization may help to prevent deadlocks
16 years ago
orbiter 138422990a - removed useCell option: the indexCell data structure is now the default index structure; old collection data is still migrated
16 years ago
orbiter 16baa7ad24 To translate a mediawiki dump into the YaCy surrogate format do the following:
16 years ago
orbiter 5195c94838 two patches for performance enhancements of the index handover process from documents to the index cache:
16 years ago
orbiter 9416f5c26f more speed test cases: kelondro provides map functions that are more than 20% faster than standard java classes and use less than halve of the memory of java classes:
16 years ago
orbiter b53790abb1 more performance hacks: 10% more speed for Base64.compare() which is really often used in YaCy code
16 years ago
orbiter 8ffb9889e1 some fixes and performance hacks
16 years ago
orbiter dfb96ecb72 more fixes
16 years ago
orbiter 1b8d346b4c fixes in connection with transiton to byte[] hashes
16 years ago
orbiter 996572de95 quickfix
16 years ago
orbiter 380ed2dac0 performance and debugging additions
16 years ago
f1ori 76af84d732 * add custom comparator to ScoreCluster for byte[]
16 years ago