Commit Graph

3898 Commits (c4ae2cd03fa8638a5fdc54a6d97907c6960fafc6)

Author SHA1 Message Date
orbiter ce1adf9955 serialized all logging using concurrency:
16 years ago
lotus aec3e7995a autoconfig.pac can be used to browse .yacy-domains only
16 years ago
orbiter bc6dd8194b refactoring: moved search query class to new search package
16 years ago
orbiter a4805defdd added stub for new search process
16 years ago
orbiter b8e738a7be a collection of
16 years ago
apfelmaennchen 39779e4796 DidYouMean: as I moved to only 8 consumer and 4 producer threads, I removed poison pills as it does not make sense anymore - threads are interrupted directly. Having a consumer thread per test case just didn't make sense either (see svn 6070) due to the massive overhead.
16 years ago
apfelmaennchen c3c4dd0933 DidYouMean - changed to much simpler LinkedBlockingQueue
16 years ago
apfelmaennchen 01ac1b5d7e - blocking queue implementation of DidYouMean
16 years ago
orbiter b8bb1bb364 join with a timeout does not cause that the corresponding thread is stopped after the time-out. It does only cause that the waiting is stopped. Here we need additionally a signal to the thread to stop after we finished waiting.
16 years ago
orbiter b69f22e9ca mistake in last commit: computation of loops in ReversingTwoConsecutiveLetters
16 years ago
orbiter 3130334932 - start first with threads that run more loops
16 years ago
apfelmaennchen 6cde7ebf16 DidYouMean
16 years ago
orbiter f348190566 tried to insert a database dump import method to the phpBB3 import function. Reason: imports or large database dumps are cannot be handled with phpMyAdmin and this should be an easy way to the database dumps into a mySQL database where it can be exported again with the phpBB3 content integration adapter. Completion or removal of this function stub will follow before next main release.
16 years ago
orbiter 945777aa80 replaced rwi term counting method by one that computes the maximum of the blobs that contibute to the RWI. An addition of the blob sizes is wrong/incorrect and does not reflect the real size. Truncation the size operation to the maximum of all blobs is also incorrect, but not as wrong as the sum of all blob sizes wich double-counts many rwi entries.
16 years ago
orbiter 7c4d1d471c hand-over of more specific object
16 years ago
apfelmaennchen 09acfa66d1 - improved "did you mean"
16 years ago
apfelmaennchen da6ce37f7b - fixed encoding problem
16 years ago
apfelmaennchen 54a48b4184 - added "did you mean" to search page
16 years ago
orbiter 550312ac85 added new command script to do a auto-Update from command line. this will make it easy to do mass-auto-updates in private yacy clusters
16 years ago
orbiter 0fc1168554 - reduced time-out for socket-connection communication from 20 seconds to 5 seconds. This is a test to find out if the time-out was a cause for problems in metager environments
16 years ago
orbiter 28b86385cd patch for bad behaving swf parser
16 years ago
orbiter d58b395993 fix for http://forum.yacy-websuche.de/viewtopic.php?p=15693#p15693
16 years ago
orbiter 733385cdd7 enahnced database access times by removal of unnecessary synchronization.
16 years ago
orbiter 398e210fef removed synchronization in logging that causes deadlocks in high-performance environments
16 years ago
orbiter db3a06dd81 removed cookie handling in httpc:
16 years ago
orbiter 1c54ae4a63 some small changes in HandleMap Testing
16 years ago
orbiter 2c5554c912 small enhancements in search result computation speed
16 years ago
orbiter e0b3984805 added navigation keys for site and author facets to remote search interface
16 years ago
orbiter 27fa6a66ad - completed the author navigation
16 years ago
orbiter a9a8b8d161 - added display of author navigation (usage of that navigator not yet implemented
16 years ago
orbiter c879783008 added steering of navigator computation:
16 years ago
orbiter c079b18ee7 - refactoring of IntegerHandleIndex and LongHandleIndex: both classes had been merged into the new HandleMap class, which handles (key<byte[]>,n-byte-long) pairs with arbitraty key and value length. This will be useful to get a memory-enhanced/minimized database table indexing.
16 years ago
orbiter bead0006da replaced tmp file extensions by prt
16 years ago
orbiter 3189f9cd39 fixed problem with DCEntry initialization
16 years ago
orbiter a704d82280 patch for problem with digest
16 years ago
orbiter 3029ef6eb3 fixed a bug that was recently inserted which caused that no idx and gap files were written.
16 years ago
orbiter b6e274f211 omit most of forced crawl delays by using a separat delay table which flushes delayed URLs at the correct time
16 years ago
orbiter d50be59088 - added a automatic re-construction of the domain stack after 10 minutes. this includes then urls to the domain stack that were left over in case of stack size limitations when the domain stack was created the last time
16 years ago
orbiter 5fdba0fa51 - fixed a not working selection rule in balancer
16 years ago
orbiter f5602404d5 another speed boost for the balancer
16 years ago
orbiter 95e8cbd1c3 new fully redesigned balancer and bugfixes regarding lost profile handles and killed crawls
16 years ago
orbiter c062385552 fix for http://forum.yacy-websuche.de/viewtopic.php?p=15555#p15555
16 years ago
orbiter 42ae40b9f6 some bugfixes to database close() methods
16 years ago
orbiter a0c53abbe1 - wait until local results are computed during search, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2167&hilit=&p=15521#p15521
16 years ago
orbiter 9bfd22f65d fix for http://forum.yacy-websuche.de/viewtopic.php?p=15523#p15523
16 years ago
orbiter 1c77db670f re-designed response format for navigation:
16 years ago
orbiter 15fad767c0 some refactoring of topic generation
16 years ago
orbiter cc49aedf12 - fixed problem with remote search NPE
16 years ago
f1ori 9e18abc2ac * fix charset detection, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2137
16 years ago
orbiter c38c852090 modified access method to get index entries out of a array of BLOBs:
16 years ago
orbiter ab06a6edd2 renamed topwords to topics and enhanced computation methods of topics
16 years ago
orbiter a5d481eab1 enhanced navigation
16 years ago
orbiter 7639ec2f38 - fixed letter case bug for dc record creation
16 years ago
orbiter 4522c13ee7 added option for a table prefix when importing phpbb3
16 years ago
orbiter 1c69d9b8b6 more refactoring of the index classes
16 years ago
orbiter 3d5f2ff544 - added new servlets to support search portal administrators for the integration of yacy search fields in their web pages
16 years ago
orbiter 4d4315687f fix for problem with concurrency in host navigator, bug reported by wsb
16 years ago
orbiter 88426912ad more refactoring to make the segment object easier to use and to be prepared to integrate author navigation
16 years ago
lotus d813fd26ed reset sent/received counters on index delete
16 years ago
orbiter 99bf0b8e41 refactoring of plasmaWordIndex:
16 years ago
orbiter 876746602d catch problems of file hash computation, see also:
16 years ago
orbiter fec6f9054f some refactoring of search methods
16 years ago
orbiter 3d4b826ca5 migration of all databases that use the deprecated BLOBTree format into the BLOBHeap format. Old databases are migrated automatically.
16 years ago
orbiter 4b4bddca00 added new submenu to crawler menu: import of phpbb3 forum postings from mysql
16 years ago
orbiter d8284046b0 enhanced speed of site navigation computation
16 years ago
orbiter c72a5cf326 added stub for PHPBB3 extraction code using direct access to mySQL
16 years ago
orbiter e735d3a69f fix for http://forum.yacy-websuche.de/viewtopic.php?p=15175#p15175
16 years ago
orbiter 63a0255166 - refactoring: added new content package, which will contain connector classes for different types of data sources to import texts into the YaCy index
16 years ago
orbiter f246928c20 first attempt to add 'real' Navigation to yacy search results: host navigation
16 years ago
orbiter 54b9e99c01 - more information about peer tags
16 years ago
orbiter 26a46b5521 increased default maximum file size for database files to 2GB
16 years ago
orbiter addecdb18c simplified code, removed one unused method in all implementing classes
16 years ago
borg-0300 47fce9020c small change (Orbiter's wish)
16 years ago
borg-0300 e07b14e5d7 finally a working fix for 5960
16 years ago
borg-0300 3ebb904d2c fix for 5960, http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2119
16 years ago
lotus 734680dc70 initialize the ResourceObsever in own thread
16 years ago
orbiter e005cfea37 fix for bug in -incell option of URLAnalysis
16 years ago
orbiter a7e392f31b The collection index will not be supported any more.
16 years ago
orbiter a2f48863fc - added prototype for navigation index
16 years ago
lotus 47fd226bdb proper parsing of sentences
16 years ago
orbiter 27eb8d62cb - new development cycle
16 years ago
orbiter b7457d3807 patch for http://forum.yacy-websuche.de/viewtopic.php?p=14720#p14720
16 years ago
orbiter bffbe43e09 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14522#p14522
16 years ago
orbiter f133d6065c fix for http://forum.yacy-websuche.de/viewtopic.php?p=14955#p14955
16 years ago
lotus 82af994041 added missing loglevel
16 years ago
orbiter ad9762746d no exception in case of uniq() time-out, see also
16 years ago
orbiter 1efe686e3f fix for http://forum.yacy-websuche.de/viewtopic.php?p=13960#p13960
16 years ago
lotus 13fb84ab81 you can define your default number of search results displayed by search.items
16 years ago
orbiter f2e4d156e8 removed debug messages
16 years ago
orbiter 709bfc2cd4 added a memory check in http post protocol
16 years ago
orbiter c01d6f43e1 - fixed problem with thread dump if no arguments are given
16 years ago
orbiter a49edd9415 fix for bug in search with site: constraint
16 years ago
orbiter c1e5fad9a7 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14767#p14767
16 years ago
orbiter 8ee3a94e82 fix for non-caching of sitehash, see http://forum.yacy-websuche.de/viewtopic.php?p=14440#p14440
16 years ago
borg-0300 21930d05ed fix for [B@...
16 years ago
orbiter b6ba387e01 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14751#p14751
16 years ago
orbiter 4338dcf936 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2093&hilit=
16 years ago
lotus bad7ce9286 experimental option trayIcon.force for unsupported platforms. java 1.6 needed
16 years ago
low012 ea27853c59 *) some refactoring
16 years ago
low012 d164b42604 *) cosmetics
16 years ago
orbiter 17150b2950 fixed bug in snippet computation
16 years ago
orbiter 89aeb318d3 enhanced the wikimedia dump import process
16 years ago
orbiter 5fb77116c6 added a submenu to index administration to import a wikimedia dump (i.e. a dump from wikipedia) into the YaCy index: see
16 years ago
hermens df733af4fa Try not to loose content from ram during IndexCell.delete by moving ram.delete after the dangerous operations on the array (array.get and array.delete)
16 years ago
hermens ac72005f2f Let IndexCell.remove remove entries from the ram portion of the DB as well.
16 years ago
orbiter 8ba7ff5353 a fix and another speed enhancement for the RWI cache
16 years ago
orbiter 05f077e85f added stack trace output to solve problem in
16 years ago
orbiter 71a4cadf31 better and more performant synchronization in SimpleARC, the caching object for word hashes. Speeds up indexing.
16 years ago
orbiter e6773cbb33 better handling of RWI cache for concurrency and less overhead when writing new entries -> even more indexing speed
16 years ago
orbiter c097531e3d added a catch Exception to all thread to check if any of them silently dies without any other notification
16 years ago
orbiter 083533e5ec fix for bugs in IODispatcher
16 years ago
orbiter 21fbca0410 better scaling of HEAP dump writer for small memory configurations;
16 years ago
orbiter 6e0b57284d better care for states of the IODispatcher
16 years ago
orbiter 1db9cdd4e4 fixed bug in writing of robots.txt entries in case that host names exceeded 64 characters and some other problems
16 years ago
f1ori bde88b684a * splitt off yacyRelease from yacyVersion
16 years ago
orbiter 057ce14c8e more fixes (character encoding, parser exceptions, http client failure, blob writing)
16 years ago
orbiter d2ac0aa682 - fixed possible bugs in Stack (may affect Crawler reset) and RandomAccess handling
16 years ago
lotus 1351d903a1 don't follow links like mailto:
16 years ago
orbiter e88a66bcae temporary disabling computation of all sublinks (check needed)
16 years ago
low012 ff5f82d780 *) removed description of removed commands from wikiHelp ([= =])
16 years ago
orbiter eacf95213a fix for crawling of mailto-links
16 years ago
orbiter 9c6ac43f66 fixes for wiki parser
16 years ago
orbiter 3a64c9d02f - fix for problem with concurrency when computing word hashes
16 years ago
orbiter d3f8aa5a2a set of small fixes
16 years ago
low012 78ffb61297 *) got rid of unnecessary variable which might also fix IndexOutOfBoundsException
16 years ago
orbiter d31e6f9c14 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14457#p14457
16 years ago
orbiter 8d6212233b fix for IODispatcher
16 years ago
orbiter f678472f46 fix for quote problem in json output
16 years ago
orbiter d079d6dfdb small changes in surrogate reader, wiki code and portal test
16 years ago
orbiter 07f09742bb set of small fixes and comments
16 years ago
borg-0300 06ed4ef7b3 * better picture handling
16 years ago
orbiter 5a634cab23 removed generation of anchor link sets in document types that describe container formats.
16 years ago
low012 f1244264b8 *) hopefully fixed bug reported in http://forum.yacy-websuche.de/viewtopic.php?t=2057
16 years ago
orbiter 2e3186189b fix for mediawikiIndex surrogate producer + added concurrency
16 years ago
apfelmaennchen 6f5ea7b1a8 small fix for previous post
16 years ago
apfelmaennchen 138a0747e3 added serverObjects.putJSON as JSON has very particulare encoding requirements
16 years ago
orbiter d977dd9a96 fix for surrogate loader
16 years ago
orbiter 9cb68353da fix for bug in ProfilingGraph for ppm >> 10000 ppm (!)
16 years ago
orbiter 9e4db75aac reduced internal logging and reduced memory that internal logging can use
16 years ago
orbiter c10c257255 attempt to fix a deadlock situation where the IODispatcher did not work.
16 years ago
orbiter 09987e93fd fixed some more bad handling of byte[]
16 years ago
orbiter 1bcc1450cb more explaining error message in case of IOExceptions during html parsing
16 years ago
orbiter fe51f4d668 less synchronization may help to prevent deadlocks
16 years ago
orbiter 58802e4201 added missing success test in storeDocumentIndex,
16 years ago
orbiter 171e62bee5 addition to the fix from last commit (which did not work)
16 years ago
orbiter 059949a0d1 tried to fix problem with snippet fetch for second search page when verify=false
16 years ago
lotus b08991e278 moved some constants, rename of Tray class
16 years ago
orbiter 138422990a - removed useCell option: the indexCell data structure is now the default index structure; old collection data is still migrated
16 years ago
orbiter 1b9e532c87 some concurrency for wikipedia dump reader
16 years ago
lotus 25d2160288 small fix
16 years ago