Commit Graph

4385 Commits (7bcfa033c93da424d5f535bc1a62bdb1a0990602)

Author SHA1 Message Date
orbiter 154bbc3364 code cleanup: call of static methods directly to the class
16 years ago
orbiter 222850414e simplification of the code: removed unused classes, methods and variables
16 years ago
orbiter 93dfb51fd4 problems with code style
16 years ago
orbiter adf01c676e reduce lookup time when merging a large number of BLOBs
16 years ago
orbiter 9a674d8047 - After the removal of the Tree class some code simplifications are possible. This affects mostly the Records class, which can be refactored and the result of the refactoring results in a reduced number of classes.
16 years ago
orbiter c5122d6836 completed migration of BLOBTree to BLOBHeaps:
16 years ago
orbiter d1083a6913 maybe we have less problems with open connections to the server if we don't do BF forced sleeps (just a test)
16 years ago
low012 ebe6c823ac *) changed svn properties agains (hopefully doing it right this time)
16 years ago
low012 a80ac3a415 *) fixed wrong parser descriptions
16 years ago
low012 457b6c0d6d *) updated Apache POI library to be able to parse Visio files
16 years ago
apfelmaennchen a10c8022d1 DidYouMean:
16 years ago
f1ori 7eb3bff5b3 * workaround for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2220&hilit=#p16128
16 years ago
orbiter 99fa265e1d fix for search bug caused by tenant patch
16 years ago
orbiter 79875782af be a bit more lazy when removing domain navigation entries
16 years ago
orbiter 57af311627 fix for wrong urls in navigator when a tenant is used
16 years ago
lotus 76b96337e2 just some chatty code
16 years ago
low012 91785d895c *) minor changes in comments
16 years ago
orbiter bdda140c02 fix for json output (no doubleqotes any more, doublequote quoting did not work)
16 years ago
orbiter 2f84736120 ignore signature files that cannot be downloaded because of failed encoding
16 years ago
orbiter 041d9c253e some refactoring and more error-awareness in LogalizeHandler
16 years ago
orbiter 6b307d6d59 more tolerance for corrupted index entries in exported row sets
16 years ago
orbiter 33aafa9b4b better logging when writing merged dumps
16 years ago
lotus db70badcf0 possibility to set remote host on upnp device
16 years ago
orbiter 4d29e90708 uaeh
16 years ago
orbiter 3c3e6499ae added more logging for merge operation
16 years ago
orbiter 15180fc95e - patch for future computation in SplitTable
16 years ago
orbiter 9a5ec20b3c avoid merge during startup
16 years ago
lotus bf6b92343c try to avoid stuck pdf parser
16 years ago
lotus c695c7f512 try to remove hung swf parser from queue
16 years ago
orbiter fc69a76197 update to web structure picture:
16 years ago
orbiter ae015e8e98 refactoring of blob package classes
16 years ago
orbiter 8b8877c233 moved image collector
16 years ago
orbiter be1c7ddc64 refactoring of search classes -- moved Ranking Profile to search package
16 years ago
orbiter fd31a3616a - more logging in server process
16 years ago
orbiter 5a7fd6b4c8 just some comment lines
16 years ago
orbiter 31f60a3b3e when doing searches, also apply a online caution to DHT transmission and stop transmissions while heavy load caused by searching. This omits the many requests to the URL database that are needed for DHT transfer and it avoids collisions with URL retrieval needed for search results.
16 years ago
orbiter 17dc6d4be5 small fix for new Logger
16 years ago
orbiter ce1adf9955 serialized all logging using concurrency:
16 years ago
lotus aec3e7995a autoconfig.pac can be used to browse .yacy-domains only
16 years ago
orbiter bc6dd8194b refactoring: moved search query class to new search package
16 years ago
orbiter a4805defdd added stub for new search process
16 years ago
orbiter b8e738a7be a collection of
16 years ago
apfelmaennchen 39779e4796 DidYouMean: as I moved to only 8 consumer and 4 producer threads, I removed poison pills as it does not make sense anymore - threads are interrupted directly. Having a consumer thread per test case just didn't make sense either (see svn 6070) due to the massive overhead.
16 years ago
apfelmaennchen c3c4dd0933 DidYouMean - changed to much simpler LinkedBlockingQueue
16 years ago
apfelmaennchen 01ac1b5d7e - blocking queue implementation of DidYouMean
16 years ago
orbiter b8bb1bb364 join with a timeout does not cause that the corresponding thread is stopped after the time-out. It does only cause that the waiting is stopped. Here we need additionally a signal to the thread to stop after we finished waiting.
16 years ago
orbiter b69f22e9ca mistake in last commit: computation of loops in ReversingTwoConsecutiveLetters
16 years ago
orbiter 3130334932 - start first with threads that run more loops
16 years ago
apfelmaennchen 6cde7ebf16 DidYouMean
16 years ago
orbiter f348190566 tried to insert a database dump import method to the phpBB3 import function. Reason: imports or large database dumps are cannot be handled with phpMyAdmin and this should be an easy way to the database dumps into a mySQL database where it can be exported again with the phpBB3 content integration adapter. Completion or removal of this function stub will follow before next main release.
16 years ago
orbiter 945777aa80 replaced rwi term counting method by one that computes the maximum of the blobs that contibute to the RWI. An addition of the blob sizes is wrong/incorrect and does not reflect the real size. Truncation the size operation to the maximum of all blobs is also incorrect, but not as wrong as the sum of all blob sizes wich double-counts many rwi entries.
16 years ago
orbiter 7c4d1d471c hand-over of more specific object
16 years ago
apfelmaennchen 09acfa66d1 - improved "did you mean"
16 years ago
apfelmaennchen da6ce37f7b - fixed encoding problem
16 years ago
apfelmaennchen 54a48b4184 - added "did you mean" to search page
16 years ago
orbiter 550312ac85 added new command script to do a auto-Update from command line. this will make it easy to do mass-auto-updates in private yacy clusters
16 years ago
orbiter 0fc1168554 - reduced time-out for socket-connection communication from 20 seconds to 5 seconds. This is a test to find out if the time-out was a cause for problems in metager environments
16 years ago
orbiter 28b86385cd patch for bad behaving swf parser
16 years ago
orbiter d58b395993 fix for http://forum.yacy-websuche.de/viewtopic.php?p=15693#p15693
16 years ago
orbiter 733385cdd7 enahnced database access times by removal of unnecessary synchronization.
16 years ago
orbiter 398e210fef removed synchronization in logging that causes deadlocks in high-performance environments
16 years ago
orbiter db3a06dd81 removed cookie handling in httpc:
16 years ago
orbiter 1c54ae4a63 some small changes in HandleMap Testing
16 years ago
orbiter 2c5554c912 small enhancements in search result computation speed
16 years ago
orbiter e0b3984805 added navigation keys for site and author facets to remote search interface
16 years ago
orbiter 27fa6a66ad - completed the author navigation
16 years ago
orbiter a9a8b8d161 - added display of author navigation (usage of that navigator not yet implemented
16 years ago
orbiter c879783008 added steering of navigator computation:
16 years ago
orbiter c079b18ee7 - refactoring of IntegerHandleIndex and LongHandleIndex: both classes had been merged into the new HandleMap class, which handles (key<byte[]>,n-byte-long) pairs with arbitraty key and value length. This will be useful to get a memory-enhanced/minimized database table indexing.
16 years ago
orbiter bead0006da replaced tmp file extensions by prt
16 years ago
orbiter 3189f9cd39 fixed problem with DCEntry initialization
16 years ago
orbiter a704d82280 patch for problem with digest
16 years ago
orbiter 3029ef6eb3 fixed a bug that was recently inserted which caused that no idx and gap files were written.
16 years ago
orbiter b6e274f211 omit most of forced crawl delays by using a separat delay table which flushes delayed URLs at the correct time
16 years ago
orbiter d50be59088 - added a automatic re-construction of the domain stack after 10 minutes. this includes then urls to the domain stack that were left over in case of stack size limitations when the domain stack was created the last time
16 years ago
orbiter 5fdba0fa51 - fixed a not working selection rule in balancer
16 years ago
orbiter f5602404d5 another speed boost for the balancer
16 years ago
orbiter 95e8cbd1c3 new fully redesigned balancer and bugfixes regarding lost profile handles and killed crawls
16 years ago
orbiter c062385552 fix for http://forum.yacy-websuche.de/viewtopic.php?p=15555#p15555
16 years ago
orbiter 42ae40b9f6 some bugfixes to database close() methods
16 years ago
orbiter a0c53abbe1 - wait until local results are computed during search, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2167&hilit=&p=15521#p15521
16 years ago
orbiter 9bfd22f65d fix for http://forum.yacy-websuche.de/viewtopic.php?p=15523#p15523
16 years ago
orbiter 1c77db670f re-designed response format for navigation:
16 years ago
orbiter 15fad767c0 some refactoring of topic generation
16 years ago
orbiter cc49aedf12 - fixed problem with remote search NPE
16 years ago
f1ori 9e18abc2ac * fix charset detection, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2137
16 years ago
orbiter c38c852090 modified access method to get index entries out of a array of BLOBs:
16 years ago
orbiter ab06a6edd2 renamed topwords to topics and enhanced computation methods of topics
16 years ago
orbiter a5d481eab1 enhanced navigation
16 years ago
orbiter 7639ec2f38 - fixed letter case bug for dc record creation
16 years ago
orbiter 4522c13ee7 added option for a table prefix when importing phpbb3
16 years ago
orbiter 1c69d9b8b6 more refactoring of the index classes
16 years ago
orbiter 3d5f2ff544 - added new servlets to support search portal administrators for the integration of yacy search fields in their web pages
16 years ago
orbiter 4d4315687f fix for problem with concurrency in host navigator, bug reported by wsb
16 years ago
orbiter 88426912ad more refactoring to make the segment object easier to use and to be prepared to integrate author navigation
16 years ago
lotus d813fd26ed reset sent/received counters on index delete
16 years ago
orbiter 99bf0b8e41 refactoring of plasmaWordIndex:
16 years ago
orbiter 876746602d catch problems of file hash computation, see also:
16 years ago
orbiter fec6f9054f some refactoring of search methods
16 years ago
orbiter 3d4b826ca5 migration of all databases that use the deprecated BLOBTree format into the BLOBHeap format. Old databases are migrated automatically.
16 years ago
orbiter 4b4bddca00 added new submenu to crawler menu: import of phpbb3 forum postings from mysql
16 years ago
orbiter d8284046b0 enhanced speed of site navigation computation
16 years ago
orbiter c72a5cf326 added stub for PHPBB3 extraction code using direct access to mySQL
16 years ago
orbiter e735d3a69f fix for http://forum.yacy-websuche.de/viewtopic.php?p=15175#p15175
16 years ago
orbiter 63a0255166 - refactoring: added new content package, which will contain connector classes for different types of data sources to import texts into the YaCy index
16 years ago
orbiter f246928c20 first attempt to add 'real' Navigation to yacy search results: host navigation
16 years ago
orbiter 54b9e99c01 - more information about peer tags
16 years ago
orbiter 26a46b5521 increased default maximum file size for database files to 2GB
16 years ago
orbiter addecdb18c simplified code, removed one unused method in all implementing classes
16 years ago
borg-0300 47fce9020c small change (Orbiter's wish)
16 years ago
borg-0300 e07b14e5d7 finally a working fix for 5960
16 years ago
borg-0300 3ebb904d2c fix for 5960, http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2119
16 years ago
lotus 734680dc70 initialize the ResourceObsever in own thread
16 years ago
orbiter e005cfea37 fix for bug in -incell option of URLAnalysis
16 years ago
orbiter a7e392f31b The collection index will not be supported any more.
16 years ago
orbiter a2f48863fc - added prototype for navigation index
16 years ago
lotus 47fd226bdb proper parsing of sentences
16 years ago
orbiter 27eb8d62cb - new development cycle
16 years ago
orbiter b7457d3807 patch for http://forum.yacy-websuche.de/viewtopic.php?p=14720#p14720
16 years ago
orbiter bffbe43e09 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14522#p14522
16 years ago
orbiter f133d6065c fix for http://forum.yacy-websuche.de/viewtopic.php?p=14955#p14955
16 years ago
lotus 82af994041 added missing loglevel
16 years ago
orbiter ad9762746d no exception in case of uniq() time-out, see also
16 years ago
orbiter 1efe686e3f fix for http://forum.yacy-websuche.de/viewtopic.php?p=13960#p13960
16 years ago
lotus 13fb84ab81 you can define your default number of search results displayed by search.items
16 years ago
orbiter f2e4d156e8 removed debug messages
16 years ago
orbiter 709bfc2cd4 added a memory check in http post protocol
16 years ago
orbiter c01d6f43e1 - fixed problem with thread dump if no arguments are given
16 years ago
orbiter a49edd9415 fix for bug in search with site: constraint
16 years ago
orbiter c1e5fad9a7 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14767#p14767
16 years ago
orbiter 8ee3a94e82 fix for non-caching of sitehash, see http://forum.yacy-websuche.de/viewtopic.php?p=14440#p14440
16 years ago
borg-0300 21930d05ed fix for [B@...
16 years ago
orbiter b6ba387e01 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14751#p14751
16 years ago
orbiter 4338dcf936 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2093&hilit=
16 years ago
lotus bad7ce9286 experimental option trayIcon.force for unsupported platforms. java 1.6 needed
16 years ago
low012 ea27853c59 *) some refactoring
16 years ago
low012 d164b42604 *) cosmetics
16 years ago
orbiter 17150b2950 fixed bug in snippet computation
16 years ago
orbiter 89aeb318d3 enhanced the wikimedia dump import process
16 years ago
orbiter 5fb77116c6 added a submenu to index administration to import a wikimedia dump (i.e. a dump from wikipedia) into the YaCy index: see
16 years ago
hermens df733af4fa Try not to loose content from ram during IndexCell.delete by moving ram.delete after the dangerous operations on the array (array.get and array.delete)
16 years ago
hermens ac72005f2f Let IndexCell.remove remove entries from the ram portion of the DB as well.
16 years ago
orbiter 8ba7ff5353 a fix and another speed enhancement for the RWI cache
16 years ago
orbiter 05f077e85f added stack trace output to solve problem in
16 years ago
orbiter 71a4cadf31 better and more performant synchronization in SimpleARC, the caching object for word hashes. Speeds up indexing.
16 years ago
orbiter e6773cbb33 better handling of RWI cache for concurrency and less overhead when writing new entries -> even more indexing speed
16 years ago
orbiter c097531e3d added a catch Exception to all thread to check if any of them silently dies without any other notification
16 years ago
orbiter 083533e5ec fix for bugs in IODispatcher
16 years ago
orbiter 21fbca0410 better scaling of HEAP dump writer for small memory configurations;
16 years ago
orbiter 6e0b57284d better care for states of the IODispatcher
16 years ago
orbiter 1db9cdd4e4 fixed bug in writing of robots.txt entries in case that host names exceeded 64 characters and some other problems
16 years ago
f1ori bde88b684a * splitt off yacyRelease from yacyVersion
16 years ago
orbiter 057ce14c8e more fixes (character encoding, parser exceptions, http client failure, blob writing)
16 years ago
orbiter d2ac0aa682 - fixed possible bugs in Stack (may affect Crawler reset) and RandomAccess handling
16 years ago
lotus 1351d903a1 don't follow links like mailto:
16 years ago
orbiter e88a66bcae temporary disabling computation of all sublinks (check needed)
16 years ago
low012 ff5f82d780 *) removed description of removed commands from wikiHelp ([= =])
16 years ago
orbiter eacf95213a fix for crawling of mailto-links
16 years ago
orbiter 9c6ac43f66 fixes for wiki parser
16 years ago
orbiter 3a64c9d02f - fix for problem with concurrency when computing word hashes
16 years ago
orbiter d3f8aa5a2a set of small fixes
16 years ago
low012 78ffb61297 *) got rid of unnecessary variable which might also fix IndexOutOfBoundsException
16 years ago
orbiter d31e6f9c14 fix for http://forum.yacy-websuche.de/viewtopic.php?p=14457#p14457
16 years ago
orbiter 8d6212233b fix for IODispatcher
16 years ago
orbiter f678472f46 fix for quote problem in json output
16 years ago
orbiter d079d6dfdb small changes in surrogate reader, wiki code and portal test
16 years ago
orbiter 07f09742bb set of small fixes and comments
16 years ago
borg-0300 06ed4ef7b3 * better picture handling
16 years ago
orbiter 5a634cab23 removed generation of anchor link sets in document types that describe container formats.
16 years ago
low012 f1244264b8 *) hopefully fixed bug reported in http://forum.yacy-websuche.de/viewtopic.php?t=2057
16 years ago
orbiter 2e3186189b fix for mediawikiIndex surrogate producer + added concurrency
16 years ago
apfelmaennchen 6f5ea7b1a8 small fix for previous post
16 years ago
apfelmaennchen 138a0747e3 added serverObjects.putJSON as JSON has very particulare encoding requirements
16 years ago
orbiter d977dd9a96 fix for surrogate loader
16 years ago
orbiter 9cb68353da fix for bug in ProfilingGraph for ppm >> 10000 ppm (!)
16 years ago
orbiter 9e4db75aac reduced internal logging and reduced memory that internal logging can use
16 years ago
orbiter c10c257255 attempt to fix a deadlock situation where the IODispatcher did not work.
16 years ago
orbiter 09987e93fd fixed some more bad handling of byte[]
16 years ago
orbiter 1bcc1450cb more explaining error message in case of IOExceptions during html parsing
16 years ago
orbiter fe51f4d668 less synchronization may help to prevent deadlocks
16 years ago
orbiter 58802e4201 added missing success test in storeDocumentIndex,
16 years ago
orbiter 171e62bee5 addition to the fix from last commit (which did not work)
16 years ago
orbiter 059949a0d1 tried to fix problem with snippet fetch for second search page when verify=false
16 years ago
lotus b08991e278 moved some constants, rename of Tray class
16 years ago
orbiter 138422990a - removed useCell option: the indexCell data structure is now the default index structure; old collection data is still migrated
16 years ago
orbiter 1b9e532c87 some concurrency for wikipedia dump reader
16 years ago
lotus 25d2160288 small fix
16 years ago
orbiter 16baa7ad24 To translate a mediawiki dump into the YaCy surrogate format do the following:
16 years ago
orbiter 0b2c98edc9 some more work on the wikipedia-dump exporter (not finished yet)
16 years ago
orbiter 5195c94838 two patches for performance enhancements of the index handover process from documents to the index cache:
16 years ago
orbiter 9416f5c26f more speed test cases: kelondro provides map functions that are more than 20% faster than standard java classes and use less than halve of the memory of java classes:
16 years ago
orbiter b53790abb1 more performance hacks: 10% more speed for Base64.compare() which is really often used in YaCy code
16 years ago
orbiter 8ffb9889e1 some fixes and performance hacks
16 years ago
orbiter dfb96ecb72 more fixes
16 years ago
orbiter 1b8d346b4c fixes in connection with transiton to byte[] hashes
16 years ago
f1ori 0b0a46d35a * fix transferRWI as suggested by celle (thanks!)
16 years ago
orbiter 996572de95 quickfix
16 years ago
orbiter 380ed2dac0 performance and debugging additions
16 years ago
lotus 635b0a9da7 code-split
16 years ago
orbiter fa3adbbfc6 added domain checks to surrogate reader and RWI transfer receiver to prevent spaming using surrogates
16 years ago
f1ori 76af84d732 * add custom comparator to ScoreCluster for byte[]
16 years ago
lotus ab0030d7a7 allow dht-out for remote-crawl processing peers on default settings
16 years ago
low012 d1116c049f *) added new method "contains()" to Blacklist interface
16 years ago
f1ori 08445e42f0 * don't throw exception, in case of bad charset in http-header
16 years ago
f1ori 2f860a2564 * convert byte[] hashes to string for log output
16 years ago
f1ori d93a2a6552 * ignore whitespaces so you can copy&paste signatures better
16 years ago
orbiter fbcbcc5bdb export of yacy document objects as dublin core record in xml
16 years ago
orbiter d7cbf4cdd4 more performance hacks: less overhead in word hash computation
16 years ago
orbiter 29e96c1a60 bugfixes and performance hacks
16 years ago
orbiter 4e97a31009 corrections in dublin core syntax
16 years ago
f1ori 44daec7936 * introduce signatures to autoupdate
16 years ago
orbiter 538e375901 replaced old caching method for computed word hashes with a better method. The word hash computation is a new performance bottleneck (after the IO bottleneck was removed with the IndexCell data structure) and a better caching for word hashes was necessary.
16 years ago
orbiter 9e853e1977 partly reverting SVN 5818: identical comparator required for join operator
16 years ago
orbiter e16c25ddf7 (peak-) performance hacks
16 years ago
orbiter 63cd152969 fixes
16 years ago
orbiter 7dfe7e7cc6 fixed some problems with surrogate reader. This is now ready for testing.
16 years ago
orbiter 3a1364ed5c removed example lines from SurrogateReader sources; added additional example file
16 years ago
orbiter 9050a3c4c5 alpha version of surrogate reading and indexing.
16 years ago
orbiter b15b059c0d fix for latest commit
16 years ago
orbiter c8624903c6 full redesign of index access data model:
16 years ago
f1ori dd6b5005ff * fix missing charset handling in getpageinfo_p
16 years ago
orbiter bd5f4c78d8 - added default profile for surrogate indexing
16 years ago
orbiter ad78e3a59f - less lines in rssTerminal
16 years ago
orbiter bc80dc913a added new surrogate reader (surrogates are parsed documents on batches)
16 years ago
orbiter 12d81e98eb - fixed bad search results when searching for empty string
16 years ago
orbiter 8a24350036 - fix for join method with new generalized RWI data structure (caused by latest commit)
16 years ago
orbiter e58320a507 added more info in log fore debugging
16 years ago
orbiter 89ec3acb3e - full abstraction of index content type: the kelondro full text index may now also contain indexes about other content than text, i.e. navigation indexes or reverse linking indexes.
16 years ago
borg-0300 7a48090fcf - fix for "uk" language
16 years ago
orbiter dc2af61bc9 allow up to 50 results from remote peers
16 years ago
orbiter c0e8ed5461 fixed problem with not http client
16 years ago
orbiter 8862a2fed0 ups
16 years ago
orbiter de68948bc5 better handling of free memory computation and emrgency cache flush for index cell
16 years ago
f1ori fcb77c3140 * added .im (Isle of Man) to TLD-list
16 years ago
orbiter b81c7467d8 protection against too many files in RICELL in case of massive emergency dumps caused by low memory
16 years ago
orbiter d4d87d90c4 - extended experimental wikipedia dump parser
16 years ago
orbiter c3aff2521e fix for NPE
16 years ago
orbiter 57c00dd8c9 fix for bad filtering of common http error
16 years ago
orbiter 14361f1ca4 added log message for index generation in HeapReader
16 years ago
orbiter c08f9b36a4 refactoring of wiki parser.
16 years ago
orbiter 44e01afa5b - refactoring
16 years ago
orbiter 82fb60a720 increased memory limit for emergency cache flush
16 years ago
low012 9180617dd9 *) Classes to handle import of lists (especially blacklists) from XML files, not used yet, but will be used soon.
16 years ago
lotus 596e6215dc fix in case of white space in path name
16 years ago
orbiter b887f4a116 keep more free mem
16 years ago
orbiter c2359f20dd refactoring: better abstraction of reference and metadata prototypes.
16 years ago
orbiter ab656687d7 more strict BLOB initialization .. may also help to save some ram
16 years ago
orbiter 5b138ada16 fixes to web structure reference collection and url construction
16 years ago
orbiter a29a11e526 added evaluation of incoming links in webstructure api
16 years ago
orbiter f6691411b5 - migration of files from SplitTable (which are used for the URL-DB) to a different file name format.
16 years ago
shostakovich 1f37cc6107 Robots.txt is now reused after one day. See forum-topic:
16 years ago
orbiter f21a8c9e9c a different naming scheme for BLOBArray files. This may be necessary if blobs are written more often than once in a second.
16 years ago
orbiter 7ba078daa1 - added fast site-operator
16 years ago
orbiter b4126432bc hardening of index dump write process
16 years ago
orbiter 9bfb2641db - removed deprecated threads
16 years ago
orbiter 293290c317 fix for bad assert in last commit
16 years ago
orbiter bd409fb7ba added web structure analysis for a special domain that can be requested from the api.
16 years ago
orbiter b6c2167143 - patch for bad web structure dumps
16 years ago
orbiter 0139988c04 - added writing of temporary file names and renaming to final file name when index dump/merge are done. Interrupted merges can be cleaned up.
16 years ago
orbiter 3621aa96ab - added a memory protection for the IndexCell migration
16 years ago
orbiter 568e8f1741 fix in unmountBLOB
16 years ago
orbiter 9da69d6b68 - better selection of files to be merged
16 years ago
orbiter d39a5b42ca more care about open file handles. Now files also close on windows and can be deleted afterwards.
16 years ago
orbiter 029495e64d fixed bug introduced in SVN 5756 in EcoTable.put()
16 years ago
orbiter 587838bd09 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5758 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
orbiter d2e2420a68 - added another file selection method for index cell merge
16 years ago
orbiter 96eaecda3e - added migration class to go from index collections to the index cell data structure.
16 years ago
orbiter 0f0b4aec75 better index cell merge logic
16 years ago
orbiter 832fef670f migration of urls-files into subdirectory METADATA
16 years ago
orbiter fa07234d4e fix for clear method: now deletes files
16 years ago
lulabad df87e4dbf6 missing count of send Index and URLs
16 years ago
borg-0300 c450e3746b svn attributes added
16 years ago
orbiter 37f892b988 added new concurrent merger class for IndexCell RWI data
16 years ago
borg-0300 8c494afcfe svn attributes added
16 years ago
orbiter 67aaffc0a2 - added Latency control to the crawler:
16 years ago
orbiter 0926310461 another performance hack
16 years ago
orbiter ebe5d69d14 performance hacks
16 years ago
orbiter 61f9dbf0cc - fixed a display problem in watch crawler
16 years ago
orbiter b3f75e48fa - enhanced balancer: auto-solving of waiting-deadlocks
16 years ago
orbiter 9a90ea05e0 added a merge operation for IndexCell data structures
16 years ago
orbiter d99ff745aa fix for http://forum.yacy-websuche.de/viewtopic.php?p=13378#p13378
16 years ago
orbiter 0c3ab291c4 fix for http://forum.yacy-websuche.de/viewtopic.php?p=13354#p13354
16 years ago
orbiter a9cea419ef Integration of the new index data structure IndexCell
16 years ago
borg-0300 fd0976c0a7 refactoring
16 years ago
orbiter 83792d9233 more refactoring
16 years ago
borg-0300 ce79239322 "typo"
16 years ago
borg-0300 cdbdc731c5 small updates: unescape, isCGI
16 years ago
orbiter 474aac65af more refactoring
16 years ago
orbiter 209f25f5f5 refactoring to integrate indexCell data structures
16 years ago
borg-0300 359a238acf faster isCGI()
16 years ago
borg-0300 f75628e53b some corrections
16 years ago
orbiter b7138e5fcb even more efficient comparator calls (less System.arraycopy for primary keys)
16 years ago
orbiter 65784eb656 - more efficient comparator calls
16 years ago
orbiter 44874cb550 added a deleteOnExit for blob file deletion in case that a deletion is not successful.
16 years ago
orbiter 66f78d67e0 bad idea. Concurrency in index management will be done differently
16 years ago
orbiter 7dff1cba62 removed option to use different primary keys in kelondro tables
16 years ago
orbiter 7f67238f8b refactoring of plasmaWordIndex: less methods in the class, separated the index to CachedIndexCollection
16 years ago
orbiter 14a1c33823 refactoring of wordIndex class
16 years ago
orbiter d49238a637 more performance hacks: better default values for scaling, less memory usage
16 years ago
orbiter 39644dc14e performance hacks to compare methods in database core
16 years ago
orbiter e2e7949feb replaced old PPM computation with a better one that simply sums up events that had been stored in the profiling table.
16 years ago
orbiter f6d989aa04 added new class RowSetArray which arranges RowSet objects like Elements in a hashtable, but still provides the functionality of sorted enumeration. The new class is now integrated into the ObjectIndexCache, which is the core class to provide index functions to all database files. The new index access is about twice as fast as before. This has strong speed enhancement effects on all parts of YaCy.
16 years ago
borg-0300 0a2fabeef3 static TMPDIR
16 years ago
lotus 9f7e62e900 refactoring
16 years ago
lotus f35dc11dc4 allow crawl start from pages with script tags
16 years ago
orbiter 6958eff196 removed unnecessary exceptions, extended testing in IntegerHandleIndex
16 years ago
orbiter 13c666adef performance hack to ObjectIndex put() method:
16 years ago
orbiter 1f1be1518c added stub for another performance hack: concurrent indexes
16 years ago
orbiter 3e4c28e188 enhanced count feature for kelondroRowSet. This is about twice as fast as before. Should speed up the collection analysis (half time!)
16 years ago
orbiter 84e37387a2 fix for last commit and more testing stub
16 years ago
orbiter ca006c506d stub for performance enhancements for RowSet (no functional change yet)
16 years ago
orbiter d988204875 better shutdown of tools
16 years ago
orbiter 100247bdda added also an export and delete-feature to the URLAnalysis. This completes the clean-up feature for URLs. To do a complete clean-up of the url database, start the following:
16 years ago
hermens 8c60d6d117 In DHT selection delete only those references that were actually selected
16 years ago
orbiter 60078cf322 added next tool for url analysis: check for references, that occur in the URL-DB but not in the RICOLLECTIONS
16 years ago
orbiter b1ddc4a83f do not merge collections if ram == false
16 years ago
orbiter dbdd10da84 better logging and startup behaviour for referenceHash computation
16 years ago
orbiter d64836c34f added statistical analysis of URL reference
16 years ago
orbiter 3b28daab40 code-beautification (to be consistent with external documentation paper)
16 years ago
orbiter 485c9406e5 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1915&hilit=&p=13249#p13249
16 years ago
orbiter 858f800a07 more logging in httpd to detect shutdown cause. See also:
16 years ago
orbiter b80db04667 - refactoring of IntegerHandleIndex and LongHandleIndex (better method names)
16 years ago
lotus 8ee946bf1d show upnp status
16 years ago
orbiter 16f5c6a85e fixed merge method initialization in ReferenceContainer
16 years ago
orbiter d7a493b4f5 added experimental timeline api
16 years ago
orbiter efcd95dc37 simplification of (internal) query process / refactoring
16 years ago
orbiter f1b712c29a small corrections to image loading methods in result presentation
16 years ago
orbiter d4b56d5819 added more asserts to BLOBHeap.flushBuffer() to fix the problem described in
16 years ago
f1ori c545fcb9fa * add class to handle keys and signatures
16 years ago
orbiter aa44d9bad9 more refactoring of kelondro.text / deleted de.anomic.index
16 years ago
orbiter 6ffc6e3389 more refactoring of indexer and kelondro classes;
16 years ago
orbiter 404bc21da9 simplification of (internal) query process / refactoring
16 years ago
orbiter 76ef5f0f14 refactoring of index package: better names for the classes (to be continued)
16 years ago
orbiter 2df57b1fd1 refactoring of index collection class
16 years ago
lotus 39a177649b * added upnp listener for devices that do not respond to discovery but advertise themselves
16 years ago
orbiter d1d9fbae5c enabling the URLAnalysis to operate on multime input files, just use a wild card when calling the class from the command line
16 years ago
orbiter c728879ab8 fixes to yacyURL - more exceptions in case that urls are strange
16 years ago
orbiter 7542336ae5 performance enhancement to yacyURL: omit second processing of resolveBackpath. This method is already applied during initialization of the object and was called a second time when the url was exportet.
16 years ago
orbiter 7ea53fe47b added another url list transformation option:
16 years ago
orbiter e521e81148 bugfix in yacyURL (for latest performance hack)
16 years ago
orbiter 54625360f7 performance update
16 years ago
orbiter d884c4718a added gzip support for URLAnalysis:
16 years ago
orbiter 46632f4385 performance update to yacyURL
16 years ago
orbiter cf9b74e6e3 added another method to process url lists: extract hosts only
16 years ago
orbiter 89d8e824ed memory protection for URLAnalysis
16 years ago
orbiter 0f6fa804ff performance update to URLAnalysis
16 years ago
orbiter 8444357291 added new row interator in kelondro tables files that enumerates rows
16 years ago
orbiter e8f5f2f612 added tool to analyse url strings
16 years ago
lotus 6117e083e5 option to customize tray label (tooltip) with tray.label
16 years ago
orbiter b8c3803bfc don't panic when canceling server sessions
16 years ago
orbiter de714783b1 - added host, path, filename to search result
16 years ago
lotus 9519d84372 changed "dooble" variable to "browserintegration" to be less specific
16 years ago
lotus 8429083972 adjusted tray for dooble:
16 years ago
orbiter ef62ec635e removed overwriting of logging config
16 years ago
orbiter c852d2d70e - reject too old seeds
16 years ago
orbiter aca973e2d9 catch more exceptions
16 years ago
orbiter 9559bc23fd automatic clean-up of dead connections
16 years ago
hermens 02dfd6183b Fix logging in serverCore
16 years ago
hermens d30456e2c8 Fix logging in serverCore
16 years ago
orbiter 4f9dae2571 remove reference in crawl entries
16 years ago
orbiter 1ba4301920 automated interruption of dead incoming connections, if they are there for more than one minute
16 years ago
orbiter c12bb8a6d0 - refactoring of the http client
16 years ago
orbiter 5d3983faae the soLinger parameter was wrong.
16 years ago
orbiter 62505bb3cb more bugfixes as recommendet by findbugs
16 years ago
orbiter 6b450d09ca some fixes recommended by findbugs
16 years ago
orbiter 4db80065ac select more
16 years ago
orbiter 94c42691d8 - reject less transmissions as transmission receiver
16 years ago
orbiter f887fc159f try to reduce the large number of unclosed incoming connections
16 years ago
orbiter e04a0e05c3 fix for last commit
16 years ago
orbiter a9ad863686 second part of 'doubles' fix - better handling of doubles in RAMIndex. More logging.
16 years ago
orbiter 59427064fb first part of 'doubles' fix (not fully ready yet)
16 years ago
orbiter 26978b2a25 - better memory protection in kelondro caches: computation of needed memory for cache grow
16 years ago
lotus e9e2fff47a better scaling on performance graph
16 years ago
lotus 4aad461100 added UPnP support
16 years ago
orbiter 99b9788e54 fix for possible 100% CPU caused by concurrent access of HashMap
16 years ago
orbiter be0c492ae5 fix for memory leak bug in new dht transmissions
16 years ago
hermens 2173865f92 Prevent race condition when switching timezones.
16 years ago
orbiter 40d9849aa4 - better control of chunk size in dht selection
16 years ago
orbiter 30a1de41b3 disabled the BufferedIOChunks, because I consider it as broken.
16 years ago
orbiter 411f2212f2 more memory leak fixing hacks
16 years ago
orbiter 985d421f91 found and fixed some memory leaks
16 years ago
orbiter 333489420b - fix for NPE when loading the cytag image
16 years ago
orbiter 6a32193916 - refactoring of cache naming in web index cache (no more dht semantics there)
16 years ago
orbiter 6c627dbdff update to the server core
16 years ago
orbiter 5393f356aa fix for termination problem
16 years ago
orbiter 6a876ecb88 first fixes to the DHT transmission process
16 years ago
orbiter c25c334b75 replaced old DHT transmission method with new method. Many things have changed! some of them:
16 years ago
orbiter e9a4182e6a using a concurrent hash map for the template cache
16 years ago
lotus e8ae2599fd * some refactoring/moves to consoleInterface
16 years ago
orbiter 01b97ef3f8 added new cybertag-tracking feature that was inspired by itgrl
16 years ago
borg-0300 b19bc611b0 gc: better logging
16 years ago
orbiter b1f9c00118 fix for bug in merge operator initialization
16 years ago
orbiter b57c9da1f8 - fixes to doc, ppt, xls parser: better title
16 years ago
f1ori 7936e58fe7 * sorry,previous version didn't compile
16 years ago
f1ori 76cdc59789 * added some convertions to and from UTF-8
16 years ago
orbiter 65a1de6c05 longer timeout for remote crawl queries
16 years ago
orbiter 072dd01ac8 more logging for RSS parser (to fix the remote crawl problem)
16 years ago
orbiter 9d282d2c16 - renamed interactivesearch to yacyinteractive
16 years ago
orbiter 180fe81ef7 quick hack to copy new log configuration over old one
16 years ago
orbiter d3e33fd6c1 removed strange retry logic from DHT transfer
16 years ago
orbiter db510b5d52 more exception logging
16 years ago
orbiter ef82cced01 removed default line 'P2P WEB SEARCH' if no line is given
16 years ago
orbiter 78b7361937 fixed problem with logging
16 years ago
low012 f136ddcfd4 *) this change is supposed to prevent the creation of temporary files by Apache Commons Fileupload library in cases where it is not necessary (as proposed by thq in http://forum.yacy-websuche.de/viewtopic.php?f=8&t=1806)
16 years ago
orbiter 94110df85a moved logging partially to kelondro
16 years ago
orbiter 024da2916b refactoring of logging
16 years ago
orbiter 83ce65707a (almost) completed partition of classes in kelondro
16 years ago
orbiter 7ee494fde5 more refactoring of kelondro:
16 years ago
lotus d4281b78da dynamic memory scale
16 years ago
orbiter bf93767ec6 refactoring of kelondro database classes
16 years ago
orbiter fc27bf8c4c refactoring of kelondro classes:
16 years ago
orbiter fe77fc3d62 - added new property setting 'repositoryPath'
16 years ago
orbiter 6cbca1e508 extended last fix, preventing more sorts
16 years ago
orbiter f9672d3f97 applied fix for inefficient put method as recommended by celle, see
16 years ago
apfelmaennchen 3484e55be4 - small fix for bookmarksDB
16 years ago
apfelmaennchen 6dd52422ea - added two dialogs to manage bookmark tags in YaCy-UI
16 years ago
orbiter 3154926311 some better memory protection and OOM prevention in EcoFS
16 years ago
f1ori aaafe05c02 * revert debug change
16 years ago
f1ori 5570fa817b * remove & from openBrowser command (see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1728&hilit=#p12321)
16 years ago
orbiter 419469ac27 added more methods to control the vertical DHT (not yet active .. )
16 years ago
lotus 4ef6b15eb8 limit -Xmx setting to 1999m on win32. bigger values would never work.
16 years ago
orbiter dedfc7df7f removed distinction between DHT-in and DHT-out. This is necessary to make room for the new cell data structure, which cannot use this this distinction in the first place, but will enable the same meaning with different mechanisms (segments, later)
16 years ago
orbiter b74159feb8 preparations to integrate the new 'cell' index data structure
16 years ago
orbiter d399444e49 added debug information to class loader
16 years ago
orbiter 5080fc33bf fix for http://forum.yacy-websuche.de/viewtopic.php?p=12247#p12247
16 years ago
orbiter 335d6ce8fc fix for class loading problem
16 years ago
low012 78778df464 *) this should adjust the Dev/Main detection of the updater to the new version numbers (0.7x is Dev, if x != 0)
16 years ago
orbiter b423d0a036 moved all servlets from htroot/xml to htroot/api
16 years ago
orbiter 91af105373 last changes before release
16 years ago
f1ori 7eade3f181 * fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1728
16 years ago
orbiter d1bace5e4d enhanced cleanup function
16 years ago
orbiter cb76d9e0e4 more synchronized in BLOBHeap (will not fix problem with Runtime-Error as reported in forum)
16 years ago
orbiter ff41da613e removed exception printout during load of snippets
16 years ago
orbiter 814a28775f removed thread dump writing in case of invocation target exception in httpd (looked bad, not serious)
16 years ago
orbiter bed38a5f8c fix for uncaught exception in RSSReader
16 years ago
orbiter 05c235de32 fix for npe
16 years ago
low012 7608944081 *) bugfix for REMOTE_HOST environment variable in CGI code (shows hostname of client instead of hostname of YaCy peer now)
16 years ago
orbiter a6b29cf72c reverted change of search event processing in SVN 5460. The new code did not work properly,
16 years ago
orbiter 9ef77d57f5 added an access control to the search interface using white/blacklists:
16 years ago
orbiter efe801173c better dht-in cache flush. see also:
16 years ago
orbiter 941ab78d9b better termination for blocking threads
16 years ago
apfelmaennchen 3dc208fad0 bugfix: bookmarks can now handle folder names like /news and /newspaper without getting confused...
16 years ago
orbiter e948df68ac longer timeout for queues during shutdown
16 years ago
orbiter 2b32248079 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1516&p=10545#p10545
16 years ago
low012 c1330f5743 *) added environment variable DOCUMENT_ROOT
16 years ago
low012 f26b8fcb1b *) comment mode is 'moderated' instead of 'activated' by default now (to avoid spam being visible)
16 years ago
orbiter b2a8c653ee small fixes
16 years ago
orbiter f675d47f86 better protection against database failures
16 years ago
orbiter 4f45605f04 small update for timing in search result processing
16 years ago
orbiter 9d119c6b61 migration of auto-update rules to new release strategy:
16 years ago
orbiter 4d5b401f00 try to fix some performance problems with the internal index management:
16 years ago
borg-0300 a0605325bb fixed a NullPointer Exception
16 years ago
orbiter b2b7edae18 fixed interactive search
16 years ago
lotus 2be119f0df adjusted big peer to 28M links
16 years ago
orbiter c6880ce28b removed the permanent cache flush and replaced it with a periodic cache flush
16 years ago
orbiter ef7fe537c5 fixed a cache-bug in cachedFileRA
16 years ago
orbiter 6c7e83909b - refactoring of data access methods to be prepared for new cell data structure
16 years ago
lotus c8451614f3 fix for overflow
16 years ago
orbiter c4c4c223b9 fixed a problem with attribute flags on RWI entries that prevented proper selection of index-of constraint
16 years ago
orbiter 6072831235 no cr transmission for robinson peers
16 years ago
low012 afe98bc11c *) added changes as proposed by Halborinda in http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1674
16 years ago
orbiter 07fc115e90 removed active profiling in kelondroRowSet
16 years ago
orbiter be4c458951 refactoring (implemented Iterable in kelondroRowCollection)
16 years ago
low012 bb5c2cd12e *) ISINDEX parameters will not be put on commandline anymore to prevent possible security hazards (better safe than sorry). Parmeters will have to be read from QUERY_STRING in ISINDEX case too which does not seem to be uncommon behaviour for web servers: http://vms.pdv-systeme.de/users/martinv/cgi_basics/cgi_basics.html#Datenuebergabe
16 years ago
orbiter b6bba18c37 replaced the storing procedure for the index ram cache with a method that generates BLOBHeap-compatible dumps
16 years ago
low012 db1cfae3e7 *) cleaning up after myself
16 years ago
low012 f547f9a78c *) added CGI capabilities (run Perl scripts and other software via HTTP GET and POST)
16 years ago
f1ori bdc380cd84 * add lastModified to templateCache
16 years ago
f1ori 025094675f * remove empty directory
16 years ago
f1ori c5691180cb * skip style-tags in HTML-files
16 years ago
orbiter 3567c58b18 added another filed information for BLOBHeap dumps: the gaps
16 years ago
orbiter abdd4aa414 added a index dump for blob heaps:
16 years ago
orbiter 8c3205b62e fix for OOB Exception
16 years ago
orbiter 78c568331e added test channel to /xml/feed.rss
16 years ago
orbiter e004da48d3 - added fast fingerprint computation for files (any). Will be used in new index dump method
16 years ago
f1ori 2d2ce24011 * remove all encoding-stuff from proxy
16 years ago
f1ori 73c8a0839c * abort download, when proxy connection is closed
16 years ago
orbiter bb935fdbb0 less organization overhead for DNS caching and prefetching
16 years ago
f1ori 4907697cfa * make fileuploads through proxy bigger than 65500 bytes possible
16 years ago
orbiter fc8189f3fb better self-healing of corrupted databases
16 years ago
f1ori 963da8c3f9 * updated tm-extractors to new version 1.0
16 years ago
orbiter e34ac22fbd - added new monitoring servlet at
16 years ago
lotus 449e697436 fix for null-seed in seedfile
16 years ago
orbiter d376d81fc4 replaced busy thread control of crawl stacker by blocking threads
16 years ago
orbiter f29b48d9ff patch for IndexOutOfBoundsException
16 years ago
f1ori 0881190b19 * Robots.txt: don't interpret Crawl-Delays for other robots
16 years ago
orbiter 243e73f53b removed unnecessary usage of kelondroBLOBTree
16 years ago
orbiter 8cb7170b75 - set status of kelondroTree, kelondroBLOBTree and kelondroFlexTable to deprecated
16 years ago
orbiter 7535fd7447 - refactoring of CrawlEntry and CrawlStacker
16 years ago
lotus 18513e2ee2 npe fix: http://forum.yacy-websuche.de/viewtopic.php?t=1646
16 years ago
orbiter 2802138787 - refactoring of CrawlStacker (to prepare it for new multi-Threading to remove DNS lookup bottleneck)
16 years ago
orbiter db6b3bf5a3 speed enhancement for integrated http server:
16 years ago
orbiter 7cd08bd5fb fix for NPE in BLOBCompressor
16 years ago
orbiter 5b94498643 fine-tuning of cache usage from SVN 5386 and a bug fix for overflow in available() method
16 years ago
orbiter 1779c3c507 - added a read cache to the RAFile interface to RandomAccessFile
16 years ago
orbiter e1acdb952c fix for problem with userDB and bookmarksDB which was caused by changes in kelondroRA in SVN 5376
16 years ago
orbiter 4a2dac659e more speed hacks:
16 years ago
orbiter 47292e696a more performance hacks
16 years ago
orbiter 759cef23dd fix for bug in kelondroAbstractRA.readFully
16 years ago
orbiter d39d420b39 performance hacks
16 years ago
orbiter 0b4808ba3d added new interactive search feature:
16 years ago
orbiter 74a3d86114 fixed a error response that might present classified information
16 years ago
orbiter c6525ab75f fix for NPE in seed handling
16 years ago
lotus 1951d30a62 addendum to last commit
16 years ago
lotus 325ba7bfb8 only query words with length > 2
16 years ago
low012 e423fa9846 *) added method to only get file names in directory listing which match a filter
16 years ago
orbiter 513179f404 changed interface to colletctionIndex and adopted all implementing classes:
16 years ago
orbiter 9d64693cfb reverting again the changes to new concurrent chunkIterator
16 years ago
orbiter 45ad1c3dd5 - re-activated concurrent iterator for EcoFiles
16 years ago
orbiter 2e2120046f speed enhancement for BLOBHeap opening process
16 years ago
orbiter fa26a8f25a fix for deadlock-like behavior in balancer
16 years ago
orbiter 1918a0173e added more exception handling during crawling
16 years ago
orbiter 10f5ec1040 reverted last commit (more testing needed)
16 years ago
f1ori 5af8923f37 * distribute forgotten jar-file in parser
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter 0ca4bc7b79 - added reader and visualization for mediawiki-export files:
16 years ago
danielr 2e63f03ca5 copy&paste vergessen :/
16 years ago
danielr cd8082b4e3 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1111#p11166
16 years ago
lotus 4f996a7651 fix for logparser pattern
16 years ago
f1ori d18c18971e * dirlisting in UTF-8 encoding
16 years ago
orbiter 867d0f2f56 removed some unnecessary pause delays
16 years ago
f1ori d49ffcd818 * files distributed by yacy are utf-8, files from repository use the system default charset
16 years ago
orbiter 8c96bc2ac1 do not use proxy caching rules for crawling
16 years ago
orbiter dba7ef5144 extended crawling constraints:
16 years ago
orbiter 96174b2b56 more debugging / better result status logging for parser/caching errors
16 years ago
f1ori 90e78b2cf6 * improve encoding detection of http service
16 years ago
orbiter ef66438662 - more space in error db to store larger error messages
16 years ago
orbiter 674ad2d55b different handling of error cases that occur during loading files with http or ftp:
16 years ago
danielr 538359a0ff simple fix to get DHT working again (maybe something more has to be done ;)
16 years ago
f1ori 7e1fe05e3c * added utf8-encoding to many getBytes-calls
16 years ago
lotus fad044fb54 update to snippet marker:
16 years ago
lotus 16723d0fa6 ask another peer if crawljob loading fails
16 years ago
orbiter 1b18d4bcf3 enhancement to crawling and remote crawling:
16 years ago
orbiter 3f746be5d4 - consolidation and refactoring of many DHT target - computing methods
16 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation:
16 years ago
orbiter dd27ce7216 added control logic to ECO tables that deletes ram copies of the tables if they get too large
16 years ago
orbiter 38e6ba5d00 forgot to re-rename commonsPath
16 years ago
orbiter 22989d0d8a added property index.storeCommons to switch commons storage on or off
16 years ago
f1ori 4b4ce75396 * http-server: submit charset from html metatags
16 years ago
f1ori 69e695bd4b * detect charset for directory index
16 years ago
f1ori 340ecd919d * include non ascii characters in visible characters
16 years ago
lotus 5cf0cbb47e javadoc
16 years ago
lotus 8d07607d1d update to resource observer:
16 years ago
f1ori d0543a7c39 * fix the debug ant-target
17 years ago
low012 baae3d91b1 *) fixed warning when compiling listManager
17 years ago
danielr a4fb76e93c undo r5300 (not fixed as seen after longer run)
17 years ago
low012 a99a629ed4 *) quick fix to prevent comments for blog entries which don't exist (http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1554)
17 years ago
low012 00e27e5050 *) fixed bug which made it possible to write files outside of the DATA/LIST directory when creating a new blacklist
17 years ago
danielr 0f9c0bd0d5 fix for ConcurrentModificationException at de.anomic.index.indexContainerHeap$heapCacheIterator.next(indexContainerHeap.java:324)
17 years ago
danielr 103ad2a437 some javadoc
17 years ago
orbiter b098522977 some very small advances to index utf-8 (not working yet), inserted also debugging code
17 years ago
orbiter 2f49666908 integrated the character decoding into the parser, removed old code
17 years ago
orbiter 49293c1358 fix for deadlock in new encoder :-(
17 years ago
orbiter 0edec2b760 FULL redesign of algorithms in htmlTools to encode/decode strings from/to unicode and html.
17 years ago
orbiter 958ec20cd0 removed specialized umlaute-handling in html parser. This has to be replaced by something that is able to transfer all possible html encodings into utf-8. Please see SVN 5293 for test cases.
17 years ago
f1ori 2e53cbc66a should compile now
17 years ago
f1ori f3bf2e379e should compile again
17 years ago
f1ori dd8441f102 fix bug: data from plasmaParser is allready converted to UTF-8
17 years ago
orbiter 6941bf42b1 performance hacks
17 years ago
orbiter 9b0c4b1063 redesign of parts of the new BLOB buffer
17 years ago
orbiter 1778fb420d - added some performance tweaks to the new BLOB buffer
17 years ago
orbiter 9663e61449 added another class to handle BLOB writings to the new HTCACHE data storage:
17 years ago
orbiter 382226da94 fix for bug introduced in SVN 5281: parameters were switched
17 years ago
danielr f2fd043797 refactoring (moved duplicate code into methods)
17 years ago
danielr c612046e5e r5278 java 1.5 compatible
17 years ago
f1ori af71ec93bf ops, forgot to import something
17 years ago
f1ori 9e65e9141c * always use UTF-8 for encoding hashes
17 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache:
17 years ago
danielr f095137238 - respecting httpdMaxBusySessions (refusing new connections if limit is hit)
17 years ago
orbiter 8ba33f104e fix for npe
17 years ago
orbiter 998861acfd - some refactoring in BLOBHeap to enable more gap processing functions
17 years ago
lotus 9d50bfd0b3 fix for npe: http://forum.yacy-websuche.de/viewtopic.php?p=10562
17 years ago
orbiter 766cad6e93 enhancement in memory management of BLOB Heap files / merging of deleted entries
17 years ago
orbiter 7860d5d632 fix for bug in seed list management (cause was bad class overloading, only visual effects!)
17 years ago
orbiter ffed5fc415 fixed problem with lost peers in database
17 years ago
orbiter 6fb865fbdc - fix of bug in iterator in kelondroBLOBHeap which caused bug in crawl profile listing
17 years ago
orbiter 2d65887723 - fix for bug in new profile handling
17 years ago
orbiter ff68f394dd fix for problem with balancer and lost crawl profiles:
17 years ago
lotus fb8d9850ea fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1462
17 years ago
lotus 0d1a2f6183 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1461
17 years ago
orbiter 9ac16f565b - fixed several bugs in database management functions
17 years ago
orbiter 820a03f9d6 - removed some warnings
17 years ago
lotus fe2792e9ce use accept-language header instead of user agent for language detection
17 years ago
orbiter c8bdd965ec - larger update time for status page
17 years ago
lotus dda771db9d - search result layout
17 years ago
orbiter ce4715e305 removed indexing of anchor links and tagging such words as part of urls (that was wrong)
17 years ago
orbiter ce57de6cb3 - fixed re-setting of DHT Send/Receive settings
17 years ago
lotus 31c31e54e4 new tray icon image for different icon sizes (e.g. linux)
17 years ago
f1ori 9589dfe080 * removed trayicon popupmenu title
17 years ago
lotus 5a637f004d localized tray
17 years ago
lotus 9d4f0325e1 - removed shutdown from search page (we have it in tray now!)
17 years ago
lotus 214277dad6 - revert r5202
17 years ago
f1ori 7afa084207 * add nativ java trayicon, using reflections
17 years ago
apfelmaennchen b97ff24b43 bookmarksDB / xbel.xml:
17 years ago
orbiter 6e7d113eac fix for wrong index initialization after network switch
17 years ago
lotus 0a0cc3bf67 added missing classes to build target "run"
17 years ago
orbiter 7b35d54c6c fixed some problems with network switching (was not completely 'clean')
17 years ago
orbiter f0b42e5a98 fixed NPE
17 years ago
orbiter 8e0de7f180 update to language statistic evaluation:
17 years ago
orbiter 1198eeecc7 added language selection to search query:
17 years ago