Commit Graph

936 Commits (62ad1476ac0eb02b20bb7ae4d5604fd65ec1c3dd)

Author SHA1 Message Date
theli 63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use.
19 years ago
orbiter 94d7ced900 fix for last ranking commit
19 years ago
orbiter 03835c2ee8 enhanced search result computation
19 years ago
orbiter ac3419b65f better debugging for indexOutOfBoundException bug
19 years ago
orbiter a8bc768206 enhancements to ranking evaluation
19 years ago
theli 33898ae7e9 *) ResourceInfoFactory.java: Bugfix for classNotFoundException
19 years ago
theli 406e170e25 *) more verbose error message
19 years ago
theli b298474e22 *) Bugfix needed because of changed plasmaCrawlLURL.load behavior
19 years ago
orbiter 96c6e4e322 - enhancements to detailed search page
19 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs
19 years ago
theli a5ed86105b *) bugfix for handling of ResourceInfo object in proxy
19 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
19 years ago
hermens 7aeadbe7cc another NullPointerException in http.ResourceInfo
19 years ago
orbiter 141f9e5bb4 fix for new plasmaCrawlLURL.load behavior
19 years ago
hermens 087f7511f8 prevent NullPointerException in http.ResourceInfo
19 years ago
orbiter a2525072f2 bugfix for kelondroRow - property generation
19 years ago
theli b44514242a *) crawler/ftp/CrawlWorker.java: better errorhandling
19 years ago
theli 7d7f30139c *) crawler/ftp/CrawlWorker.java: delete old cache file
19 years ago
theli 4ae0f122f8 *) ResourceInfo.java: License header added
19 years ago
theli 043edfa4d8 *) ftp/ResourceInfo.java ResourceInfo object for ftp resources added
19 years ago
orbiter 4866868c0e added write cache for LURLs
19 years ago
orbiter 8a0e35618b enhancements to search result preparation
19 years ago
theli 5c1bb53d2a Missing description for last commit
19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli 4825bfaaf3 *) Bugfix for PrintWriter Problem
19 years ago
theli 7930839594 *) URL.java: userinfo was not taken over when generating a new url from a base url and a rel. path
19 years ago
theli 7a35b8e237 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
19 years ago
theli ffbf416e76 *) direct access to requestheader of htCache.Entry removed to make it more http independent
19 years ago
theli 3870d615e3 *) setting htCache.Entry fields to private
19 years ago
theli 393a7d10be *) setting htCache.Entry fields to private
19 years ago
theli ab5a9bee66 *) adding some copyright headers
19 years ago
theli 5847492537 *) next step of restructuring for new crawlers
19 years ago
theli fce9e7741b *) next step of restructuring for new crawlers
19 years ago
theli e3f0136606 *) next step of restructuring for new crawlers
19 years ago
theli 9ded4e8d5a *) Bugfix for name resolution in proxy mode
19 years ago
theli 1c8300fcec *) Bugfix for name resolution in proxy mode
19 years ago
theli 4e2a950ac9 *) next step of restructuring for new crawlers
19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers
19 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
19 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
19 years ago
theli b4acbdaa97 *) better handling of server shutdown
19 years ago
theli f3ac4dbbb9 *) better handling of server shutdown
19 years ago
theli 959b779aba *) avoid performance loss if log level is greater than 'fine'
19 years ago
orbiter 18b6876860 new cache flush configuration settings
19 years ago
hermens f0278b4092 Bugfix for / by zero when the AssortmentCluster is empty
19 years ago
orbiter 14e0bb0dcf allow more references per word for new db
19 years ago
orbiter 985dcbde7f changed some parameters that may cause better memory usage and more indexing speed
19 years ago
orbiter b7f4a1521b added options to switch on or off the kelondroFlexTable for NURL, EURL and PreNURL
19 years ago
orbiter c26da4893b turned back NURL usage of kelondroTree, kelondroFlexTable has still problems with deleted entries
19 years ago
orbiter db1eae0227 * simplified initialization of database objects
19 years ago
hermens 0b73f2b132 Repair DNS prefetch during cacheScan
19 years ago
orbiter 27a159b401 * documentation update
19 years ago
theli f80f776b89 *) Trying to solve NullpointerException problem in function addURLtoErrorDB
19 years ago
hydrox 1c99b5a484 *)fixed logging for urldbcleanup
19 years ago
orbiter 8f3f4ab0eb enhanced synchronisation in plasmaWordIndex
19 years ago
orbiter 23dd972608 fixed memory calculation in performanceMemory web page
19 years ago
orbiter 1ce3c22761 better memory control:
19 years ago
orbiter 39b4c26bdc more memory control:
19 years ago
orbiter 3e9d509c39 some small fixes
19 years ago
orbiter eb633c0a4f server threads must now supply a method that can be called in case
19 years ago
orbiter f5720cb2fa removed most synchronization in wordIndex (for testing)
19 years ago
orbiter 0187c60010 because of a bug in the JRE 1.4.2 there was no memory protection
19 years ago
orbiter cfb51fdef1 less synchronization in plasmaWordIndex
19 years ago
orbiter d6a928c2da quickfix for http://www.yacy-forum.de/viewtopic.php?t=2705
19 years ago
orbiter 6ad471ef96 * applied many compiler warning recommendations
19 years ago
hydrox 9da3aa74d3 silly me, fix for the fix as advised by theli
19 years ago
hydrox bb3d9a5582 *) e.getMessage().indexOf() can only be used if there is actually an ExceptionMessage.
19 years ago
hydrox 7a54010a9c *) Iterators can't be casted to IndexContainer
19 years ago
orbiter cd5f7e137c fixed problem with NURL-generation upon first startup
19 years ago
orbiter 8418af141a added several consistency checks and small changes
19 years ago
theli 9d13aeca13 *) removing class. does not work so far
19 years ago
theli 95a84ae469 *) adding missing classes
19 years ago
theli eee44be602 *) adding an interface for customized blacklist classes
19 years ago
orbiter 6d2f15971a there is a very strange error that causes that the kelondroRecords structure
19 years ago
theli d2e8e76218 *) now it's possible to configure the yacy blacklist separately for dht, search, proxy, crawler
19 years ago
orbiter 9ae9062bd3 * disabled new kelondroFlex table for NURLs
19 years ago
orbiter 689bbcf9cd replaced kelondroTree db for NURLs by new kelondroFlexTable
19 years ago
orbiter 7fbba41962 synchronization fixes
19 years ago
orbiter 328f9859a5 more synchronization in plasmaWordIndex
19 years ago
orbiter 130e6d4719 generalized index object for eurl, nurl and lurl to prepare move
19 years ago
orbiter acdf24877f more synchronization against outOfMemoryError in wordIndex
19 years ago
orbiter 95160d7f2c fixed size computation of index elements from the collection index
19 years ago
orbiter 26116cabde added missing rowdef assignment
19 years ago
orbiter abf22f6e60 removed url normalform computation from htmlFilterContentScraper.
19 years ago
orbiter 740d49751d * strict type and size check in kelondroRow handling
19 years ago
orbiter 314021453f * more logging
19 years ago
orbiter 61b151b083 * added another auto-fix for collection index inconsitency check
19 years ago
orbiter f58283def2 better control of index flush
19 years ago
orbiter 4be21a3cab ups
19 years ago
orbiter 80b6c90d54 enhancements to prevent blocking during dht transfer receive
19 years ago
theli 9f298083cd *) adding more urls to the error url
19 years ago
hermens d56f06401e - Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible
19 years ago
theli c09f734d06 *) offer router configuration on ConfigBasic.html
19 years ago
hermens dcbb4d0a6b Display the size of HashBlacklistedCache on PerformanceMemory page.
19 years ago
orbiter d799622da1 better flush limit for index collections
19 years ago
orbiter 279b1d969d Integrated new indexing data structure 'collections' into the main class
19 years ago
orbiter 4ff742e42d implemented indexCollectionRI
19 years ago
orbiter 01f95eccd3 re-write of kelondroCollectionIndex. This is the data structure that
19 years ago
orbiter ebc2233092 * implemented (finished) class indexRowSetContainer
19 years ago
orbiter 9183d21f25 renamed new index class to old name
19 years ago
orbiter c4e922885a replaced indexURLEntry by new class that uses a kelondroRow.Entry object
19 years ago
orbiter e357599f92 * fixed problem with indexContainer iteration from RAM:
19 years ago
orbiter 8b77afd72c some fixes to new container merger
19 years ago
orbiter 417ed5102e redesign of database iterators:
19 years ago
orbiter ad692fc6c7 implemented option to extract nurls from the database
19 years ago
orbiter 7fd90ca7c8 * strict handling of NURL entry element generation, storage and stacking
19 years ago
orbiter 5f72be2a95 some redesign of EURL storage
19 years ago
orbiter 1ed3e2daef added option to extract domains and/or urls from the eurl database
19 years ago
orbiter 58df8b7bbf a large collection of different changes
19 years ago
orbiter e4f1820b58 protection against too long authentication strings in switchboard
19 years ago
theli b3c569f706 *) renaming of function getTransferedEntitySpeed to getTransferedEntrySpeed to avoid confusion
19 years ago
orbiter 5214f571cd simplified method call in balancer
19 years ago
orbiter 7935f27038 enhanced synchronization in balancer
19 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL
19 years ago
orbiter 07900366ac deactivated cache-initialization for file-indexes (files in WORDS)
19 years ago
orbiter 40aa735520 fixe timing problem causing too long delay during initialization of kelondroTree objects
19 years ago
theli 24a02cbeef *) Bugfix for not parsable application/xhtml+xml resources if
19 years ago
orbiter b0ca5fa784 some correction algorithm for preload time computation during assortment open
19 years ago
orbiter e22cbaee97 - extended logging for preload
19 years ago
orbiter 671fd9a5c9 work towards new indexing database structure
19 years ago
orbiter 92f4cb4d73 added option to configure the start-up delay time for kelondro database files.
19 years ago
orbiter 6643da3fbd bugfix for http://www.yacy-forum.de/viewtopic.php?p=23463#23463
19 years ago
hydrox 8ba8e2b7d9 *) added cache for blacklists urlhashs recieved by DHT. DHT does not request URLs listed in this cache.
19 years ago
hermens 53cbcc6d6e Implement emergency break in index receive when the limit of the ramCache is exceeded by more than cacheLimit
19 years ago
orbiter 66964dc015 removed high/med/low from kelondroRecords cache control.
19 years ago
borg-0300 4c6083b264 network picture;
19 years ago
borg-0300 955915385a network picture;
19 years ago
borg-0300 027fa8ab1c network picture;
19 years ago
theli b20496e42b *) make DHT DoS check configurable (requested by KoH)
19 years ago
orbiter 12af69dd86 cosmetics
19 years ago
allo 67a8c74be3 Fix for dynamic login with static password.
19 years ago
allo ef9eb50c3c fix for adminlogin
19 years ago
allo 6fe2fed87e cookieauth works with static Admin.
19 years ago
theli 45b39ee1be *) solving unpacking problems with to long filename by
19 years ago
theli fb090652df *) use a more compact for plasmaWordIndexAssortmentImporter.java because the long name
19 years ago
theli 4ca0857c0c *) Index transfer now considers the pause time send by busy peers during
19 years ago
orbiter 75ed507d39 some debugging of new kelondroFlexTable class
19 years ago
orbiter 370c481fa7 bugfixes
19 years ago
orbiter c36e9fc8d3 full integration of kelondroRow
19 years ago
orbiter c75cacda95 added a flex-width-array: this is a table where it is
19 years ago
orbiter 4a907a570f 1st step to migrate kelondroTree to usage of kelondroRow instead of byte[][]
19 years ago
orbiter 09f780df27 more bugfixes for the new row/stack handling changes
19 years ago
orbiter 3c3c047d0a integrated kelondroRow into kelondroStack
19 years ago
orbiter 5bb565944f integration of new kelondroRow into some parts of kelondro,
19 years ago
orbiter eaa6f012f0 refactoring: better naming for classic DB (files in WORDS)
19 years ago
orbiter 5041d330ce refactoring
19 years ago
orbiter 7b3b12888c refactoring: integrated indexContainer abstraction layer
19 years ago
orbiter cb295fbbdc refactoring
19 years ago
rramthun bc94a714b2 Better explanation for the auto-dom-filter.
19 years ago
orbiter 196b8abb30 refactoring
19 years ago