Commit Graph

1486 Commits (fbb712c669dc001230cbfac24065e9b91dd52c19)

Author SHA1 Message Date
orbiter 62c947b4aa next try to fix deadlock in plasmaWordIndex 18 years ago
orbiter 871ee1ce0f one step closer to automatic updates: 18 years ago
theli 2399ed817c *) robots.txt parser now extracts the sitemap-URL (will be used later) 18 years ago
orbiter fa012789b2 tried to fix a deadlock problem durin shutdown 18 years ago
orbiter e192f616a2 collection of small bugfixes 18 years ago
orbiter f8de19fb2f robinson cluster: added client-side protocol implementation 18 years ago
(no author) 4f4d3d71dd *) Faster appearance of ConfigBasic by bypassing UPNP-scan in case of existing external connects 18 years ago
orbiter 657585fe0d network functions for robinson peers: server-side protection 18 years ago
orbiter 89c1511738 - added new Network Configuration menu, can be found in basic settings 18 years ago
orbiter 62b79aa0a9 bugfix for http://www.yacy-forum.de/viewtopic.php?p=34558#34558 18 years ago
orbiter 2f3b518169 temporary patch for startup-problem: 18 years ago
rramthun e6fb6426a3 *) Some cosmetical changes and corrections 18 years ago
orbiter 595ee10468 fixed datatabase inconsistency bugs 18 years ago
orbiter ca79362b9d disabling auto-setting of remote crawl performance 18 years ago
orbiter 7a7a1c7c29 fight against problems with remove-methods and synchronization 18 years ago
orbiter 063063aa0c fix for 100% cpu bug during dht selection 18 years ago
michitux 4990909178 Some bugfixes, new layout/style for image search results: 18 years ago
orbiter 78d04bcbcf fixed bug in search statistics 18 years ago
orbiter b79b4082e2 completed search exclusion: 18 years ago
orbiter 06a7978730 moved url pattern matching for search to better place 18 years ago
orbiter 159bd0cab5 diverses; b.o. fix for http://www.yacy-forum.de/viewtopic.php?p=33914#33914 18 years ago
orbiter 40c14a4f0e - better implementation of search query properties 18 years ago
orbiter 6e7340ef52 added exclusion search 18 years ago
orbiter e4734a8b6b fix for fix in SVN 3537 18 years ago
orbiter 356033aceb fixed bug with continuous reset of balancer file index 18 years ago
orbiter ba2c307ab3 optimized memory allocation in kelondroRow.Entry 18 years ago
theli 24ea4ca631 *) adding first version of postscript parser 18 years ago
orbiter 6488ec8a80 no deletions in index in case that snippet-loading fails and there is no network connection 18 years ago
orbiter 5c3afb3202 added option to configure a path to a secondary index location. 18 years ago
orbiter 242c19b480 completed TLD categorization 18 years ago
theli 75d90834a2 *) adding additional file extension for powerpoint 18 years ago
orbiter 2cb16824e3 removed support for old database structures. 18 years ago
orbiter 3688ec33e5 release 0.51 18 years ago
theli 1f61c13697 *) RSS-parser extracts the author tags now 18 years ago
theli b374812f01 *) adding rpm packager as author 18 years ago
orbiter 6b9eea3932 - removed differentiation between longTitle and shortTitle; this cannot be used for search results, 18 years ago
orbiter a738b57b31 added author tag to indexing content 18 years ago
orbiter 6be57983a8 another update to the crawl balancer 18 years ago
orbiter 4783a30910 - fixed a flush problem in balancer 18 years ago
orbiter 861f41e67e redesigned NURL-handling: 18 years ago
orbiter 581db87237 more debug code for 18 years ago
orbiter 81c4cc6bf7 better debugging of balancer failure 18 years ago
orbiter 96b79bf86d redesigned remove method in kelondroRowSet 18 years ago
orbiter 9f929b5438 better snippet handling in case of snippet load fail 18 years ago
auron_x d451ad48d3 *) improved peerloadgraphic: 18 years ago
orbiter a5d668c0c6 added speed-buttons for easy performance setting 18 years ago
orbiter 5b0a84ce09 fix for synchronization deadlock with flushMissNameCache. 18 years ago
karlchenofhell e2ac5f62bd - Code hübscher machen [von NNs TODO] 18 years ago
allo f04097c3dd integrated tor-patch for crawling, if yacyDebugMode is set. 18 years ago
auron_x 22fe14f292 *) first version of Peerload-graphic 18 years ago
orbiter 432d7d4e9c better catch 18 years ago
orbiter 8f7e8b6ee2 auto-delete for not-fixable db error in crawl stacker. 18 years ago
orbiter 7a52b07fcc better memory protection during freemen cycle 18 years ago
orbiter 6faa262259 fix for NURL-fix 18 years ago
orbiter 243a2f831b fixed problem with not found NURL-hashes 18 years ago
orbiter 6ad39bae1e fixed shutdown problem 18 years ago
orbiter 38b93f8cb8 bugfix for my last commit: 18 years ago
orbiter d755a8026d - better OOM protection 18 years ago
orbiter 33f97cff7a changed startup initialization sequence slightly 18 years ago
orbiter 4e8eb1dbe3 some minor changes here and there 18 years ago
karlchenofhell 03c5906ae7 - minor bugfixes for url-fetcher & http://www.yacy-forum.de/viewtopic.php?t=3646 18 years ago
orbiter 313f6a7680 fix for http://www.yacy-forum.de/viewtopic.php?p=31553#31553 18 years ago
orbiter 958ebea5c5 fix for http://www.yacy-forum.de/viewtopic.php?p=32470#32470 18 years ago
orbiter 1cba31de43 redesigned ram organization for database caches 18 years ago
orbiter db235f2d61 added some memory protection in collection index multiple merge 18 years ago
orbiter b466baa574 added some memory protection 18 years ago
low012 ce360ef43e *) no more HTML in plasmaCrawlProfile.java anymore 18 years ago
karlchenofhell 88245e44d8 - improved version of robots.txt (delete your old htroot/robots.txt before updating): 18 years ago
orbiter 51e12049fa third generation of R/W head path optimization 18 years ago
orbiter 10a3c20b8d some more enhancements to R/W Head path optimization 18 years ago
orbiter f4cfd19835 second Generation of collection R/W head path optimization: 18 years ago
orbiter 1fda50fd3c correct R/W head positioning in kelondroFlex 18 years ago
orbiter 304412a049 first generation of collection index R/W head path optimization 18 years ago
hydrox cb89c74d52 *) added blog-comments 18 years ago
karlchenofhell 6fbe31425a - some code-cleanup (no more syntax-warnings here) 18 years ago
orbiter e3480d4ad3 fix for warning in crawl balancer 18 years ago
karlchenofhell 619653c054 - fix for last commit 18 years ago
karlchenofhell 26f5757b40 - added support for multiple paths per domain to default-blacklist 18 years ago
orbiter f7803a6ce4 enhanced crawl balancer 18 years ago
orbiter c3e8c23f5d fix for 'CANNOT FETCH ENTRY: hash is null' bug 18 years ago
orbiter dc0c06e43d PLEASE MAKE A BACK-UP OF YOUR COMPLETE DATA DIRECTORY BEFORE USING THIS 18 years ago
karlchenofhell d114a0136e - crawl profile: don't add null-values 18 years ago
theli e1edb23689 *) Bugfix for IllegalMonitorStateException 18 years ago
orbiter a15963ff98 better balancing: if element from top would force a busy waiting, 18 years ago
orbiter dda24fcb85 ups 18 years ago
orbiter 8c1d2e0227 protection against crawl balancer failure: 18 years ago
orbiter 30d79d69a6 fix for wrong display of search statistics 18 years ago
orbiter daf2e15f59 some storage process enhancements (write without preceding read) 18 years ago
orbiter d25caa07bf redesigned some parts of http authentication 18 years ago
orbiter b2f4087400 redesign of last-seen fieln inside seed: 18 years ago
orbiter 819ff21c92 fixed QPM output 18 years ago
auron_x 89e7af037a *) used more switchboard-vars instead of config-vars 18 years ago
orbiter 306c50ac40 QPM (queries per minute) statistic stub 18 years ago
karlchenofhell 9f74b128dd - added many more commented constants (please use constants rather than i.e. config-setting strings directly) 18 years ago
orbiter 9c05e2a820 re-design ob kelondroMap 18 years ago
orbiter f25c0e98d1 - replaced String by StringBuffer in condenser 18 years ago
karlchenofhell d311e258f8 - adjusted LogStatistics to nano-seconds 18 years ago
orbiter f3f99b19c6 extended search statistics 18 years ago
orbiter c0851ee943 refactoring: moved and renamed de.anomic.data.searchResults to plasma package 18 years ago
allo c39dda2374 finished refactoring of searchtemplates. 18 years ago
allo 35039982da refactoring of search process: store results in a searchResults structure. At the moment, its just stored in it, and read from it again. 18 years ago
orbiter 76fab83395 fixed bugs in seach statistics 18 years ago
orbiter d07b132a0d - fixed colors of network grafic 18 years ago
allo 29aa7031d3 workaround for the snippets 18 years ago
karlchenofhell aea199cb7b - IndexTransfer is working again 18 years ago
orbiter 5515571950 redesign of ymage classes 18 years ago
orbiter 52c6461e6b some bugfix for statistics 18 years ago
(no author) fe72b772cf added a monitor page for search requests 18 years ago
karlchenofhell b873ad51ab - fix for http://www.yacy-forum.de/viewtopic.php?t=3369 18 years ago
borg-0300 1aa74bbd2b update for last commit 18 years ago
borg-0300 23e613b2ab CPU & IO reduce (Index Distribution) 18 years ago
(no author) c67d22116e added exists-check based only on RAM index lookup: 18 years ago
(no author) 37e53b4a6a replaced tree database structure for seed db by flex data structure 18 years ago
karlchenofhell 35fb671721 - updated DetailedSearch and ViewFile 18 years ago
theli d157201e08 *) IfesL for "Unexpected end of ZLIB" error message 18 years ago
hydrox 2c01508ada *) fix for http://www.yacy-forum.de/viewtopic.php?p=29575#29575 18 years ago
borg-0300 d2be3c674d wrong cache values fixed 18 years ago
karlchenofhell df6281ba1f - removed JS from DetailedSearch => valid 18 years ago
hydrox fb1d8b91af *) changed Startpoints of IndexCleaner and IndexTransfer from ------------ to AAAAAAAAAAAA. 18 years ago
orbiter 9b726ac366 release 0.50 18 years ago
orbiter 036a0c828e fix for auto-configuration of crawler thread memory 18 years ago
orbiter a4e90bc1dc fix + debug-code for http://www.yacy-forum.de/viewtopic.php?p=29126#29126 18 years ago
borg-0300 6b5f28b746 answer for last commit: no 18 years ago
borg-0300 d98ba7bc33 fix for memory limit computation ? 18 years ago
orbiter c48374d14a new memory limit computation for indexing queue 18 years ago
orbiter 08ac4c5ed0 bugfix for http://www.yacy-forum.de/viewtopic.php?p=29045#29045 18 years ago
orbiter 8e3bd17554 adopted DetailedSearch page to new ranking options 18 years ago
orbiter 93a7e88245 more ranking parameter usage 18 years ago
orbiter 2dbea612c9 fixed display bug for image search preview 18 years ago
orbiter 0a050bc043 enhanced ranking 18 years ago
orbiter 61798f0ae6 added option to distinguish between text crawl and media crawl 18 years ago
orbiter febe6b114a design update of crawler monitor 18 years ago
allo 782db9099d version independent name for commons-pool lib 18 years ago
orbiter 7ff86d6ba6 - image search now shows thumbnails (in bad order, but it works) 18 years ago
orbiter ee3d91cb6b print-out of links that result from contraint-filtering 18 years ago
orbiter e4570bffaf -implemented a specialized snippet-fetch for media content 18 years ago
low012 694a6e4f44 *) better text snipptes: any possible searchword (welt, linux, tag) in welt-linux-tag will be marked correctly now 18 years ago
orbiter bddc197453 reverted by-mistake removed change from low012/SVN 3068 18 years ago
orbiter 1377c53aa3 extraction of media links from search results 18 years ago
low012 586add4c6c *) Better snippets: words like GNU/Linux will not prevent Linux or GNU from being marked if they are searchword (see http://www.yacy-forum.de/viewtopic.php?t=2891) 18 years ago
borg-0300 8b7c543885 NullPointer fix 19 years ago
orbiter 937ccd4e76 fix for snippet-generation 19 years ago
auron_x c086c71f17 *) fixed ArrayIndexOutOfBoundsException 19 years ago
orbiter c93cfdc23a fix for http://www.yacy-forum.de/viewtopic.php?p=28564#28564 19 years ago
orbiter 93a5ace330 fix for http://www.yacy-forum.de/viewtopic.php?p=28544#28544 19 years ago
orbiter bf0d820659 - added correct flagging of word properties 19 years ago
orbiter 10d888e70c - added a media search for images, audio, video and applications 19 years ago
orbiter a603c4d5e8 more code simplifications 19 years ago
orbiter 9a85f5abc3 cleanup 19 years ago
borg-0300 fbe1ee402b plasmaCrawlLURL$kiter cleanup 19 years ago
orbiter 773ba1e91a - generalized object order handling 19 years ago
borg-0300 15381cbf73 other bugfix 19 years ago
borg-0300 ad65cc9d2f NullPointer fixes 19 years ago
borg-0300 d33745a7ea NullPointer 19 years ago
orbiter 3a4933b63c bugfix for 19 years ago
orbiter 109ed0a0bb - cleaned up code; removed methods to write the old data structures 19 years ago
orbiter 052f28312a removed assortments from indexing data structures 19 years ago
orbiter 2372b4fe0c release 0.49 19 years ago
orbiter f8efb3c948 fixed a null pointer exception problem reported in the forum. 19 years ago
orbiter ad1e4aa88e added selection of audio, video, image and application resources 19 years ago
orbiter 7cc4cec9c9 bugfix for assertion bugs documented in 19 years ago
orbiter 7dbcd358b4 fix for http://www.yacy-forum.de/viewtopic.php?p=28231#28231 19 years ago
orbiter 86394e7a56 fix for cache-delete problem: 19 years ago
orbiter ceb9e3aa17 - enhanced parser: collection of audio, video, image and application links 19 years ago
orbiter 0b9370a9dc fix for http://www.yacy-forum.de/viewtopic.php?p=28108#28108 19 years ago
orbiter b5a29e9651 - fix for snippets that are too short 19 years ago
orbiter f1528672b1 filtering of non-index pages during index-of search 19 years ago
orbiter 8e7215475b - extended ViewFile to use is as debugging-tool: you can now use the 19 years ago
orbiter 30888e7a2f implementation of search constraints 19 years ago
orbiter 49a83f99d9 - fix for wrong DHT ordering in DHT selection 19 years ago
orbiter f4b547dc13 limited index transfer to peer with version 0.486 19 years ago
orbiter 10a4ab5195 disabled some (more) write caches 19 years ago
orbiter 09bcc10344 bugfix for some problems of last change with assortments 19 years ago
orbiter e3d75f42bd final version of collection entry type definition 19 years ago
orbiter c9364246cc introduced new RWI-Object. 19 years ago
orbiter e628d34e16 patches for bad data 19 years ago
orbiter 497428c8ec refactoring 19 years ago
orbiter 76fceb9997 refactoring 19 years ago
orbiter eeda881553 bugfix for last commit 19 years ago
orbiter bb7d4b5d5e refactoring to prepare new RWI entry object 19 years ago
orbiter bdc9216366 - more asserts 19 years ago
orbiter 1751a799ac - deactivated all write buffers 19 years ago
orbiter ba967c4875 - bugfixes and debug code 19 years ago
orbiter ee4715a21c - more asserts 19 years ago
orbiter 114a76a86e - added flag to urlhash that shows that domain is a local domain 19 years ago
orbiter b2d51be33c bugfix for latest changes to entry generalization 19 years ago
hermens 8385557672 Small fix for the Cache Monitor when using proxyCacheLayout=hash 19 years ago
orbiter f1ed55a5fc bugfix for last commit 19 years ago
orbiter 8fdefd5c68 generalization of payload definition of index storage 19 years ago
theli ad248d61ca *) more verbose exception 19 years ago
hydrox 7e8669b15c *) added possibility to "recycle" a DHTChunk that failed to transfer. 19 years ago
low012 4feaa91890 *) Added additional MIME-Type. 19 years ago
low012 89af433879 *) Deleted parts of WebCat that were not needed for parsing SWFs. 19 years ago
orbiter 46a712e195 - more asserts 19 years ago
low012 8c9bc7e341 *) extracting urls works now 19 years ago
low012 493391e42d *) new flash parser, still experimental 19 years ago
orbiter 215c4e65f1 code cleanup 19 years ago
orbiter bd4f43cd66 - fixed a null pointer exception bug 19 years ago
auron_x 194d42b6a7 *) changed PPM-calculation to be more accurate 19 years ago
orbiter fe8afaf426 switched off usage of write cache for imprortant databases 19 years ago
orbiter d3431433b0 more anonymization in logging 19 years ago
orbiter e6044e5198 bugfix for 19 years ago
orbiter 78b7f6f7fd bugfix for index remove bug, 19 years ago
orbiter 147d88cf23 re-design of database caching 19 years ago
orbiter 4e363108e1 - removed bad debug code that caused a large and unnecessary delay during global search 19 years ago
orbiter 2a9d868f6d - removed object cache from kelondroTree 19 years ago
orbiter 3ffc5b8793 fixed problem with serverCharBuffer.append(char) 19 years ago
orbiter 06854988da - full integration of new LURL database in INDEX 19 years ago
octoate e4a3574b77 StringBuffer now resets every time the parser is called 19 years ago
karlchenofhell ce237aefad - assortment-sizes table from PerformanceQueues_p.html is not shown if not used 19 years ago
theli a5b9b514c1 *) retry crawling without content-encoding if the content-encoding header was not correct 19 years ago
theli 92f774edd1 *) Better charset encoding detection 19 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration 19 years ago
octoate cc24dde5e0 First version of a MS Excel parser based on Apache POI 19 years ago
karlchenofhell 4c63129136 - stupid mistake... 19 years ago
karlchenofhell ebf0da2a45 - now the fix http://www.yacy-forum.de/viewtopic.php?t=2974 works 19 years ago
theli 3d152bfe43 *) Logging message added 19 years ago
karlchenofhell b5e40e2fa2 - fix for http://www.yacy-forum.de/viewtopic.php?t=2974 (no cache-sizes for new db) 19 years ago
orbiter 77a59a115d refactoring of indexing methods 19 years ago
theli cbb1e710b9 *) removing old class 19 years ago
orbiter c6d46f7ebd null pointer bugfix 19 years ago
theli decb09df6d *) Trying to be more tolerant against wrong charset names 19 years ago
theli e9afe39cbb *) Trying to be more tolerant against wrong charset names 19 years ago
theli 7526c831a8 *) Suppressing stracktrace 19 years ago
orbiter 50f2578c55 - some bugfixing and code cleanup 19 years ago
orbiter bdf4c7c51e added missing files for last commit 19 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format 19 years ago
octoate 1c4076da8a First version of the MS Powerpoint parser based on Apache POI 19 years ago
theli 5b75d64d7d *) bugfix for last commit 19 years ago
theli 71ed104bc7 *) adding additional rpm mimetype (used by packman) 19 years ago
orbiter 6396f5971e bugfixes and migration attempt toward new kelondroFlex db 19 years ago
hermens 48f81acc0e reverse SVN 2744, it is not needed 19 years ago
hermens 1da9aece12 Repair DNS prefetch during cacheScan 19 years ago
theli 22649408ad *) Better errorhandling for charset encoding problem during content parsing 19 years ago
theli a9c7e3f061 *) Bugfix for NoSuchElementException 19 years ago
orbiter c8f3a7d363 added snippet-url re-indexing 19 years ago
low012 2cfd4633ac *) even better handling of searchwords in snippets, words can consist of letters and numbers now 19 years ago
orbiter e17fea7015 files in htcache are now stored in different hash/tree subdirectories 19 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details) 19 years ago
orbiter 25ae3d3161 generalized definition of hexhash 19 years ago
orbiter f0d747c723 removed deprecated method 19 years ago
orbiter 5ff77612ac bugfix for old WORDS storage method 19 years ago
orbiter 0f10bdde22 more generic cache methods 19 years ago
hermens 6557112d8f small fix for plasmaURLPool.getURL() needed for new alternative htcache layout 19 years ago
hermens 440c6ee657 Implement alternative htcache layout 19 years ago
orbiter fd61209797 lines inside tags without punctuation are extended by a single dot. 19 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things): 19 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly 19 years ago
orbiter db294687ea enhanced logging 19 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage 19 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB 19 years ago
theli 1586d57187 *) odtParser: better handling of large files 19 years ago
theli f17ce28b6d *) plasmaHTCache: 19 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM 19 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy 19 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by 19 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps 19 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls 19 years ago
theli cd5f349666 *) Better handling of large files during parsing 19 years ago
low012 f8ac694e51 *) fixed a bug where searchword in snippets were not displayed bold in front of a punctuation mark (see http://www.yacy-forum.de/viewtopic.php?p=25998) 19 years ago
orbiter df1629b05a - code cleanup 19 years ago
theli b73efd5565 *) missing changes needed because of last commit 19 years ago
orbiter 2463e5624a 'quick' release 0.47 19 years ago
theli 625c2ce6b1 *) bugfix for snippet fetching problem if content but not http header is available in cache 19 years ago
theli 813a8a8179 *) migration of mimeTypeParser to jmimemagic 0.1 19 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes 19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher) 19 years ago
theli 1dc12d6659 *) Bugfix for shutdown problem caused by cacheScan thread 19 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString; 19 years ago
theli 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui 19 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the 19 years ago
theli 5c6251bced *) some improvements for extended html document charset support 19 years ago
orbiter f453c14b5d removed unreacheable catch blocks and unused imports 19 years ago
theli ad7f600f25 *) Bugfix. re-enabling inheritance of serverCharBuffer from writer class 19 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets 19 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added 19 years ago
orbiter f644a1c3a7 better evaluation of index abstracts 19 years ago
allo 2fd610b556 http://www.yacy-forum.de/viewtopic.php?p=25611#25611 19 years ago
theli 06fa891152 *) htmlFilterContentScraper.java: using proper charset for document title 19 years ago
theli 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser) 19 years ago
theli c5d3020941 *) better errorhandling for last commit 19 years ago
theli d0a5a53789 *) changes needed for multi-language support 19 years ago
orbiter 26ab1fa885 fixed null pointer exception 19 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem 19 years ago
orbiter 41e27b85b7 fix for crawler condition 19 years ago
theli 9ecf7f0da2 *) some TODO makers for UTF-8 problem 19 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache 19 years ago
orbiter 6e2907135a bugfixes for remote search server part 19 years ago
orbiter cf9884e22b first attempt to implement a secondary search 19 years ago
orbiter b251076e64 avoid ConcurrentModificationException 19 years ago
orbiter 75b198bc02 - updated references to indexContainer 19 years ago
orbiter b7e7808ea6 wordmigration now works also for new index database 19 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error 19 years ago
orbiter 4f9e42d5ed more changes towards better join-search 19 years ago
orbiter a7281a9b4d fix for last commit 19 years ago
orbiter 82a6054275 - fixed bug with new indexAbstract generation 19 years ago
theli fded1f4a5d *) better handling of maximum file size limit in crawler 19 years ago
orbiter 74d1dea30b changes towards better join-search 19 years ago
orbiter ae4e8ce03e - cut for 'probably last html-interface version': version number update 19 years ago
orbiter 64bed59ee8 enhancements to ranking 19 years ago
theli 63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use. 19 years ago
orbiter 94d7ced900 fix for last ranking commit 19 years ago
orbiter 03835c2ee8 enhanced search result computation 19 years ago
orbiter ac3419b65f better debugging for indexOutOfBoundException bug 19 years ago
orbiter a8bc768206 enhancements to ranking evaluation 19 years ago
theli 33898ae7e9 *) ResourceInfoFactory.java: Bugfix for classNotFoundException 19 years ago
theli 406e170e25 *) more verbose error message 19 years ago
theli b298474e22 *) Bugfix needed because of changed plasmaCrawlLURL.load behavior 19 years ago
orbiter 96c6e4e322 - enhancements to detailed search page 19 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs 19 years ago
theli a5ed86105b *) bugfix for handling of ResourceInfo object in proxy 19 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior 19 years ago
hermens 7aeadbe7cc another NullPointerException in http.ResourceInfo 19 years ago
orbiter 141f9e5bb4 fix for new plasmaCrawlLURL.load behavior 19 years ago
hermens 087f7511f8 prevent NullPointerException in http.ResourceInfo 19 years ago
orbiter a2525072f2 bugfix for kelondroRow - property generation 19 years ago
theli b44514242a *) crawler/ftp/CrawlWorker.java: better errorhandling 19 years ago
theli 7d7f30139c *) crawler/ftp/CrawlWorker.java: delete old cache file 19 years ago
theli 4ae0f122f8 *) ResourceInfo.java: License header added 19 years ago
theli 043edfa4d8 *) ftp/ResourceInfo.java ResourceInfo object for ftp resources added 19 years ago
orbiter 4866868c0e added write cache for LURLs 19 years ago
orbiter 8a0e35618b enhancements to search result preparation 19 years ago
theli 5c1bb53d2a Missing description for last commit 19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542 19 years ago
theli 4825bfaaf3 *) Bugfix for PrintWriter Problem 19 years ago
theli 7930839594 *) URL.java: userinfo was not taken over when generating a new url from a base url and a rel. path 19 years ago
theli 7a35b8e237 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent 19 years ago
theli ffbf416e76 *) direct access to requestheader of htCache.Entry removed to make it more http independent 19 years ago
theli 3870d615e3 *) setting htCache.Entry fields to private 19 years ago
theli 393a7d10be *) setting htCache.Entry fields to private 19 years ago
theli ab5a9bee66 *) adding some copyright headers 19 years ago
theli 5847492537 *) next step of restructuring for new crawlers 19 years ago
theli fce9e7741b *) next step of restructuring for new crawlers 19 years ago
theli e3f0136606 *) next step of restructuring for new crawlers 19 years ago
theli 9ded4e8d5a *) Bugfix for name resolution in proxy mode 19 years ago
theli 1c8300fcec *) Bugfix for name resolution in proxy mode 19 years ago
theli 4e2a950ac9 *) next step of restructuring for new crawlers 19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers 19 years ago
theli eb9b138986 *) next step of restructuring for new crawlers 19 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols 19 years ago
theli b4acbdaa97 *) better handling of server shutdown 19 years ago
theli f3ac4dbbb9 *) better handling of server shutdown 19 years ago
theli 959b779aba *) avoid performance loss if log level is greater than 'fine' 19 years ago
orbiter 18b6876860 new cache flush configuration settings 19 years ago
hermens f0278b4092 Bugfix for / by zero when the AssortmentCluster is empty 19 years ago
orbiter 14e0bb0dcf allow more references per word for new db 19 years ago
orbiter 985dcbde7f changed some parameters that may cause better memory usage and more indexing speed 19 years ago
orbiter b7f4a1521b added options to switch on or off the kelondroFlexTable for NURL, EURL and PreNURL 19 years ago
orbiter c26da4893b turned back NURL usage of kelondroTree, kelondroFlexTable has still problems with deleted entries 19 years ago
orbiter db1eae0227 * simplified initialization of database objects 19 years ago
hermens 0b73f2b132 Repair DNS prefetch during cacheScan 19 years ago
orbiter 27a159b401 * documentation update 19 years ago
theli f80f776b89 *) Trying to solve NullpointerException problem in function addURLtoErrorDB 19 years ago
hydrox 1c99b5a484 *)fixed logging for urldbcleanup 19 years ago
orbiter 8f3f4ab0eb enhanced synchronisation in plasmaWordIndex 19 years ago
orbiter 23dd972608 fixed memory calculation in performanceMemory web page 19 years ago
orbiter 1ce3c22761 better memory control: 19 years ago
orbiter 39b4c26bdc more memory control: 19 years ago
orbiter 3e9d509c39 some small fixes 19 years ago
orbiter eb633c0a4f server threads must now supply a method that can be called in case 19 years ago
orbiter f5720cb2fa removed most synchronization in wordIndex (for testing) 19 years ago
orbiter 0187c60010 because of a bug in the JRE 1.4.2 there was no memory protection 19 years ago
orbiter cfb51fdef1 less synchronization in plasmaWordIndex 19 years ago
orbiter d6a928c2da quickfix for http://www.yacy-forum.de/viewtopic.php?t=2705 19 years ago
orbiter 6ad471ef96 * applied many compiler warning recommendations 19 years ago
hydrox 9da3aa74d3 silly me, fix for the fix as advised by theli 19 years ago
hydrox bb3d9a5582 *) e.getMessage().indexOf() can only be used if there is actually an ExceptionMessage. 19 years ago
hydrox 7a54010a9c *) Iterators can't be casted to IndexContainer 19 years ago
orbiter cd5f7e137c fixed problem with NURL-generation upon first startup 19 years ago
orbiter 8418af141a added several consistency checks and small changes 19 years ago
theli 9d13aeca13 *) removing class. does not work so far 19 years ago
theli 95a84ae469 *) adding missing classes 19 years ago
theli eee44be602 *) adding an interface for customized blacklist classes 19 years ago
orbiter 6d2f15971a there is a very strange error that causes that the kelondroRecords structure 19 years ago
theli d2e8e76218 *) now it's possible to configure the yacy blacklist separately for dht, search, proxy, crawler 19 years ago
orbiter 9ae9062bd3 * disabled new kelondroFlex table for NURLs 19 years ago
orbiter 689bbcf9cd replaced kelondroTree db for NURLs by new kelondroFlexTable 19 years ago
orbiter 7fbba41962 synchronization fixes 19 years ago
orbiter 328f9859a5 more synchronization in plasmaWordIndex 19 years ago
orbiter 130e6d4719 generalized index object for eurl, nurl and lurl to prepare move 19 years ago
orbiter acdf24877f more synchronization against outOfMemoryError in wordIndex 19 years ago
orbiter 95160d7f2c fixed size computation of index elements from the collection index 19 years ago
orbiter 26116cabde added missing rowdef assignment 19 years ago
orbiter abf22f6e60 removed url normalform computation from htmlFilterContentScraper. 19 years ago
orbiter 740d49751d * strict type and size check in kelondroRow handling 19 years ago
orbiter 314021453f * more logging 19 years ago
orbiter 61b151b083 * added another auto-fix for collection index inconsitency check 19 years ago
orbiter f58283def2 better control of index flush 19 years ago
orbiter 4be21a3cab ups 19 years ago
orbiter 80b6c90d54 enhancements to prevent blocking during dht transfer receive 19 years ago
theli 9f298083cd *) adding more urls to the error url 19 years ago
hermens d56f06401e - Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible 19 years ago
theli c09f734d06 *) offer router configuration on ConfigBasic.html 19 years ago
hermens dcbb4d0a6b Display the size of HashBlacklistedCache on PerformanceMemory page. 19 years ago
orbiter d799622da1 better flush limit for index collections 19 years ago
orbiter 279b1d969d Integrated new indexing data structure 'collections' into the main class 19 years ago
orbiter 4ff742e42d implemented indexCollectionRI 19 years ago
orbiter 01f95eccd3 re-write of kelondroCollectionIndex. This is the data structure that 19 years ago
orbiter ebc2233092 * implemented (finished) class indexRowSetContainer 19 years ago
orbiter 9183d21f25 renamed new index class to old name 19 years ago