Commit Graph

2193 Commits (b6a5f53020fcc1b3e00981822f3d170b47981cc4)

Author SHA1 Message Date
theli f37e2041e8 *) adding soap function to import yacy bookmarks from xml or html (transfered via soap attachments)
18 years ago
orbiter f1ed55a5fc bugfix for last commit
18 years ago
orbiter 8fdefd5c68 generalization of payload definition of index storage
18 years ago
theli 29a1f132ec *) some strings replaced by constants
18 years ago
theli 4a3ec63e34 *) new soap service to manage yacy bookmarks
18 years ago
(no author) 9b3fd2b9e5 *) removing doctype definition to avoid problems with xml parser
18 years ago
(no author) c64d5018b4 *) Bugfix. Problem in XML Parser
18 years ago
theli 5e57e0814d *) new soap function to display log
18 years ago
theli ad248d61ca *) more verbose exception
18 years ago
hydrox 7e8669b15c *) added possibility to "recycle" a DHTChunk that failed to transfer.
18 years ago
low012 4feaa91890 *) Added additional MIME-Type.
18 years ago
low012 89af433879 *) Deleted parts of WebCat that were not needed for parsing SWFs.
18 years ago
orbiter 46a712e195 - more asserts
18 years ago
low012 8c9bc7e341 *) extracting urls works now
18 years ago
orbiter fc2936d500 bugfix for internal index entry generation
18 years ago
low012 493391e42d *) new flash parser, still experimental
18 years ago
orbiter 215c4e65f1 code cleanup
18 years ago
orbiter bd4f43cd66 - fixed a null pointer exception bug
18 years ago
auron_x 194d42b6a7 *) changed PPM-calculation to be more accurate
18 years ago
orbiter fe8afaf426 switched off usage of write cache for imprortant databases
18 years ago
orbiter 985fd807cc bugfixing in collection methods
18 years ago
theli c7bea4addb *) soap api
18 years ago
theli ee4d4e8567 *) Soap-handler: bugfix. wrong content-length was send when using content-encoding
18 years ago
orbiter d3431433b0 more anonymization in logging
18 years ago
orbiter e6044e5198 bugfix for
18 years ago
theli 4d19d94348 *) bugfix for nullpointerexception
18 years ago
theli 532c23b5c7 *) soap handler
18 years ago
orbiter 78b7f6f7fd bugfix for index remove bug,
18 years ago
(no author) 0e79f2fd7e name of the file to tranlate apears ahead its translation
18 years ago
orbiter ebd2d629d8 added missing file for last commit
18 years ago
orbiter 147d88cf23 re-design of database caching
18 years ago
orbiter 4e363108e1 - removed bad debug code that caused a large and unnecessary delay during global search
18 years ago
orbiter f21ede312e bugfixes for internals of database organization
18 years ago
orbiter eb4bfb0e9d fixed problem with cache.profile()
18 years ago
orbiter 2a9d868f6d - removed object cache from kelondroTree
18 years ago
theli 7299dc30e3 *) new soap service to manage the yacy file-share
18 years ago
theli 777e39cea0 *) new template to display the dir-listing in xml format.
18 years ago
theli 9e8942a064 *) adding method to implement blacklist from file
18 years ago
theli 4d1f933ea1 *) avoid reading of content body into memory
18 years ago
theli 88cfdecd38 *) Bugfix: calling close must not close the wrapped input stream, otherwise
18 years ago
theli d38ef0493d *) be more tolerant against missing ports in url
18 years ago
theli cfe54fedc7 *) Bugfix for resolveBackpath problem with tailing /..
18 years ago
orbiter dc056fabf3 small bugfix
18 years ago
orbiter 278d8c3c7e - more asserts
18 years ago
allo 5a6488256d catch the "username too short" exception
18 years ago
orbiter 2d3f1a53fd handling of Missing byte-order mark exception
18 years ago
theli ac13fa763a *) bugfix for blacklist remove (blacklist was not informed about remove)
18 years ago
allo 8a5c2d0a19 fix for supertemplates, too.
18 years ago
allo c35793fb46 fix for last commit
18 years ago
theli 3e0516446b *) new soap function to get the current queue status
18 years ago
allo a831c83025 create servletProperties, with the servlet specific funktions from serverObjects
18 years ago
orbiter 83a0efc65a better assert statements and fixes
18 years ago
karlchenofhell d13b381f83 - added mint-green skin
18 years ago
orbiter 2025e885d6 a fix for problems with remove situations in kelondroFlexSplitTable
18 years ago
theli b12da510f3 *) adding optional libraries for needed for soap attachments
18 years ago
theli 9eecc9a888 *) libs added to classpath
18 years ago
theli a1acc9c389 *) new function to configure distributed crawling
18 years ago
theli 0996e550e7 *) deploy soap peer admin service
18 years ago
orbiter 3ffc5b8793 fixed problem with serverCharBuffer.append(char)
18 years ago
orbiter 8b56887676 removed unused code
18 years ago
orbiter 06854988da - full integration of new LURL database in INDEX
18 years ago
(no author) 02c66c04f2 *) Missing file from last commit
18 years ago
octoate e4a3574b77 StringBuffer now resets every time the parser is called
18 years ago
theli ef912811f1 *) adding new soap service for peer administration
18 years ago
karlchenofhell ce237aefad - assortment-sizes table from PerformanceQueues_p.html is not shown if not used
18 years ago
theli 68204ff729 *) Suppressing for bad client requests.
18 years ago
theli c1dff41f99 *) adding possibility to deploy custom SOAP services
18 years ago
theli df49724f28 *) better error handling for seed upload - test download - problems
18 years ago
theli a5b9b514c1 *) retry crawling without content-encoding if the content-encoding header was not correct
18 years ago
theli 52466067d8 *) Bugfix for ArrayIndexOutOfBoundsExceptions which occure because SimpleDateFormat is not thread-safe
18 years ago
theli b357a13e9a *) adding synchronization block because SimpleDateFormat is not thread-safe
18 years ago
theli 92f774edd1 *) Better charset encoding detection
18 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
18 years ago
octoate cc24dde5e0 First version of a MS Excel parser based on Apache POI
18 years ago
karlchenofhell 4c63129136 - stupid mistake...
18 years ago
karlchenofhell b14a500b88 - removed debug output from PerformanceMemory_p
18 years ago
karlchenofhell ebf0da2a45 - now the fix http://www.yacy-forum.de/viewtopic.php?t=2974 works
18 years ago
theli 09337c9751 *) Bugfix wrong chars in soap search result document
18 years ago
theli 3d152bfe43 *) Logging message added
18 years ago
karlchenofhell b5e40e2fa2 - fix for http://www.yacy-forum.de/viewtopic.php?t=2974 (no cache-sizes for new db)
18 years ago
theli 96f45e9b15 *) Bugfix wrong chars in soap search result document
18 years ago
theli da2ac6fa23 *) adding new ant target to allow generation of client stub classes for yacy soap api
18 years ago
theli a9cc6df21b *) adding wsdl files to generate client stub classes with ant
18 years ago
orbiter 77a59a115d refactoring of indexing methods
18 years ago
orbiter 14490f0a83 added missing flush statement
18 years ago
orbiter 688cbfb776 - bugfixing for flextable bug
18 years ago
allo a29b4d4fb5 extended Supertemplates for Headerincludes.
18 years ago
theli a7e11ada50 *) suppressing stacktrace for "server has closed connection"
18 years ago
theli 5b114249ce *) Bugfix for ViewLog problem with multiline logging messages
18 years ago
theli de5e233766 *) Bugfix for GuiHandler sorting problem
18 years ago
theli fd94aa4bef *) Bugfix for IndexOutOfBound in GuiHandler
18 years ago
orbiter 29a1318ef9 bugfixes for wrong database access that do not consider deleted entries
18 years ago
theli cbb1e710b9 *) removing old class
18 years ago
orbiter c6d46f7ebd null pointer bugfix
18 years ago
theli decb09df6d *) Trying to be more tolerant against wrong charset names
18 years ago
theli e9afe39cbb *) Trying to be more tolerant against wrong charset names
18 years ago
theli 7526c831a8 *) Suppressing stracktrace
18 years ago
orbiter 50f2578c55 - some bugfixing and code cleanup
18 years ago
orbiter bdf4c7c51e added missing files for last commit
18 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
18 years ago
orbiter 130cc76927 loop detection and termination in deletedHandles method
18 years ago
octoate 1c4076da8a First version of the MS Powerpoint parser based on Apache POI
18 years ago
theli 5b75d64d7d *) bugfix for last commit
18 years ago
theli 71ed104bc7 *) adding additional rpm mimetype (used by packman)
18 years ago
borg-0300 76d959122b new constants, finals, Stringbuffer, cleanup
18 years ago
orbiter 6396f5971e bugfixes and migration attempt toward new kelondroFlex db
18 years ago
hermens 48f81acc0e reverse SVN 2744, it is not needed
18 years ago
hermens 1da9aece12 Repair DNS prefetch during cacheScan
18 years ago
orbiter 918b59dc5e - bugfix for snippet profile (no delete button)
18 years ago
orbiter 2bb529cedb added peer tags for peers in robinson mode
18 years ago
orbiter afbb547f3d extended options for abstracts generation in remote search interface
18 years ago
theli 22649408ad *) Better errorhandling for charset encoding problem during content parsing
18 years ago
theli a9c7e3f061 *) Bugfix for NoSuchElementException
18 years ago
orbiter f25f61d9d3 documentation of compile problem. See
18 years ago
orbiter c8f3a7d363 added snippet-url re-indexing
18 years ago
low012 2cfd4633ac *) even better handling of searchwords in snippets, words can consist of letters and numbers now
18 years ago
orbiter b062847797 fix for
18 years ago
orbiter e17fea7015 files in htcache are now stored in different hash/tree subdirectories
18 years ago
orbiter 661f005214 fix for seed upload build script
18 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details)
18 years ago
orbiter ddf8f220f6 fix for build fail
18 years ago
orbiter 25ae3d3161 generalized definition of hexhash
18 years ago
orbiter 86047f439d removed very bad bug that prevented production of any remote search result
18 years ago
orbiter f0d747c723 removed deprecated method
18 years ago
orbiter 5ff77612ac bugfix for old WORDS storage method
18 years ago
orbiter 0f10bdde22 more generic cache methods
18 years ago
orbiter 72482b1426 fixed scraper
18 years ago
hermens 6557112d8f small fix for plasmaURLPool.getURL() needed for new alternative htcache layout
18 years ago
hermens 440c6ee657 Implement alternative htcache layout
18 years ago
allo 226f2c5b2c first version, of the Serverlet Debugger
18 years ago
orbiter adf1f74ab2 bugfix for java 1.5 compile problem with serverCharBuffer.append(char)
18 years ago
orbiter fd61209797 lines inside tags without punctuation are extended by a single dot.
18 years ago
allo 1d0c0edda3 first version of posts/get from the del.icio.us api
18 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
18 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
18 years ago
orbiter 1dfab1abe3 more control for seed receive
18 years ago
theli 1c0e65f55f *) Bugfix for problems with charset detection
18 years ago
orbiter db294687ea enhanced logging
18 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
18 years ago
theli ce7ee74316 *) better errorhandling in filehandler (try catch block now starts before argument parsing)
18 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
18 years ago
theli 1586d57187 *) odtParser: better handling of large files
18 years ago
theli f17ce28b6d *) plasmaHTCache:
18 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM
18 years ago
orbiter bcf2b800b4 applied UTF-8 encoding parameter to yacy-internal protocol communication
18 years ago
orbiter c40fca08a2 fixed bad handling of string separation
18 years ago
orbiter 5a40ea7866 refactoring of wget string list generation
18 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
18 years ago
orbiter d4c239e4be - fixed problem in collection index with deletion of single url references
18 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
18 years ago
orbiter b033a80750 better control of failure in node seek of kelondroTree
18 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
18 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
18 years ago
theli cd5f349666 *) Better handling of large files during parsing
18 years ago
theli 8b2ceddb91 *) Displaying servere and warning logging messages in different colors on ViewLog_p.html
18 years ago
low012 f8ac694e51 *) fixed a bug where searchword in snippets were not displayed bold in front of a punctuation mark (see http://www.yacy-forum.de/viewtopic.php?p=25998)
18 years ago
orbiter df1629b05a - code cleanup
18 years ago
theli c665f6cddb *) handling of quotes in charset string
18 years ago
theli b73efd5565 *) missing changes needed because of last commit
18 years ago
theli 140ddba93f *) adding soap functions to pause and resume the crawler
18 years ago
orbiter 2463e5624a 'quick' release 0.47
18 years ago
theli 49fbb688df *) SOAP: old urlInfo renamed to urlInfoByHash, new urlInfo Function added.
18 years ago
theli 8f143d516b *) make snippet fetcher accessible via soap api
18 years ago
theli 97615af406 *) Restructuring of YaCy SOAP services
18 years ago
theli 241b881560 *) Redesign of YaCy SOAP handler
18 years ago
theli 009a33170b *) Content-Location header added
18 years ago
theli 1aa07a52cd *) Bugfix for UnsupportedEncodingException if the media type contains multiple parameters
18 years ago
theli 625c2ce6b1 *) bugfix for snippet fetching problem if content but not http header is available in cache
18 years ago
theli 813a8a8179 *) migration of mimeTypeParser to jmimemagic 0.1
18 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes
18 years ago
theli 57415b6889 *) Bugfix for surftipps UTF-8 problem
18 years ago
allo b0a4fcce8c fix from theli
18 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
18 years ago
orbiter e03427871e enhanced surftipps:
18 years ago
theli 1dc12d6659 *) Bugfix for shutdown problem caused by cacheScan thread
18 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
18 years ago
borg-0300 af1d89e381 check url == null added;
18 years ago
theli cc667b0aa5 *) htmlFilterContentScraper.java: adding support for link tag
18 years ago
theli 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui
18 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
18 years ago
borg-0300 f18304ddd3 unused/not needed imports removes;
18 years ago
orbiter ec031eb993 first version of surftipps
18 years ago
borg-0300 b174fbd0ca "import ...*" removed;
18 years ago
orbiter 807756150e patch for strange bug reported by email
18 years ago
theli 5c6251bced *) some improvements for extended html document charset support
18 years ago
theli 33f0f703c0 *) reinserting type cast again
18 years ago
orbiter 8c11a543dc fixed line ending coding
18 years ago
theli b690597275 *) adding casts to avoid compatibility problems between java 1.4 and java 1.5 writer class usage
18 years ago
theli 5afb0cbce8 *) setting default charset (for unkown documents) to iso-8859-1
18 years ago
orbiter f453c14b5d removed unreacheable catch blocks and unused imports
18 years ago
theli ad7f600f25 *) Bugfix. re-enabling inheritance of serverCharBuffer from writer class
18 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
18 years ago
theli fc594e8eda *) adding httpContentLengthInputStream.java class to allow reading of http response bodies
18 years ago
low012 cd636eb00e *) Fix for the fix...
18 years ago
low012 f9a5b55a9e *) Fixed bug described in http://www.yacy-forum.de/viewtopic.php?p=25448#25448
18 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
18 years ago
low012 8a30c5343d *) Fixed bug where exclamation marks could get lost between [=...=] and <pre>...</pre>
18 years ago
low012 d8f4b17e31 *) Hopefully fixed bug described in http://www.yacy-forum.de/viewtopic.php?t=2825.
18 years ago
theli 0e84a969d6 *) Bugfix for serverCharBuffer read from file operation
18 years ago
theli 90ef19d778 *) first version of a serverCharBuffer
18 years ago
orbiter d374ef2bbe bugfix for tryRemoveURLs
18 years ago
orbiter f644a1c3a7 better evaluation of index abstracts
18 years ago
orbiter 1b48473bc5 bugfix to utf8 recognition
18 years ago
orbiter 90f7241b59 serverByteBuffer.trim() can now recognize utf-8 characters
18 years ago
allo 2fd610b556 http://www.yacy-forum.de/viewtopic.php?p=25611#25611
18 years ago
theli e34d9b3fec *) charset aware headlines (after the serverByteBuffer.trim problem is solved)
18 years ago
theli 8115ac47b5 *) charset aware metadata parsing
18 years ago
theli 3ac30bdf22 *) some todo markers added for additional charset support
18 years ago
theli 06fa891152 *) htmlFilterContentScraper.java: using proper charset for document title
18 years ago
theli 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser)
18 years ago
theli c5d3020941 *) better errorhandling for last commit
18 years ago
theli d0a5a53789 *) changes needed for multi-language support
18 years ago
orbiter d82875c72b removed removal of 'funny symbols' that may have caused utf-8 problems
18 years ago
orbiter 26ab1fa885 fixed null pointer exception
18 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem
18 years ago
orbiter 41e27b85b7 fix for crawler condition
18 years ago
orbiter 0ee7e45413 bugfix for merge method (caused by bad refactoring)
18 years ago
orbiter 5c2f30eaca adjustments to dhtInCache write
18 years ago
theli 9ecf7f0da2 *) some TODO makers for UTF-8 problem
18 years ago
theli e2f8339827 *) some bugfixes for UTF-8 related problems
18 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
18 years ago
orbiter 6e2907135a bugfixes for remote search server part
18 years ago
orbiter cf9884e22b first attempt to implement a secondary search
18 years ago
theli 2a06ce5538 *) next bugfix for UTF-8
18 years ago
theli bdc51591ae *) UTF-8 Bug solved (hopefully)
18 years ago
theli ef751b9d33 *) removing all string operations from the template engine
18 years ago
orbiter 7ef80c1026 more debugging
18 years ago
orbiter b251076e64 avoid ConcurrentModificationException
18 years ago
orbiter 75b198bc02 - updated references to indexContainer
18 years ago
orbiter 0bed3b9ac3 removed superfluous interface
18 years ago
orbiter b7e7808ea6 wordmigration now works also for new index database
18 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error
18 years ago
orbiter 4f9e42d5ed more changes towards better join-search
18 years ago
orbiter a7281a9b4d fix for last commit
18 years ago
orbiter 82a6054275 - fixed bug with new indexAbstract generation
18 years ago
theli fded1f4a5d *) better handling of maximum file size limit in crawler
18 years ago
orbiter 416b4e5c6b ups
18 years ago
orbiter 309accb983 memory control for ymage generation:
18 years ago
orbiter 74d1dea30b changes towards better join-search
18 years ago
orbiter ae4e8ce03e - cut for 'probably last html-interface version': version number update
18 years ago
orbiter 64bed59ee8 enhancements to ranking
18 years ago
theli 63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use.
18 years ago
auron_x 06b1365066 *) fixed existing protection against divbyzero and removed the new one
18 years ago
orbiter 94d7ced900 fix for last ranking commit
18 years ago
orbiter cc97a3e9c6 fixed possibly bug with indexOutOfBoundsException
18 years ago
orbiter 03835c2ee8 enhanced search result computation
18 years ago
orbiter 809960ddc6 avoid division by zero
18 years ago
orbiter ac3419b65f better debugging for indexOutOfBoundException bug
18 years ago
orbiter 75b03a4580 fix for new ArrayIndexOutOfBoundException
18 years ago
orbiter a8bc768206 enhancements to ranking evaluation
18 years ago
auron_x a82e926c5d *) fix for wrong totalPPM-calculation
18 years ago
theli 33898ae7e9 *) ResourceInfoFactory.java: Bugfix for classNotFoundException
18 years ago
theli 406e170e25 *) more verbose error message
18 years ago
theli b298474e22 *) Bugfix needed because of changed plasmaCrawlLURL.load behavior
18 years ago
orbiter c2e6cc8c6b small part of Bosts patch
18 years ago
orbiter 96c6e4e322 - enhancements to detailed search page
18 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs
18 years ago
theli a5ed86105b *) bugfix for handling of ResourceInfo object in proxy
18 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
18 years ago
hermens 7aeadbe7cc another NullPointerException in http.ResourceInfo
18 years ago
orbiter 141f9e5bb4 fix for new plasmaCrawlLURL.load behavior
18 years ago
orbiter 1e7fd48afd added size method to ftpc
18 years ago
hermens 087f7511f8 prevent NullPointerException in http.ResourceInfo
18 years ago
orbiter a2525072f2 bugfix for kelondroRow - property generation
18 years ago
hydrox 59a5511dbb *) added missing static Strings as requested by theli
18 years ago
theli 6578564c9a *) Ignore more hop by hop http headers
18 years ago
theli b44514242a *) crawler/ftp/CrawlWorker.java: better errorhandling
18 years ago
theli 7d7f30139c *) crawler/ftp/CrawlWorker.java: delete old cache file
18 years ago
theli 4ae0f122f8 *) ResourceInfo.java: License header added
18 years ago
theli 043edfa4d8 *) ftp/ResourceInfo.java ResourceInfo object for ftp resources added
18 years ago
orbiter 4866868c0e added write cache for LURLs
18 years ago
orbiter 8a0e35618b enhancements to search result preparation
18 years ago
theli 5c1bb53d2a Missing description for last commit
18 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli 4825bfaaf3 *) Bugfix for PrintWriter Problem
18 years ago
orbiter d4c5e2af01 html-dirlist can now also be generated from existing connections
18 years ago
theli 7930839594 *) URL.java: userinfo was not taken over when generating a new url from a base url and a rel. path
18 years ago
orbiter 17ba468165 added html dirlisting generation in ftpc.java:
18 years ago
theli 7a35b8e237 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
18 years ago
theli ffbf416e76 *) direct access to requestheader of htCache.Entry removed to make it more http independent
18 years ago
theli 3870d615e3 *) setting htCache.Entry fields to private
18 years ago
theli 393a7d10be *) setting htCache.Entry fields to private
18 years ago
theli ab5a9bee66 *) adding some copyright headers
18 years ago
theli 5847492537 *) next step of restructuring for new crawlers
18 years ago
orbiter 6cce47e217 test of ftp-urls in URL class
18 years ago
theli fce9e7741b *) next step of restructuring for new crawlers
18 years ago
theli e3f0136606 *) next step of restructuring for new crawlers
18 years ago
theli 9ded4e8d5a *) Bugfix for name resolution in proxy mode
18 years ago
theli 1c8300fcec *) Bugfix for name resolution in proxy mode
18 years ago
theli 4e2a950ac9 *) next step of restructuring for new crawlers
18 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers
18 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
18 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
18 years ago
theli b4acbdaa97 *) better handling of server shutdown
18 years ago
theli f3ac4dbbb9 *) better handling of server shutdown
18 years ago
theli 959b779aba *) avoid performance loss if log level is greater than 'fine'
18 years ago
auron_x 57dda1a92c *)again fixing for wrong version display, now totally working with double instead of float
18 years ago
auron_x 479b74e1dd *) fix for stupid mistake in new ppm-calc which caused decimal digits beeing written to seedinfo
18 years ago
auron_x 348258a557 *) changed PPM-calculation to be much more accurate
18 years ago
orbiter 18b6876860 new cache flush configuration settings
18 years ago
hermens f0278b4092 Bugfix for / by zero when the AssortmentCluster is empty
18 years ago
orbiter 14e0bb0dcf allow more references per word for new db
18 years ago
orbiter 985dcbde7f changed some parameters that may cause better memory usage and more indexing speed
18 years ago
orbiter b7f4a1521b added options to switch on or off the kelondroFlexTable for NURL, EURL and PreNURL
18 years ago
orbiter c26da4893b turned back NURL usage of kelondroTree, kelondroFlexTable has still problems with deleted entries
18 years ago
orbiter db1eae0227 * simplified initialization of database objects
18 years ago
hermens 0b73f2b132 Repair DNS prefetch during cacheScan
18 years ago
orbiter 27a159b401 * documentation update
18 years ago
theli f80f776b89 *) Trying to solve NullpointerException problem in function addURLtoErrorDB
18 years ago
orbiter d78b824e85 fixed problem with default path after first start-up
19 years ago
hydrox 1c99b5a484 *)fixed logging for urldbcleanup
19 years ago
orbiter 135e019883 removed one superfluous line from last commit
19 years ago
orbiter 1591a55963 added object cache miss-cache use for remove method
19 years ago
orbiter 8f3f4ab0eb enhanced synchronisation in plasmaWordIndex
19 years ago
orbiter f933f00f09 another patch to URL protocol handling for 'news', 'nntp' etc:
19 years ago
orbiter 4c6e00d80a more bugfixes for URL class, see:
19 years ago
orbiter 23dd972608 fixed memory calculation in performanceMemory web page
19 years ago
orbiter b7dc251948 fixed bugs in url class:
19 years ago
orbiter 1ce3c22761 better memory control:
19 years ago
orbiter 39b4c26bdc more memory control:
19 years ago
orbiter 3e9d509c39 some small fixes
19 years ago
orbiter 276225d79e fix for URL class
19 years ago
orbiter eb633c0a4f server threads must now supply a method that can be called in case
19 years ago
orbiter f5720cb2fa removed most synchronization in wordIndex (for testing)
19 years ago
orbiter 0187c60010 because of a bug in the JRE 1.4.2 there was no memory protection
19 years ago
auron_x 4eca0f8830 *) fixed PPM calculation for multiple indexer-threads
19 years ago
orbiter cfb51fdef1 less synchronization in plasmaWordIndex
19 years ago
orbiter d6a928c2da quickfix for http://www.yacy-forum.de/viewtopic.php?t=2705
19 years ago
orbiter 6ad471ef96 * applied many compiler warning recommendations
19 years ago
allo cf1186597b utf fix from theli
19 years ago
hydrox 9da3aa74d3 silly me, fix for the fix as advised by theli
19 years ago
hydrox bb3d9a5582 *) e.getMessage().indexOf() can only be used if there is actually an ExceptionMessage.
19 years ago
hydrox 7a54010a9c *) Iterators can't be casted to IndexContainer
19 years ago
theli 5e0b6f8f83 *) sorting peer name list on Blacklist_p.html
19 years ago
orbiter cd5f7e137c fixed problem with NURL-generation upon first startup
19 years ago
orbiter 8418af141a added several consistency checks and small changes
19 years ago
theli 9d13aeca13 *) removing class. does not work so far
19 years ago
theli 95a84ae469 *) adding missing classes
19 years ago
theli eee44be602 *) adding an interface for customized blacklist classes
19 years ago
orbiter 6d2f15971a there is a very strange error that causes that the kelondroRecords structure
19 years ago
theli d2e8e76218 *) now it's possible to configure the yacy blacklist separately for dht, search, proxy, crawler
19 years ago
orbiter 9ae9062bd3 * disabled new kelondroFlex table for NURLs
19 years ago
orbiter 689bbcf9cd replaced kelondroTree db for NURLs by new kelondroFlexTable
19 years ago
orbiter 7fbba41962 synchronization fixes
19 years ago
orbiter 328f9859a5 more synchronization in plasmaWordIndex
19 years ago
orbiter f43c90fa98 fixed handling of null referer in crawlOrder
19 years ago
orbiter 130e6d4719 generalized index object for eurl, nurl and lurl to prepare move
19 years ago
orbiter acdf24877f more synchronization against outOfMemoryError in wordIndex
19 years ago
orbiter 95160d7f2c fixed size computation of index elements from the collection index
19 years ago
orbiter 26116cabde added missing rowdef assignment
19 years ago
orbiter cfbacbbf08 reverted change in robotsParser
19 years ago
orbiter abf22f6e60 removed url normalform computation from htmlFilterContentScraper.
19 years ago
orbiter 740d49751d * strict type and size check in kelondroRow handling
19 years ago
orbiter 314021453f * more logging
19 years ago
allo a52f36787f better templatedebugging
19 years ago
allo 3480d36417 added some debug code
19 years ago
orbiter 61b151b083 * added another auto-fix for collection index inconsitency check
19 years ago
orbiter 0bbbd129ef small fix for exception message
19 years ago
orbiter 718fbc2dae enhancements in kelondroCollectionIndex:
19 years ago
orbiter f58283def2 better control of index flush
19 years ago
orbiter 4be21a3cab ups
19 years ago
orbiter 80b6c90d54 enhancements to prevent blocking during dht transfer receive
19 years ago
theli 9f298083cd *) adding more urls to the error url
19 years ago
hermens d56f06401e - Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible
19 years ago
theli c09f734d06 *) offer router configuration on ConfigBasic.html
19 years ago
hermens dcbb4d0a6b Display the size of HashBlacklistedCache on PerformanceMemory page.
19 years ago
orbiter d799622da1 better flush limit for index collections
19 years ago
orbiter d468d665c9 some changes that may help to prevent deadlocks that cause an OutOfMemoryError
19 years ago
theli d54767f634 *) last step of removing embedded html from dir class
19 years ago
orbiter 279b1d969d Integrated new indexing data structure 'collections' into the main class
19 years ago
orbiter 4ff742e42d implemented indexCollectionRI
19 years ago
orbiter 01f95eccd3 re-write of kelondroCollectionIndex. This is the data structure that
19 years ago
orbiter ebc2233092 * implemented (finished) class indexRowSetContainer
19 years ago
orbiter 9183d21f25 renamed new index class to old name
19 years ago
orbiter c4e922885a replaced indexURLEntry by new class that uses a kelondroRow.Entry object
19 years ago
orbiter 0b7112f8b2 fix for missing topLevelClone in indexRAMCacheRI.wordContainerIterator
19 years ago
orbiter e357599f92 * fixed problem with indexContainer iteration from RAM:
19 years ago
theli 57fe5cc671 *) code cleanup
19 years ago
allo 4e9f02c8ec integration of Michaels string-extraction.
19 years ago
orbiter 8b77afd72c some fixes to new container merger
19 years ago
orbiter 830167596a bugfix for
19 years ago
theli 839806a775 *) serverPortForwardingUpnp.java: code cleanup, license header added
19 years ago
theli 03230cd887 *) removing old port forwarding classes
19 years ago
theli 6e676224d0 *) adding support for upnp
19 years ago
orbiter 417ed5102e redesign of database iterators:
19 years ago
theli 0db237467f *) bugfix for URL generation from file
19 years ago
orbiter ad692fc6c7 implemented option to extract nurls from the database
19 years ago
orbiter 7fd90ca7c8 * strict handling of NURL entry element generation, storage and stacking
19 years ago
orbiter 5f72be2a95 some redesign of EURL storage
19 years ago
orbiter 1ed3e2daef added option to extract domains and/or urls from the eurl database
19 years ago
orbiter 7e0a130fb5 new indexURLEntry class 'indexURLEntryNew', to replace old class
19 years ago
orbiter 58df8b7bbf a large collection of different changes
19 years ago
orbiter e20ff77c10 another bugfix in new url class
19 years ago
orbiter 685430a1b5 bugfix in new URL class, better loggin for domain extraction
19 years ago
orbiter 79af283f6c better debugging in new URL class for wrong port numbers
19 years ago
allo 1b2ea58ee9 wrong substring invocation.
19 years ago
orbiter e4f1820b58 protection against too long authentication strings in switchboard
19 years ago
orbiter b3f7e62e03 better handling of whitespace
19 years ago
orbiter 4149939c02 better handling of whitespace for gettext quotation
19 years ago
orbiter 97fa6788a1 added gettext support:
19 years ago