Commit Graph

1718 Commits (0f10bdde22dc256b09fe1eb10ae967a3a021a3c6)

Author SHA1 Message Date
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
19 years ago
borg-0300 af1d89e381 check url == null added;
19 years ago
theli cc667b0aa5 *) htmlFilterContentScraper.java: adding support for link tag
19 years ago
theli 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui
19 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
19 years ago
borg-0300 f18304ddd3 unused/not needed imports removes;
19 years ago
orbiter ec031eb993 first version of surftipps
19 years ago
borg-0300 b174fbd0ca "import ...*" removed;
19 years ago
orbiter 807756150e patch for strange bug reported by email
19 years ago
theli 5c6251bced *) some improvements for extended html document charset support
19 years ago
theli 33f0f703c0 *) reinserting type cast again
19 years ago
orbiter 8c11a543dc fixed line ending coding
19 years ago
theli b690597275 *) adding casts to avoid compatibility problems between java 1.4 and java 1.5 writer class usage
19 years ago
theli 5afb0cbce8 *) setting default charset (for unkown documents) to iso-8859-1
19 years ago
orbiter f453c14b5d removed unreacheable catch blocks and unused imports
19 years ago
theli ad7f600f25 *) Bugfix. re-enabling inheritance of serverCharBuffer from writer class
19 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
19 years ago
theli fc594e8eda *) adding httpContentLengthInputStream.java class to allow reading of http response bodies
19 years ago
low012 cd636eb00e *) Fix for the fix...
19 years ago
low012 f9a5b55a9e *) Fixed bug described in http://www.yacy-forum.de/viewtopic.php?p=25448#25448
19 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
19 years ago
low012 8a30c5343d *) Fixed bug where exclamation marks could get lost between [=...=] and <pre>...</pre>
19 years ago
low012 d8f4b17e31 *) Hopefully fixed bug described in http://www.yacy-forum.de/viewtopic.php?t=2825.
19 years ago
theli 0e84a969d6 *) Bugfix for serverCharBuffer read from file operation
19 years ago
theli 90ef19d778 *) first version of a serverCharBuffer
19 years ago
orbiter d374ef2bbe bugfix for tryRemoveURLs
19 years ago
orbiter f644a1c3a7 better evaluation of index abstracts
19 years ago
orbiter 1b48473bc5 bugfix to utf8 recognition
19 years ago
orbiter 90f7241b59 serverByteBuffer.trim() can now recognize utf-8 characters
19 years ago
allo 2fd610b556 http://www.yacy-forum.de/viewtopic.php?p=25611#25611
19 years ago
theli e34d9b3fec *) charset aware headlines (after the serverByteBuffer.trim problem is solved)
19 years ago
theli 8115ac47b5 *) charset aware metadata parsing
19 years ago
theli 3ac30bdf22 *) some todo markers added for additional charset support
19 years ago
theli 06fa891152 *) htmlFilterContentScraper.java: using proper charset for document title
19 years ago
theli 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser)
19 years ago
theli c5d3020941 *) better errorhandling for last commit
19 years ago
theli d0a5a53789 *) changes needed for multi-language support
19 years ago
orbiter d82875c72b removed removal of 'funny symbols' that may have caused utf-8 problems
19 years ago
orbiter 26ab1fa885 fixed null pointer exception
19 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem
19 years ago
orbiter 41e27b85b7 fix for crawler condition
19 years ago
orbiter 0ee7e45413 bugfix for merge method (caused by bad refactoring)
19 years ago
orbiter 5c2f30eaca adjustments to dhtInCache write
19 years ago
theli 9ecf7f0da2 *) some TODO makers for UTF-8 problem
19 years ago
theli e2f8339827 *) some bugfixes for UTF-8 related problems
19 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
19 years ago
orbiter 6e2907135a bugfixes for remote search server part
19 years ago
orbiter cf9884e22b first attempt to implement a secondary search
19 years ago
theli 2a06ce5538 *) next bugfix for UTF-8
19 years ago
theli bdc51591ae *) UTF-8 Bug solved (hopefully)
19 years ago
theli ef751b9d33 *) removing all string operations from the template engine
19 years ago
orbiter 7ef80c1026 more debugging
19 years ago
orbiter b251076e64 avoid ConcurrentModificationException
19 years ago
orbiter 75b198bc02 - updated references to indexContainer
19 years ago
orbiter 0bed3b9ac3 removed superfluous interface
19 years ago
orbiter b7e7808ea6 wordmigration now works also for new index database
19 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error
19 years ago
orbiter 4f9e42d5ed more changes towards better join-search
19 years ago
orbiter a7281a9b4d fix for last commit
19 years ago
orbiter 82a6054275 - fixed bug with new indexAbstract generation
19 years ago
theli fded1f4a5d *) better handling of maximum file size limit in crawler
19 years ago
orbiter 416b4e5c6b ups
19 years ago
orbiter 309accb983 memory control for ymage generation:
19 years ago
orbiter 74d1dea30b changes towards better join-search
19 years ago
orbiter ae4e8ce03e - cut for 'probably last html-interface version': version number update
19 years ago
orbiter 64bed59ee8 enhancements to ranking
19 years ago
theli 63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use.
19 years ago
auron_x 06b1365066 *) fixed existing protection against divbyzero and removed the new one
19 years ago
orbiter 94d7ced900 fix for last ranking commit
19 years ago
orbiter cc97a3e9c6 fixed possibly bug with indexOutOfBoundsException
19 years ago
orbiter 03835c2ee8 enhanced search result computation
19 years ago
orbiter 809960ddc6 avoid division by zero
19 years ago
orbiter ac3419b65f better debugging for indexOutOfBoundException bug
19 years ago
orbiter 75b03a4580 fix for new ArrayIndexOutOfBoundException
19 years ago
orbiter a8bc768206 enhancements to ranking evaluation
19 years ago
auron_x a82e926c5d *) fix for wrong totalPPM-calculation
19 years ago
theli 33898ae7e9 *) ResourceInfoFactory.java: Bugfix for classNotFoundException
19 years ago
theli 406e170e25 *) more verbose error message
19 years ago
theli b298474e22 *) Bugfix needed because of changed plasmaCrawlLURL.load behavior
19 years ago
orbiter c2e6cc8c6b small part of Bosts patch
19 years ago
orbiter 96c6e4e322 - enhancements to detailed search page
19 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs
19 years ago
theli a5ed86105b *) bugfix for handling of ResourceInfo object in proxy
19 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
19 years ago
hermens 7aeadbe7cc another NullPointerException in http.ResourceInfo
19 years ago
orbiter 141f9e5bb4 fix for new plasmaCrawlLURL.load behavior
19 years ago
orbiter 1e7fd48afd added size method to ftpc
19 years ago
hermens 087f7511f8 prevent NullPointerException in http.ResourceInfo
19 years ago
orbiter a2525072f2 bugfix for kelondroRow - property generation
19 years ago
hydrox 59a5511dbb *) added missing static Strings as requested by theli
19 years ago
theli 6578564c9a *) Ignore more hop by hop http headers
19 years ago
theli b44514242a *) crawler/ftp/CrawlWorker.java: better errorhandling
19 years ago
theli 7d7f30139c *) crawler/ftp/CrawlWorker.java: delete old cache file
19 years ago
theli 4ae0f122f8 *) ResourceInfo.java: License header added
19 years ago
theli 043edfa4d8 *) ftp/ResourceInfo.java ResourceInfo object for ftp resources added
19 years ago
orbiter 4866868c0e added write cache for LURLs
19 years ago
orbiter 8a0e35618b enhancements to search result preparation
19 years ago
theli 5c1bb53d2a Missing description for last commit
19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli 4825bfaaf3 *) Bugfix for PrintWriter Problem
19 years ago