Commit Graph

899 Commits (b062847797e89c9b233b440d016c7931f33c9a5d)

Author SHA1 Message Date
orbiter e17fea7015 files in htcache are now stored in different hash/tree subdirectories
19 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details)
19 years ago
orbiter 25ae3d3161 generalized definition of hexhash
19 years ago
orbiter f0d747c723 removed deprecated method
19 years ago
orbiter 5ff77612ac bugfix for old WORDS storage method
19 years ago
orbiter 0f10bdde22 more generic cache methods
19 years ago
hermens 6557112d8f small fix for plasmaURLPool.getURL() needed for new alternative htcache layout
19 years ago
hermens 440c6ee657 Implement alternative htcache layout
19 years ago
orbiter fd61209797 lines inside tags without punctuation are extended by a single dot.
19 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
19 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
19 years ago
orbiter db294687ea enhanced logging
19 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
19 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
19 years ago
theli 1586d57187 *) odtParser: better handling of large files
19 years ago
theli f17ce28b6d *) plasmaHTCache:
19 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM
19 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
19 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
19 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
19 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
19 years ago
theli cd5f349666 *) Better handling of large files during parsing
19 years ago
low012 f8ac694e51 *) fixed a bug where searchword in snippets were not displayed bold in front of a punctuation mark (see http://www.yacy-forum.de/viewtopic.php?p=25998)
19 years ago
orbiter df1629b05a - code cleanup
19 years ago
theli b73efd5565 *) missing changes needed because of last commit
19 years ago
orbiter 2463e5624a 'quick' release 0.47
19 years ago
theli 625c2ce6b1 *) bugfix for snippet fetching problem if content but not http header is available in cache
19 years ago
theli 813a8a8179 *) migration of mimeTypeParser to jmimemagic 0.1
19 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes
19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
19 years ago
theli 1dc12d6659 *) Bugfix for shutdown problem caused by cacheScan thread
19 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
19 years ago
theli 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui
19 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
19 years ago
theli 5c6251bced *) some improvements for extended html document charset support
19 years ago
orbiter f453c14b5d removed unreacheable catch blocks and unused imports
19 years ago
theli ad7f600f25 *) Bugfix. re-enabling inheritance of serverCharBuffer from writer class
19 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
19 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
19 years ago
orbiter f644a1c3a7 better evaluation of index abstracts
19 years ago
allo 2fd610b556 http://www.yacy-forum.de/viewtopic.php?p=25611#25611
19 years ago
theli 06fa891152 *) htmlFilterContentScraper.java: using proper charset for document title
19 years ago
theli 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser)
19 years ago
theli c5d3020941 *) better errorhandling for last commit
19 years ago
theli d0a5a53789 *) changes needed for multi-language support
19 years ago
orbiter 26ab1fa885 fixed null pointer exception
19 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem
19 years ago
orbiter 41e27b85b7 fix for crawler condition
19 years ago
theli 9ecf7f0da2 *) some TODO makers for UTF-8 problem
19 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
19 years ago