Commit Graph

3751 Commits (da0f4ee5992311061555027cedd368213f1615c3)

Author SHA1 Message Date
reger 3d1d297308 refactor namespace navigator as part of navigatorplugin map, this allows
8 years ago
reger 67f660523b Make navigators underlaying indexfield name accessible in interface
8 years ago
reger 5eb3ee4e20 Add search navigator interface to allow for additional navigators (plugins)
8 years ago
reger fd3f58fcaa improve query modifier parsing of "collection:" and possible collision
8 years ago
reger af39a76bf6 Reduce number of default max. search navigator lines (from 10000)
8 years ago
reger 20a1b29ed3 add simple test case for ReferenceContainer helpful for debugging
8 years ago
reger 3c7220bc7b Refacture rwi reference word position and word distance calculation
8 years ago
luccioman f0639d810c Customized name for Threads still using the default "Thread-n" pattern.
8 years ago
luccioman db3b9db9c2 Crawl from local file : faster task end when manually terminating crawl.
8 years ago
reger 4c67ed3f8d catch rwi ranking div by zero exception
8 years ago
luccioman 47af33a04c Advanced Crawl from local file : better processing of large files.
8 years ago
luccioman ee92082a3b Updated javadocs : warning about closing stream responsibility.
8 years ago
luccioman 6f49ece22f Fixed redirected URLs processing as crawl start point.
8 years ago
reger 68217465fe div by null in word distance calculation
8 years ago
luccioman 7263d17436 Removed mentions of deprecated LURL-db.
9 years ago
reger 8b74a6bf57 fix min/max calculation of WordReferenceVars.distance()
9 years ago
luccioman da362628fb Added fine log level for too long blacklist matching processing.
9 years ago
reger aaae7c6462 adjust ConcurrentScoreMap internal value map to interface and use parameter
9 years ago
reger 31d2a5645e remove obsolete query variable
9 years ago
luccioman a588ed7628 Applied image headers customization to the new ViewFavicon servlet.
9 years ago
luccioman 7717a3d43d Fixed license headers on files created to improve favicon management.
9 years ago
luccioman 6e1959f469 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
reger 685d8e86bf Avoid frequent data type casting (float/long) for rwi score
9 years ago
luccioman 3ccd89e274 Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments
9 years ago
luccioman 4b699c469a Blacklist refactoring : extracted a function for easier unit testing
9 years ago
luccioman 54cfcc3f56 CrawlCheck_p.html : also display info about disallowed URLs.
9 years ago
luccioman 8b341e9818 Robots : properly handle URLs including non ASCII characters
9 years ago
reger e68b00678e prevent negative score on URIMetadataNode - in the special case were no
9 years ago
luccioman 242707f9b4 Fixed loadFromCache with strategy IFFRESH.
9 years ago
reger b752bcfecb adjust date in text detection to ignore some program version strings
9 years ago
reger b017e97421 optimize condenser language detection a little.
9 years ago
reger ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
9 years ago
reger 474f0476c6 adjust Tokenizer sentence count on trailing text after last recognized sentence
9 years ago
reger 3861ac9293 upd maven dependency-check plugin to reflect changes of https://nvd.nist.gov
9 years ago
reger 681a61dafb adjust rwi index result word position handling used for rwi ranking
9 years ago
reger 14f7577231 add support for older Word versions (Word6/Word95) to docParser
9 years ago
reger 1a79c64495 generalize DateDetection with holiday date rules readily available in icu
9 years ago
reger 6f68f08354 correct DateDetection Silvester date
9 years ago
reger 32a2e3a22a have RSSFeed.getChannel return empty message on missing channel element,
9 years ago
luccioman 8d57b5b970 Added some javadocs.
9 years ago
luccioman 60df09fff9 Fixed some HTML validation errors : Illegal character in query
9 years ago
reger 862f28eaa6 display number of documents/rss-items for label "docs" in load_rss_p servlet
9 years ago
luccioman dcdea2d02f Fixed shutdown for crawler.MaxActiveThreads value greater than 200
9 years ago
luccioman d286ba2c3e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
luccioman b8f6458152 Prevent yacy main thread from hanging on browser opening process.
9 years ago
reger 70e1eb30a5 prevent StringIndexOutOfBounds in getLocalFile()
9 years ago
luccioman 1bb0b135ac Avoid duplication of various MS Windows file URLs flavors
9 years ago
luccioman b9a8476f02 Removed unused import
9 years ago
reger e73c1eea8c remove unused rootpattern, leftover from commit
9 years ago
reger 6f8c3ccea4 improve url hash computation for file path with mixed java & windows
9 years ago