Commit Graph

3730 Commits (6e1959f469dec4deabe4093c22f2f4dcfb2ff426)

Author SHA1 Message Date
luccioman 6e1959f469 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
reger 685d8e86bf Avoid frequent data type casting (float/long) for rwi score
9 years ago
luccioman 3ccd89e274 Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments
9 years ago
luccioman 4b699c469a Blacklist refactoring : extracted a function for easier unit testing
9 years ago
luccioman 54cfcc3f56 CrawlCheck_p.html : also display info about disallowed URLs.
9 years ago
luccioman 8b341e9818 Robots : properly handle URLs including non ASCII characters
9 years ago
reger e68b00678e prevent negative score on URIMetadataNode - in the special case were no
9 years ago
luccioman 242707f9b4 Fixed loadFromCache with strategy IFFRESH.
9 years ago
reger b752bcfecb adjust date in text detection to ignore some program version strings
9 years ago
reger b017e97421 optimize condenser language detection a little.
9 years ago
reger ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
9 years ago
reger 474f0476c6 adjust Tokenizer sentence count on trailing text after last recognized sentence
9 years ago
reger 3861ac9293 upd maven dependency-check plugin to reflect changes of https://nvd.nist.gov
9 years ago
reger 681a61dafb adjust rwi index result word position handling used for rwi ranking
9 years ago
reger 14f7577231 add support for older Word versions (Word6/Word95) to docParser
9 years ago
reger 1a79c64495 generalize DateDetection with holiday date rules readily available in icu
9 years ago
reger 6f68f08354 correct DateDetection Silvester date
9 years ago
reger 32a2e3a22a have RSSFeed.getChannel return empty message on missing channel element,
9 years ago
luccioman 8d57b5b970 Added some javadocs.
9 years ago
luccioman 60df09fff9 Fixed some HTML validation errors : Illegal character in query
9 years ago
reger 862f28eaa6 display number of documents/rss-items for label "docs" in load_rss_p servlet
9 years ago
luccioman dcdea2d02f Fixed shutdown for crawler.MaxActiveThreads value greater than 200
9 years ago
luccioman d286ba2c3e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
luccioman b8f6458152 Prevent yacy main thread from hanging on browser opening process.
9 years ago
reger 70e1eb30a5 prevent StringIndexOutOfBounds in getLocalFile()
9 years ago
luccioman 1bb0b135ac Avoid duplication of various MS Windows file URLs flavors
9 years ago
luccioman b9a8476f02 Removed unused import
9 years ago
reger e73c1eea8c remove unused rootpattern, leftover from commit
9 years ago
reger 6f8c3ccea4 improve url hash computation for file path with mixed java & windows
9 years ago
reger efcb6a1e74 fix supported mime XML -> xml for rssParser (mime normalized to lower case for comparison)
9 years ago
luccioman b3b75b0498 Accessibility : add a customizable alternative text to YaCy log
9 years ago
luccioman f2bc1b268d Updated URL fragment validation rules according to current standards
9 years ago
luccioman b1b8e69da8 Fixed NullPointerException cases
9 years ago
luccioman 3ee4f56c39 Improved ErrorCache behavior when switching networks
9 years ago
luccioman 7d5ba2afa4 Added some JavaDoc and moved crawlStacker close at the right place.
9 years ago
luccioman 8edbcd8ad4 Log eventual Solr instances close errors.
9 years ago
reger 330768c8a2 fix for solr write.lock after mode change http://mantis.tokeek.de/view.php?id=686
9 years ago
reger 585d2a6441 test case: for NewsPool to check the id modificator (for unique id)
9 years ago
luccioman de5c873e38 Removed unused JavaScript file docs.min.js
9 years ago
Michael Peter Christen df51e4ef07 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
Michael Peter Christen e063aaf97f enable fuzzy search, solr style (append a ~ to get a fuzzyness on the
9 years ago
reger ff6589fc0f test case: simulating multi word query for local rwi index
9 years ago
reger e990297d2e avoid NPE on hello message with missing "yourip" key
9 years ago
reger e51ab8c7aa hack to generate a unique message-id for messages created in the same second
9 years ago
Michael Peter Christen b82300358a removed version number check because it does not work any more if
9 years ago
Michael Peter Christen 2107674999 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
Michael Peter Christen 0d28f563f4 fix for java version "9-ea"
9 years ago
reger 3b694b3935 add some javadoc to rwi wordreference distance, position
9 years ago
reger a4465c97d6 as requested, disable/remove old swf parser
9 years ago
reger 7f63fc50f3 prepare a IndexSegment test case for RWI index testing
9 years ago