Commit Graph

2795 Commits (504327b15c142a88b12016fb8ee75144b822a1f3)

Author SHA1 Message Date
Michael Peter Christen 504327b15c fix for condition for writing the webgraph 11 years ago
Michael Peter Christen 542c20a597 changed handling of crawl profile field crawlingIfOlder: this should be 11 years ago
Michael Peter Christen 4eec1a7452 refactoring (change Metadata name of load time data structure to avoid 11 years ago
reger c95ba52cf0 improve logexception info 11 years ago
orbiter e441831a24 reverted toString() change in AnchorURL to prevent mistakenly used 11 years ago
reger 47f201a6b8 Add Solr default query fields (&qf) to select servlet 11 years ago
reger f96cfdc84d prevent array out of bound exception on getRankingProfile(x) 11 years ago
reger 5f5fb4ecdc remove unused static (RSS)search from protocol 11 years ago
reger 7c1706d83a use CRLF in generated bat command scripts for windows 11 years ago
reger a2cb366b25 Combine /heuristic search modifier with opensearch configured targets 11 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow" 11 years ago
Michael Peter Christen bf1b6b93e7 do not write CR values to webgraph if no CR values are computed 11 years ago
Michael Peter Christen e039e78210 small bugfixes 11 years ago
Michael Peter Christen 32a2ff925c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 11 years ago
Michael Peter Christen d07cdd8c3b added SolrCloud access mode and configuration 11 years ago
Michael Peter Christen 8514bffc22 enhanced postprocessing status report 11 years ago
reger b24572f304 fix GSA filter query assignment 11 years ago
Michael Peter Christen b5fc2b63ea removed exist() retrieval functions from error cache and replaced it 11 years ago
Michael Peter Christen 62c72360ee cleanup of checkAcceptanceInitially in CrawlStacker, should avoid 11 years ago
Michael Peter Christen dd5cdfe212 reverted filter query hack, it did not work 11 years ago
Michael Peter Christen b5d78ba156 reduced number of solr queries during crawling 11 years ago
Michael Peter Christen 5326970d6c enhanced solr queries for single document extraction 11 years ago
Michael Peter Christen 525575bd97 added debugging of filter queries in thread dump thread names 11 years ago
Michael Peter Christen f319ef268f testing filter queries instead of queries to retrieve documents by id 11 years ago
Michael Peter Christen fd87fa1613 removed more unnecessary exist-checks in ErrorCache 11 years ago
Michael Peter Christen f2b476e08b don't do a double check to solr for failed documents if they are not 11 years ago
Michael Peter Christen 06ab72d1af enhanced crawler host round-robin strategy 11 years ago
orbiter dab9a0786a Merge branch 'master' of git@gitorious.org:yacy/rc1.git 11 years ago
orbiter 51bf5c85b0 Renamed the transmission cloud to buffer in dispatcher since the name 11 years ago
Michael Peter Christen a694b6a8fc another fix for unique field computation 11 years ago
Michael Peter Christen fb3dd56b02 fix for processing of noindex flag in http header 11 years ago
Michael Peter Christen b0d941626f fixed bugs in canonical, robots and title/description unique calculation 11 years ago
reger d9472d043a cleanup older unused classes 11 years ago
reger 665e12f88e move startup time from old serverCore to switchboard (most used here) 11 years ago
reger 336425912a remove unused localSearchThread from SearchEvent 11 years ago
reger 32bd2a61c1 add local ip to AbstractRemoteHandler local hostname cache 11 years ago
Michael Peter Christen f3a6b6e21e fix for bad URL decoding 11 years ago
Michael Peter Christen 1092e798a5 fixed double content postprocessing 11 years ago
Michael Peter Christen aee5b108e5 added linkScraperParser, a parser which ignores the text like the 11 years ago
reger 2b8cc5832c fix seek error for 0 file size records file 11 years ago
reger 2ba394333f fix Crawler HostQueue release of stackfile 11 years ago
reger 40133ba2d0 fix NPE in Condenser, 11 years ago
orbiter 59160984cc timeline performance update 11 years ago
orbiter 54bea96e67 Merge branch 'master' of git@gitorious.org:yacy/rc1.git 11 years ago
Michael Peter Christen 841cc77391 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 11 years ago
Michael Peter Christen e09218129c remove check for local solr. This check was made during a time when Solr 11 years ago
orbiter 2073e69034 fix for long periods in timeline 11 years ago
reger 1f94df29e7 fix NPE in solr rss where snippet contains only the title text 11 years ago
Michael Peter Christen 09dcdb9b19 update to solr 4.9.0 11 years ago
Michael Peter Christen 1cd4b2e8be Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 11 years ago