Commit Graph

918 Commits (8d1b66accca613e0b79f075cda336c95be6bcf57)

Author SHA1 Message Date
reger 5a4995ded3 fill solr rss writer dc:subject tag with keyword content
11 years ago
orbiter c9e593cf78 removed warnings
11 years ago
reger 7584352e7b use more predefined Solr query parameter constants
11 years ago
reger e9eae45b55 simplify rssreader and improve atom feed link extraction
11 years ago
reger a8508417d1 catch NPE during crawl (OAI import)
11 years ago
Michael Peter Christen 3860711aef fix for possible interruption of concurrent queries
11 years ago
Michael Peter Christen 6344718f8b reducing the concurrent query stack size and reduced concurrency of
11 years ago
Michael Peter Christen a0c53174c5 better solr query logging to detect unnecessary sort requests for more
11 years ago
Michael Peter Christen b983e68254 more retries, less sleep
11 years ago
Michael Peter Christen 0ceeceb35e more logic on Solr queries; usage of the query terms in posprocessing,
11 years ago
orbiter 38864ae004 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter 4099296b45 added new classes which shall reduce call overhead to Solr (stub)
11 years ago
reger d0c02e1de7 adjust rss lat/lon to double
11 years ago
Michael Peter Christen 05d58e4df0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 98f45c9032 fix for image alt attachment to AnchorURLs in html parser.
11 years ago
orbiter 22ce4fb4dd better error handling for remote solr queries and exists-checks
11 years ago
reger 6c10b59f3e move bootstrap peers test systems to its test class
11 years ago
orbiter 1027f3d04a fix for the usage of ready-prepared solr queries, some queries are
11 years ago
orbiter 4b06adb751 fix for file urls
11 years ago
reger e5854a5cdb fix localhost link to opensearchdescription.xml
11 years ago
Michael Peter Christen b44626e55b fixed target_alt_t in webgraph
11 years ago
Michael Peter Christen 4eec1a7452 refactoring (change Metadata name of load time data structure to avoid
11 years ago
reger c95ba52cf0 improve logexception info
11 years ago
orbiter e441831a24 reverted toString() change in AnchorURL to prevent mistakenly used
11 years ago
reger a2cb366b25 Combine /heuristic search modifier with opensearch configured targets
11 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
11 years ago
Michael Peter Christen e039e78210 small bugfixes
11 years ago
Michael Peter Christen 32a2ff925c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen d07cdd8c3b added SolrCloud access mode and configuration
11 years ago
Michael Peter Christen 8514bffc22 enhanced postprocessing status report
11 years ago
reger b24572f304 fix GSA filter query assignment
11 years ago
Michael Peter Christen b5fc2b63ea removed exist() retrieval functions from error cache and replaced it
11 years ago
Michael Peter Christen dd5cdfe212 reverted filter query hack, it did not work
11 years ago
Michael Peter Christen 5326970d6c enhanced solr queries for single document extraction
11 years ago
Michael Peter Christen 525575bd97 added debugging of filter queries in thread dump thread names
11 years ago
Michael Peter Christen f319ef268f testing filter queries instead of queries to retrieve documents by id
11 years ago
Michael Peter Christen fd87fa1613 removed more unnecessary exist-checks in ErrorCache
11 years ago
Michael Peter Christen a694b6a8fc another fix for unique field computation
11 years ago
Michael Peter Christen fb3dd56b02 fix for processing of noindex flag in http header
11 years ago
Michael Peter Christen b0d941626f fixed bugs in canonical, robots and title/description unique calculation
11 years ago
Michael Peter Christen f3a6b6e21e fix for bad URL decoding
11 years ago
Michael Peter Christen 1092e798a5 fixed double content postprocessing
11 years ago
reger 1f94df29e7 fix NPE in solr rss where snippet contains only the title text
11 years ago
Michael Peter Christen 36e623d8bf enhanced metadata enrichment for media file type search:
11 years ago
Michael Peter Christen 49886fab08 enhanced debugging
11 years ago
Michael Peter Christen b893c42a0f bugfix for image search
11 years ago
reger 90c4576361 add a link to recrawl index entry to metadata html page
11 years ago
Michael Peter Christen e132689818 fixed and enhanced Base64 (en)coder (again)
11 years ago
Michael Peter Christen 2415e3db43 enhanced ASCII byte[] -> String conversion
11 years ago
Michael Peter Christen 4751ed974f enhanced base64 encoding
11 years ago