Commit Graph

11215 Commits (3073c69aee02460ec622a90f196277014a0d6a65)
 

Author SHA1 Message Date
reger 7328c2883b fix type in .init description
10 years ago
reger 94819f0797 set .ini default boost fields to same as assigned by button "reset to default"
10 years ago
reger b4b937a046 update to pdfbox 1.8.6
10 years ago
orbiter 1027f3d04a fix for the usage of ready-prepared solr queries, some queries are
10 years ago
Michael Peter Christen f94c91315b if the webgraph is used, then use it also for reference computation to
10 years ago
Michael Peter Christen 6e1dc444c3 added a snippet test function in ViewFile: you can now search for a
10 years ago
Michael Peter Christen c63e93df46 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 1bf605b6d1 toString() fix
10 years ago
orbiter 4b06adb751 fix for file urls
10 years ago
orbiter 08409ec680 no idea why the words max was an ordered one. This change increaes speed
10 years ago
reger dd311ddac9 Merge origin/master
10 years ago
reger e5854a5cdb fix localhost link to opensearchdescription.xml
10 years ago
reger 29d1945c16 fix double &query parameter (index.html)
10 years ago
Marc Nause 172d7e68da Updated commandline reconfiguration tool.
10 years ago
Michael Peter Christen b44626e55b fixed target_alt_t in webgraph
10 years ago
Michael Peter Christen 504327b15c fix for condition for writing the webgraph
10 years ago
Michael Peter Christen 542c20a597 changed handling of crawl profile field crawlingIfOlder: this should be
10 years ago
Michael Peter Christen 4eec1a7452 refactoring (change Metadata name of load time data structure to avoid
10 years ago
reger c95ba52cf0 improve logexception info
10 years ago
reger 7f0e757bb5 fix bookmark.rss
10 years ago
orbiter e441831a24 reverted toString() change in AnchorURL to prevent mistakenly used
10 years ago
reger 697b9743e7 Add link to RemoteCrawl_p
10 years ago
reger 47f201a6b8 Add Solr default query fields (&qf) to select servlet
10 years ago
reger f96cfdc84d prevent array out of bound exception on getRankingProfile(x)
10 years ago
Michael Peter Christen 970368359b Merge branch 'master' of ssh://gitorious.org/yacy/rc1
10 years ago
Michael Peter Christen c4608469bf Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
10 years ago
reger 8004cfc961 fix input boostfield factor of 0.0 in RankingSolr
10 years ago
reger 5f5fb4ecdc remove unused static (RSS)search from protocol
10 years ago
reger 7c1706d83a use CRLF in generated bat command scripts for windows
10 years ago
reger a2cb366b25 Combine /heuristic search modifier with opensearch configured targets
10 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
10 years ago
Michael Peter Christen bf1b6b93e7 do not write CR values to webgraph if no CR values are computed
10 years ago
Michael Peter Christen e039e78210 small bugfixes
10 years ago
Michael Peter Christen 87f8118108 added option to delete documents from the webgraph
10 years ago
Michael Peter Christen 32a2ff925c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen d07cdd8c3b added SolrCloud access mode and configuration
10 years ago
Michael Peter Christen 8514bffc22 enhanced postprocessing status report
10 years ago
malykhin.dmitry 53ecd54b45 Update russian translation
10 years ago
reger f99f3d5cf2 fix button (clear list) text color in CrawlResults
10 years ago
reger b24572f304 fix GSA filter query assignment
10 years ago
Michael Peter Christen b5fc2b63ea removed exist() retrieval functions from error cache and replaced it
11 years ago
Michael Peter Christen 62c72360ee cleanup of checkAcceptanceInitially in CrawlStacker, should avoid
11 years ago
Michael Peter Christen dd5cdfe212 reverted filter query hack, it did not work
11 years ago
Michael Peter Christen b5d78ba156 reduced number of solr queries during crawling
11 years ago
Michael Peter Christen 5326970d6c enhanced solr queries for single document extraction
11 years ago
Michael Peter Christen 525575bd97 added debugging of filter queries in thread dump thread names
11 years ago
Michael Peter Christen f319ef268f testing filter queries instead of queries to retrieve documents by id
11 years ago
Michael Peter Christen fd87fa1613 removed more unnecessary exist-checks in ErrorCache
11 years ago
Michael Peter Christen f2b476e08b don't do a double check to solr for failed documents if they are not
11 years ago
Michael Peter Christen 06ab72d1af enhanced crawler host round-robin strategy
11 years ago