Commit Graph

1053 Commits (1f9389396a87bd8d328b4e6917651ba8bb279b61)

Author SHA1 Message Date
Michael Peter Christen 77662e08e1 concurrently initialize the error cache; extended also the cache by
11 years ago
Michael Peter Christen 07c5b57953 removed warnings
11 years ago
Michael Peter Christen 2e09da9832 npe fix
11 years ago
Michael Peter Christen d80418f1b1 added partial updates to solr during postprocessing: during
11 years ago
Michael Peter Christen b1cfbc4a04 added new solr field url_paths_count_i which can be used to enhance the
11 years ago
Michael Peter Christen 30d4402cd1 fixed location search
11 years ago
Michael Peter Christen 8c1a89cb34 added another decoration flag to switch off network graphics in crawler
11 years ago
Michael Peter Christen 5082feb103 less volume for effect sounds
11 years ago
Michael Peter Christen 0bfc69b29b more ipv6 bugfixes
11 years ago
Michael Peter Christen 883622306e Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 0843b12ef3 ipv6 fix: avoid that shrinked own ip set is overwritten with (non-valid)
11 years ago
orbiter cddf884bc4 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
Michael Peter Christen 74957f3760 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 2a052f446a Added an experimental audio feedback system.
11 years ago
Marc Nause 1e6e69bc40 Finished implementation of UPNP:
11 years ago
orbiter f3a12801f0 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter d93325a578 lazy handling of process_sxt field (part of postprocessing)
11 years ago
reger b5ca20de15 preserve content_type (mime) if supplied in preference of construct in from file type.
11 years ago
reger fb1fcc2b03 handle noarchive tag, skip writing page to cache
11 years ago
Michael Peter Christen 3073c69aee Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 6491270b3a large IPv6 redesign of peer ping methods!
11 years ago
reger 8b1ce49ee6 remove unused variable timeout
11 years ago
orbiter a922b122a3 added a hack to forward solr search results from an external attached
11 years ago
Michael Peter Christen 2645dc816a added warning for not well-formed postprocessing queries
11 years ago
Michael Peter Christen 6d3d4c4ea6 changed the concurrent enumeration of query results in such a way that
11 years ago
Michael Peter Christen ad35d9294f added a 'stats' table which records some peer statistics twice every
11 years ago
reger 8284ea751a catch TimeoutException during ping and do not delete yacy.conf during prereadconfigfile
11 years ago
reger ffa7c7116f better fix for NPE in image search
11 years ago
Michael Peter Christen f1032fb8fe more enhancements to image search in case that a restriction to a single
11 years ago
Michael Peter Christen 475125f9d7 hack to get more results when doing a remote site search
11 years ago
Michael Peter Christen 81f9b34da7 increaesed ability ot search for all images on a single server within
11 years ago
reger b5e0f70197 - remove repositoryPath post from ConfigBasic (obsolete)
11 years ago
reger 8931e14514 fix NPE in image search
11 years ago
Michael Peter Christen 1735dbc9d9 enhanced image search: bugfixes and performance enhancements
11 years ago
Michael Peter Christen ebd0be2cea fixes and speed updates for search process
11 years ago
Michael Peter Christen 7611bf79bd Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
11 years ago
Michael Peter Christen 524bedc00a fixed text in startup tray icon and added shutdown icon during shutdown
11 years ago
Michael Peter Christen e87dc08c0d set the correct fail time in error docs
11 years ago
Michael Peter Christen a7dd89c4de changed method to write the citation index: do not catch up references
11 years ago
orbiter f318d7c285 enhanced date-ordered ranking
11 years ago
reger a6891ff7f8 fix Querygoal.parse exception on +/-null-term
11 years ago
orbiter a65df4ce7e do not push noindex errors into log if in intranet mode. noindex
11 years ago
Marc Nause 2af56fa37d Improved UPnP. (still not perfect)
11 years ago
orbiter d68438c3d9 make sure that the postprocessing background thread never dies by any
11 years ago
reger e88537522d allow single quote " ' " in query
11 years ago
orbiter 487021fb0a snippet computation update
11 years ago
orbiter 927aaa95a6 concurrency bugfix
11 years ago
reger 7584352e7b use more predefined Solr query parameter constants
11 years ago
reger f9db5dd6c5 reduce doublecontent check document (prevent out of memory)
11 years ago
reger a8508417d1 catch NPE during crawl (OAI import)
11 years ago
Michael Peter Christen 6344718f8b reducing the concurrent query stack size and reduced concurrency of
11 years ago
Michael Peter Christen c465b791af typo
11 years ago
Michael Peter Christen 191ec8c82a added concurrency to postprocess rewrite process
11 years ago
Michael Peter Christen a1e8bdd5e9 log ppm instead of docs/second
11 years ago
Michael Peter Christen cc0ded7abd set process type of web graph according to fields as defined in the
11 years ago
Michael Peter Christen 12fb9d7cd1 log postprocessing constraints in case that postprocessing is not
11 years ago
Michael Peter Christen 338f574bdc no sorting if http/www unique fields are not demanded (makes query
11 years ago
Michael Peter Christen 0ceeceb35e more logic on Solr queries; usage of the query terms in posprocessing,
11 years ago
orbiter 4099296b45 added new classes which shall reduce call overhead to Solr (stub)
11 years ago
orbiter 3491ab4c38 removed unused images from webgraph edge computation
11 years ago
orbiter 2371d6b8db target linktexts must be string to enable search facets on these fields
11 years ago
Michael Peter Christen 001e05bb80 do not store failure of loading of robots.txt into the index as a fail
11 years ago
Michael Peter Christen 05d58e4df0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 98f45c9032 fix for image alt attachment to AnchorURLs in html parser.
11 years ago
orbiter 22ce4fb4dd better error handling for remote solr queries and exists-checks
11 years ago
orbiter 738989aab7 reverted commit f94c91315b because the
11 years ago
Michael Peter Christen c115f3869c enhanced snippet computation and test method in ViewFile
11 years ago
orbiter 1027f3d04a fix for the usage of ready-prepared solr queries, some queries are
11 years ago
Michael Peter Christen f94c91315b if the webgraph is used, then use it also for reference computation to
11 years ago
Michael Peter Christen 6e1dc444c3 added a snippet test function in ViewFile: you can now search for a
11 years ago
Michael Peter Christen b44626e55b fixed target_alt_t in webgraph
11 years ago
Michael Peter Christen 504327b15c fix for condition for writing the webgraph
11 years ago
Michael Peter Christen 542c20a597 changed handling of crawl profile field crawlingIfOlder: this should be
11 years ago
Michael Peter Christen 4eec1a7452 refactoring (change Metadata name of load time data structure to avoid
11 years ago
reger f96cfdc84d prevent array out of bound exception on getRankingProfile(x)
11 years ago
reger a2cb366b25 Combine /heuristic search modifier with opensearch configured targets
11 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
11 years ago
Michael Peter Christen bf1b6b93e7 do not write CR values to webgraph if no CR values are computed
11 years ago
Michael Peter Christen d07cdd8c3b added SolrCloud access mode and configuration
11 years ago
Michael Peter Christen 8514bffc22 enhanced postprocessing status report
11 years ago
Michael Peter Christen b5fc2b63ea removed exist() retrieval functions from error cache and replaced it
11 years ago
Michael Peter Christen 62c72360ee cleanup of checkAcceptanceInitially in CrawlStacker, should avoid
11 years ago
Michael Peter Christen b5d78ba156 reduced number of solr queries during crawling
11 years ago
Michael Peter Christen fd87fa1613 removed more unnecessary exist-checks in ErrorCache
11 years ago
Michael Peter Christen f2b476e08b don't do a double check to solr for failed documents if they are not
11 years ago
orbiter dab9a0786a Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter 51bf5c85b0 Renamed the transmission cloud to buffer in dispatcher since the name
11 years ago
Michael Peter Christen fb3dd56b02 fix for processing of noindex flag in http header
11 years ago
Michael Peter Christen b0d941626f fixed bugs in canonical, robots and title/description unique calculation
11 years ago
reger d9472d043a cleanup older unused classes
11 years ago
reger 665e12f88e move startup time from old serverCore to switchboard (most used here)
11 years ago
reger 336425912a remove unused localSearchThread from SearchEvent
11 years ago
Michael Peter Christen 1092e798a5 fixed double content postprocessing
11 years ago
orbiter 59160984cc timeline performance update
11 years ago
orbiter 2073e69034 fix for long periods in timeline
11 years ago
Michael Peter Christen 09dcdb9b19 update to solr 4.9.0
11 years ago
Michael Peter Christen 1cd4b2e8be Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 8c52f0651b refactoring of AccessTracker events & timeline fix
11 years ago
reger 431a5f9c4e added test case for TextSnippet,
11 years ago
Michael Peter Christen 5b94a257ce no timeout for large reference collections
11 years ago