Commit Graph

1020 Commits (a67a46541586a7bf829f9445759d6748b8cfc54f)

Author SHA1 Message Date
Michael Peter Christen 1735dbc9d9 enhanced image search: bugfixes and performance enhancements
10 years ago
Michael Peter Christen ebd0be2cea fixes and speed updates for search process
10 years ago
Michael Peter Christen 7611bf79bd Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
10 years ago
Michael Peter Christen 524bedc00a fixed text in startup tray icon and added shutdown icon during shutdown
10 years ago
Michael Peter Christen e87dc08c0d set the correct fail time in error docs
10 years ago
Michael Peter Christen a7dd89c4de changed method to write the citation index: do not catch up references
10 years ago
orbiter f318d7c285 enhanced date-ordered ranking
10 years ago
reger a6891ff7f8 fix Querygoal.parse exception on +/-null-term
10 years ago
orbiter a65df4ce7e do not push noindex errors into log if in intranet mode. noindex
10 years ago
Marc Nause 2af56fa37d Improved UPnP. (still not perfect)
10 years ago
orbiter d68438c3d9 make sure that the postprocessing background thread never dies by any
10 years ago
reger e88537522d allow single quote " ' " in query
10 years ago
orbiter 487021fb0a snippet computation update
10 years ago
orbiter 927aaa95a6 concurrency bugfix
10 years ago
reger 7584352e7b use more predefined Solr query parameter constants
10 years ago
reger f9db5dd6c5 reduce doublecontent check document (prevent out of memory)
10 years ago
reger a8508417d1 catch NPE during crawl (OAI import)
10 years ago
Michael Peter Christen 6344718f8b reducing the concurrent query stack size and reduced concurrency of
10 years ago
Michael Peter Christen c465b791af typo
10 years ago
Michael Peter Christen 191ec8c82a added concurrency to postprocess rewrite process
10 years ago
Michael Peter Christen a1e8bdd5e9 log ppm instead of docs/second
10 years ago
Michael Peter Christen cc0ded7abd set process type of web graph according to fields as defined in the
10 years ago
Michael Peter Christen 12fb9d7cd1 log postprocessing constraints in case that postprocessing is not
10 years ago
Michael Peter Christen 338f574bdc no sorting if http/www unique fields are not demanded (makes query
10 years ago
Michael Peter Christen 0ceeceb35e more logic on Solr queries; usage of the query terms in posprocessing,
10 years ago
orbiter 4099296b45 added new classes which shall reduce call overhead to Solr (stub)
10 years ago
orbiter 3491ab4c38 removed unused images from webgraph edge computation
10 years ago
orbiter 2371d6b8db target linktexts must be string to enable search facets on these fields
10 years ago
Michael Peter Christen 001e05bb80 do not store failure of loading of robots.txt into the index as a fail
10 years ago
Michael Peter Christen 05d58e4df0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 98f45c9032 fix for image alt attachment to AnchorURLs in html parser.
10 years ago
orbiter 22ce4fb4dd better error handling for remote solr queries and exists-checks
10 years ago
orbiter 738989aab7 reverted commit f94c91315b because the
10 years ago
Michael Peter Christen c115f3869c enhanced snippet computation and test method in ViewFile
10 years ago
orbiter 1027f3d04a fix for the usage of ready-prepared solr queries, some queries are
10 years ago
Michael Peter Christen f94c91315b if the webgraph is used, then use it also for reference computation to
10 years ago
Michael Peter Christen 6e1dc444c3 added a snippet test function in ViewFile: you can now search for a
10 years ago
Michael Peter Christen b44626e55b fixed target_alt_t in webgraph
10 years ago
Michael Peter Christen 504327b15c fix for condition for writing the webgraph
10 years ago
Michael Peter Christen 542c20a597 changed handling of crawl profile field crawlingIfOlder: this should be
10 years ago
Michael Peter Christen 4eec1a7452 refactoring (change Metadata name of load time data structure to avoid
10 years ago
reger f96cfdc84d prevent array out of bound exception on getRankingProfile(x)
10 years ago
reger a2cb366b25 Combine /heuristic search modifier with opensearch configured targets
10 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
10 years ago
Michael Peter Christen bf1b6b93e7 do not write CR values to webgraph if no CR values are computed
10 years ago
Michael Peter Christen d07cdd8c3b added SolrCloud access mode and configuration
10 years ago
Michael Peter Christen 8514bffc22 enhanced postprocessing status report
10 years ago
Michael Peter Christen b5fc2b63ea removed exist() retrieval functions from error cache and replaced it
11 years ago
Michael Peter Christen 62c72360ee cleanup of checkAcceptanceInitially in CrawlStacker, should avoid
11 years ago
Michael Peter Christen b5d78ba156 reduced number of solr queries during crawling
11 years ago
Michael Peter Christen fd87fa1613 removed more unnecessary exist-checks in ErrorCache
11 years ago
Michael Peter Christen f2b476e08b don't do a double check to solr for failed documents if they are not
11 years ago
orbiter dab9a0786a Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter 51bf5c85b0 Renamed the transmission cloud to buffer in dispatcher since the name
11 years ago
Michael Peter Christen fb3dd56b02 fix for processing of noindex flag in http header
11 years ago
Michael Peter Christen b0d941626f fixed bugs in canonical, robots and title/description unique calculation
11 years ago
reger d9472d043a cleanup older unused classes
11 years ago
reger 665e12f88e move startup time from old serverCore to switchboard (most used here)
11 years ago
reger 336425912a remove unused localSearchThread from SearchEvent
11 years ago
Michael Peter Christen 1092e798a5 fixed double content postprocessing
11 years ago
orbiter 59160984cc timeline performance update
11 years ago
orbiter 2073e69034 fix for long periods in timeline
11 years ago
Michael Peter Christen 09dcdb9b19 update to solr 4.9.0
11 years ago
Michael Peter Christen 1cd4b2e8be Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 8c52f0651b refactoring of AccessTracker events & timeline fix
11 years ago
reger 431a5f9c4e added test case for TextSnippet,
11 years ago
Michael Peter Christen 5b94a257ce no timeout for large reference collections
11 years ago
Michael Peter Christen f5b817bac4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger a5707cd2eb enable proper Author navigator
11 years ago
Michael Peter Christen 74206a10c7 refactoring
11 years ago
orbiter fec673c9d1 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter c59da9fe7a added access tracker log reader stub
11 years ago
Michael Peter Christen 36e623d8bf enhanced metadata enrichment for media file type search:
11 years ago
Michael Peter Christen b893c42a0f bugfix for image search
11 years ago
orbiter 0bbb5040b8 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter 9d5d86cd03 Added filter query options to the ranking servlet /RankingSolr_p.html.
11 years ago
Michael Peter Christen d2151857f1 Added collection navigation:
11 years ago
Michael Peter Christen 74c249288a added a push api to make it possible to upload files directly without
11 years ago
Michael Peter Christen ba6ffddefc refactoring
11 years ago
Michael Peter Christen 0c324d735c NPE fix for postprocessing without term index
11 years ago
Michael Peter Christen 922979aae1 added option to prefer http over https in unique-protocol ranking
11 years ago
Michael Peter Christen b3b174e2b8 fixed webgraph postprocessing and status display in Crawler_p servlet
11 years ago
Michael Peter Christen f23c4142e0 added option to configure a custom user agent within allip networks
11 years ago
Michael Peter Christen 8ad41a882c fixed several problems with postprocessing:
11 years ago
Michael Peter Christen ff5b3ac84d added new fields http_unique_b and www_unique_b which can be used for
11 years ago
Michael Peter Christen f0db501630 better handling of ranking parameters and new default values for date
11 years ago
Michael Peter Christen 53948da7d0 tried to make last_modified recognition smarter
11 years ago
Michael Peter Christen 6634b5b737 debug code for index distribution testing
11 years ago
orbiter 97983ba89f fixed generics warnings for generic array instantiation that appeared
11 years ago
sixcooler 830057d788 lower Segment-size (hope to get Segments of 10GB)
11 years ago
orbiter c028ae9b09 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
reger e31493e139 "Use remote proxy for yacy" has no function, remove option and related config item
11 years ago
orbiter 0d8072aa99 removed warnings
11 years ago
Michael Peter Christen a1ac4c3b76 automatically clear graphics cache
11 years ago
reger 1432a817dd respect "index media" switched off in CrawlStartExpert.html
11 years ago
Michael Peter Christen 4e734815e8 enhanced snippets: remove lines which are identical to the title and
11 years ago
Michael Peter Christen e84e07399a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger 8a7c68e4c7 content of surrogates/out never accessed (remove)
11 years ago
Michael Peter Christen 229f2248b8 added configuration option for maxmimum load and minimum ram for
11 years ago
orbiter 8e5ce7cd51 fixed a situation where finished crawls had not been detected.
11 years ago