Commit Graph

4000 Commits (5216c681a9971b921aea8c041106df8b2102055b)

Author SHA1 Message Date
luccioman d90b001e1b Improved previous merge "Show ranking in HTML UI".
8 years ago
luccioman 0f0f42b509 Added some JavaDoc
8 years ago
reger 077d062be3 Adjust mergeDocuments to keep youngest last-modified date of document
8 years ago
luccioman 654801523e Fixed StringIndexOutOfBoundsException case.
8 years ago
luccioman 522a268305 Improved new blacklist entries URL scheme detection.
8 years ago
luccioman 532981b363 Updated putHTML() JavaDoc
8 years ago
luccioman 58d23047dd Handle '?' and '+' chars as valid wild cards when adding to blacklist.
8 years ago
luccioman a87281b498 Added MediaWiki dump import scheduling feature.
8 years ago
luccioman edd7ccac40 Added some JavaDoc
8 years ago
luccioman 79fdf14b0a Fixed regression introduced by commit 9ad4d16
8 years ago
Michael Peter Christen 7678fd67e3 copied fix from yacy_grid_parser for wrong array type
8 years ago
Michael Peter Christen 200b100fb8 added patch to rewrite altered yacy grid schema into yacy schema
8 years ago
reger 9ad4d16829 Add a responsHeader to the solr index export with a format identifier
8 years ago
luccioman 9697209ef6 Fixed Index Export feature for compatibility with old indexed documents.
8 years ago
luccioman 88c062639b Added some JavaDoc
8 years ago
luccioman 31fff2c986 Extended WikiCode template inclusion syntax support.
8 years ago
Michael Peter Christen 973d74712f added yacy grid flatjson surrogate parser
8 years ago
luccioman b1da92648e Fixed surrogates import monitoring page (/CrawlResults.html?process=7)
8 years ago
luccioman 527d494c1a Fixed "Unchecked conversion" compilation warnings.
8 years ago
reger c77e43a391 Take out mailto collect in internal parsed document
8 years ago
Michael Peter Christen 335868edba Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
8 years ago
reger bec34d3546 Add url input field as source for WarcImporter
8 years ago
luccioman f66438442e Extended Mediawiki dump import to remote URLs.
8 years ago
luccioman e5c3b16748 Improved http client close time on stream processing errors.
8 years ago
luccioman 23775e76e2 Fixed endless loop case in wikicode processing.
8 years ago
luccioman 0bc868a819 Improved support for non ASCII chars in local file system URLs
8 years ago
reger 7b80189bda Activate hosts navigator plugin. This includes rwi results in the navigator
8 years ago
Michael Peter Christen f5ad29edb1 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 76e9135526 added flatjson parser (stub, unfinished)
8 years ago
reger b7417ac329 Introduce a Keyword search navigator using the index field keywords.
8 years ago
luccioman 09e72eb0a4 Set Config Portal as a private administration page.
8 years ago
reger ba339a2a45 Add servlet to import warc file from filesystem IndexImportWarc_p.html.
8 years ago
Michael Peter Christen 1d81b8f102 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 69081bce00 added export to elasticsearch. The export dump can easily be imported to
8 years ago
reger 510f11d374 Implement surrogate import from Warc archives (as first option handle
8 years ago
luccioman 4b649b0a11 Fixed NPE case and API URL link on Solr HTML output for webgraph core.
8 years ago
luccioman af28a07780 Updated API calls recording/replay with recent changes.
8 years ago
reger 81670c3484 One more use of SwitchboardConstants.SERVER_PORT constant,
8 years ago
luccioman cde237b687 Enforced access controls on some administrative actions.
8 years ago
luccioman df5970df6d Extended Apache HTTP Digest Auth. for use of YaCy encoded password
8 years ago
reger f05976c017 Display the local search word statistic in alphabetic order
8 years ago
reger 3dd23c178b Introduce the option to configure a shutdown port.
8 years ago
reger a2afb4bae0 add switchboardconstants for server ports config keys
8 years ago
reger 56d0a87a83 remove double occuance of geo:lat in rss tokens
8 years ago
reger b4fa1141b8 implement RequestHeader getRequestURI, getRequestURL for legacy request
8 years ago
reger 209a7374bd remove unused import pdfParser
8 years ago
reger de1c1c16db Improve pdf text extraction resource handling.
8 years ago
reger 9b6d1abd9e eliminate some compiler unchecked and deprecation warnings
8 years ago
reger 18c7563dbe Extend DCEntry.getLanguage convert to ISO639-1 codes for more languages
8 years ago
reger ce87025462 further avoid to set connect info properties as header value
8 years ago
reger cd4d891ea4 use pre-defined "Connection" header key, replace depreceated
8 years ago
luccioman 0173b0bc32 Added an advanced settings page for referrer policy settings.
8 years ago
reger 81963a89fe fix proxyservlet response url to respect http scheme if a relative
8 years ago
luccioman cdcd923375 Privacy enhancement : added settings to control referrer policy.
8 years ago
reger 86534a56f7 fixed ReindexSolrBusyThread new and unexpected repeat of same query with
8 years ago
reger 275c0cddd1 Adjust DefaultServlet test case to recent change,
8 years ago
reger 41e2ee0eca Fix call parameter for ConnectionInfo in MonitorHandler
8 years ago
luccioman ac766327d3 Switched a few more Solr fields from strictly mandatory to optional
8 years ago
reger f254fcfc67 fix htmlParser <script> text extraction on code containing expression
8 years ago
luccioman 2f191e0e1c Improved MultiprocotolURL non ASCII characters support.
8 years ago
luccioman 18e8b3a220 Merge branch 'escape' of https://github.com/sinkuu/yacy_search_server
8 years ago
reger 7419989de3 Correct dublincore title property text to lowercase in htmlresponsewriter,
8 years ago
Burkhard 4fdc11cae8 Update SearchEvent.java
8 years ago
luccioman cdc7f3e431 Switched some Solr fields from mandatory to optional
8 years ago
luccioman 3475d8c1a9 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman c68a8be2d9 Refactored and enforced Solr mandatory fields for proper operation
8 years ago
reger 334c70c37a correct fromDate init value on missing param in api/timeline_p servlet
8 years ago
reger cc770512d5 add hint of query syntax in AccessTracker log (qs=normal querystring,
8 years ago
luccioman e5858bc8c8 Fixed a NullPointerException case possible on Index Export
8 years ago
reger 7e53860fc7 fix NPE in HTMLResponseWriter on missing document title
8 years ago
reger 5e8879beb7 Reduce self generated content for text_t (visible text index field)
8 years ago
luccioman 6e89d125f2 Added robots.txt support for heuristics federated search.
8 years ago
sinkuu a46b232bf1 Use java.net.URLDecoder
8 years ago
luccioman bf16de29c1 Added support for HTML OpenSearch results.
8 years ago
luccioman 54405577aa Replaced absolute redirection locations by relative ones when possible.
8 years ago
luccioman 1857651988 Added a new Debug/Analysis advanced settings subsection.
8 years ago
luccioman 526f2d6a8b Fixed NPE case occurring when local solr index is disabled in search.
8 years ago
luccioman def55ec166 Improved termination of timed out remote solr requests to peers.
8 years ago
luccioman 08de58b6d3 Named a Thread without name for easier monitoring
8 years ago
luccioman 9a5a124bf2 Distinguished solr connectors thread names for easier monitoring.
8 years ago
reger 1f497ccad5 Add consistency check for related index fields upon load and save of
8 years ago
luccioman 68afe900d0 Added user-friendly controls over disk usage configuration settings.
8 years ago
reger 95d2a28599 adjust the Field-Reindex Thread to verify and update the document id
8 years ago
luccioman fc01b69eca Fixed local image search pagination regression.
8 years ago
Michael Peter Christen 02d0b3172c Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen d4f45cf05e added dc.date.modified and dc.date.created to date parser
8 years ago
reger f9180fabc4 assure that RWI Index.Segment IODispatcher is not blocking on shudown
8 years ago
reger e61ee180a7 Group all proxy settings on System Administration by adding settings of
8 years ago
luccioman 39e081ef38 Fixed display of crawler pending URLs counts in HostBrowser.html page.
8 years ago
reger df80c57842 add ukr and pol to DCEntry.getLanguage ISO639-2 3-char language code
8 years ago
luccioman e048e74072 Added an optional parameter to webstructure.xml api.
8 years ago
reger 581b00cc20 remove obsolete lastmodified calculation in WebgraphConfig
8 years ago
luccioman 5c8958bcea Updated Javadoc and Junit tests for the WebStructureGraph class.
8 years ago
luccioman d9766ca981 Fixed WatchWebStructure_p.html render to include https URLs.
8 years ago
luccioman ed3dd5e31a Fixed webstructure.xml API used with a domain name 'about' parameter.
8 years ago
luccioman 0da1e6ba16 Factored code re-implementing DigestURL.hosthash() method.
8 years ago
luccioman 86adfef30f Added automated unit tests and perfs test for WebStructureGraph class.
8 years ago
luccioman 9cea7cbb10 Detailed some Javadoc related to /api/webstructure.xml usage.
8 years ago
luccioman 6a4d51d8f9 Cleaned up some Javadoc warnings.
8 years ago
luccioman 86dc198698 Fixed some JavaDocs broken links.
8 years ago