yacy_search_server

Commit Graph

Author	SHA1	Message	Date
reger	c4d5f1fc54	upd to slf4j-1.7.24.jar	8 years ago
reger	c4b90eae98	upd to icu4j-58_2.jar	8 years ago
reger	a2afb4bae0	add switchboardconstants for server ports config keys	8 years ago
reger	e0c5b28331	update to jsoup-1.10.2.jar	8 years ago
reger	5b5ada38c3	update to jsch-0.1.54.jar	8 years ago
reger	038b9cd98e	update translation for ConfigNetwork_p.html	8 years ago
reger	f7fce1baad	make digest default authentication in defaults/web.xml	8 years ago
reger	56d0a87a83	remove double occuance of geo:lat in rss tokens	8 years ago
reger	882d99dae4	upd to metadata-extractor-2.10.1.jar	8 years ago
reger	b4fa1141b8	implement RequestHeader getRequestURI, getRequestURL for legacy request	8 years ago
reger	209a7374bd	remove unused import pdfParser	8 years ago
reger	de1c1c16db	Improve pdf text extraction resource handling. For sort pdf <= 3 pages use already extracted content, only for long pdf > 3 pages reassign content and close internal writer (to direct free buffers)	8 years ago
reger	52c9d0c858	upd to pdfbox-2.0.4.jar	8 years ago
reger	9b6d1abd9e	eliminate some compiler unchecked and deprecation warnings in nav plugins by explicite type declaration and replacing date.getYear with Calendar.get	8 years ago
reger	6eb7d27449	upd to httpclient v4.5.3	8 years ago
luccioman	8e77fe3860	Fixed unresolved pattern case in search results progress bar. This is a fix for mantis 715 (http://mantis.tokeek.de/view.php?id=715). A possible path scenario that could leading to this case : - YaCy is running low in memory - a search is requested - before the end of search results rendering, the cleanup job runs and deletes the running search event from the cache because of short memory - then yacysearchitem renders with "-UNRESOLVED_PATTERN-" parameter values passed to the statistics() JavaScript function	8 years ago
luccioman	79df5bb20a	Fixed settingsAck_p.html back link for case where referrer is stripped.	8 years ago
reger	18c7563dbe	Extend DCEntry.getLanguage convert to ISO639-1 codes for more languages by using icu.ULocale for languages not already covered (ICU normalizes to ISO639-1 2 char codes). Add test class Use DublinCore vocabulary declarations in DCEntry and SurrogateReader for easier usage debugging, Init SurrogateReader.inputSource on first use.	8 years ago
reger	ce87025462	further avoid to set connect info properties as header value following comment "use of properties as header values is discouraged" in case where (proxy)HTTPClient overwrites values with supplied url. Use defined request.referer procedure in response class.	8 years ago
reger	cd4d891ea4	use pre-defined "Connection" header key, replace depreceated	8 years ago
luccioman	5b03feb776	Fixed unresolved pattern case on /yacysearchlatestinfo.json api	8 years ago
luccioman	0173b0bc32	Added an advanced settings page for referrer policy settings. Feedback will be welcome, notably on the descriptive content of this page.	8 years ago
reger	81963a89fe	fix proxyservlet response url to respect http scheme if a relative Location header is returned.	8 years ago
luccioman	9d9f86dcdd	Updated Archive-It heuristics URL. The archive-it OpenSearch URL requested without restriction on collections ("i" parameter) almost always ends up with timeout or fails.	8 years ago
luccioman	cdcd923375	Privacy enhancement : added settings to control referrer policy. HTTP "Referer" header sent by the browser when using YaCy can now be controlled either with the referrer meta tag as a global policy, or only for search result links by adding the attribute rel="noreferrer". To improve privacy with the less possible regressions, the default is set as meta tag with value "origin-when-cross-origin" : internal YaCy links behavior is not affected, but when visiting external websites referrer url is not empty but stripped from query parameters and path. Older browsers, Safari, MS IE and Edge do not support the referrer meta tag, so the standard but less flexible noreferrer link type can also be enabled as an alternative. User-friendly settings page to be implemented.	8 years ago
reger	86534a56f7	fixed ReindexSolrBusyThread new and unexpected repeat of same query with low number of found documents - by adding additional end condition to remove processed query with number of found docs <= process-chunck-size. Noticed on query h4_txt:[* TO *], found 21, process 21, call of commit happend but on next cycle same query again 21 docs found (while h4_txt was removed from schema and committed inputdocuments).	8 years ago
reger	0aa0dd0b5b	fix delta time calculation in PerformanceSearch_p for the 1. entry (INITIALIZATION displayed absolute date, set delta to 0 for 1. entry)	8 years ago
luccioman	13c5c09518	Fixed datacite.org heuristics base url. The datacite Solr search http URL was returning http status 301 in order to redirect to its https version, thus making that YaCy heuristic always fail.	8 years ago
reger	275c0cddd1	Adjust DefaultServlet test case to recent change, depreciate unused CONNECTION_PROP_PROTOCOL (also as it might be misleading with getProtocol vs getScheme)	8 years ago
reger	41e2ee0eca	Fix call parameter for ConnectionInfo in MonitorHandler (expected scheme e.g. http, was protocol version). Depreceate obsolete custom X-...-Scheme header constant. Use existing FORMAT_ANSIC Dateformatter in HeaderFramework. Correct htmlParserTest (del one not intended println)	8 years ago
luccioman	9e626f6b00	Added a hint title for required fields in the Solr Schema editor	8 years ago
luccioman	ac766327d3	Switched a few more Solr fields from strictly mandatory to optional	8 years ago
reger	f254fcfc67	fix htmlParser <script> text extraction on code containing expression recognized as tag like 1<a reported in https://github.com/yacy/yacy_search_server/issues/109 Script content is ignored by default, but the text is filtered for html tags. Modified scraper to skip tag filtering while within a <script> section (until a closing tag is detected </script>. Possible side effect, missing </script> end-tag will truncate trailing content text.	8 years ago
luccioman	2f191e0e1c	Improved MultiprocotolURL non ASCII characters support. After @sinkuu Pull Request #108 added JUnit tests, updated some JavaDoc and also improved URL tokenization to support non ASCII characters.	8 years ago
luccioman	18e8b3a220	Merge branch 'escape' of https://github.com/sinkuu/yacy_search_server	8 years ago
luccioman	562fc14eb9	Merge pull request #110 from goofy-bz/patch-1 Fixing some typos	8 years ago
goofy-bz	72a1bc0af1	Fixing some typos up to line #1000 only	8 years ago
reger	7419989de3	Correct dublincore title property text to lowercase in htmlresponsewriter, remove unused (carry over) local variable Do the same for other responsewriter.	8 years ago
Burkhard	4fdc11cae8	Update SearchEvent.java Fix NPE on disabled local SolrIndex, occuring on search moving to the 2nd result page. The debug purpose only setting to disabeling local SolrIndex (System Admin -> Debug Settings) should long term probably be removed from production code.	8 years ago
luccioman	cdc7f3e431	Switched some Solr fields from mandatory to optional These fields are default enabled but with no doubt not strictly mandatory with the current code base. As reported by @reger24, splitting between essential mandatory and optional fields is still to be improved to reflect the current YaCy needs.	8 years ago
reger	7c188ad092	Add extract of queries.log in form of top search word cloud (last 7 days) to AccessTracker_p.html (Network Access -> Local Search Log page). It displays top 20 words of search queries.	8 years ago
luccioman	3475d8c1a9	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	8 years ago
luccioman	c68a8be2d9	Refactored and enforced Solr mandatory fields for proper operation - Added a new method to check activation of mandatory fields on Collection Configuration commit, consistently with checks previously performed in Switchboard startup and with mandatory fields in the default schema. - Reorganized default schema and CollectionConfiguration enumeration : moved no more mandatory fields in a specific section, and moved fields enabled at startup to the mandatory section. - Marked mandatory fields as required and with stronger font in the IndexSchema_p.html page	8 years ago
reger	334c70c37a	correct fromDate init value on missing param in api/timeline_p servlet revert test modification from last commit in AccessTracker.main	8 years ago
reger	cc770512d5	add hint of query syntax in AccessTracker log (qs=normal querystring, sq=solr-querystring) to allow to filter simple text queries for processing, remove toString for counter parameter use more predefined constants in solrservlet	8 years ago
luccioman	e5858bc8c8	Fixed a NullPointerException case possible on Index Export As reported by Palulukas in YaCy forum (http://forum.yacy-websuche.de/viewtopic.php?f=18&t=5944&sid=dcef5b899ab4aa9b40e3a3d158c13aed#p33454) the Index Export operation can fails, notably when the Solr index contains one or more documents with empty (despite required) "load_date_dt" field. This fixes the export failure when the situation finally occurs, but more should be done to harden verifications on minimum required fields.	8 years ago
reger	7e53860fc7	fix NPE in HTMLResponseWriter on missing document title	8 years ago
reger	5e8879beb7	Reduce self generated content for text_t (visible text index field) to avoid repeat of tokenized url as description, continuation of `7e09bff4a1` `1409cabe8b` Add some javadoc, and not needed remove of omitted fields in postprocessing.	8 years ago
reger	6ec6ab55ba	removed faroo news from default opensearch config As @luccioman informed, it's only useable with a free api key http://www.faroo.com/hp/api/api.html http://blog.faroo.com/2013/06/30/faroo-introduces-an-api-key/	8 years ago
luccioman	6e89d125f2	Added robots.txt support for heuristics federated search. As noticed by @reger24, abusive use of OpenSearch systems should be prevented, especially if allowing to parse and reuse HTML results. robots.txt file is now checked before requesting an external OpenSearch system to respect the host exclusions and eventual crawl-delay value. The check is also performed when trying to add a new OpenSearch URL template through the /ConfigHeuristics_p.html admin page.	8 years ago

1 2 3 4 5 ...

13101 Commits (c4d5f1fc54136fb69af8e9bbd1aeee5fd05685a4) All Branches Search

13101 Commits (c4d5f1fc54136fb69af8e9bbd1aeee5fd05685a4)

All Branches