yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	37827b6788	removed doubes from getpageinfo	5 years ago
Michael Peter Christen	f03e16d3df	enhanced crawl start url check experience urls are now urlencoded and a check is also performed in case that an url is copied into the url field using copy-paste	5 years ago
Michael Christen	41f9b8517f	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	5 years ago
Michael Christen	4ccd1ea3c0	new servlet path "p2p" with a test class. Call the class with http://localhost:8090/p2p/seeds.json	5 years ago
Michael Peter Christen	f7c97fd99e	scanner crawl starts wants non-parseable files	5 years ago
Michael Peter Christen	a20b61f5c0	fix for bad json	5 years ago
Michael Peter Christen	d62a8ec542	masking connects	5 years ago
Michael Peter Christen	5eb0033aef	typo	5 years ago
Michael Peter Christen	2c0742fc43	added json version of peer list	5 years ago
Michael Christen	cfa27d2fd5	fixed links	5 years ago
Michael Peter Christen	0bddf2d895	switched url and snippet position	5 years ago
Michael Peter Christen	2999f4b985	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	5 years ago
Michael Peter Christen	449780f762	enhanced search result design	5 years ago
Michael Christen	cdc7adedc2	added sponsor link	5 years ago
Michael Christen	f2d45ebb87	design updates + added link to new forum	5 years ago
Michael Peter Christen	789670bd8c	design changes - more space	5 years ago
Michael Christen	3a46b07603	fixed many links to old forum, now https://searchlab.eu	6 years ago
luccioman	6b45cd5799	New optional crawl filter on the URL a doc must match to crawl its links For finer control over which parsed documents can trigger an addition of their links to the crawl stack, complementary to the existing crawl depth parameter.	6 years ago
luccioman	d16bc99835	Added "Show Metadata" links to the ViewFile.html links mode To conveniently follow parsed links in the file viewer	6 years ago
luccioman	8c068a9c99	Better HTML text semantics for technical descriptions	6 years ago
luccioman	a5771b1f14	Made SNI extension user configurable without the need for server restart TLS Server Name Indication (SNI) extension activation can now be configured with the new Settings_p.html?page=httpClient administration page. SNI extension is also now enabled by default, as in 2019 the unrecognized_name(112) alert is more properly handled by major web servers TLS implementations, following the RFC 6066 standard. Related YaCy issues : #153 #189 and #272 JDK 1.7 bug : https://bugs.java.com/bugdatabase/view_bug.do?bug_id=7127374 Apache httpd issue : https://bz.apache.org/bugzilla/show_bug.cgi?id=56241 RFC 6066 : https://tools.ietf.org/html/rfc6066#section-3	6 years ago
luccioman	42c8a251c8	Render a relevant message and status on blocked search requests When unauthenticated (or with insufficient rights) client is blocked either because blacklisted or excessive request rate, render an error message and a relevant HTTP status for API requests, instead of an empty response that appears broken.	6 years ago
luccioman	a8316c79da	Allow JS resorting of search results by unauthenticated users Acces rate limitations to this search mode by unauthenticated users are set low by default to prevent unwanted server overload but can be customized through the SearchAccessRate_p.html configuration page Fixes #291	6 years ago
luccioman	0ab2b49c31	Made /yacysearch access rate limitations user configurable With a new admin page at /SearchAccessRate_p.html in menu Network Access > Local Search > Access Rate Limitations	6 years ago
luccioman	630fa0015a	P2P/Privacy switch buttons support with JavaScript disabled	6 years ago
luccioman	74fd2f30fa	Support for search result switch buttons with JavaScript disabled	6 years ago
luccioman	ebc583cdb2	Properly render the href attribute of the active page button	6 years ago
luccioman	093ea9586c	Properly fill current page number to new server side pagination template When current page is automatically reset to zero because of a new search request.	6 years ago
luccioman	6e9d5f60ad	Server side initial pagination links rendering For better support of the search page usage with JavaScript disabled. Reduces also the number of initial refreshes of the paginations links. When JavaScript is enabled, pagination links are still regularly refreshed until all the search feeds are terminated on server side.	6 years ago
luccioman	4b9cc4746d	Upgraded Bootstrap dependency from v3.3.7 to v3.4.1 Non regressions tested on the following platforms : Linux Debian Stretch : - Firefox 60.5.1esr - Chromium 72.0.3626.96 Windows 10 : - Firefox 65.0.1 - Chrome 72.0.3626.109 - Edge 25.10586.672.0 - IE 11.1540.10586.0 Mac OS : - Safari 11.0	6 years ago
luccioman	c617ea58a0	Render additional embedded audios from links on extended audio search	6 years ago
luccioman	69f1971052	Added basic controls to play all audio results. Not displayed when JavaScript is disabled.	6 years ago
luccioman	9782a98a9c	Added the possibility to customize facets sort type and direction Previously search navigators/facets elements were sorted only by counts. Now from the ConfigSearchPage_p.html admin page, sort direction (ascending/descending) and type (on counts or labels) can be customized independently for each navigator.	6 years ago
sgaebel	c2398fd890	remove warnings: 'Statement unnecessarily nested within else clause'	6 years ago
sgaebel	8d2e7262d9	Recrawl: - set the chunksize to 100 to meet the max of the embedded solr - re-enable sorting (the case where we switched it of should be away) - enable recrawling on remote-solr	6 years ago
luccioman	60b520fb13	Cleaned up Spanish translation after merge of PR #238 * Fixed some indentation * Removed untranslated entries	6 years ago
luccioman	cd72515188	Merge pull request #238 from ivanhercaz/esLang [WIP] Spanish translation	6 years ago
luccioman	2f75e2d9c8	Fixed a case of NullPointerException on disconnected RWI data structure	6 years ago
luccioman	e85f231bdf	Fixed termination of Host browser and link structure Solr query threads On some conditions (especially when reaching timeout), concurrent Solr query tasks used by the /HostBrowser.html and /api/linkstructure.json never terminated, thus leaking resources, as reported by @Vort in issue #246	6 years ago
luccioman	260ac11c65	Limit length of initially visible text in link structure graph nodes To improve a bit readability of graphs having a large number of nodes.	6 years ago
luccioman	5a8d9abd8a	Upgraded d3js dependency from 3.4.4 to 5.7.0	6 years ago
luccioman	9f8e1994a4	Added missing CSS width units to some HostBrowser.html styling	6 years ago
luccioman	0b1d2cb0dd	Fixed "TypeError: table.tBodies[0] is undefined" host browser JS error Traced in browser console when a host details table is empty.	6 years ago
luccioman	fcf6b16db4	Added new crawler attribute for finer control over Media Type detection New "Media Type detection" section in the advanced crawl start page allow to choose between : - not loading URLs with unknown or unsupported file extension without checking the actual Media Type (relying Content-Type header for now). This was the old default behavior, faster, but not really accurate. - always cross check URL file extension against the actual Media Type. This lets properly parse URLs ending with an apparently odd file extension, but which have actually a supported Media Type such as text/html. Sample URLs with misleading file extensions added as documentation in the crawl start page. fixes issue #244	6 years ago
luccioman	88d0ed676c	Render http status instead of null responses on snapshot api errors	6 years ago
luccioman	92e10d7d1c	Added a crawl start hint message on availability or not of wkhtmltopdf As this tool is required to produce pdf snapshots	6 years ago
luccioman	8852c97cee	Added basic styling for cleaner rendering of missing image snapshots For the output of the Solr snapshots writer	6 years ago
luccioman	746e0e788d	Render a relevant HTTP status code on snapshot image rendering error Instead of a null response body which is not very helpful.	6 years ago
luccioman	753bda1409	Fixed remaining blacklist entries improper decoding of '+' character In the blacklist cleaner and import/export administration pages.	6 years ago
luccioman	61c337f29a	Decode blacklist entries for easier edition of non ascii chars Not using the JDK URLDecoder.decode() function, as it strips '+' characters when they occur after '?' (both characters having regular expression semantics when used in blacklist path patterns)	6 years ago

1 2 3 4 5 ...

5965 Commits (749671d945f085f42d81614116d127dbafa7261b)