yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	910a496c9f	replaced http links with https	4 months ago
okybaca	4add1f6bc7	replaced all the links to legacy legacy wiki to legacy wiki	1 year ago
Michael Peter Christen	60c9986a0e	new release file names with date and git hash ...without reference to 9000ish SVN	2 years ago
Michael Peter Christen	33889b4501	moved more servlets to new location	2 years ago
Michael Peter Christen	6d388bb7bf	refactoring - moved htroot/yacy classes	2 years ago
Michael Peter Christen	9c1bc533fa	removed hazelcast because it is phoning home, see also: https://github.com/yacy/yacy_search_server/issues/504	2 years ago
Thomas Koch	3116713672	rm buildDate from build.xml and its usages The https://reproducible-builds.org project invests a lot of work to make builds reproducible. This is a security property. It allows to compare the build of binaries from different builder machines. If they are identical, it means that either the builds have not been manipulated or an attacker managed to attack all builder machines in exactly the same way. One problem that the reproducible-builds project often sees is that projects include the build time in their binaries. This makes builds unreproducible for apparently no reason. The build date should not be of interest since binaries built on different dates but from the same source code should not be different. Thus I decided to remove the build date instead of re-implementing the functionality without the GitRev task. Anyways the reported date was not the build date but the date of the last git commit which is even less informative. The git commit ID would have information value but should only be relevant for "nightly builds".	2 years ago
Michael Peter Christen	3d138d3fdd	catch error when initializing hazelcast should fix https://github.com/yacy/yacy_search_server/issues/468	2 years ago
reger24	05d6d0405f	Move sub-menu UI Translations from public Status to secure Sys Administration - as UI Translation (TransNews_p.html) is a secured page - it uses for publishing internal News system but belongs not really to "Community Data"	3 years ago
reger24	6a5f0b3684	Servlet IndexBroser_p add button "Add to blacklist" allows to add the displayed host to add to the default blacklist	3 years ago
Michael Peter Christen	bd3f2483a1	replaced url and date retrieval by only url retrieval This should prevent that the search index is used for freshnes of the index entry.	3 years ago
Michael Peter Christen	63ad8ce6b2	removed ymarks had not been used since a long time	3 years ago
Michael Peter Christen	dbd211a1ad	removed/replaced reflection in memory tool	4 years ago
Michael Peter Christen	1cdb21592b	added hazelcast and some modifications to align legacy YaCy with YaCyGrid	4 years ago
Michael Peter Christen	198826c362	added network scanner process to discover all YaCy peers in the intranet this will be used to wire YaCy peers in a kubernetes cluster	4 years ago
Michael Peter Christen	d9602e8325	Implemented a new syntax in the template engine to simplify json APIs Added also an example for one of the existing APIs. The problem is the comma separator between objects which must not be there for the last entry in a sequence. The new syntax adds the separator symbol automatically.	4 years ago
Michael Peter Christen	022fb15670	fix for https://github.com/yacy/yacy_search_server/issues/385	4 years ago
Michael Peter Christen	e0ad8ca9da	replaced json library from JSON.org with libandroid-json-java This fixes https://github.com/yacy/yacy_search_server/issues/347	5 years ago
Michael Peter Christen	0a7bda2a21	removed JSON-evil license line These classes had been my own creative work. Just the copyright line had been appeared possibly due to a bad copy-paste activity, unaware that the line is a non-free addition.	5 years ago
Michael Peter Christen	37827b6788	removed doubes from getpageinfo	5 years ago
luccioman	2f75e2d9c8	Fixed a case of NullPointerException on disconnected RWI data structure	6 years ago
luccioman	88d0ed676c	Render http status instead of null responses on snapshot api errors	6 years ago
luccioman	746e0e788d	Render a relevant HTTP status code on snapshot image rendering error Instead of a null response body which is not very helpful.	6 years ago
luccioman	79bd9f623a	Updated YaCy home page embedded links from http to https scheme	7 years ago
luccioman	addd18c993	Removed some remaining uses of deprecated Seed.getIP()	7 years ago
luccioman	0a058ba6af	Keep https in result message URL when push_p API is requested over https	7 years ago
luccioman	dbf4c1cd76	Improved blacklist entries editing operations : - Fixes issue #160 : handle properly syntax exceptions with a user friendly message - Fixes loss of information on multiple blacklist entries editions - Fixes loss of entries when moving entries from one list to another	7 years ago
luccioman	5db1c9155a	Do locale independant case conversion on hosts, schemes, and file exts. Required for proper operation when the default system locale is Turkish, as dottless and dotted i characters have specific case conversion rules in this language.	7 years ago
luccioman	1de86cf1bf	Fixed JPEG snapshot resizing when running on OpenJDK. Resizing JPEG snapshot images through /api/snapshot.jpg failed when running on OpenJDK, but rendered successfully with a Oracle JDK. Details in mantis 772 ( http://mantis.tokeek.de/view.php?id=772 ). Removing any alpha component (useless in snapshot images) from the rendered resized image solves the issue.	7 years ago
luccioman	a17a418e78	Fixed NullPointerException cases on snapshot images parsing.	7 years ago
luccioman	285f0d6a39	Consistently encode snapshot image with format requested on the API. Previously, calling /api/snapshot.png rendered JPEG encoded images.	7 years ago
luccioman	4eba88f2ff	Removed some unnecessary uses of java.lang.reflect api. This improves code browsing and readability, making search by references or call hierarchy IDE features more accurate.	7 years ago
luccioman	3f0446f14b	Ensure proper synchronous robots entry retrieval on first check. Previously, when checking for the first time the robots.txt policy on a unknown host (not cached in the robots table), result was always empty in the /getpageinfo_p.xml api and in the /CrawlCheck_p.html page. Next calls returned however the correct information.	7 years ago
reger	a21789d4e7	Fix unresolved pattern in api/share.html by init some display var's	7 years ago
luccioman	bf55f1d6e5	Started support of partial parsing on large streamed resources. Thus enable getpageinfo_p API to return something in a reasonable amount of time on resources over MegaBytes size range. Support added first with the generic XML parser, for other formats regular crawler limits apply as usual.	7 years ago
luccioman	8da3174867	Ensure lower case conversion consistency with any default locale. Especially for Turkish speaking users using "tr" as their system default locale : strings for technical stuff (URLs, tag names, constants...) must not be lower cased with the default locale, as 'I' doesn't becomes 'i' like in other locales such as "en", but becomes 'ı'.	7 years ago
luccioman	0f80c978d6	Limit the number of initially previewed links in crawl start pages. This prevent rendering a big and inconvenient scrollbar on resources containing many links. If really needed, preview of all links is still available with a "Show all links" button. Doesn't affect the number of links used once the crawl is effectively started, as the list is then loaded again server-side.	8 years ago
luccioman	cbccf97361	Added JavaDoc to the getpageinfo_p API servlet.	8 years ago
luccioman	bd88fd303e	Deprecated duplicated and internally unused getpageinfo servlet. Redirections set for the transition of any eventual external uses: - /api/getpageinfo.xml to /api/getpageinfo_p.xml - /api/getpageinfo.json to /api/getpageinfo_p.json	8 years ago
reger	a2afb4bae0	add switchboardconstants for server ports config keys	8 years ago
reger	334c70c37a	correct fromDate init value on missing param in api/timeline_p servlet revert test modification from last commit in AccessTracker.main	8 years ago
luccioman	e048e74072	Added an optional parameter to webstructure.xml api. This new "documentStructure" parameter can be set to false to only get hosts accumulated references on a resource and thus prevent scraping the specified URL and getting citations references. Also set WebStructureGraph constants as final and updated the Javadoc with example api call URLs.	8 years ago
luccioman	17b7c92009	Made sure webstructure.xml API produces valid XML. Host names should not contain XML special characters such as quotation mark, but at this stage the WebGraph may have mistakenly recorded a host name with such characters. What's more the DigestURL constructor does not prevent this. By the way using serverObjects.putXML to encode host names we ensure here the rendered XML is well formed and can be parsed by external tools even if an structure entry is incorrect.	8 years ago
luccioman	ed3dd5e31a	Fixed webstructure.xml API used with a domain name 'about' parameter. As described in mantis 720 (http://mantis.tokeek.de/view.php?id=720), when requesting this API with a domain name instead of a complete URL only HTTP references on default port were listed.	8 years ago
luccioman	f793d97e56	Factored common code with DigestURL.hosthash()	8 years ago
luccioman	9cea7cbb10	Detailed some Javadoc related to /api/webstructure.xml usage.	8 years ago
reger	c50e23c495	reduce creation of empty legacy RequestHeader() in situation where null is acceptable (less for garbage collection).	8 years ago
reger	f45945cada	increase use of header const for custom "EXT" header	8 years ago
luccioman	812abfc868	Converted one more set of URLs to pure relative ones. Easier YaCy peer configuration behind a reverse proxy subfolder : no need for the reverse proxy to rewrite HTML links or URLs in css files. Tested on Debian Jessie with an apache2 reverse proxy. See related mantis issues http://mantis.tokeek.de/view.php?id=106 and http://mantis.tokeek.de/view.php?id=701	8 years ago
luccioman	74fec066f4	Converted more URLs to pure relative ones. Easier YaCy peer configuration behind a reverse proxy subfolder : no need for the reverse proxy to rewrite HTML links or URLs in css files. Tested on Debian Jessie with an apache2 reverse proxy. See related mantis issues http://mantis.tokeek.de/view.php?id=106 and http://mantis.tokeek.de/view.php?id=701	8 years ago

1 2 3 4 5 ...

484 Commits (3c88f87831dd7781aefc4bf8abb9a1dee7bff30b)