yacy_search_server

Commit Graph

Author	SHA1	Message	Date
luccioman	4f0ab318ef	Fixed snippets statistics displayed "provided by Solr" count	7 years ago
luccioman	e115e57cc7	Reduced text snippet extraction processing time. By not generating MD5 hashes on all words of indexed texts, processing time is reduced by 30 to 50% on indexed documents with more than 1Mbytes of plain text.	7 years ago
luccioman	ce289ebaf7	Upgraded ConfigNetwork_p html doctype and added language attribute	7 years ago
luccioman	16254fac1e	Removed unpaired select closing tag	7 years ago
luccioman	692c1cfdde	Added a UI section to configure encryption of peers communications	7 years ago
luccioman	e67df103b5	Removed more remaining uses of deprecated Seed.getIP() function.	7 years ago
luccioman	addd18c993	Removed some remaining uses of deprecated Seed.getIP()	7 years ago
luccioman	c35d0568b6	Support for preferred https in peers communication on more operations	7 years ago
luccioman	0a058ba6af	Keep https in result message URL when push_p API is requested over https	7 years ago
luccioman	8bc36506f2	Enforced access controls on basic administration settings pages. Ensuring http post method is used for operations with server-side effects (in respect of http semantics), and a valid transaction token is provided by the user-agent.	7 years ago
luccioman	a3ec7a7a5f	Added analysis optional setting to compute statistics on text snippets Thus producing some basic stats on processing times for snippets generation and counts on snippets per source type.	7 years ago
luccioman	72808655a5	Added controls on mode switch when attached to remote Solr instance(s) - to prevent unwanted exposure of index entries about private local/intranet documents when switching from "Intranet Indexing" mode while attached to remote Solr instance(s) - to warn user about remote Solr instance(s) still attached when switching from modes other than "Intranet Indexing"	7 years ago
luccioman	2af3bf79c7	Improve rendering of remote Solr admin URLs - properly handle IPv6 loopback address replacement - replace loopback address or host only when accessing peer remotely - replace loopback part with the peer hostname as requested rather than with its seed public IP as this works better for Intranet mode and when peer is behind a reverse proxy.	7 years ago
luccioman	0d34034f17	Ensure an embedded Solr is available for Solr dump/restore operations Otherwise, these operations triggered NullPointerException when only an external Solr index is attached.	7 years ago
luccioman	d92b191942	Ensure no remote Solr is attached before "Shut Down and Re-Start Solr" Otherwise once this operation is applied, the remote Solr(s) instances are deconnected and the embedded Solr is connected even if disabled by setting "core.service.fulltext". Also use constants for related default setting values.	7 years ago
luccioman	69690c13a0	Optionally allow external Solr server with self-signed certificate This is necessary when you want to attach to a dedicated external Solr server protected with basic http authentication and requested over https but having only a self-signed certificate.	7 years ago
luccioman	211f3d04ab	Added hint message inciting to check accounts settings on fresh install When unrestricted access from localhost is set and the accounts config page has not been visited at all.	7 years ago
luccioman	2fd4d05e2f	Added a shared Java constant for setting key server.servlets.called	7 years ago
luccioman	033f7c4c00	Adjusted localhost/qualified account admin access informational texts. Following remarks from @etam on issue #170	7 years ago
luccioman	05702c2ced	Adjusted api table query matching strategies When inlined (for example in the CrawlProfileEditor_p.html page) : search only on the comment, as the url is not visible On regular display : search on comment OR url, instead of comment AND url. Otherwise searching on comments terms is almost useless as these terms are not necessarily present in the url.	7 years ago
luccioman	65451a3d62	Fixed start record on the last api table results page When the last results page size was lower than maximumRecords, results from the previous page where displayed again.	7 years ago
luccioman	86c902b853	Enable api table page navigation with search query Applied the same default results page size as when a type filter is defined for proper and consistend page navigation when combining type filter and search query.	7 years ago
luccioman	9c7faa04d8	Display the total number of matching items when filtering on table API Notably for a proper page navigation of the crawl scheduler table (CrawlProfileEditor_p.html page).	7 years ago
luccioman	311e91ff77	Added hint to clarify results rendered dates and 'Sort by date' switch	7 years ago
luccioman	90dc580158	Fixed initial ViewFile mode and suggestions links from previous commit	7 years ago
luccioman	0b6aed4de6	Keep the selected view mode when typing a new URL in the ViewFile page Otherwise, when interested in viewing `Link List` for example, each time you typed a new URL, `Parsed Sentences` view mode was selected as default and you had to selected again the view mode you are insterested in.	7 years ago
luccioman	db55eaa673	Updated link to Solr Function Queries documentation page	7 years ago
luccioman	7496df93c3	Fixed error 414 (URI Too Long) when manually selecting to many RSS items Switched form method to HTTP POST to prevent this.	7 years ago
luccioman	fb3032c530	Added a crawl filtering possibility on documents Media Type (MIME)	7 years ago
luccioman	90d4802082	Updated link URL to IANA Media Types with https	7 years ago
luccioman	e45afedee4	Added support for enclosures (media links) to the RSS loader	7 years ago
luccioman	aaefd5219c	Reduce log verbosity of RSS loader on feed items with no link	7 years ago
Michael Peter Christen	187075b878	added nav filter	7 years ago
luccioman	07e8628853	Added HTML5 embedded audio for results playing on supporting browsers Restricted to authenticated or localhost users only to prevent redistribution license issues.	7 years ago
luccioman	46c9da6428	Allow creation of vocabularies from remote CSV file URLs.	7 years ago
luccioman	348d07a999	Enforced controls on vocabulary editing operations.	7 years ago
luccioman	2532db2ce6	Vocabulary editor : use accessible labels and CSS for elements position	7 years ago
luccioman	ac14437316	Vocabulary_p.html : richer semantics for HTML tables Also replaced deprecated attributes	7 years ago
luccioman	b67742336e	Provide user interface messages on vocabulary creation read/write errors	7 years ago
luccioman	ea57763294	Mark vocabulary name field as required using html instead of JavaScript	7 years ago
luccioman	39ec8cba37	Fixed Vocabulary_p.html HTML validation errors. Validated with Validated with Nu Html Checker 17.11.1.	7 years ago
luccioman	7c644090ff	Fixed CrawlStartExpert.html HTML validation errors Validated with Nu Html Checker 17.11.1	7 years ago
luccioman	519fc9a600	Issue #156 : new option to clean up (or not) search cache on crawl start Prevent also unnecessary search event cache clean-up on each access to the crawl monitor page (Crawler_p.html).	7 years ago
luccioman	3e8dd90211	Use https rather than http in links and queries to openstreetmap.org	7 years ago
luccioman	8d7099a081	Handle escaped line breaks and separators in vocabulary import from CSV	7 years ago
luccioman	09f93fed0e	Added a line start field for vocabulary import from CSV file As a convenience to ignore eventual CSV header lines	7 years ago
luccioman	d28d612069	Added option to choose field delimiter in vocabulary import from CSV	7 years ago
luccioman	95f1954c78	Adjusted last blacklist entry example for a more accurate description As discussed in issue #160 , blacklist entries can indeed currently not be "complete" regular expressions, but must be structured as a domain part, a separator character ('/'), and a path part.	7 years ago
luccioman	dbf4c1cd76	Improved blacklist entries editing operations : - Fixes issue #160 : handle properly syntax exceptions with a user friendly message - Fixes loss of information on multiple blacklist entries editions - Fixes loss of entries when moving entries from one list to another	7 years ago
reger	5df72c1c65	Remove now obsolete html for language-nav and ISO639 jar reference	7 years ago
reger	87077b8fb6	Adjust and move Language Navigator to be member of the navigatior plugin list.	7 years ago
luccioman	eb20589e29	Fixed issue #158 : completed div CSS class ignore in crawl	7 years ago
luccioman	fa65fb1a03	Fixed loss of search modifiers on bookmark, recommand or delete result	7 years ago
luccioman	0cdee4e26a	Fixed loss of "meanCount" search param when using facets or page buttons Then on new search queries, no suggestions at all could be displayed.	7 years ago
luccioman	117a859879	Do not clear all search modifiers when unselecting one modifier. Previously, when clicking a selected facet in the search results page to unselect it, all other eventually selected modifiers/facets were also removed.	7 years ago
luccioman	a9dc0874c0	Remove old query terms from search results suggestions links. Especially when old terms were misspelled, suggestions links then provided most of the time empty results.	7 years ago
luccioman	c71b545235	Enable results suggestions (Did you Mean) even when RWI is not enabled. RWI is no more necessary for suggestions processing since commit `c40ba51ca6`. Revealed by a question about spell check from ouahpiti on YaCy forum (http://forum.yacy-websuche.de/viewtopic.php?f=23&t=6084 ).	7 years ago
luccioman	9412881230	Added basic support for autotagging microdata annotated item types. With the appropriate vocabulary settings in Vocabulary_p.html page, this can produce Vocabulary search facets displaying item types referenced in html documents by microdata annotation. Tested notably, but not limited to, vocabulary classes/types defined by Schema.org and Dublin Core.	7 years ago
luccioman	539925a275	Added an utility to generate/update XLIFF master file from lng files.	7 years ago
luccioman	41a6b052d9	Updated master and French translation for the IndexReIndexMonitor_p page	7 years ago
luccioman	929e0d6eae	Replaced improper ByteBuffer.equals() implementation by Arrays.equals() Renamed also ByteBuffer.equals() to startsWith() as this is the appropriate function implementation semantics.	7 years ago
luccioman	8b572b7337	Commit Solr index before simulating or starting recrawl job. This ensures up-to-date simulation query results, and recrawl processing.	7 years ago
luccioman	5e2812c060	Automatically refresh running recrawl report when JavaScript is enabled. For users who would prefer to keep JavaScript disabled, a manual Refresh button is still available.	7 years ago
luccioman	0fce264ba4	Set reindex page to html5 and removed presentational only html tables.	7 years ago
luccioman	83df922afc	Removed unused duplicated HTML id on header hidden field	7 years ago
luccioman	4e03335625	Added more details to the recrawl job report	7 years ago
luccioman	d95d393a0d	Add a query link to local Solr to browse selected recrawl candidates	7 years ago
luccioman	59f7763af6	Display recrawl job report also when job is actively running	7 years ago
luccioman	0c9e0b3566	Record recrawl calls to make them schedulable	7 years ago
luccioman	433e241e4f	Added a report info box about eventual last terminated recrawl job For easier monitoring of recrawls.	7 years ago
luccioman	b2af25b14f	Added a stop condition to the Recrawl busy thread	7 years ago
luccioman	421728d25a	Made possible to customize selection query before launching a recrawl	7 years ago
luccioman	fab6e54fec	Enforced controls (HTTP method, token) on ReIndex and ReCrawl operations	7 years ago
luccioman	8a4ea1c11e	Added UI switch to control content domain constraint per search request	7 years ago
luccioman	36a45b3905	Added UI setting for strictness of content-type checking on media search	7 years ago
luccioman	e6907fdab3	Added optional search parameter/setting to control content domain filter Thus allowing to choose at configuration or per search request, whether extending or not results beyond strict content domain filter (image, video, audio or application). Related graphical controls to be added to user interface.	7 years ago
luccioman	d42c1773c8	Added UI setting for optional encryption with https on p2p searches	7 years ago
luccioman	09c4ee56a7	Added optional https support for remote crawl and profile operations	7 years ago
luccioman	5db1c9155a	Do locale independant case conversion on hosts, schemes, and file exts. Required for proper operation when the default system locale is Turkish, as dottless and dotted i characters have specific case conversion rules in this language.	7 years ago
luccioman	1c4803e40a	Enable optional https support for /yacy/transferURL API calls. Also updated some Javadoc and consistently use Switchboard instance as a constructor parameter where relevant.	7 years ago
luccioman	79a2ba306a	Updated links to Java Regular Expressions documentation to version 8	7 years ago
luccioman	17e004599d	Started implementing optional https preference for protocol operations Introduced through the new configurable setting network.unit.protocol.https.preferred, defaulting to false for now. Let choose to prefer using https when available on remote peers to perform YaCy protocol operations including notably hello or transferRWI. Not yet implemented for every YaCy protocol operations.	7 years ago
ScRe13	bb3d3fe074	fixed default loading default settings; load was populated with wrong value	7 years ago
reger	20bba135fe	Show hide or show public surftip button depending on current config status, to show the button to switch the status (hiding button of current status)	7 years ago
Michael Peter Christen	b907819cb4	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	7 years ago
Michael Peter Christen	25573bd5ab	added a crawl filter based on <div> tag class names When a crawl is started, a new field to exclude content from scraping is available. The field can be identified with the class name of div tags. All text contained in such a div tag where the configured class name(s) match are not indexed, while the remaining page is indexed.	7 years ago
luccioman	640fed2a9c	Removed Java 1.8 no more necessary version checking (fixes issue #147 ) Java 1.8 is by the way now a prerequisite to run from latest sources.	7 years ago
luccioman	d95b288f19	Removed use of deprecated Jetty IPAccessHandler for client filtering. Upgraded to InetAccessHandler. Added InetPathAccessHandler extension to InetAccessHandler to maintain path patterns capability previously available in IPAccessHandler but lost in InetAccessHandler. Filtering on IPv6 addresses is now supported. Support for deprecated pattern formats such as "192.168." and "192.168.1.1/path" has been removed, but startup automated migration should convert such patterns eventually present in serverClient.	7 years ago
Michael Peter Christen	607b39b427	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git Conflicts: htroot/yacysearchitem.java	7 years ago
Michael Peter Christen	4355de0f3c	(more!) evaluation of XRealIP from nginx reverse proxy	7 years ago
luccioman	f9cba827c0	Made "tld:" modifier case insensitive and IDN complient. Thus allowing typing internationalized top-level domains with non ASCII characters as tld: modifier.	7 years ago
luccioman	c5c3cc1274	Use HTTP Post operation for resetting memory monitoring state. Fixes issue #145 Also added textual hint on the button, and display it only when it makes sense, that is to say when the memory state is 'exhausted'.	7 years ago
luccioman	cb10daba92	Renamed Chinese & Greek lng files using ISO639-1 codes. Previously named with their ISO 3166-1 country code : this way, when setting language to "Browser" in ConfigBasic.html, it didn't work properly when browser preferred language was Chinese or Greek as their respective language codes are "zh" and "el" (not "cn" and "gr" which are their country codes)	7 years ago
luccioman	4b61edff32	Added a help link to ISO 639-1 language codes list ref	7 years ago
luccioman	a994d439af	Added description of spatial restrictions in search options	7 years ago
luccioman	8a48f80909	Added language HTML attribute to the search home page.	7 years ago
luccioman	5ff76fdcb9	Fixed spelling	7 years ago
luccioman	2c3f0ff9e8	Updated search page keyboard shortcuts descriptions.	7 years ago
luccioman	af825e9ffc	Use accessible labels for search home page radio buttons.	7 years ago
luccioman	8e732d437c	Enable HTTP Digest authentication for non admin users. Also ensure authentication is not lost by Digest timeout when navigating between index.html and search results page. This way, running searches with extended features on a remote peer or a password protected peer works with a regular user (with "Extended search" rights). When authenticating on the search page with a user without "Extended search" rights, it appears as authenticated, but has just its usual access to the public search features.	7 years ago
luccioman	5161451a35	Stay authenticated when going to the search start page. Otherwise, when authenticated as admin and navigating from search results or admin pages to the search start page (/index.html), if nothing is done on that page within HTTP Digest Auth timeout (about 2mn), then search is performed without authentication and so without extended search features.	7 years ago
luccioman	d0bed78d02	Use the same top nav bar on index.html and search results. Thus eventually including the same optional login link/status in the search start page than in the results page, for the same convenient login without the need to use the Administration section.	7 years ago
luccioman	f678394ce5	Fixed loss of index page form values on 'more options' link click. Restores the behavior introduced eleven years ago (see commit `479861a3cf`) and lost by mistake 3 years ago (see commit `617dd9c97b`), when the click handler started referencing a missing HTML id.	7 years ago
luccioman	af198b990b	Added an optional login link/status to the search public top nav bar. Thus allowing a more convenient way (wihout the need to go to the admin section) to login when searching on your remote or password protected peer and benefit from extended search features such as Heuristics, Bookmarking or JavasScript resorting. Can be disabled using the ConfigSearchPage_p.html.	7 years ago
luccioman	1de86cf1bf	Fixed JPEG snapshot resizing when running on OpenJDK. Resizing JPEG snapshot images through /api/snapshot.jpg failed when running on OpenJDK, but rendered successfully with a Oracle JDK. Details in mantis 772 ( http://mantis.tokeek.de/view.php?id=772 ). Removing any alpha component (useless in snapshot images) from the rendered resized image solves the issue.	7 years ago
luccioman	a17a418e78	Fixed NullPointerException cases on snapshot images parsing.	7 years ago
luccioman	285f0d6a39	Consistently encode snapshot image with format requested on the API. Previously, calling /api/snapshot.png rendered JPEG encoded images.	7 years ago
luccioman	4da15db998	Fixed search result Snapshots link. Previously rendered as a broken URL containing the absolute file path of a snapshot on the search server. Now rendered as a valid URL linking to the /api/snapshot API to provide available snapshot content. Snapshot format is selected among the available ones in the following order of preference : JPG/PNG, PDF, and XML.	7 years ago
luccioman	fe75f326d8	Fixed ProfilingGraph calculation integer overflows and added test class. Complementary to fix proposed in PR #128 by @otteresk.	7 years ago
luccioman	8303e15419	Reduced number of search navigators refresh requests in JS resort mode The SearchEvent listen to changes on each of its navigators, and the information about their overall state is sent with each fetched search item (as a "data-nav-generation" attribute). Then the browser can regularly fetch a fresh version of yacysearchtrailer.html only if necessary (when that nav-generation value change).	7 years ago
luccioman	2ac78e2cca	Addedd missing parameters to yacysearchtrailer call on JS resort mode	7 years ago
luccioman	dbff7b14fc	Add a configurable limit to tags initially displayed in search results When the limit is reached, a button allow expanding/collapsing remaining tags. When this feature is activated without a limit to the number of displayed tags, when encountering search results with a very large number of keywords, the results page can become almost unusable (very long vertical scrollbar)	7 years ago
reger	f8c7d0265e	Adjust tags css style in ConfigSearchPage to equal search page	7 years ago
luccioman	fcea6def72	Added textual hints to language radio buttons labels As an help and accessible alternative to visual styling marking whether a language is available in browser preferred lang mode.	7 years ago
luccioman	27ab733685	Ensure private search features are not lost on Digest auth timeout This is a fix for mantis 766 ( http://mantis.tokeek.de/view.php?id=766 ) Since the upgrade to Digest authentication, access to protected search features was indeed disabled once the Digest nonce timed out. After Digest auth timeout the browser no more sent authentication information and as the search results page is not private, protected features were simply be hidden without asking browser again for authentication. Adding a supplementary parameter when accessing the search results as authenticated fixes this.	7 years ago
reger	dd82f85953	Add links to the optional keyword tags of search result If swichted on link (click) to the tag adds the keyword to the search query. If a keyword navigator is active the selected keyword adds or replaces a query keyword: modifier (currently replace was choosen as multiple keywords are not fully supported yet)	7 years ago
luccioman	fc28c58731	Added missing accessible labels to ConfigSearchPage_p.html	7 years ago
luccioman	8294374c10	Fixed ConfigSearchPage_p HTML validation errors. Validated with Nu Html Checker 17.9.0	7 years ago
luccioman	57a33aefb0	Removed unnecessary max counts init on empty search navigators.	7 years ago
luccioman	b1e7bd0dd6	Restrict Search Result Layout modification to HTTP POST only.	7 years ago
luccioman	ef8aea7f8d	Made the dates navigator max elements number user configurable. Also used object properties on QueryParams instances, rather than using mutable class (static) properties.	7 years ago
luccioman	0b0980b364	Improved accessibility of histograms widgets. Added keyboard navigation support and missing WAI-ARIA attributes. Tested with NVDA 2017.3 screenreader on recent major browsers.	7 years ago
luccioman	62c7cd9a77	Upgraded JavaScript lib raphael.js from 2.1.3 to 2.2.7	7 years ago
luccioman	cbbc7b43d3	Refresh paginations buttons instead of fully rendering each time. This prevent the already displayed pagination buttons to be unresponsive when clicking on them while the rendering JS function is running.	7 years ago
luccioman	18412dca21	Handle JS refreshing of belatedly added search navigators	7 years ago
luccioman	9049a926a5	Restrict JS results resorting to authenticated users. Until a more efficient DOM refresh model needing less XHR requests per search is implemented.	7 years ago
luccioman	4ab961fa46	Added HTML ids to search navigators for a more reliable JS refreshing.	7 years ago
luccioman	ad61a3afed	Results JS resort : properly handle results with same ranking value.	7 years ago
luccioman	57a1007772	Added new graphical setting for browser JS/On demand results resorting.	7 years ago
luccioman	d00a35576c	Apply JS resort only when currently relevant : p2p text search	7 years ago
luccioman	4e3c928d31	Do not animate unnecessarily when changing page on JS sorted results.	7 years ago
luccioman	fb6743e8f8	Prevent unnecessary DOM finds in JS resorting functions. Also removed now unused functions earlierPage() and laterPage().	7 years ago
luccioman	b1b9ffbbc8	Stop updating results with JS resorting on server feeds termination	7 years ago
luccioman	6f5e55c9f0	Updated the JavaScript license information page	7 years ago
luccioman	c7149acb48	Disabled as default verbose browser console logs in yacysort.js	7 years ago
luccioman	b50700c35f	Added missing copyright header to the yacysort.js file	7 years ago
luccioman	86d41f0242	Moved the JS resort specific styling to the usual YaCy CSS location	7 years ago
luccioman	9e86d183b8	Disable manual search results resorting when resorting is done with JS Also added a constant for the js resorting setting key.	7 years ago
luccioman	4ccd38357f	Trigger js resorting animations using only CSS classes. Also added some more descriptive comments.	7 years ago
luccioman	e40a225bc1	Merge branch 'javascript-resort' of https://github.com/Scarfmonster/yacy_search_server into jsResort	7 years ago
Ryszard Goń	2af011243f	Javascript re-sorting: Remove potentially breaking display property and reset max-height when animation is finished.	7 years ago
Ryszard Goń	634f52fefc	Javascript re-sorting: replace jQuery show() with css animations	7 years ago
luccioman	5d3ceb31b7	Improved search navigators counters accuracy and consistency. - added some missing increments from RWI results - decrement relevant navigator counts when solr or RWI results are evicted because duplicates detection or constraints checked belatedly - do not compute facets when unnecessary to avoid unwanted CPU load - do not increment from facets when already done - do not rely on facets on remote solr peers requests, as most of the time only a limited part of their total results if fetched (thus also preventing unnecessary load on remote peers) - use a concurrency friendly score map for the dates navigators to prevent unwanted ConcurrentModificationExceptions This improves the situation for the most obvious inconsistencies in search navigators counts, but more has to be done for a true accuracy (notably when query modifiers constraints are applied belatedly - after the solr or RWI retrieval request - such as the content domain constraint)	7 years ago
JeremyRand	ab0e50b941	Javascript re-sorting: optimize the jQuery selectors a little bit.	7 years ago
JeremyRand	86b5094970	Fix numbered page navigation from getting corrupted when statistics() runs.	7 years ago
JeremyRand	a888254769	Add UI for numbered page navigation when Javascript re-sorting is enabled.	7 years ago
JeremyRand	74333c931e	Fix the sidebar item "Wiki Name Space" with Javascript re-sorting.	7 years ago
JeremyRand	4a9e64caea	(WIP) Add numbered page navigation when Javascript re-sorting is enabled. TODO: Add UI for selecting the number.	7 years ago
JeremyRand	6ec256dc34	(WIP) Fix the sidebar when Javascript resorting is in use. TODO: Add some markup so that DOM traversal in the animations is less painful.	7 years ago
JeremyRand	d37df75afa	(WIP) Optionally sort HTML search items via Javascript. TODO: Expose a GUI setting for this.	7 years ago
JeremyRand	61be709a97	Add data-ranking attribute to each HTML search item.	7 years ago
luccioman	a28428047a	Fixed count of filtered results from local solr. Was inadequately modified in my previous related commits (making next pages buttons unavailable in Search portal mode), as SearchEvent.local_solr_available did not count the total filtered results but only the ones within the currently fetched result page(s).	7 years ago
luccioman	30c2f50e0b	Use final results counts in progress bar detailed statistics. Using unfiltered detailed counts (local and remote entries found before doubles detection and before applying query modifiers) was confusing and inconsistent with the total count. It could let think more results are to come in the next pages, without understanding why they are not displayed.	7 years ago
luccioman	8b25b485eb	Make result action links visible when focusing them with keyboard.	7 years ago
luccioman	3e933979df	Removed duplicate HTML class attribute.	7 years ago
luccioman	ce22076920	Fixed Unresolved_Pattern occurence on results favicon HTML id.	7 years ago
luccioman	a1a0515312	Added a button to manually refresh sorting of p2p search results. As a server-side oriented alternative to the JavaScript realtime resorting feature proposed in PR #104. The goal is the same as in this PR : having the possibility compensate the network latency of various peers results fetching and obtain once possible a consistently ranked result set.	7 years ago
luccioman	4eba88f2ff	Removed some unnecessary uses of java.lang.reflect api. This improves code browsing and readability, making search by references or call hierarchy IDE features more accurate.	7 years ago
reger	51a4e03c93	Allow to stop currently running warc import (stop button)	7 years ago
luccioman	3f0446f14b	Ensure proper synchronous robots entry retrieval on first check. Previously, when checking for the first time the robots.txt policy on a unknown host (not cached in the robots table), result was always empty in the /getpageinfo_p.xml api and in the /CrawlCheck_p.html page. Next calls returned however the correct information.	7 years ago
luccioman	b23a563065	Prevent search result failure on incomplete images information. Complements the recent modification related to images in commit `7f395ef`. Unfortunately many documents metadata fetched from the freeworld p2p network have only partial information about embedded images. Without proper error handling, this made many searches in p2p mode to fail completely.	7 years ago
Michael Peter Christen	7f395ef937	added image link in search results This should be a help to make a preview of search results. The image is computed from the list of embedded images, it is always the first image in that list. In rss-type results the image is presented like <media:content medium="image" url="https://abc.xyz/logo.png"/> as defined in http://www.rssboard.org/media-rss#media-content	7 years ago
reger	4979439e87	Skip public post of jre version. Added to determine switch to java8 `596b5dfa59`	7 years ago
reger	588c6e96fb	upd version for typeahead.jquery.js in jslicense.html	7 years ago
luccioman	8100c033a2	URL Viewer : apply crawler size limits when adding to local index. This allow large files parsing and preview, while preventing unwanted OutOfMemory errors which are likely to occur when adding to the Solr Index resources larger than configured crawler limits.	7 years ago
reger	e5cff062b5	Clean up redundant but obsolete jquery.rdfquery-core-1.0.js script lib	7 years ago
reger	23bda133d2	Fix css conflict of YMarks.html to make it viewable. yacy-ymarks.css sidebar conflicts with bootstraps sidebar (different overlay settings). Simply renamed it to ymark-sidebar.	7 years ago
reger	a21789d4e7	Fix unresolved pattern in api/share.html by init some display var's	7 years ago
luccioman	bf55f1d6e5	Started support of partial parsing on large streamed resources. Thus enable getpageinfo_p API to return something in a reasonable amount of time on resources over MegaBytes size range. Support added first with the generic XML parser, for other formats regular crawler limits apply as usual.	7 years ago
luccioman	1b3c169a9c	URL Viewer : decode raw text using the eventual response charset. When provided, or decode as UTF-8 as previously done.	7 years ago
reger	e6e20dab52	upd to Jetty 9.4.6.v20170531 Modify loginservice to the changes in Jetty, partially based on pull request #101 https://github.com/yacy/yacy_search_server/pull/101 bu @automenta	7 years ago
luccioman	e4c730b99f	Updated PerformanceQueues_p.xml API with last related servlet changes	7 years ago
luccioman	dcc56318bb	Made remote search max system load limits configurable from UI. As reported by davide on YaCy forums ( http://forum.yacy-websuche.de/viewtopic.php?f=23&t=6004 ) when the system is on high load, unless reading carefully YaCy configuration file, it could be difficult to understand why remote search results are not fetched.	7 years ago
luccioman	4b72b29ea2	Added an informative title on the crawl start robots.txt status icon	7 years ago
luccioman	d08f31c3a8	Crawl start Ajax request : properly handle eventual XML parsing errors Otherwise on a malformed getpageinfo_p XML response (from the browser point of view), JavaScript errors where thrown and the ajax status steering wheel remained displayed indefinitely.	7 years ago
luccioman	8da3174867	Ensure lower case conversion consistency with any default locale. Especially for Turkish speaking users using "tr" as their system default locale : strings for technical stuff (URLs, tag names, constants...) must not be lower cased with the default locale, as 'I' doesn't becomes 'i' like in other locales such as "en", but becomes 'ı'.	7 years ago
luccioman	c41b31dcb3	Cleaned up memory usage page HTML - fixed validation errors - removed deprecated attributes - improved accessibility with richer table semantics (headers and caption elements) and language declaration	8 years ago
luccioman	0487336ec3	Prevent integer overflow in table statistics and use strong typing	8 years ago
luccioman	0f80c978d6	Limit the number of initially previewed links in crawl start pages. This prevent rendering a big and inconvenient scrollbar on resources containing many links. If really needed, preview of all links is still available with a "Show all links" button. Doesn't affect the number of links used once the crawl is effectively started, as the list is then loaded again server-side.	8 years ago
luccioman	32288a8999	Merge branch 'master' of https://github.com/yacy/yacy_search_server	8 years ago
luccioman	e9b4b29f90	Limit scope of some local JavaScript variables.	8 years ago
Michael Peter Christen	369b8e0e0b	added json(p) endpoint for crawl start	8 years ago
luccioman	9dd790087d	Added HT Cache basic statistics (hit rate)	8 years ago
luccioman	28b451a0b3	Made Cache compression level and lock timeout user configurable	8 years ago
Michael Peter Christen	6fe735945d	migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8 Also: now Version 1.921	8 years ago
luccioman	8399275142	Properly close file output streams even on exceptions scenarios.	8 years ago
reger	632354e2ff	Tokenize result entry keywords and add some styling for display	8 years ago
reger	a814f3d885	Introduce keyword query parameter This enables keyword navigator to filter on keywords. Added search page output and layout config for keywords, allowing e.g. in Intranet use to display the keywords. No styling or links applied to the keyword text (but is desirable possibly in combination with bootstrap-tagsinput for future/intranet).	8 years ago
luccioman	cbccf97361	Added JavaDoc to the getpageinfo_p API servlet.	8 years ago
luccioman	bd88fd303e	Deprecated duplicated and internally unused getpageinfo servlet. Redirections set for the transition of any eventual external uses: - /api/getpageinfo.xml to /api/getpageinfo_p.xml - /api/getpageinfo.json to /api/getpageinfo_p.json	8 years ago
luccioman	1be4d32f99	Restored search page default behavior for Tab, Page Up and Down keys Replaced by shortcuts defined by the HTML "accesskey" attribute which has the advantage to be advertised by screen readers when focusing the corresponding buttons, contrary to custom JavasScript key handlers. Now With Firefox : - "Alt + Shift + n" for next page - "Alt + Shift + p" for previous page Following ARIA recommendation : "keyboard shortcuts enhance, not replace, standard keyboard access." ( see https://www.w3.org/TR/wai-aria-practices/#kbd_shortcuts_behavior_design) Fix for mantis 711 (http://mantis.tokeek.de/view.php?id=711)	8 years ago
luccioman	45346c1be8	Added missing accessibility attributes on search results progress bar.	8 years ago
luccioman	91a06bc669	Annotated search result information separators for screen readers.	8 years ago
luccioman	31ad043bb9	Added user interface feedback on results feeding termination status. Added as an additional icon with title in the search progress bar, to inform about background search feeder threads terminated or still running. While giving a bit more information to users about the p2p search process, this can help choosing whether or not wait a little bit more time before going to the next page, in order to get results from various sources sorted as best as possible (see #91 for a discussion about sorting accuracy and network latency). Other related modifications included : - regular updates to statistics in the progress bar until the background feeders are completely terminated. - removed some uses of unsecure and discouraged JavaScript elements	8 years ago
luccioman	d90b001e1b	Improved previous merge "Show ranking in HTML UI". - added the new setting as configurable in the "Debug/Analysis" settings page. Debug/analysis is its main purpose for now as there is currently no nice and "understansable" ranking score info servlet (see forum discussion http://forum.yacy-websuche.de/viewtopic.php?f=8&t=5884 ) - render in the "Search Page Layout" page preview when enabled - added constants	8 years ago
luccioman	efe1232d90	Merge branch 'html-show-ranking' of https://github.com/JeremyRand/yacy_search_server Conflicts: defaults/yacy.init	8 years ago
luccioman	4564541b3b	Fixed blacklist Regex containing '+' characters rendering. As reported on YaCy forum by shni (http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5970) when a blacklist entry contained both '?' and '+' characters, the '+' chars were wrongly decoded and rendered as spaces.	8 years ago
luccioman	0612a8f4f2	Fixed the previously added link to scheduled dump operations.	8 years ago
luccioman	a87281b498	Added MediaWiki dump import scheduling feature. Checking the last modified date by default to prevent unnecessary long running operations.	8 years ago
luccioman	10c03c6c64	Improved MediaWiki dump import monitoring. When import thread is terminated : - now stop refreshing and stay on the monitoring page to give user a feedback after a long running import - added link to the next monitoring step : results from surrogates reader - added link to new import On the new import page, added a link on the eventual last import report.	8 years ago
luccioman	8d288f5dba	Crawl results page : apply table lines number limit. Take into account the already existing default limit value (especially useful after a long crawl or surrogates import), or a custom one from parameter "count". Added a "Show all" link for convenience.	8 years ago
reger	c77e43a391	Take out mailto collect in internal parsed document As earlier plans to make use of mailto as separate webgraph entity didn't materialize (see http://forum.yacy-websuche.de/viewtopic.php?f=8&t=5726&p=32493&hilit=mailto#p32493) free the unused handling and resources.	8 years ago
reger	bec34d3546	Add url input field as source for WarcImporter allowing to import warc from url without prior download.	8 years ago
reger	d3df8a46c4	fix unresolved_pattern on missing post parameter api/message.html	8 years ago
luccioman	f66438442e	Extended Mediawiki dump import to remote URLs. When using a public HTTP URL in /IndexImportMediawiki_p.html, the remote file now is directly streamed and processed, allowing import of several GB dumps even with a low memory remote peer, and without need to manually download the dump file first.	8 years ago
luccioman	7edddd7b0d	Improved error reports on various wiki dump prerequisites failure cases. Also added some JavaDoc.	8 years ago
luccioman	dfe8d4139b	Used a text input for wiki dump import file selection. Using an HTML "file" input was confusing (as reported by promocore on YaCy forum : http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5965) , and it only worked with MS IE/Edge on a local YaCy peer : - for security reasons some current major browsers such as Firefox or Chrome do not allow to send full file path information when using a file form input - the local file system selection popup doesn't make sense when you want to import a dump on a remote YaCy server	8 years ago
reger	3a71430030	Adjust ConfigSearchPage_p to activated hosts navigator as plugin	8 years ago
reger	7b80189bda	Activate hosts navigator plugin. This includes rwi results in the navigator count. This might be tangential related to http://mantis.tokeek.de/view.php?id=736 as the example includes a local index search, while rwi results are not counted.	8 years ago
reger	05a1b14b4a	add missing text from ConfigRobotsTxt_p to master.lng and link to Translation Editor to Translation News page.	8 years ago
reger	a39c00a93f	add servlet to list user in UserDB and made user editor available in separate servlet for a quick and easy overview of configured user and selection for edit.	8 years ago
reger	a4498e17c0	fix edit current user form to required post mehtod introduced with `cde237b687`	8 years ago
luccioman	665d087d76	Enforced access controls on a few more administration pages. - ensure use of HTTP POST method when performing server side effect operations - transaction token required to ensure the request has effectively been requested by user interaction	8 years ago
luccioman	0feded21dd	Escaped HTML eventually active content from recorded API call comments.	8 years ago
luccioman	09e72eb0a4	Set Config Portal as a private administration page. Consistently with its required action from submission credentials, and because external unauthenticated users do not need to access these settings.	8 years ago
reger	9339a6a4c5	use css error class for error msg in IndexImportOAIPMH_p.html, adjust to xhtml <p> usage rule	8 years ago
reger	ba339a2a45	Add servlet to import warc file from filesystem IndexImportWarc_p.html. Apply Importer interface to WarcImporter	8 years ago
Michael Peter Christen	1d81b8f102	Merge branch 'master' of git@github.com:yacy/yacy_search_server.git	8 years ago
Michael Peter Christen	69081bce00	added export to elasticsearch. The export dump can easily be imported to elasticsearch using the command curl -XPOST localhost:9200/collection1/yacy/_bulk --data-binary @yacy_dump_XXX.flatjson	8 years ago
luccioman	5b5b9d5d96	URL Viewer : only display the link to metadata when metadata exists	8 years ago
luccioman	39ffa42a3c	Modified RWI settings page radio click event to use HTTP POST	8 years ago
luccioman	af28a07780	Updated API calls recording/replay with recent changes. - enabled HTTP POST calls with Digest HTTP authentication - made API calls compatible with API newly restricted to HTTP POST only with transaction token validation - ensured backward compatibility with older entries recorded as HTTP GET	8 years ago
luccioman	cde237b687	Enforced access controls on some administrative actions. - ensure use of HTTP POST method : HTTP GET should only be used for information retrieval and not to perform server side effect operations (see HTTP standard https://tools.ietf.org/html/rfc7231#section-4.2.1) - a transaction token is now required for these administrative form submissions to ensure the request can not be included in an external site and performed silently/by mistake by the user browser	8 years ago
reger	cbf58d5f0a	Add hint text to default ServerAcess Port Settings page	8 years ago
reger	f05976c017	Display the local search word statistic in alphabetic order	8 years ago
reger	3dd23c178b	Introduce the option to configure a shutdown port. A port value of -1 will disable this option. If set to a value greater 0, YaCy listens on this of on the local loopback address (127.0.0.1) for a shutdown or restart signal. E.g. connect to http://localhost:8005/shutdown will stop the YaCy server. http://localhost:8005/restart will restart it. This option allows to stop YaCy locally independant from the web web frontend (which might be configured for password protected remote access).	8 years ago
reger	a2afb4bae0	add switchboardconstants for server ports config keys	8 years ago
reger	038b9cd98e	update translation for ConfigNetwork_p.html	8 years ago
luccioman	8e77fe3860	Fixed unresolved pattern case in search results progress bar. This is a fix for mantis 715 (http://mantis.tokeek.de/view.php?id=715). A possible path scenario that could leading to this case : - YaCy is running low in memory - a search is requested - before the end of search results rendering, the cleanup job runs and deletes the running search event from the cache because of short memory - then yacysearchitem renders with "-UNRESOLVED_PATTERN-" parameter values passed to the statistics() JavaScript function	8 years ago
luccioman	79df5bb20a	Fixed settingsAck_p.html back link for case where referrer is stripped.	8 years ago
luccioman	5b03feb776	Fixed unresolved pattern case on /yacysearchlatestinfo.json api	8 years ago
luccioman	0173b0bc32	Added an advanced settings page for referrer policy settings. Feedback will be welcome, notably on the descriptive content of this page.	8 years ago
luccioman	cdcd923375	Privacy enhancement : added settings to control referrer policy. HTTP "Referer" header sent by the browser when using YaCy can now be controlled either with the referrer meta tag as a global policy, or only for search result links by adding the attribute rel="noreferrer". To improve privacy with the less possible regressions, the default is set as meta tag with value "origin-when-cross-origin" : internal YaCy links behavior is not affected, but when visiting external websites referrer url is not empty but stripped from query parameters and path. Older browsers, Safari, MS IE and Edge do not support the referrer meta tag, so the standard but less flexible noreferrer link type can also be enabled as an alternative. User-friendly settings page to be implemented.	8 years ago
reger	0aa0dd0b5b	fix delta time calculation in PerformanceSearch_p for the 1. entry (INITIALIZATION displayed absolute date, set delta to 0 for 1. entry)	8 years ago
luccioman	9e626f6b00	Added a hint title for required fields in the Solr Schema editor	8 years ago
reger	7c188ad092	Add extract of queries.log in form of top search word cloud (last 7 days) to AccessTracker_p.html (Network Access -> Local Search Log page). It displays top 20 words of search queries.	8 years ago
luccioman	3475d8c1a9	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	8 years ago
luccioman	c68a8be2d9	Refactored and enforced Solr mandatory fields for proper operation - Added a new method to check activation of mandatory fields on Collection Configuration commit, consistently with checks previously performed in Switchboard startup and with mandatory fields in the default schema. - Reorganized default schema and CollectionConfiguration enumeration : moved no more mandatory fields in a specific section, and moved fields enabled at startup to the mandatory section. - Marked mandatory fields as required and with stronger font in the IndexSchema_p.html page	8 years ago
reger	334c70c37a	correct fromDate init value on missing param in api/timeline_p servlet revert test modification from last commit in AccessTracker.main	8 years ago
luccioman	6e89d125f2	Added robots.txt support for heuristics federated search. As noticed by @reger24, abusive use of OpenSearch systems should be prevented, especially if allowing to parse and reuse HTML results. robots.txt file is now checked before requesting an external OpenSearch system to respect the host exclusions and eventual crawl-delay value. The check is also performed when trying to add a new OpenSearch URL template through the /ConfigHeuristics_p.html admin page.	8 years ago
reger	a011a97de9	make ConfigParser a protected page, for consistent behavior of locked menu items.	8 years ago
luccioman	54405577aa	Replaced absolute redirection locations by relative ones when possible. This makes integration of YaCy behind a reverse proxy subfolder easier.	8 years ago
luccioman	1857651988	Added a new Debug/Analysis advanced settings subsection. As discussed in PR #93 with @JeremyRand and @reger24 this new advanced settings page includes: - a new setting to control remote Solr responses encoding - some existing debug settings which could not be set through the admin user interface	8 years ago
luccioman	94af489f14	Removed deprecated "localMissCount" prop from yacysearchlatestinfo.json. This property has been deprecated four years ago by commit `d74472f562`. For any active search event id, it was then always filled with "-UNRESOLVED_PATTERN-".	8 years ago
luccioman	f6ad927a14	Refactored the DHT-Trigger section in Performance_p.html page. This is to be more easily understandable and to reflect more accurately the current memory strategies implementations that eventually set the "proper" state not only because DHT reception.	8 years ago
luccioman	b51fd9467c	Fixed unresolved pattern on directory entries in HostBrowser.xml api. As described in mantis 725 (http://mantis.tokeek.de/view.php?id=725) the HostBrowser.xml api directory entries had incorrect count attribute value. This was because the HostBrowser html page and backing template servlet evolved, but modifications were not reported on the xml api.	8 years ago
reger	f6b08443f0	adjust column layout in Settings_Proxy.inc	8 years ago
luccioman	95b63f5126	Added a CSS class for infobox block. This will prevent mistakenly hiding a div element not designed to be an infobox but having a ".info" parent (After having previously added the possibility for a div - and not only a span element - to be an infobox).	8 years ago
luccioman	68afe900d0	Added user-friendly controls over disk usage configuration settings. As mentioned in issue #103, control settings over YaCy disk usage already existed but lacked a user-friendly way to set them. I added it to the Performance_p.html administration page with a little refactoring on the "Resource Observer" fieldset for improved accessibility and HTML standards respect. Also added the possibility to enable/disable the autoregulation fonction from this page.	8 years ago
luccioman	d0182e4797	Improved Index Browser accessibility with semantically richer html tags. Made use of ol, li, thead, th, tbody, h1 and h2 html tags. Added aria-label attributes to provide alternative textual information previously only conveyed by color cue. Tested behavior with NVDA 2016.4 screen reader.	8 years ago

... 3 4 5 6 7 ...

6069 Commits (961d3cc8afc54a83aa841d5b958a401265753d08)