yacy_search_server

Commit Graph

Author	SHA1	Message	Date
luccioman	d95d393a0d	Add a query link to local Solr to browse selected recrawl candidates	7 years ago
luccioman	59f7763af6	Display recrawl job report also when job is actively running	7 years ago
luccioman	0c9e0b3566	Record recrawl calls to make them schedulable	7 years ago
luccioman	433e241e4f	Added a report info box about eventual last terminated recrawl job For easier monitoring of recrawls.	7 years ago
luccioman	b2af25b14f	Added a stop condition to the Recrawl busy thread	7 years ago
luccioman	421728d25a	Made possible to customize selection query before launching a recrawl	7 years ago
luccioman	fab6e54fec	Enforced controls (HTTP method, token) on ReIndex and ReCrawl operations	7 years ago
luccioman	8a4ea1c11e	Added UI switch to control content domain constraint per search request	7 years ago
luccioman	36a45b3905	Added UI setting for strictness of content-type checking on media search	7 years ago
luccioman	e6907fdab3	Added optional search parameter/setting to control content domain filter Thus allowing to choose at configuration or per search request, whether extending or not results beyond strict content domain filter (image, video, audio or application). Related graphical controls to be added to user interface.	7 years ago
luccioman	d42c1773c8	Added UI setting for optional encryption with https on p2p searches	7 years ago
luccioman	09c4ee56a7	Added optional https support for remote crawl and profile operations	7 years ago
luccioman	5db1c9155a	Do locale independant case conversion on hosts, schemes, and file exts. Required for proper operation when the default system locale is Turkish, as dottless and dotted i characters have specific case conversion rules in this language.	7 years ago
luccioman	1c4803e40a	Enable optional https support for /yacy/transferURL API calls. Also updated some Javadoc and consistently use Switchboard instance as a constructor parameter where relevant.	7 years ago
luccioman	79a2ba306a	Updated links to Java Regular Expressions documentation to version 8	7 years ago
luccioman	17e004599d	Started implementing optional https preference for protocol operations Introduced through the new configurable setting network.unit.protocol.https.preferred, defaulting to false for now. Let choose to prefer using https when available on remote peers to perform YaCy protocol operations including notably hello or transferRWI. Not yet implemented for every YaCy protocol operations.	7 years ago
ScRe13	bb3d3fe074	fixed default loading default settings; load was populated with wrong value	7 years ago
reger	20bba135fe	Show hide or show public surftip button depending on current config status, to show the button to switch the status (hiding button of current status)	7 years ago
Michael Peter Christen	b907819cb4	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	7 years ago
Michael Peter Christen	25573bd5ab	added a crawl filter based on <div> tag class names When a crawl is started, a new field to exclude content from scraping is available. The field can be identified with the class name of div tags. All text contained in such a div tag where the configured class name(s) match are not indexed, while the remaining page is indexed.	7 years ago
luccioman	640fed2a9c	Removed Java 1.8 no more necessary version checking (fixes issue #147 ) Java 1.8 is by the way now a prerequisite to run from latest sources.	7 years ago
luccioman	d95b288f19	Removed use of deprecated Jetty IPAccessHandler for client filtering. Upgraded to InetAccessHandler. Added InetPathAccessHandler extension to InetAccessHandler to maintain path patterns capability previously available in IPAccessHandler but lost in InetAccessHandler. Filtering on IPv6 addresses is now supported. Support for deprecated pattern formats such as "192.168." and "192.168.1.1/path" has been removed, but startup automated migration should convert such patterns eventually present in serverClient.	7 years ago
Michael Peter Christen	607b39b427	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git Conflicts: htroot/yacysearchitem.java	7 years ago
Michael Peter Christen	4355de0f3c	(more!) evaluation of XRealIP from nginx reverse proxy	7 years ago
luccioman	f9cba827c0	Made "tld:" modifier case insensitive and IDN complient. Thus allowing typing internationalized top-level domains with non ASCII characters as tld: modifier.	7 years ago
luccioman	c5c3cc1274	Use HTTP Post operation for resetting memory monitoring state. Fixes issue #145 Also added textual hint on the button, and display it only when it makes sense, that is to say when the memory state is 'exhausted'.	7 years ago
luccioman	cb10daba92	Renamed Chinese & Greek lng files using ISO639-1 codes. Previously named with their ISO 3166-1 country code : this way, when setting language to "Browser" in ConfigBasic.html, it didn't work properly when browser preferred language was Chinese or Greek as their respective language codes are "zh" and "el" (not "cn" and "gr" which are their country codes)	7 years ago
luccioman	4b61edff32	Added a help link to ISO 639-1 language codes list ref	7 years ago
luccioman	a994d439af	Added description of spatial restrictions in search options	7 years ago
luccioman	8a48f80909	Added language HTML attribute to the search home page.	7 years ago
luccioman	5ff76fdcb9	Fixed spelling	7 years ago
luccioman	2c3f0ff9e8	Updated search page keyboard shortcuts descriptions.	7 years ago
luccioman	af825e9ffc	Use accessible labels for search home page radio buttons.	7 years ago
luccioman	8e732d437c	Enable HTTP Digest authentication for non admin users. Also ensure authentication is not lost by Digest timeout when navigating between index.html and search results page. This way, running searches with extended features on a remote peer or a password protected peer works with a regular user (with "Extended search" rights). When authenticating on the search page with a user without "Extended search" rights, it appears as authenticated, but has just its usual access to the public search features.	7 years ago
luccioman	5161451a35	Stay authenticated when going to the search start page. Otherwise, when authenticated as admin and navigating from search results or admin pages to the search start page (/index.html), if nothing is done on that page within HTTP Digest Auth timeout (about 2mn), then search is performed without authentication and so without extended search features.	7 years ago
luccioman	d0bed78d02	Use the same top nav bar on index.html and search results. Thus eventually including the same optional login link/status in the search start page than in the results page, for the same convenient login without the need to use the Administration section.	7 years ago
luccioman	f678394ce5	Fixed loss of index page form values on 'more options' link click. Restores the behavior introduced eleven years ago (see commit `479861a3cf`) and lost by mistake 3 years ago (see commit `617dd9c97b`), when the click handler started referencing a missing HTML id.	7 years ago
luccioman	af198b990b	Added an optional login link/status to the search public top nav bar. Thus allowing a more convenient way (wihout the need to go to the admin section) to login when searching on your remote or password protected peer and benefit from extended search features such as Heuristics, Bookmarking or JavasScript resorting. Can be disabled using the ConfigSearchPage_p.html.	7 years ago
luccioman	1de86cf1bf	Fixed JPEG snapshot resizing when running on OpenJDK. Resizing JPEG snapshot images through /api/snapshot.jpg failed when running on OpenJDK, but rendered successfully with a Oracle JDK. Details in mantis 772 ( http://mantis.tokeek.de/view.php?id=772 ). Removing any alpha component (useless in snapshot images) from the rendered resized image solves the issue.	7 years ago
luccioman	a17a418e78	Fixed NullPointerException cases on snapshot images parsing.	7 years ago
luccioman	285f0d6a39	Consistently encode snapshot image with format requested on the API. Previously, calling /api/snapshot.png rendered JPEG encoded images.	7 years ago
luccioman	4da15db998	Fixed search result Snapshots link. Previously rendered as a broken URL containing the absolute file path of a snapshot on the search server. Now rendered as a valid URL linking to the /api/snapshot API to provide available snapshot content. Snapshot format is selected among the available ones in the following order of preference : JPG/PNG, PDF, and XML.	8 years ago
luccioman	fe75f326d8	Fixed ProfilingGraph calculation integer overflows and added test class. Complementary to fix proposed in PR #128 by @otteresk.	8 years ago
luccioman	8303e15419	Reduced number of search navigators refresh requests in JS resort mode The SearchEvent listen to changes on each of its navigators, and the information about their overall state is sent with each fetched search item (as a "data-nav-generation" attribute). Then the browser can regularly fetch a fresh version of yacysearchtrailer.html only if necessary (when that nav-generation value change).	8 years ago
luccioman	2ac78e2cca	Addedd missing parameters to yacysearchtrailer call on JS resort mode	8 years ago
luccioman	dbff7b14fc	Add a configurable limit to tags initially displayed in search results When the limit is reached, a button allow expanding/collapsing remaining tags. When this feature is activated without a limit to the number of displayed tags, when encountering search results with a very large number of keywords, the results page can become almost unusable (very long vertical scrollbar)	8 years ago
reger	f8c7d0265e	Adjust tags css style in ConfigSearchPage to equal search page	8 years ago
luccioman	fcea6def72	Added textual hints to language radio buttons labels As an help and accessible alternative to visual styling marking whether a language is available in browser preferred lang mode.	8 years ago
luccioman	27ab733685	Ensure private search features are not lost on Digest auth timeout This is a fix for mantis 766 ( http://mantis.tokeek.de/view.php?id=766 ) Since the upgrade to Digest authentication, access to protected search features was indeed disabled once the Digest nonce timed out. After Digest auth timeout the browser no more sent authentication information and as the search results page is not private, protected features were simply be hidden without asking browser again for authentication. Adding a supplementary parameter when accessing the search results as authenticated fixes this.	8 years ago
reger	dd82f85953	Add links to the optional keyword tags of search result If swichted on link (click) to the tag adds the keyword to the search query. If a keyword navigator is active the selected keyword adds or replaces a query keyword: modifier (currently replace was choosen as multiple keywords are not fully supported yet)	8 years ago
luccioman	fc28c58731	Added missing accessible labels to ConfigSearchPage_p.html	8 years ago
luccioman	8294374c10	Fixed ConfigSearchPage_p HTML validation errors. Validated with Nu Html Checker 17.9.0	8 years ago
luccioman	57a33aefb0	Removed unnecessary max counts init on empty search navigators.	8 years ago
luccioman	b1e7bd0dd6	Restrict Search Result Layout modification to HTTP POST only.	8 years ago
luccioman	ef8aea7f8d	Made the dates navigator max elements number user configurable. Also used object properties on QueryParams instances, rather than using mutable class (static) properties.	8 years ago
luccioman	0b0980b364	Improved accessibility of histograms widgets. Added keyboard navigation support and missing WAI-ARIA attributes. Tested with NVDA 2017.3 screenreader on recent major browsers.	8 years ago
luccioman	62c7cd9a77	Upgraded JavaScript lib raphael.js from 2.1.3 to 2.2.7	8 years ago
luccioman	cbbc7b43d3	Refresh paginations buttons instead of fully rendering each time. This prevent the already displayed pagination buttons to be unresponsive when clicking on them while the rendering JS function is running.	8 years ago
luccioman	18412dca21	Handle JS refreshing of belatedly added search navigators	8 years ago
luccioman	9049a926a5	Restrict JS results resorting to authenticated users. Until a more efficient DOM refresh model needing less XHR requests per search is implemented.	8 years ago
luccioman	4ab961fa46	Added HTML ids to search navigators for a more reliable JS refreshing.	8 years ago
luccioman	ad61a3afed	Results JS resort : properly handle results with same ranking value.	8 years ago
luccioman	57a1007772	Added new graphical setting for browser JS/On demand results resorting.	8 years ago
luccioman	d00a35576c	Apply JS resort only when currently relevant : p2p text search	8 years ago
luccioman	4e3c928d31	Do not animate unnecessarily when changing page on JS sorted results.	8 years ago
luccioman	fb6743e8f8	Prevent unnecessary DOM finds in JS resorting functions. Also removed now unused functions earlierPage() and laterPage().	8 years ago
luccioman	b1b9ffbbc8	Stop updating results with JS resorting on server feeds termination	8 years ago
luccioman	6f5e55c9f0	Updated the JavaScript license information page	8 years ago
luccioman	c7149acb48	Disabled as default verbose browser console logs in yacysort.js	8 years ago
luccioman	b50700c35f	Added missing copyright header to the yacysort.js file	8 years ago
luccioman	86d41f0242	Moved the JS resort specific styling to the usual YaCy CSS location	8 years ago
luccioman	9e86d183b8	Disable manual search results resorting when resorting is done with JS Also added a constant for the js resorting setting key.	8 years ago
luccioman	4ccd38357f	Trigger js resorting animations using only CSS classes. Also added some more descriptive comments.	8 years ago
luccioman	e40a225bc1	Merge branch 'javascript-resort' of https://github.com/Scarfmonster/yacy_search_server into jsResort	8 years ago
Ryszard Goń	2af011243f	Javascript re-sorting: Remove potentially breaking display property and reset max-height when animation is finished.	8 years ago
Ryszard Goń	634f52fefc	Javascript re-sorting: replace jQuery show() with css animations	8 years ago
luccioman	5d3ceb31b7	Improved search navigators counters accuracy and consistency. - added some missing increments from RWI results - decrement relevant navigator counts when solr or RWI results are evicted because duplicates detection or constraints checked belatedly - do not compute facets when unnecessary to avoid unwanted CPU load - do not increment from facets when already done - do not rely on facets on remote solr peers requests, as most of the time only a limited part of their total results if fetched (thus also preventing unnecessary load on remote peers) - use a concurrency friendly score map for the dates navigators to prevent unwanted ConcurrentModificationExceptions This improves the situation for the most obvious inconsistencies in search navigators counts, but more has to be done for a true accuracy (notably when query modifiers constraints are applied belatedly - after the solr or RWI retrieval request - such as the content domain constraint)	8 years ago
JeremyRand	ab0e50b941	Javascript re-sorting: optimize the jQuery selectors a little bit.	8 years ago
JeremyRand	86b5094970	Fix numbered page navigation from getting corrupted when statistics() runs.	8 years ago
JeremyRand	a888254769	Add UI for numbered page navigation when Javascript re-sorting is enabled.	8 years ago
JeremyRand	74333c931e	Fix the sidebar item "Wiki Name Space" with Javascript re-sorting.	8 years ago
JeremyRand	4a9e64caea	(WIP) Add numbered page navigation when Javascript re-sorting is enabled. TODO: Add UI for selecting the number.	8 years ago
JeremyRand	6ec256dc34	(WIP) Fix the sidebar when Javascript resorting is in use. TODO: Add some markup so that DOM traversal in the animations is less painful.	8 years ago
JeremyRand	d37df75afa	(WIP) Optionally sort HTML search items via Javascript. TODO: Expose a GUI setting for this.	8 years ago
JeremyRand	61be709a97	Add data-ranking attribute to each HTML search item.	8 years ago
luccioman	a28428047a	Fixed count of filtered results from local solr. Was inadequately modified in my previous related commits (making next pages buttons unavailable in Search portal mode), as SearchEvent.local_solr_available did not count the total filtered results but only the ones within the currently fetched result page(s).	8 years ago
luccioman	30c2f50e0b	Use final results counts in progress bar detailed statistics. Using unfiltered detailed counts (local and remote entries found before doubles detection and before applying query modifiers) was confusing and inconsistent with the total count. It could let think more results are to come in the next pages, without understanding why they are not displayed.	8 years ago
luccioman	8b25b485eb	Make result action links visible when focusing them with keyboard.	8 years ago
luccioman	3e933979df	Removed duplicate HTML class attribute.	8 years ago
luccioman	ce22076920	Fixed Unresolved_Pattern occurence on results favicon HTML id.	8 years ago
luccioman	a1a0515312	Added a button to manually refresh sorting of p2p search results. As a server-side oriented alternative to the JavaScript realtime resorting feature proposed in PR #104. The goal is the same as in this PR : having the possibility compensate the network latency of various peers results fetching and obtain once possible a consistently ranked result set.	8 years ago
luccioman	4eba88f2ff	Removed some unnecessary uses of java.lang.reflect api. This improves code browsing and readability, making search by references or call hierarchy IDE features more accurate.	8 years ago
reger	51a4e03c93	Allow to stop currently running warc import (stop button)	8 years ago
luccioman	3f0446f14b	Ensure proper synchronous robots entry retrieval on first check. Previously, when checking for the first time the robots.txt policy on a unknown host (not cached in the robots table), result was always empty in the /getpageinfo_p.xml api and in the /CrawlCheck_p.html page. Next calls returned however the correct information.	8 years ago
luccioman	b23a563065	Prevent search result failure on incomplete images information. Complements the recent modification related to images in commit `7f395ef`. Unfortunately many documents metadata fetched from the freeworld p2p network have only partial information about embedded images. Without proper error handling, this made many searches in p2p mode to fail completely.	8 years ago
Michael Peter Christen	7f395ef937	added image link in search results This should be a help to make a preview of search results. The image is computed from the list of embedded images, it is always the first image in that list. In rss-type results the image is presented like <media:content medium="image" url="https://abc.xyz/logo.png"/> as defined in http://www.rssboard.org/media-rss#media-content	8 years ago
reger	4979439e87	Skip public post of jre version. Added to determine switch to java8 `596b5dfa59`	8 years ago
reger	588c6e96fb	upd version for typeahead.jquery.js in jslicense.html	8 years ago
luccioman	8100c033a2	URL Viewer : apply crawler size limits when adding to local index. This allow large files parsing and preview, while preventing unwanted OutOfMemory errors which are likely to occur when adding to the Solr Index resources larger than configured crawler limits.	8 years ago
reger	e5cff062b5	Clean up redundant but obsolete jquery.rdfquery-core-1.0.js script lib	8 years ago

1 2 3 4 5 ...

5853 Commits (1e4ceaac3f8dfcfe7a6bc969827a7ae49d719f00)