YaCy '#[clientname]#': Heuristics Configuration

#%env/templates/metas.template%# #%env/templates/header.template%# #%env/templates/submenuSearchConfiguration.template%#

Heuristics Configuration

A heuristic is an 'experience-based technique that help in problem solving, learning and discovery' (wikipedia). The search heuristics that can be switched on here are techniques that help the discovery of possible search results based on link guessing, in-search crawling and requests to other search engines. When a search heuristic is used, the resulting links are not used directly as search result but the loaded pages are indexed and stored like other content. This ensures that blacklists can be used and that the searched word actually appears on the page that was discovered by the heuristic.

search-result: shallow crawl on all displayed search results

add as global crawl job

When a search is made then all displayed result links are crawled with a depth-1 crawl. This means: right after the search request every page is loaded and every page that is linked on this page. If you check 'add as global crawl job' the pages to be crawled are added to the global crawl queue (remote peers can pickup pages to be crawled). Default is to add the links to the local crawl queue (your peer crawls the linked pages).

Available/Active Opensearch System #{osdcfg}# #{/osdcfg}#

Active	Title	Comment	Url (format opensearch Url template syntax)	delete
	#[title]#	#[comment]#
new

#[osderrmsg]#

With the button "discover from index" you can search within the metadata of your local index (Web Structure Index) to find systems which support the Opensearch specification. The task is started in the background. It may take some minutes before new entries appear (after refreshing the page). Alternatively you may copy & paste a example config file located in defaults/heuristicopensearch.conf to the DATA/SETTINGS directory. For the discover function the web graph option of the web structure index and the fields target_rel_s, target_protocol_s, target_urlstub_s have to be switched on in the webgraph Solr schema. #{osdsolrfieldswitch}##{/osdsolrfieldswitch}#

#%env/templates/footer.template%#