You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/doc/Technology.html

92 lines
3.8 KiB

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">
<html>
<head>
<title>YaCy: Technology</title>
<meta http-equiv="content-type" content="text/html;charset=iso-8859-1">
<!-- <meta name="Content-Language" content="German, Deutsch, de, at, ch"> -->
<meta name="Content-Language" content="English, Englisch">
<meta name="keywords" content="YaCy HTTP Proxy search engine spider indexer java network open free download Mac Windwos Software development">
<meta name="description" content="YaCy P2P Web Search: Technology">
<meta name="copyright" content="Michael Christen">
<script src="navigation.js" type="text/javascript"></script>
<link rel="stylesheet" media="all" href="style.css">
<!-- Realisation: Michael Christen; Contact: mc<at>anomic.de-->
</head>
<body bgcolor="#fefefe" marginheight="0" marginwidth="0" leftmargin="0" topmargin="0">
<SCRIPT LANGUAGE="JavaScript1.1"><!--
globalheader();
//--></SCRIPT>
<NOSCRIPT>
<table border="0" cellspacing="0" cellpadding="0" width="100%">
<tr><td></td></tr>
<tr><td height="1" bgcolor="#000000"></td></tr>
<tr><td>
<!-- start headline -->
<table bgcolor="#4070A0" border="0" cellspacing="0" cellpadding="0" width="100%">
<tr><td width="180" height="80" rowspan="3"><a href="http://www.yacy.net"><img border="0" src="grafics/yacy.gif" align="top"></a></td>
<td></td><td width="120"></td></tr>
</table>
<!-- end headline -->
</td></tr>
<tr><td height="2"></td></tr>
<tr><td>
<table border="0" cellspacing="0" cellpadding="0" width="100%">
<tr>
<td width="100" valign="top">
<!-- start lmenue -->
<table width="100%" border="0" cellspacing="0" cellpadding="0">
<tr><td height="2"></td></tr>
<tr><td height="20" class="white" bgcolor="#BDCDD4" valign="middle">&nbsp;<a href="index.html" class="dark">Main Index</a></td></tr>
</table>
<!-- end lmenue -->
</td>
<td width="10" valign="top"></td>
<td valign="top">
<table border="0" cellspacing="0" cellpadding="0" width="100%">
<tr><td height="2"></td></tr>
<tr><td><br>
</NOSCRIPT>
<!-- ----- HERE STARTS CONTENT PART ----- -->
<h2>Web Search Technology</h2><br>
<p><img width="480" src="grafics/YaCy_Technology_Components.png" align="center"></p>
<p>YaCy consists mainly of four parts:
a <b>web crawler</b>, an <b>indexer</b>,
a built-in <b>database engine</b> and
the <b>p2p index exchange</b> protocol, based on http.
The YaCy search engine can be accessed through the <b>built-in http server</b>.
All parts of this architecture are included in the YaCy distribution.</p><br><br>
<p><img width="480" src="grafics/YaCy_Technology_UserInterface.png" align="center"></p>
<p>YaCy has a built-in http server,
and the user interface is realized as web pages on the own web server.
A search request to YaCy is done inside your web browser.
</p><br><br>
<p><img width="480" src="grafics/YaCy_Technology_Crawler.png" align="center"></p>
<p>A web search engine can only search web pages that had been <i>crawled</i>, which means that
all pages of subpages (and so on) of a start point had been loaded. YaCy has an integrated web crawler.</p><br><br>
<p><img width="480" src="grafics/YaCy_Technology_Indexing.png" align="center"></p>
<p>Before a huge number of web pages can be searched efficiently, the pages must be <i>indexed</i>.
This is a very difficult process which runs inside YaCy without any user action.
After indexing of web pages a single YaCy installation is able to provide search results
from more that 10 million of web pages efficiently.</p>
<!-- ----- HERE ENDS CONTENT PART ----- -->
<SCRIPT LANGUAGE="JavaScript1.1"><!--
globalfooter();
//--></SCRIPT>
<NOSCRIPT>
<br><br></td></tr></table>
</td>
<td width="10" valign="top">
</td>
</tr></table>
</td></tr></table>
</NOSCRIPT>
</body>
</html>