<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd"> <html> <head> <title>YaCy: Technology</title> <meta http-equiv="content-type" content="text/html;charset=iso-8859-1"> <!-- <meta name="Content-Language" content="German, Deutsch, de, at, ch"> --> <meta name="Content-Language" content="English, Englisch"> <meta name="keywords" content="YaCy HTTP Proxy search engine spider indexer java network open free download Mac Windwos Software development"> <meta name="description" content="YaCy P2P Web Search: Technology"> <meta name="copyright" content="Michael Christen"> <script src="navigation.js" type="text/javascript"></script> <link rel="stylesheet" media="all" href="style.css"> <!-- Realisation: Michael Christen; Contact: mc<at>anomic.de--> </head> <body bgcolor="#fefefe" marginheight="0" marginwidth="0" leftmargin="0" topmargin="0"> <SCRIPT LANGUAGE="JavaScript1.1"><!-- globalheader(); //--></SCRIPT> <NOSCRIPT> <table border="0" cellspacing="0" cellpadding="0" width="100%"> <tr><td></td></tr> <tr><td height="1" bgcolor="#000000"></td></tr> <tr><td> <!-- start headline --> <table bgcolor="#4070A0" border="0" cellspacing="0" cellpadding="0" width="100%"> <tr><td width="180" height="80" rowspan="3"><a href="http://www.yacy.net"><img border="0" src="grafics/yacy.gif" align="top"></a></td> <td></td><td width="120"></td></tr> </table> <!-- end headline --> </td></tr> <tr><td height="2"></td></tr> <tr><td> <table border="0" cellspacing="0" cellpadding="0" width="100%"> <tr> <td width="100" valign="top"> <!-- start lmenue --> <table width="100%" border="0" cellspacing="0" cellpadding="0"> <tr><td height="2"></td></tr> <tr><td height="20" class="white" bgcolor="#BDCDD4" valign="middle"> <a href="index.html" class="dark">Main Index</a></td></tr> </table> <!-- end lmenue --> </td> <td width="10" valign="top"></td> <td valign="top"> <table border="0" cellspacing="0" cellpadding="0" width="100%"> <tr><td height="2"></td></tr> <tr><td><br> </NOSCRIPT> <!-- ----- HERE STARTS CONTENT PART ----- --> <h2>Web Search Technology</h2><br> <p><img width="480" src="grafics/YaCy_Technology_Components.png" align="center"></p> <p>YaCy consists mainly of four parts: a <b>web crawler</b>, an <b>indexer</b>, a built-in <b>database engine</b> and the <b>p2p index exchange</b> protocol, based on http. The YaCy search engine can be accessed through the <b>built-in http server</b>. All parts of this architecture are included in the YaCy distribution.</p><br><br> <p><img width="480" src="grafics/YaCy_Technology_UserInterface.png" align="center"></p> <p>YaCy has a built-in http server, and the user interface is realized as web pages on the own web server. A search request to YaCy is done inside your web browser. </p><br><br> <p><img width="480" src="grafics/YaCy_Technology_Crawler.png" align="center"></p> <p>A web search engine can only search web pages that had been <i>crawled</i>, which means that all pages of subpages (and so on) of a start point had been loaded. YaCy has an integrated web crawler.</p><br><br> <p><img width="480" src="grafics/YaCy_Technology_Indexing.png" align="center"></p> <p>Before a huge number of web pages can be searched efficiently, the pages must be <i>indexed</i>. This is a very difficult process which runs inside YaCy without any user action. After indexing of web pages a single YaCy installation is able to provide search results from more that 10 million of web pages efficiently.</p> <!-- ----- HERE ENDS CONTENT PART ----- --> <SCRIPT LANGUAGE="JavaScript1.1"><!-- globalfooter(); //--></SCRIPT> <NOSCRIPT> <br><br></td></tr></table> </td> <td width="10" valign="top"> </td> </tr></table> </td></tr></table> </NOSCRIPT> </body> </html>