new limitation documentation

just replaced two by four
pull/402/head
Michael Peter Christen 4 years ago
parent 3078b74e1d
commit 256fa3d985

@ -105,7 +105,7 @@
</fieldset> </fieldset>
<h3>Hints</h3> <h3>Hints</h3>
<ul> <ul>
<li><h4>Crawl Speed Limitation</h4> No more that two pages are loaded from the same host in one second (not more that 120 document per minute) to limit the load on the target server.</li> <li><h4>Crawl Speed Limitation</h4> No more that four pages are loaded from the same host in one second (not more that 120 document per minute) to limit the load on the target server.</li>
<li><h4>Target Balancer</h4> A second crawl for a different host increases the throughput to a maximum of 240 documents per minute since the crawler balances the load over all hosts.</li> <li><h4>Target Balancer</h4> A second crawl for a different host increases the throughput to a maximum of 240 documents per minute since the crawler balances the load over all hosts.</li>
<li><h4>High Speed Crawling</h4> A 'shallow crawl' which is not limited to a single host (or site) <li><h4>High Speed Crawling</h4> A 'shallow crawl' which is not limited to a single host (or site)
can extend the pages per minute (ppm) rate to unlimited documents per minute when the number of target hosts is high. can extend the pages per minute (ppm) rate to unlimited documents per minute when the number of target hosts is high.

Loading…
Cancel
Save