yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	e3ef4e3021	- increased default peer ping time from 2 minutes to 1 minute - filtering out too old peers when reading seed lists (limit is now 240 minutes) - added concurrent host names resolving in front of the http client because the http client uses the java built-in DNS resolve which is not multithreading-safe (i have seen deadlocks in thread dumps showing that this bug in jdk is still there) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7515 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	cd19d0517e	added dns resolve to HTTPClient POST using a dns cache to prevent that that not-thread-safe built-in dns cache inside apache http client is used git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7513 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d28f8040e0	removed unnecessary recording function that caused also a performance problem after serving too much files git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7512 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	af87af0d4c	- removed synchronization in serverSwitch which should improve speed - fixed wrong assert in network graph - enhanced double check method in table class git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7511 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4bd65532da	initialization of libraries concurrently (faster start-up) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7510 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	57e6728cb7	- removed usage of /etc/alternatives/www-browser because of problems with lynx, see: http://forum.yacy-websuche.de/viewtopic.php?p=21959#p21959 please look if the browser that is linked with /etc/alternatives/www-browser can be detected and insert call again if it can be made sure that this does not call lynx - replaced severe warnings with just warnings in yacyClient git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7506 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d84b4a072e	healing for some OOM problems git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7502 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	82f262f685	- enhanced circle drawing speed - beautified 'moving dot' feature (using smaller and correctly positioned dots) - added moving dots to DHT transfer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7500 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	29dc416ac6	more animations in graphics. See network and access picture. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7498 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a80ee9a03d	THE GRID is coming to YaCy .. see new animated graphics on http://localhost:8090/AccessGrid_p.html showing incoming and outgoing connections in an animated way git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7496 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	ce012e11aa	) deleted LogStatistics since the page did not work anymore and it seemed to be obsolete, tell me if you miss it and I will add it again ) a few minor changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7494 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	c5051c4020	) fixed bug which caused entries to not be deleted when deleting by URL on IndexCreateWWWLocalQueue_p.html (I hope this did not break anything else) ) cleaned up code a little bit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7493 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d58071947a	maybe terminateOldSessions is too slow, removed sleep git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7492 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	6c52e31993	new methods to open a browser - if YaCy is started with the option -gui, it is not in headless mode. Then the java 1.6 browse method is used if all other methods fail - in linux, the path /etc/alternatives/www-browser is used if no firefox is installed git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7480 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5892fff51f	introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased. Please see new coments in yacy.network.freeworld.unit for details of the new DHT selection methods. The number of maximum peers is now not fixed to a specific number but may increase with - the partition exponent - the number of redundant peers - the robinson burst percentage - the multiword burst percentage The maximum can then be the number of senior peers (all visible peers). git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7479 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4588b5a291	- fixed document number limitation for crawls that restrict the number of documents per domain - some restructuring of the document counting and logging structures was necessary - better abstraction of CrawlProfiles - added deletion of logs to the index deletion option (if the index is deleted using the servlets) which is necessary to reset the domain counters for the page limitation - more refactoring to get the LibraryProvider more clean - some refactoring of the Condenser class git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7478 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	64f32e8f00	) replaced all IPs in IP filters for proxy with the proper regular expression ) some cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7477 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	93732d6773	increased number of target peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7468 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	70ca7cec8c	fix for http://forum.yacy-websuche.de/viewtopic.php?p=21763#p21763 and another fix for non-working global search when search options are switched off git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7467 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	fe93caac5a	added flags and administration options to show advanced search and to show search result attributes (for each search result) Administration can be done at ConfigPortal.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7466 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5905f912c5	replaced more double types with float git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7462 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0cdfb82963	replaced more appearance of double values by float values git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7461 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	eb12e15738	moved all Double values to Float values because of http://www.exploringbinary.com/java-hangs-when-converting-2-2250738585072012e-308/ YaCy does not really need double-precision floating point computation anywhere, so this should not affect any feature git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7460 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	982aa689ef	* fix StringIndexOutOfBoundException in WebStructureGraph * add better escaping to saveMap and loadMap git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7458 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	88773e4daa	changed the default port from 8080 to 8090 see also: http://forum.yacy-websuche.de/viewtopic.php?p=21683#p21683 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7454 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	6c35b68f17	- removed 'peerName' property from the yacy settings file because this information is stored in the yacy seed file - the own seed file gets the lead for storage of the peer name - exchanged default peer name generation method with one that does not use the local ip - default peer names are now strings starting with '_anon' - added another switch to suppress forwarding to ConfigBasic if the name was already changed - replaced all usages of the yacy.conf peerName with access to the local seed - changes to the peer name are now applied directly and not after the next peer ping git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7453 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	786166041a	- added recording of all accessed and submitted servlets - this recording is then used to redirect from the Status.html page to BasicConfig in case that servlet was never submitted - this acts as an addition to the new default pop-up page 'index.html' which offers an administration link to Status.html. For a first-time user this then redirects directly to the former start page BasicConfig.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7451 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	28f669bf0b	- fixed/enhanced move to SD/16:9 images (network, web structure) - added logging in peer ping to analyse time-consuming elements which could be cause for disappearing peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7450 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0376f73fdb	extended seed list uploader: do not only upload all active peers but also some more peers that are passive but had been active in the last 24 hours git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7449 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	991b92f4ae	enhanced network graphic git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7446 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	3ae8f40fc8	removed yacy.network.group - this feature was never used git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7442 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	efb4ca8fa8	modified auto-delete of search failure-words: - words are now not deleted from the search index automatically if index receive is switched off - a flag in the network definition defines if this feature is switched on at all - the search filter for not-found word references is switched off for server-side remote searches git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7441 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	f1f03d8c90	more logging for strange network loading bug git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7438 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	4e29e9712a	* create cleanupjob for cached failed urls git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7437 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	a321c7673d	* adminAccountForLocalhost only for localhost * yacy crawls local domains also, if no password is set (the interface is already protected) * it's not required anymore, to set a password in intranet mode git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7436 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	48463c4507	) General private License? ;-) ) minor code changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7432 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	c93f4dda72	- cleaned up yacy news - removed unused methods - avoid news generation in case that the peer runs in robinson mode git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7431 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	6c1b14c8e1	- more control in access tracker: count number of returned search results (not only info how much is in the index) - extended query params for this - enhanced cora git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7430 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	9f38c0023d	*) Minor changes, mainly cleaning up a little bit, no functional changes. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7428 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	54e77e6255	refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7426 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	10ae8d961b	- cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring) - cleaned up (removed special code and documentation for 27c3) - added remote search functions to be used within cora git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7420 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
lotus	0e54233408	UPnP: map port again if we are not reachable (e.g. when router rebooted) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7419 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
lotus	b1484299b2	same units for memory observer configuration (MiB) old setting for DHT (RAM) will be lost after update can be set on /Performance_p.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7418 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	89ae6101b9	fix for NPE and added comment in search result git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7412 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0769f4caa6	added search suggestions for interactive search: is only shown if there are no search results git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7411 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a4c9d27287	- moved some variables from Stwitchboard to new class AccessTracker - added a limitation in access tracking to delete queries which are older than 10 minutes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7410 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	e4aabaa1c3	* fix negative filelength for files >2G git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7408 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	cdfe8afe3f	fix for really bad table iteration implementation: reduction of IO git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7407 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	ee3cef91e8	* fix filesize in ftp crawls git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7402 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	b2ed4cfaf8	more small bugfixes and light refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7401 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	3d95981f7d	) cleaning up the code a little bit ) minor changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7396 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	6b70393d1d	- new java version 1.6 - replaced old gif animator by java 1.6 gif animator git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7388 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e88c428008	fix to ftp loader git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7387 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	9b25a33fd9	- fixed numerous bugs - better document names - fixed problem with ftp crawling - added automatic removal of search results from services that are not online according to the latest network scan: this does not delete the index but just does not show them. after the next network scan when the server is available again, the results are again showed. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7385 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	7bdb13bf7f	more fixes to smb crawling: better file names git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7384 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	94c48500cc	several fixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7383 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	58b59f9bc8	- a collection of bug fixes and some redesign of the Scanner class - fixed smb crawling - added smbget to download script generation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7381 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	c54170421a	fix for npe git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7379 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	6f4f957e50	*) cleaning up the code a little bit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7377 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	2521677a45	* deny adminForLocalhost and intranet network setup also on bootup and not only on network switch * require authentication for yacybot what ever adminForLocalhost is set to (after this patch, is the rule from above really nesseccary, the crawler also checks the robots.txt) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7376 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	9d2159582f	* fix system update if urls are in blacklist (for example for very general blacklists like *.de) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7375 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	56264dcc17	- added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls - integrated new parser into loader processes: enrich document parser - fixed a concurrent modification exception in kelondro iterator - hand-over of document size from crawler to indexer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7374 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	acab6801d9	added new network scanner - you can scan any ip or host in the internet for services - this replaces the intranet scanner git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7371 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a563b05b60	enhanced crawler: - added a new queue 'noload' which can be filled with urls where it is already known that the content cannot be loaded. This may be because there is no parser available or the file is too big - the noload queue is emptied with the parser process which indexes the file names only - the 'start from file' functionality now also reads from ftp crawler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7368 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	c36da90261	added a very fast ftp file list generator to site crawler: - when a site-crawl for ftp sites is now started, then a special directory-tree harvester gets the complete directory structure of a ftp server at once - the harvester runs concurrently and feeds into the normal crawl queue also in this: - fixed the 'start from file' crawl function - added a link detector for the html parser. The html parser can now also extract links that are not included in <a> tags. - this causes that a crawl start is now also possible from clear text link files git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7367 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4565b2f2c0	removed the display option from index.html, yacysearch.html and yacyinteractive.html instead, a setting at ConfigPortal.html can be made to define if the topmenu shall be shown at these pages or if there is no naviagtion at all. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7366 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	fc2e41e691	added a forwarder for the default page. The forwarder forwards a browser to a different page if the root file index.html is accessed. This can be done by setting the name of the forwarder page to the field "Default index.html Page (by forwarder)" in /ConfigPortal.html The purpose is to forward to /yacyinteractive.html for the 27C3 FTP search plattform git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7365 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	db99db4be9	some redesign of the search-fail-response mechanism: when a search fails for a single url because the snippet cannot be generated, then the url reference is deleted from the index. This mechanism was redesign and enhanced. The process now also writes into the work tables into the table searchfl to prepare a re-indexing mechanism. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7364 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	4915d1781a	* use local backup-file, if remote network-definition is not availible * resolve single point of failure in networks, managed by central network-definitions git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7363 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	18d33b5c6d	fixed several search result navigation bugs fixed bad behaviours during search result collection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7362 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	49b5a206cd	- better caclculation of search result size - predefined search recommendations git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7361 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4e2c14efbb	fixed bugs in parser and ftp client git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7360 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	f0651e5f2f	added image search to yacyinteractive.html this causes that the search result view switches from list format to image preview format when a search is restricted to png, gif or jpg documents git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7358 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	fffb91447a	fixed crawl queue delete function git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7357 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	b769cce433	- added a catch-all parser for all documents that cannot be parsed: they will contributed with their document url for the search index only - enhanced the pdf and torrent parser: better documents titles - enhanced the ftp client: more time-out time - fixed bugs in json for search results - enhanced yacyinteractive.html: added a file type navigator and a download-script generator for search result files Please have a look at yacyinteractive.html: this will become the hacker-download tool for 27c3! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7355 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	22453b13ad	implemented local host address discovery as posted in http://forum.yacy-websuche.de/viewtopic.php?p=21310#p21310 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7351 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	cc6499bf8d	- added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed - renamed YaCys search result modifications keywords for RECENT, NEAR and language: to the blekko slashtag naming scheme. YaCy now supports the following blekko-like slash built-in slashtags: /date - for search results ordered by date (most recent up) /near - for search results where search words appear near to each other (closest up) /language/<lang> - for a sorting by language where the wanted language gets up. Example: /language/de git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7350 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a9f754c45f	removed unused CR accumulation and distribution process this was never used and extended in the last years. The resulting YBR ranking criteria is still a good idea and will be used in the future. Possible generation methods for YBR ranking are: - "trust-rank" using the link structure as can be discovered in a single crawl (idea from FSCONS) - "block-rank" calculated from the local link structure - a distributed "block-rank" using the xml API to the link structure from other peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7349 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d4a1a1850b	removed warnings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7347 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	3b5830b7d4	*) Fixed typo. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7346 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	9b3fae9496	) cleaning up the code a little bit ) program to interface, not implementation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7345 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	7bb4b001ed	- view image files from cache - fixed generic header settings; affects CORS functionality git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7344 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	e7552bd719	*) cleaning up the code a little bit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7343 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	737aaf6952	various small changes to ymarks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7339 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	8a50670546	some code clean up for the last post git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7338 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	442497868d	another step towards an auto tagging function for YMarks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7337 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	741a87a3e9	* make .yacy-domains crawlable (.yacy-domains are local domains, so only in custom networks/peers) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7334 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	dca9e16f51	* don't index pages, which redirect, twice * there fore auto-redirection of HTTPClient for crawling is disabled and the old code is reactivated git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7332 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	eb79b952ef	*) cleaner code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7331 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	38fdf43587	) renamed classes according to standard Java coding conventions ) String.isEmpty() was introduced in Java 1.6, but we still use Java 1.5 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7330 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	025e3f4790	) renamed classes according to standard Java coding conventions ) removed unsused code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7328 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	3b9aa0504e	*) removed unsused code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7327 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	db3db0fdb9	*) trying to make this class less confusing (probably failing) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7326 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	54e63b556e	intermediate step for a YMark auto-tagging function based on word frequencies. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7325 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	403ee9c014	added a drill-down for metadata and word count to /api/ymarks/test_treeview.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7324 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	11ae5b108e	enabled rebuildIndex for /Table_YMark_p.html (rebuilds the tags and folders index) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7320 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	94a9be18a4	added a ymark table administration: /Table_YMark_p.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7316 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	25339f93c7	more updates to ymarks - working xbel import/export - exported xbel includes yacy specific metadata but still validates against PUBLIC DTD git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7315 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	cdd65aca71	update to ymarks - get_xbel.xml is almost working - startet ymark api documentation info.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7313 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	808edffaf6	ymarks - some refactoring - working xbel and html import (/api/ymarks/test_import.html) - working treeview (/api/ymarks/test_treeview.html) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7312 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	2c539b514a	* add domaincheck (local/global/domainlist) to urlcleaner git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7311 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	117fc86b3d	fix for http://forum.yacy-websuche.de/viewtopic.php?p=21199#p21199 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7308 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	09badc697b	- low-memory patch for crawler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7304 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	becc463d8a	enhanced did-you-mean git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7300 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	43586a2ace	a update to ymarks (please test if you wish): - import HTML (e.g. FF export) via /api/ymarks/import.html - view your import via /api/ymarks/test.html - get a xml list via /api/ymarks/get_ymark_list.xml?tags=&folders= - delete bookmark tables via standard interface /Tables_p.html it is still very experimental!! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7299 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	93c535d111	fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113 fixed a concurrent modification exception during search and a time-out problem git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7298 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4c72885cba	added a sitemap entry parser and loader for sitemaps (a recursion if a sitemap refers to another sitemap) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7295 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	790e0b1894	- enhanced index deletion in IndexControlRWIs_p: delete also robots.txt database and cache if demanded - added option for details of deletion - added deletion to new ConfigHTCache_p servlet git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7294 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	f5324b27f2	more updates to the new bookmarks (ymarks).... - split YMarkTables and YMarkIndex in two different classes - HTML import is working properly - XBEL import is still broken git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7292 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	445619f3ec	added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7291 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	acd93b1b31	* add failsafe mechanisme to domainlist retrieval domainlist is saved locally, if none of the given urls in network.unit.domainlist could be retrieved, the file from the last boot is used instead git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7289 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	70c95608d4	Added CORS Access header for yacysearch.rss output used some of the recommendations from Copro: http://forum.yacy-websuche.de/viewtopic.php?p=21015#p21015 Original Request: http://forum.yacy-websuche.de/viewtopic.php?p=20829#p20829 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7288 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
lotus	18729351e7	upnp: hint for wrongly detected local ip address git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7286 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	def4253555	* add option to network definition to provide a domainlist (syntax like in blacklists) * crawler and search allow only urls matching one in domainlist (if list is provided) * this may be useful to prevent dedicated networks from being "polluted" * FilterEngine is improved Backlist-object, Blacklist may inherit from FilterEngine in the future git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7285 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	ac6b503adf	untar files without gzip decompression even if the file has gz extension. this is done when the decompression fails. decompressed gzip files with gz extension may appear if the server sets a gzip compression header git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7282 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	efe0667fdd	more new bookmark (ymarks) code with experimental html and xbel import git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7281 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
mikeworks	caabebf9be	Fixed spelling mistake omiting -> omitting in debug messages in ConfigUpdate_p.java and Switchboard.java git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7280 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	155d556568	- better memory protection - more logging - little bit of refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7278 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	7d8de34778	* add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7276 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	25a8e55bc9	more logging about bad seeds git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7275 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	959b8c6fa0	- allow greater seed size - more logging for bad seeds git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7274 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e103419a56	- removed <3 peers barrier for peer ping feedback - more logging git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7273 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	d0e6c03b51	some updates to the new bookmark code... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7272 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	facfd204e9	added a parent configuration option. see /ConfigPortal.html requested here: http://forum.yacy-websuche.de/viewtopic.php?p=21099#p21099 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7271 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e3964f2c31	better catch of network definition load error; continue with secondary network load definition location git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7270 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	65a0381f76	*) cleaning up code (still not done) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7267 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e3e3b49d52	- enhanced main release recognition - yacybot user agent now includes the yacy network name (not the peer name!) - refactoring and clean-up (mostly turned tab into spaces) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7266 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	9c94ebdee4	small changes to new bookmark code... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7265 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	244b56e9d3	an update to the new bookmark code... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7264 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	dc40f51b8d	) added headlines as proposed by Vega ) <pre> will be displayed monospaced in wiki and blog again ) bugfix for <pre> spanning multiple lines ) replaced deprecated <s> tag with <span> equivalent git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7262 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	f035f257da	added some more bookmark code... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7261 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	22ed9c380c	*) fixed bug which was introduced in r7226 (shame on me) which made wiki unusable (all entries were stored with empty subject as key -> edits were lost) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7260 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	60fd2e549d	* log failures when writing config file git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7259 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	58e74282af	added a word counter statistic in condenser which is used by the did-you-mean to calculate best matches for given search words. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7258 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	863065abc4	added user agent logging to access tracker git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7256 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	a79728b97d	some updates to experimental bookmark code... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7254 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	ef782cd026	and even more experimental bookmark code... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7253 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	ed4371dcf3	enhanced navigation implementation and enhanced tag cloud computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7252 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	ca738ac924	- added a tag cloud to search results (using the topics) - some refactoring of score classes - added default package for new classes add_ymark and delete_ymark git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7251 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	7aca763ca8	Some more experimental bookmark code... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7250 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
apfelmaennchen	4270ed696c	Experimental code (I need to transfer the code to my macbook, sorry) for the new bookmarks API based on the Tables concept (same as for crawl starts). Currently you can add a bookmark by api/ymarks/add_ymark.xml?url=http://www.yacy.net&title=YaCy and watch the result via the standard view Tables_p.html. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7249 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e4d561971e	added more score cluster options and made score cluster usage more transparent git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7248 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e8f90201a5	fix for scheduling of rss feeds git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7247 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	7cd9d9d22a	- enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better - added limitations on DidYouMean result sets according to input and output string length git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7246 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	de722090b5	enhancements in did-you-mean guessing git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7243 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a59c885ee0	autocomplete and did-you-mean can now understand _all_ languages and can generate suggestions in all languages and character types git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7242 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	b7acd92ce4	Auto-Suggestions for YaCy Search: - added a suggest servlet according to opensearch and firefox standard - integrated the suggest servlet into opensearch description file - integrated a autocomplete plugin for jquery - added a autocomplete addition to the yacy search windows showing autosuggest queries git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7241 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	6a166c2040	patches for bad proxy behaviour - accept ipv6 localhost clients - index media files (url only) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7238 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d607b30b6a	performance enhancements for search and code review for database functions - removed read cache from Records data structure because the read cache had no cache hit during search operation - copied old read-cache class to CachedRecords and the old, now new Records class does not have the cache any more and a code review checked that data structures and synchronization is clean - removed unnecessary synchronization from Table class during get() git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7237 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	45b1ab3d07	custom + generic skins: - added a generic skin which is filled with actual color assignment using a servlet - enabled css servlets - added a generic color scheme in configuration file - added configuration input in Customization/Appearance servlet - added a jquery color picker widget - placed color picked widget to input field of generic colour definition input fields git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7235 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	fcd40cd30f	- disabled domZones (buggy, must think about better solution) - increased time-out for dns resolver and isLocal property git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7233 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0d363a94d7	more performance hacks this makes YaCy search results VERY fast for all verify=false search cases and it enhances the search speed also for all other snippet-fetch cases. With this change my peer performed 100 Queries Per Second (!!!) while doing 10 queries simultanously (!!!) in an intranet index of 20000 URLs on my 16-core Mac Check this yourself by doing: cd bin ./searchtestmulti.sh after finishing the run, divide 1000 by the given time per query (which is the qps for one thread) and then multiply again by 10 (because 10 search threads has been started) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7231 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	b8aee6d402	performance hacks for better search performance git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7230 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	091dd3f6ec	- enhanced intranet search speed - enhanced intranet portscan speed (better time-out) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7227 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	b9f405d1e8	) added comments ) more beautyful and easier to understand code (IMO) *) added display= parameter to a lot of links in Wiki.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7226 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	6e6994e328	latest bugfixes to search and indexing function after test of demo presentation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7223 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	aacf572a26	- enhancements for search speed - bug fixes in many classes including basic data structure classes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7217 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	61c82f3105	gzip-compresson @ transferRWI & transferURL back again This reduce upload-volume to suit limited bandwidth of home-users like me :-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7215 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	2c549ae341	fixed a number of small bugs: - better crawl star for files paths and smb paths - added time-out wrapper for dns resolving and reverse resolving to prevent blockings - fixed intranet scanner result list check boxes - prevented htcache usage in case of file and smb crawling (not necessary, documents are locally available) - fixed rss feed loader - fixes sitemap loader which had not been restricted to single files (crawl-depth must be zero) - clearing of crawl result lists when a network switch was done - higher maximum file size for crawler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7214 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	f6eebb6f99	replaced auto-dom filter with easy-to-understand Site Link-List crawler option - nobody understand the auto-dom filter without a lenghtly introduction about the function of a crawler - nobody ever used the auto-dom filter other than with a crawl depth of 1 - the auto-dom filter was buggy since the filter did not survive a restart and then a search index contained waste - the function of the auto-dom filter was in fact to just load a link list from the given start url and then start separate crawls for all these urls restricted by their domain - the new Site Link-List option shows the target urls in real-time during input of the start url (like the robots check) and gives a transparent feed-back what it does before it can be used - the new option also fits into the easy site-crawl start menu git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7213 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	c60aed4435	no caching in browser of dynamic web pages sent by YaCy http this may prevent unnecessary IO caused by cache storage of the browser git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7207 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e63896f2a8	added an intranet scanner and a servlet which shows all intranet addresses and an option to start a site-crawl for all these addresses at once. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7203 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e54cb7fb0c	more bugfixes (also for latest commit) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7202 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d2fd93135c	- moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed - migrated the 'yacy' user agent to 'yacybot' in many client methods since the 'yacy' user agent is only used for the proxy git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7199 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	afa708d552	) added <s>...</s> tag to WikiCode -> works just as the HTML equivalent ) code changes (PMD) without functional changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7193 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a83186ac7d	fix for bug in cytrails git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7192 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	48c0d508ac	fixes for crawling of smb links (file length not always available) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7190 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0bc6284e27	- added bugfix for access tracker in case of concurrency conflicts - added missing entry for new icu4j path in Mac App git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7188 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	10a9cb1971	simplified snippet computation process and separated the algorithm into two classes also enhances selection criteria for best snippet line computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7182 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
lotus	4450c240b7	npe fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2982 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7181 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	84a023cbc8	fixed several search bugs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7180 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	97ee278931	enhanced search speed: - better control of number of running search threads - no time-out waiting time when no ranking feeding takes place - local search queries by a remote peer may be faster up to 300 milliseconds - a local search may even be faster git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7176 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	ee3820c9cc	more logging for strange "java.lang.NoClassDefFoundError: de/anomic/http/server/RequestHeader" error git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7175 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	377f001e0d	sorting of crawl profile names in crawl profile editor, see http://forum.yacy-websuche.de/viewtopic.php?p=20851#p20851 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7172 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	3552476fbe	terminated migration from apache httpclient-3.1 to 4.1: - remove the library - added two classes from the httpclient-3.1 library as source code to YaCy because these classes were used by the YaCy HTTP Server - modified the added classes ChunkedInputStream and ContentLengthInputStream in such a way that: * there are no more dependencies to httpclient-3.1 * these classes had been simplified to serve only the purpose for the YaCy httpd git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7171 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a2f9974745	some redesign in the access tracker to realize sixcoolers question about "smartes way for deleting the first Object": - not so much abstraction for a collection, makes use of remove() (no operands) possible - different way to delete elements in track (destructive, not constructive (less copies of elements in new queue)) - more abstraction for class api since no static class must be used any more git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7169 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	03f0414025	some minor correction of my last commit sorry for the noise git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7168 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	42fa0eadb1	fix endless loop: Collection does not support remove(int) (isn't there a smartes way for deleting the first Object?) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7167 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	5a9ea0308f	*) further simplification of wiki code parser (less redundancy in code, less magic numbers), still not done with it... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7166 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	37baa8bae3	- fixes for concurrency exceptions and failed database integrity verification - added link to yacystats peer when peer is more than one day old git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7164 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	29fe401f93	- some layout and text enhancement for site crawl start - Quix0rs patch from http://forum.yacy-websuche.de/viewtopic.php?p=20839#p20839 (parts) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7163 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	461a2a6ec7	enhanced remote crawling: - 300 ppm is default now (but this is switched off by default; if you switch it on you may want more traffic?) - better timing for busy queue - better amount of remote url retrieval - better time-out values - better tracking of availability of remote crawl urls - more logging for result of receipt sending git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7159 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	670ba4d52b	- removed the remote crawl option from the network configuration submenu and - added a remote crawl menu item to the index create menu. This menu also shows a list of peers that provide remote crawl urls - set remote crawl option by default to off. This option may be important but it also confuses first-time users git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7158 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	89c2d8b81e	better initial hash computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7157 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	34e2f7f487	enhanced snippet fetch strategy: concurrent snippet fetch even for offline-snippet searches. This improves speed since it is now possible to fetch snippets offline and parsing of source files from the htcache can be enhanced using concurrency. This improves local and remote search. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7156 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0cf006865e	refactoring and enhanced concurrency git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7155 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	83ac07874f	- corrected return value of put() methods (not used anywhere, so it did not harm before) - added use of LookAheadIterator which should prevent mistakes when coding iterators with embedded iterators - added a fail-safe reaction in case of database corruption using iterators over database elements (no interruption then) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7154 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5702419194	fixed a bug in HTTPClient: keep-alive must be set to false, otherwise servers hold connections 2 seconds open until response. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7151 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5870b13f3a	- code cleanup / added debug line for further investigation in HTTPDemon.parseMultipart - changed data structure for sorting in search which performs better in that specific case (too many updates) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7150 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	ac1c08924e	more performance hacks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7149 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	14c843d364	more performance hacks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7148 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	39f409a7bb	performance hacks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7147 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	7ebef56add	- redesign of a part of the remote search client to make it possible to have a test environment for remote search performance tests - added a remote search test main methods in yacyClient git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7146 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	3c0e07ba72	removed all delays in shutdown process git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7143 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	64860dc1bb	enhanced search event logging (to be used for further improvements) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7140 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	17eebd4ef8	counting crawler traffic again: fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2808 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7138 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	32f73d1aaa	added copy for Info.plist for Mac application release updates (this file contains class paths and start parameters) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7133 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4c21d8dc9d	- changed default values for online caution (the pausing may not be necessary any more) - fixed bug in WeakPriorityBlockingQueue - show favicon faster using pre-loading (same technique as used for fast image search) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7130 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	570ca577c6	performance hacks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7129 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	348dece62f	redesign of the SortStack and SortStore classes: created a WeakPriorityBlockingQueue as special implementation of a PriorityBlockingQueue with a weak object binding. - better abstraction of ordering technique - fixed some bugs according to result numbering (distinguish different counters in Queue) - fixed a ordering bug in post-ranking (ordering was decreased instead of increased) - reversed ordering numbering using a reversed ordering. The higher the ranking number the better (now). git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7128 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	114bdd8ba7	fixed old sitemap importer which was not able to parse urls containing post elements - removed old parser - removed old importer framework (was only used by removed old parser) - added a new sitemap parser in parser framework - linked new parser with parser access in old sitemap processing routines git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7126 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
lotus	6a09f1f7e5	fix dedicated upnp testing git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7122 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5fe828fa06	- replaced pdfbox and fontbox version 1.1.0 with 1.2.1 - added some clear statements that shall clear static cache size within the pdfbox library - the pdfbox library contains a memory leak; it is unsafe to run a peer with pdf parser permanently on. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7120 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	c757a4aa9f	- corrected lifetime computation for search events - made search event cache cleanup concurrent because cleanup may cause index modifications git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7119 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	fb828f3767	- performance enhancements in search response time using faster query ID computation and an ID cache - code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7113 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	22047ffad5	enhanced computation speed of many replaceAll string operations git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7107 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e8228fba09	less locking in time format computation, caching and during secondary (remote) search evaluation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7106 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	9c0c94683c	because of a bug in search result caching count search results had not been generated as fast as possible. with this fix search results are (even) faster. Also enhanced: image search. This is now speeded up using a image search result look-ahead git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7105 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	fa2eb9676e	removed unused class git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7104 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
low012	5f391fcfa9	) cleaned up in wikiCode parser (more to be done) ) HTML fixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7103 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	b3f0d06444	fixed a problem with restarts in YaCy mac applications: the DATA directory path was not submitted when doing a restart. This solves the problem by: - storing the startup properties when yacy is started - using the properties in the restart-script again. this transports also the DATA directory location as parameter of the -gui option that is used when the Mac version of YaCy is started git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7102 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d4e4967e19	cleaned up code in yacyRelease (there will be work to do there) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7101 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	1da5241c2d	do not block server session if maximum number of sessions is reached, just try to clean up once git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7095 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5de70c3d7c	changed way of storage for search requests: - the search request cache can now get as large as 1000 entries - if more entries arrive, unused are deleted - the elements may stay in the cache up to 10 minutes and longer if they are used - the elements are deleted earlier that 10 minutes if the memory gets low This commit was mainly done for metager-feeding peers that have a query load of 50000 queries each day. Also added: - a monitor for cache hit/cache miss in PerformanceMemory_p.html (see at bottom of page) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7093 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	9d080f387e	change in handling of the all-visible home path for storage in YaCy: the home path can now be distinguished between - data home; the path where the DATA directory is created - application home; everything else This will make it possible to store application data on Mac releases within the ~/Library/YaCy directory; a place where Mac applications write their data. Similar techniques will be possible for debian and windows. To use the new data path, YaCy can be started with -start <data path> or -gui <data path> git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7092 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	65eaf30f77	redesign of crawl profiles data structure. target will be: - permanent storage of auto-dom statistics in profile - storage of profiles in WorkTable data structure not finished yet. No functional change yet. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7088 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	55da979291	disable revision detection for git git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7084 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	104318d58a	- added nice colors to feed indexing state messages - added a 'remove all' button for new and scheduled rss feed list - made adding of new rss feeds concurrent so interface is more responsible git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7078 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4f22e2df41	bugfixes for - next-execution-time in scheduler - deletion of scheduled rss feed loading (now deletes also the scheduling entry) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7075 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	42414a6ae3	added two more tables in rss reader interface: - fresh recorded rss feeds (not yet loaded or in scheduler) - rss feeds in scheduler The first list has a button that can be used to place rss feeds into the scheduler The second list has a button to delete rss feeds from the scheduler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7074 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0010cd9db1	Support for indexing of RSS feeds! - added a scanning in html parser for rss feeds - storage of rss feed addresses, can be viewed with http://localhost:8080/Tables_p.html?table=rss - rss items retrieved by http://localhost:8080/Load_RSS_p.html (in Index Creation menu) can be selected and indexed - a rss feed retrieved in http://localhost:8080/Load_RSS_p.html can now be fully indexed - indexing of rss feeds can be placed in scheduler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7073 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0f276dd63f	- MapHeap now implements Map<byte[], Map<String, String>> - refactoring of method names to comply with Map method names git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7072 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	c60d0282fd	more abstraction for tables stored in heaps: the BEncodedHeap now implements Map<byte[], Map<String, byte[]>> This will make it possible that also different database storage types may be added that implement also the same Map<byte[], Map<String, byte[]>> interface. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7070 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	3197ca42ed	preparations to move the HTCache into cora: - move the header framework classes to cora - move the ARC caching classes to cora - refactoring of code to call these classes from cora git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7068 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	844f158686	- removed dependencies in header framework: moved http date methods from DateFormatter to HeaderFramework changed logging to log4j - added ftp load access to MultiProtocolURI - ensured termination of RSS feed iteration git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7067 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5e7081cd19	refactoring towards a unified loading mechanism for MultiProtocolURIs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7065 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	caece04f26	removed System.err and System.out usage from FTPClient; changed logging to log4j (preferred in yacy.cora) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7064 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	90531f78ff	refactoring of the cora package to get subpackages for http and ftp (smb to come) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7063 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	d0fb6bc2bc	cleaned up superfluous classes after sixcoolers migration to HttpComponents-Client-4.x git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7062 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	661867923a	... migrating to HttpComponents-Client-4.x ... The Client is dead, long live the Client! (no references to the old client) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7060 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	7aa860c505	- more logging - more stability for database heap in case of buffer failure git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7058 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4d5446d641	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7057 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	70dd26ec95	added the new crawl scheduling function to the crawl start menu: - the scheduler extends the option for re-crawl timing. Many people misunderstood the re-crawl timing feature because that was just a criteria for the url double-check and not a scheduler. Now the scheduler setting is combined with the re-crawl setting and people will have the choice between no re-crawl, re-crawl as was possible so far and a scheduled re-crawl. The 'classic' re-crawl time is set automatically when the scheduling function is selected - removed the bookmark-based scheduler. This scheduler was not able to transport all attributes of a crawl start and did therefore not support special crawling starts i.e. for forums and wikis - since the old scheduler was not aber to crawl special forums and wikis, the must-not-match filter was statically fixed to all bad pages for these special use cases. Since the new scheduler can handle these filters, it is possible to remove the default settings for the filters - removed the busy thread that was used to trigger the bookmark-based scheduler - removed the crontab for the bookmark-based scheduler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7051 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5a994c9796	added a scheduler based on API actions - every process that is monitored with the API Steering interface can now be scheduled! - added input methods in Steering interface to set a scheduling time - added a view on the steering api that shows only crawl jobs inside the Crawl Profile servlet - added a scheduling call process in the cleanup process handler that triggers the scheduled processes This causes that the cleanup now also looks for scheduled processes. Such processes are therefore not executed at the same time as given in the target execution time but they will be executed within the cleanup process time window. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7050 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	189a986ebd	- modified api-call interface to record api calls with references to api-call database (carries pk) - added recording date, last execution date and next execution date for a scheduler (scheduler to be implemented next) - extended database access methods for more data formats, especially for date insert/retrieval - extended 'Steering' interface to show new database fields - migrated Steering to new http client - extended cora http client to transmit authentication and also added some convenience methods (http response code) - simplified database back-end (not so much specialized methods for multiple properties) - extended date formatter to produce a special format to show dates in html (  in spaces of date format) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7049 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	86d7f8a989	- the web visualization can now be generated in custom color - added input fields in WatchWebStructure_p.html - introduced enum classes for Draw Mode and Filter Mode git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7044 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	64d4204f44	fix for NPE in network image computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7043 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	7fdb17bb96	redirect uncaught exceptions to logging + small other changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7042 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	92df768c39	* fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2929&hilit= * strings for navigation links have to be urlencoded git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7037 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	eb97bed1df	patch for http://forum.yacy-websuche.de/viewtopic.php?p=20576#p20576 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7036 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	87b1684211	additional double-check in balancer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7035 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	0d81731e88	fixed crawler bug caused by NPE in logging git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7033 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	a82a93f2fc	- better url double check in crawler - more logging for error urls git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7032 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	a6ed6e8cb9	... migrating to HttpComponents-Client-4.x ... make the occurrence of multiple header-keys possible git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7031 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	171f2bd84e	- removed unused network oanet - added new network definition 'allip' which can be used in networks where intranet and internet-addresses shall be indexed - added a auto-switch-off for global search if there are no global peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7030 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	b480b7a4d0	fix for bug in last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7027 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	b12bfe1f91	better usage of OSM tile cache and YaCy cache by usage of better tile server computation based on a coordinate hash git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7026 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	388aa021c2	- concurrent loading of OSM tiles - added a 4-time re-try in case that tile server does not respond git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7025 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	301a59e07f	moved browser access method from kelondro/util/OS to gui/framework/Browser git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7022 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	ec72387165	added a very early test version of a YaCy gui component. The gui currently does nothing else than providing a search window that sends the search string to the browser The gui is started when YaCy is started with the option -g or --gui, like ./startYACY.sh -g The gui will primary be used to provide a 'real' macintosh version that can be started and operated like any other macintosh application. A special mac application wrapper will follow. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7021 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	d88b9606d1	fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2923 + some client fine tune git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7020 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	6388a58fc7	better memory management and slightly less (in total and temporary) RAM allocation: - confirm that database objects that are not supposed to grow do not have a index memory management that is designed for growth - changed index sorting method in such a way that it allocates less objects during quicksort - database classes classes renaming (shorter, naming addresses that objects hold in RAM) - added a large number of asserts to check if objects actually take the RAM that they should have git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7019 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5924a0d851	- enhanced concurrency in database index access for multicore - added statistics about database index caches in PerformanceMemory_p.html - adoped many classes to use the new statistics - added missing close statements git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7018 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	610855e362	do not use network graph cache if called from authorized account git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7016 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	39d96abbb5	fix yacyRelease download (http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2920&p=20545#p20545) better cookie policy git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7014 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	c29f24a519	... migrating to HttpComponents-Client-4.x ... - Proxy - Release-download git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7011 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	e7ea3b3cc5	added a buffer for network images to reduced load on yacy.net network image server git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7007 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	d5c65b17a6	added another network activity visualization: show strong query activity as radiation around peer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7006 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	15e8c13526	... migrating to HttpComponents-Client-4.x ... (gzip decompression, httploader, robots, ...) + enable proxy-crawling while log is fine git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7001 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
mikeworks	aa663cda4d	ConfigUpdate_p.html and ConfigUpdate_p.java: Added check for downloaded releases and disabled buttons in case no new releases available de.lng: Updated German translation for additional String in ConfigUpdate_p.html XHTML 1.0 Strict fixes for all the other .html files yacy/ui/css/yacyui-portalsearch.css: added .hidden class that was removed from ConfigProperties_p.html Switchboard.java: Added URL for thread Remote Crawl Job and set URL for Remote Crawl URL Loader to null to fix empty href="" git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6996 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	b7102eff92	... migrating to HttpComponents-Client-4.x ... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6989 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
lotus	965aa97993	including sbbi upnplib as source again http://www.sbbi.net/site/upnp/index.html renamed package to yacy all options are also named "yacy" instead of "sbbi" git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6986 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
lotus	74f6fd229e	some comments + debug code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6985 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	52718e6dcb	... migrating to HttpComponents-Client-4.x ... monitoring: replaced unused 'idletime' by uploading bytes added some kind of 'upload-throttling' at dht-out :-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6983 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	89b0f5bce8	fix for exception in http://forum.yacy-websuche.de/viewtopic.php?p=20418#p20418 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6980 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	5fa8038f10	... migrating to HttpComponents-Client-4.x ... monitoring and first try to use remoteProxy git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6979 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	22dbbcfa56	better (and corrected) recognition of intranet and internet-addresses. This corrects the isLocal property that is used by network definitions to restrict index ranges to local and global addresses. Address locations (intranet or internet) had been partly identified by the top level domain of the host address. Since intranet addresses can also be addressed using a host name that is in a country domain it is necessary to do a dns resolving for each check. The check is supported by a local dns cache so the intranet/internet check should not affect network traffic too much. To ensure that the cache works properly the cache class was upgraded to better concurrency data structures. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6977 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
low012	0e6fed1fb6	) less HTML errors (according to https://addons.mozilla.org/de/firefox/addon/249/) ) followed some suggestions by PMD git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6970 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	0e56d29335	... migrating to HttpComponents-Client-4.x ... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6968 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	e1316d12d0	... migrating to HttpComponents-Client-4.x ... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6966 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	c5c67f0504	start migrating to HttpComponents-Client-4.x see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2872 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6965 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	7188c54ddb	patch to get dht access to developer peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6958 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	b6fb239e74	redesign of parser interface: some file types are containers for several files. These containers had been parsed in such a way that the set of resulting parsed content was merged into one single document before parsing. Using this parser infrastructure it is not possible to parse document containers that contain individual files. An example is a rss file where the rss messages can be treated as individual documents with their own url reference. Another example is a surrogate file which was treated with a special operation outside of the parser infrastructure. This commit introduces a redesigned parser interface and a new abstract parser implementation. The new parser interface has now only one entry point and returns always a set of parsed documents. In case of single documents the parser method returns a set of one documents. To be compliant with the new interface, the zip and tar parser had been also completely redesigned. All parsers are now much more simple and cleaner in its structure. The switchboard operations had been extended to operate with sets of parsed files, not single parsed files. additionally, parsing of jar manifest files had been added. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6955 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	150cf42a1b	migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2 see http://www.gnu.org/licenses/license-list.html for explanation Since (as far as I know) nobody else has ever contributed to these files I may be allowed to just apply an older license. You may consider this as a dual-licensing and may use and optionally replicate the older files under GPL 3. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6952 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	11b7853940	added a configuration page for search heuristics. currently you can switch on there: - a site-operation heuristic that loads all direct links from a portal page if the site-operator is used - a direct crawl for search results from scroogle for the given search terms The configuration page can be found directly beside the network configuration page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6951 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	5d00888c95	- added animated visualization for DHT-in and DHT-out in network graphic - found and fixed a possible memory leak in YaCy internal RSS feed system - some refactoring in RSS feed mechanisms to make this possible git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6950 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	bf25407fdd	added peer hash to internal RSSFeed. The hash will be used to display news activities in the network graphic. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6949 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	1557e0f2d0	- some refactoring for internal RSSFeed (protocol of all actions as seen on status page) - added dht-out to internal RSSFeed (you can see now messages about distributed indexes on status page) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6948 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	5a4684f21f	allow words with length >= 2 (you can't search for 'wm' with 3-letter words...) lets try that. If we run into a memory problem because of too many 2-letter-words, then we must introduce whitelists for 2-letter words. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6947 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	37b8827a7a	- removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection. - cleaned uo the code. The new eclipse helios provided new warnings for dead code. This change cleans up most of these warnings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6945 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	dcd01698b4	added a 'transition feature' that shall lower the barrier to move from ggle to yacy (yes!): Here a new concept called 'search heuristics' is introduced. A heuristic is a kind of 'shortcut' to good results in IT, here for good search results. In this case it will be used to get a very transparent way to compare what YaCy is able to produce as search result and what ggle produces as search result. Here is what your can do now: - add the phrase 'heuristic:scroogle' to your search query, like 'oil spill heuristic:scroogle' and then a call to scroogle is made to get anonymous search results from ggle. - these results are _not_ taken as meta-search results, but are used to instantly feed a crawling and indexing process. This happens very fast, here 20 results from scroogle are taken and loaded all simultanously, parsed and indexed immediately and from the results of the parsed content the search result is feeded, along to the normal p2p search - when new results from that heuristic (more to come) get part of the search results, then it is verified if such results are redundant to existing (they had been part of the normal YaCy search result anyway) or if they had been completely new to YaCy. - in the search results the new search results from heuristics are marked with a 'H ++' and search results from heuristics that had been already found by YaCy are marked with a 'H ='. That means: - you can now see YaCy and Scroogle search results in one result page but you also see that you would not have 'missed' the ggle results when you would only have used YaCy. - to make it short: YaCy now subsumes g**gle results. If you use only YaCy, you miss nothing. to come: a configuration page that let you configure the usage of heuristics and get this feature by default. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6944 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	3a9dc52ac2	added a fascinating new way to search _and_ start a web crawl at the same time: implemented a hint from dulcedo "use site: - operator as crawl start point". YaCy already was able to search using a site-constraint. This function is now extended with a instant crawling feature. When you now use the site-operator, then the landing page of the site iand every page that is linked from this page are loaded, indexed and selected for the search result within that search request. When the remote server responds quickly enough, then this process can result in search results during the normal search result preparation .. just in some seconds. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6941 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	2b4f8f6c06	animated network graphic! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6939 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	777195e8d1	more abstraction for access of LoaderDispatcher and cache git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6937 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	7bcfa033c9	more abstraction of the htcache when using the LoaderDispatcher: a cache access shall not made directly to the cache any more, all loading attempts shall use the LoaderDispatcher. To control the usage of the cache, a enum instance from CrawlProfile.CacheStrategy shall be used. Some direct loading methods without the usage of a cache strategy have been removed. This affects also the verify-option of the yacysearch servlet. If there is a 'verify=false' now after this commit this does not necessarily mean that no snippets are generated. Instead, all snippets that can be retrieved using the cache only are presented. This still means that the search hit was not verified because the snippet was generated using the cache. If a cache-based generation of snippets is not possible, then the verify=false causes that the link is not rejected. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6936 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	2ddb952a5c	added the (fixed and anhanced) secondary search process. The process was disabled since some time. The search process for more than one word should be enhanced now and produce much more results. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6933 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	58035ef784	fix in snippet loading git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6932 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	986d4f34d9	added a consistency check for new queues git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6931 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	73f03e05ee	fixed a bug in snippet fetch strategy: cache only does not help if resource can only be found in web git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6930 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	fbf021bb50	redesign of index abstract processing - currently disabled until enough peers have fix in SVN 6928 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6929 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	87087f12fe	- scanned remote search process and enhanced some data structure and synchronizations here and there - removed concurrency overhead for small number of index normalizations as it happens during remote search - removed 'load only parseable' constraint for snippet fetch because some resources may not have any url file extension and these had therefore not been parseable and searcheable since they may become parseable after loading when their mime type is known - this partly fixes some problems with http://forum.yacy-websuche.de/viewtopic.php?p=20300#p20300 but more changes are necessary to get all expected search results git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6926 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	b62fb38344	fix for case where no release provider responds during auto-update (caused NPE) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6924 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	3a1cebb598	bugfixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6922 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	989819a28c	- reduced peer-ping time-out from 30 to 10 seconds - no re-try for the peer ping any more (it's a test, let's see what happens) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6921 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	b03caaa57a	better handling of OOM situations git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6918 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	56ff9d5fd4	- extended news size from 512 to 1024 characters - a new news db will be created (news1024.db), the old one (news.db) can be deleted - peers with too large news payload are not ignored any more (they may have been invisible because they had a too large news payload!) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6917 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	c71d829bb5	more time-out properties for http connection manager git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6912 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	60e71876ad	- more abstraction (HashMap -> Map) - more concurrency-awareness (HashMap -> ConcurrentHashMap) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6910 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	a83772c71b	fixes and enhancements for balancer: - crawl lists for each domain now uses a HandleSet which should use less memory than LinkedLists - but: fill more entries into the domain lists (all available entries) - fixes to selection criteria (best domain selection) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6909 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	9cde05418f	fixed url crawl list display git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6908 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago

... 4 5 6 7 8 ...

4679 Commits (2512119e5f6641e176e7ec80346525dc9ceff350)