yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	84be912e90	fix for null pointer exception that occurred when missing user-agent in request header see also http://forum.yacy-websuche.de/viewtopic.php?f=6&t=78&hilit= git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3943 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c1aad9e508	added parameter for network graphic background git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3942 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	19786b73b6	next try for a better restart git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3941 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	08d5db6bb4	next try to fix the restart git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3939 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2ff94b2fb4	another try to fix the restart on linux (it works on mac) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3938 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c5c268c43e	tried to fix restart button kann das mal jemand auf seiner linux-platform testen und feed-back geben ob der restart funktionier ? git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3937 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e03fcf4627	SSI fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=29 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3936 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1a45ecb356	- fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=14&p=137#p137 - fix for missing restart script in ant built target - removed some more synchronization for size() operations - removed blocking statement on search page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3935 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1fa4feb8e6	added restart button. should work on linux and mac, but was only tested on mac should of course work on windows as before git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3934 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f1ed91a8e4	added option to allow/disallow DHT transmission during indexing see also http://forum.yacy.de/viewtopic.php?f=9&t=8 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3933 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	87afdfc2a7	fix for long waiting time during deletion of processed news see http://forum.yacy.de/viewtopic.php?f=6&t=6 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3932 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	9bbd39b67c	- removed unfinished auto-updater from roland and martin - added new download-option for releases on the status page still mising: - thomas-style restart for linux/mac - untar/gunzip on shell basis (comes next) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3931 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	154ffd7c2c	fix for wrong http connection version and SSIs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3928 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1782ef57e5	- added SSI parser and include directive for <!--# include virtual="<file>" --> - added chunked file transfer for non-yacy clients - SSIs are streamed using chunked transfer, partly delivered pages can be seen in browser before transmission is finished - added client-side network unit identification - cleaned up code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3926 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6b4cfbd2d6	new network bootsraping method - no more contact to yacy.net (no remote superseed any more) - moved superseed file into new network unit definition - fixed build; includes new network bootstraping files now git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3922 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0e57a8062b	added network definition for different YaCy networks (needs much more work) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3919 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	1d41ebf489	) made age for deletion of too old seeds configurable ) changed naming-scheme of seed-deletion-properties git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3918 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	52cb3208d0	*) old (lastseen > 7d) peers are now automatically removed from passive and potential seed-dbs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3917 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	815e3da62f	fix for http://www.yacy-forum.de/viewtopic.php?p=37353#37353 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3913 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
michitux	25529290ca	- 2 small changes in documentation - hopefully fixed logging of GCs (in order to avoid things like "performed necessary GC, freed 18014398509481565 KB (requested/available/average: 4096 / 1631 / 2957 KB)") with the help of KoH git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3909 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	c59a7ce5c2	*) hopefully fixed a stupid bug (my fault of course) that sometimes messed up the marking of search words in the snippets (see http://www.yacy-forum.de/viewtopic.php?p=37329#37329 ) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3908 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6518bb6c08	changed release strategy: we will provide two different releases in the future, one standard release and one 'pro'-release. the 'pro'-release contains all additional parsers AND has different default performance values. The pro-version differs therefore from the previous 'all'-version by this default values. The pro-configuration is automatically choosen if the libx-folder exists. If a version is once initialized, its configuration stays independently from an existing libx folder. The ant targets had been changed. There are now 3 different targets to create standard and pro-releases, and one target to upgrade: - dist: creates a standard release (only, no libx target any more) - distPro: creates a pro-release (includes the libx) - distExt: creates a libx-release which includes the libx-folder only. It may be used to upgrade from standard to pro Furthermore, the naming of 'dev'-releases had been removed. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3902 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	069562a14d	fixed problem with re-crawl; replaced error file-db with ram-db git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3900 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c7a614830a	several bugfixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3899 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	465145cb6f	revert to insecure, but dau-proof defaults git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3898 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	7ad11ceaaa	security fix for peers without password. allow access only from localhost git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3897 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	2784820ee3	*) moving sleep to a better place git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3895 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	7a1b811d18	*) bugfix for SocketException: git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3893 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	71fd972ac0	- reduced default search time - catched case when web structure cannot be painted because of too less data - better logging when balance fails git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3892 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2b937abef1	slighlty different behavior in shutdown sequence for http server threads: - first close streams - make pause (that one that was made in httpdFileHandler) - close sockets git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3890 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e4aa8f2a08	disabled more sleep(200) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3889 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	cb38e57622	reduced httpd final waiting time git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3888 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b4585ad67d	im Sommer 2005 wurden die ersten pings zwischen YaCy-Peers ausgetauscht. Das klappte aber merkwürdigerweise nicht immer. Um das Protokoll zu testen schrieb ich eine einfache message-Funktion, so wie sie heute noch in YaCy drin ist. Aber auch die Messages funktionierten nicht richtig. Alex und ich haben lange Zeit gesucht, und den Fehler nie gefunden. Es stellte sich heraus das ein Timing-Detail das Problem lösen konnte, die Ursache haben wir bis heute nicht gefunden. Die Lösung des Problems bestand aus einem kurzen sleep, kurz bevor der httpd Daten zum client zurück geschrieben hat. Das ist natürlich eine fürchterlich schlechte Lösung. Bis heute war diese Sache im httpd. Mit diesem Commit habe ich den sleep auskommentiert, und es steht zu befürchten das wieder irgendwas nicht geht. Wenn jetzt das Netz zusammenbricht, keine pings mehr ankommen oder so, war es dieses sleep, das es verhinderte. Vorschläge willkommen. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3887 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f64d9f1c6c	removed forced termination in case that a previous bad termination is detected this will cause many users to be unsure what to do next an leave them helpless to simply delete the control file is the same thinig that the user is othervise forced to do git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3885 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	e1d809d5f1	- more detailed logging of MEMORY messages - forced GCs don't contribute to heuristics anymore git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3881 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	9f7765863b	bugfix for seed length control routine git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3879 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0b10ef64ba	better server access tracking git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3878 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4f5496062c	protection against too large seeds git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3877 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	684ded0e09	added new news types git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3876 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	669f840eab	- added ViewProfile / Impressum (default on) to local peer's robots.txt git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3874 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d7de0938a6	fix for http://www.yacy-forum.de/viewtopic.php?p=36587#36587 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3870 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	22ee85ca02	- specified exceptions thrown by ResourceInfoFactory and plasmaHTCache.loadResourceInfo() - caught possible NPE in CacheAdmin_p and added more error-cases - speeded up deletion of entries in the local crawl queue by crawl profile (it has been noted often that this deletion is slow) - added a bit javadoc git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3868 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	dfd5e823c3	automatic limitation of web structure host count git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3867 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	8b0aea6910	fixed automatic deletion of too many referenced hosts in web structure git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3866 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5dd9acc2a7	removed calls to deprecated methods git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3865 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	66ec8b63c1	added a httpd access tracker: - all requests to the own httdp can now be listed in the access tracker menu - the search statistics had been renamed to access tracker and extended by this tracker git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3861 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	71ca9aa6d4	- fix for changed blacklist types git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3857 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	9a8a87612d	added new qph column to search tracker servlet git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3854 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e07458bad4	added time-out function to web analysis the default time-out is 1 second git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3852 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	4a1bc4743a	*)News-entries with blacklisted URLs are now ignored git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3849 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	6074264267	dynamic rights. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3847 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	99062c0c9e	*) SOAP should support authentication against the user-DB now (requested by KoH) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3846 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	339153d40e	*) favicons that are specified in the document content via html link-tags are now detected and displayed on the search page (requested by allo). git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3845 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	854eb1492f	.yacy /.yacyh urls for the feedreader git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3844 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	7a5b22a0b8	Integration of FeedReader in Bookmarks. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3841 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	6265d321bd	- more constants - display why global search is not available on search page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3839 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	7921f07c9d	userDB fix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3837 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	7b2e1bb8f2	Feedparser with reflection. TODO: This needs a special build.xml entry git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3832 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	8bff810d19	- fixed logging output of serverMemory.request() - don't start up if DATA/yacy.running exists as this is usually a sign of an already started yacy-instance git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3831 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	18a5380ee3	) situation-dependent lock-buttons for search-page ) removed one unused import and a double definition of "ogg" as media-type git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3817 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	f05ca43780	- the wiki-parser works for remote wiki-code now, not displaying links anymore as if they were local (ViewProfile comment) - fixed wrong link to CrawlStart on Status-page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3816 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	9d6605a83c	- fixed NPE in Blacklist Cleaner during deletion of more than one double entries - don't display responseHeader1.db in CacheAdmin_p anymore git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3814 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	594ff95955	:-( git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3801 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4ca797401e	fix for ConcurrentModificationException see http://www.yacy-forum.de/viewtopic.php?p=36566#36566 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3800 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	7b904e0077	integrated robots.txt crawlDelay into the crawl balancer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3797 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	52cb033f01	- slightly different painting of web structure picture: hosts that have many own connections are painted farer away (this is not yet cato's idea, this will be implemented in another step) - doc update git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3796 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	30c3d909b1	- fixed charset problem in ConfigProfil_p.html (use accept-charset="UTF-8" in forms) - fixed wrong XML output if no peers are known in Network.xml - simplified parsing of table properties in wikiCode and ZTableToken - reimplemented GC heuristics. They are needed to constantly ensure that an amount of free memory is available which is higher than Java's max. limit for performing a Full GC (please use serverMemory.request(long, boolean) rather than serverMemory.available(long, boolean) to provide data for averaging over the last GCs) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3793 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	6c9df13552	more debugging git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3791 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	4392ee0c51	BugFix for typo and wrong include git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3789 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	d1e1580223	Surftips Blacklist Blacklists List Hardcoded instead of only updated on firststart / migration.java git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3788 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	e1a5babff1	*) Logging GUI handler: line-size is now set to max-size if max-size was exceeded See: http://www.yacy-forum.de/viewtopic.php?p=36355 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3786 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	94cc9f05f5	*) Improvements for restart via update wrapper git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3785 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	44bac7dea1	*) blog-comments can now be moderated git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3778 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
borg-0300	2ab020445a	bugfix, i think - http://www.yacy-forum.de/viewtopic.php?t=4059 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3777 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	f89517203d	*) SOAP: new function to get the Performance Settings of Queues and Processes No items left in the yadmin SOAP-TODO :-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3776 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	92351c4dcb	*) SOAP: bookmarks list now indicates if a bookmark is private (requested by KoH) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3775 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	957a25afff	getRight(rightName) instead of get...Right() git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3774 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	e0e46d3aec	*) SOAP: new function doGarbageCollection (requested by KoH) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3773 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	1efe607c34	git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3771 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	a0149317ac	*) fixed bug where headlines were added to directory of a wiki page multiple times (http://www.yacy-forum.de/viewtopic.php?t=4034 ) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3762 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	ef24bed406	Sorry... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3760 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	a29cb2e1af	blupp git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3759 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	651b05ba43	*) wsdl file updated (requested by KoH) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3758 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0ae6664ad8	enhanced web structure picture - hand-over of get properties from web front-end to graphics generation - added depth-control buttons - added marking of anchor-points to highlight relation order - enhanced ymage graphics library git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3757 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	a585b4d41b	added web structure image see http://localhost:8080/WatchWebStructure_p.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3747 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	33ad0c8246	added a web structure computation and logging: - all web page parsing operations will now increase a web structure file - the file is computed in memory and dumped at shutdown-time to PLASMASB/webStructure.map in readable form (not a database) - the file can be used externally to analyse the link structure of the crawled pages - the web structure can also be retrieved using a xml-interface at http://localhost:8080/xml/webstructure.xml - the short-term purpose is the computation of a link-graph image (before linuxtag!) - a long-term purpose could be a decentralized computation of the citation rank git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3746 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	7904175338	- sorry for typos git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3743 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	baa9402b97	- wiki-parser is now configurable via the config setting wikiParser.class which holds the class-name for the parser to use git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3742 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	601fc7d1c5	- added source to J7Zip-modifed.jar and it's license (changelog is still to come) - moved HTML-*replace-methods from wikiCode to de.anomic.data.htmlTools - prepared use of different wiki parsers as suggested here: http://www.yacy-forum.de/viewtopic.php?p=34444#34444 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3741 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	0a64047081	- plasmaParserDocument can process subdocuments now (other archive-parsers may want to use this method) - added 7zip parser - added 'text/sgml' to realtime parseable mimetypes (sometimes returned by the mime type parser) - added new cached output stream class, very suitable for parsers because of limited memory git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3740 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	b1680ab71f	*) bugfix for ArrayIndexOutOfBoundsException in robots-parser (thanks to low012) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3739 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	b30e64daab	*) passing homepath to serverLog.configureLogging git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3738 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	9a4375b115	*) robots.txt: adding support for crawl-delay git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3737 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	11ac7688d5	reverted a part of last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3736 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b3f97b5c38	git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3735 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	3c5ff7f735	adopted kelondroBytesIntMap to kelondroIntBytesMap git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3734 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5551ff5306	enhanced index storage data structure kelondroBytesIntMap this stores now two index structures, one for data that is aquired during start-up and one for data that is aquired during run-time. This reduces the grow factor, and should reduce the memory amount in case that a index-reorganisation happens. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3733 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	872eb46cb9	some redesign of the handling of the index for kelondroFlexTable git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3732 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	086239da36	- added servlet: remote crawler queue overview - added servlet: crawl profile editor git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3731 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	8ba81e0995	- added some comments (will get more in the near future) - added missing <label> to the search field in Network.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3728 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	65a8a9fc58	fix for nullpointer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3726 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b05e2314cf	another dht selection fix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3725 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	03c6551b0c	- fix for http://www.yacy-forum.de/viewtopic.php?t=3747 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3724 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b28e5d0ee9	protection against wrong word hash length see http://www.yacy-forum.de/viewtopic.php?p=35657#35657 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3723 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e897eb9b4a	fix for DHT selection target git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3722 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0384b8771b	fix for http://www.yacy-forum.de/viewtopic.php?p=35700#35700 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3719 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	578c2ef130	release 0.52 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3715 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	756a27049b	last-minute-feature 'newbie-selection' for workshop purpose: for remote search, always select all peers that are less than a day old (should be removed someday in the future if load is too high, which could mean when pph > 100) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3712 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	46367afaaa	update of memory-protection values see http://www.yacy-forum.de/viewtopic.php?p=35539#35539 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3709 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	ea87fe5d78	) Updated German translation ) Changed "Lost Handle" error to warning (masses of it if deleting crawl-profile) *) Removed unnecessary code from Windows script git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3708 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	85035dc319	addition to svn 3699: check send/receive if p2p-mode is activated git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3701 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	26f05d1fd0	avoid division by zero if search is done for no words this case is relevant if the bluewords (yacy.blue) are used git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3698 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2fa8b50e54	reverting svn 3691+3692 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3696 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	139c59ebbd	- fixed dht selction problem: the seed tables used a wrong ordering - cleaned some code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3693 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	22a0e9f117	more timeout-control git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3692 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	24db55a541	added timeout for httpd-sockets during read git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3691 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f3fee4d445	fix for http://www.yacy-forum.de/viewtopic.php?p=35322 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3689 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	7f56c8d4aa	fixed some seed selection details git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3685 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e602436fda	fixed problem with cluster routing git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3684 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0831034e07	fixed non-termination bug for robinson remote crawl peer selection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3681 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	68f5d64ae6	replaced yacy logo by better version git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3675 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d6480dc670	fix for long transfer pauses see http://www.yacy-forum.de/viewtopic.php?p=35243#35243 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3672 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	cb43ae11ba	*) Bugfix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3668 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	0b5fc3c28c	) moving date functions to serverDate class ) Sitemap-parser - logging added - parsing of modDate added git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3667 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	6f46245a51	) Bookmarks: Ajax icon is displayed while loading title ) First version of a sitemap parser added - currently only autodetection of sitemap files is supported *) DB-Import restructured - pause/resume should work again now git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3666 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	269d5ca45b	*) Bugfix for IllegalMonitorException git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3664 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	e8610f049a	*) new method allowing the updater to wait until yacy has finished startup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3662 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	111ba9e359	- fixed some width problems in new status page - fixed deadlock in dns cache - added termination security for DHT peer selection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3660 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	74dd6cac95	) signal yacy shutdown to updater ) some javadoc added git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3658 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	43748f87fb	*) changes required for the uploader git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3655 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	e12e934ade	*) Fixed broken compile process. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3650 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	d6811ac243	) Moving tar.jar from libx to lib ) Enhanced interface git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3649 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	469583ea80	*) new interface class. should be implemented by the updater to allow communication between the updater and yacy (not yet functional) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3648 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	191ef16499	fixed wrong ordering that caused bad dht selection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3646 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	7cf8981a98	- added debugging code for wrong DHT target iterator - restricted distance constraint from 0.4 to 0.2 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3644 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	abb63e3289	) disabled system.exit() in case of YaCy shutdown as it kills the whole VM (including updater and other management threads) ) UpdateCheck-Thread now pauses for given interval correctly git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3642 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	dd44a1394f	disabled automatic performance setting change - during crawl start - each indexing cycle - for delay values - for short memory cycles git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3634 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b9add5cf37	some bugfixes: - dht iterator start point - wordIndex synchronization - surftipps url check git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3633 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	06b6e35484	fix for a null pointer exception if clusters are not defined git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3632 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	35c660654d	more debugging lines to fix bug for http://www.yacy-forum.de/viewtopic.php?p=34935#34935 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3629 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	47e90f31b2	fix for deadlock in plasmaWordIndex.addPageIndex synchronization for class method not necessary see also: http://www.yacy-forum.de/viewtopic.php?p=34959#34959 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3628 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	81844e85b2	- fixed more cluster routing problems - fixed a problem in remote search when balancer caused shift process to wait too long git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3627 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	304ed3f4d2	fix for remote crawl requests in clusters git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3626 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1979a167d3	fixed problem with cast git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3625 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e48189c710	enhanced cluster routing - cluster definitions can now contain an addition for local ip addresses - cluster-cluster communication uses the local ip address instead the global address, if one is given git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3624 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	e9d87b2fce	*) changes required for the uploader git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3621 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b33cef421e	better routing for public clusters git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3620 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	7c902996b5	*) changes required for the uploaderWrapper git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3618 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f73e1e3af9	fixed bugs in remote search setting for public clusters git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3615 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	485bf1ea83	bugfix for robinson/remote crawl bug git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3614 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	62c947b4aa	next try to fix deadlock in plasmaWordIndex see also: http://www.yacy-forum.de/viewtopic.php?p=34821#34821 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3607 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	871ee1ce0f	one step closer to automatic updates: automatically aquire release information from download archives web pages from latest.yacy-forum.net and yacy.net are retrieved, parsed, links wihin are analysed, sorted and the most recent developer and main releases are provided as direct download link on the status page, if it was discovered that a more recent version than the current version is available. This process is done only once during run-time of a peer, to protect our download archives from DoS by YaCy peers. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3606 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	ec225f9ab6	*) SOAP: adding methods to get the comment and MD5 checksum of a single file git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3604 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	2399ed817c	) robots.txt parser now extracts the sitemap-URL (will be used later) ) some javadoc added *) junit testclass for robots.txt parser added git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3602 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	fa012789b2	tried to fix a deadlock problem durin shutdown see also: http://www.yacy-forum.de/viewtopic.php?p=34753#34753 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3601 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e192f616a2	collection of small bugfixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3600 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	64a6d6e5e6	added new set iterator (needed for last commit) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3599 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f8de19fb2f	robinson cluster: added client-side protocol implementation - the network configuration page shows a new option: robinson clusters - when a global search is made, all robinson peers are excluded, but: - robinson peers/clusters that provide peer tags and where search words match such tags, they are included in global search. Therefore, robinson peers/clusters support the global yacy network with their indexes, without doin DHT-exchange git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3598 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	50e1e61fa5	*) SOAP: adding functions to rename and move files git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3595 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	63a004abff	*) bugfix for Nullpointerexception git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3594 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	29fe2beac7	possibly fixed a deadlock cannot find forum link now for that git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3593 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	4f4d3d71dd	) Faster appearance of ConfigBasic by bypassing UPNP-scan in case of existing external connects ) Marked two deprecated source-points *) Added possibility to dump words from indexing to file. Should not affect performance in the current form. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3592 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	657585fe0d	network functions for robinson peers: server-side protection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3591 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	89c1511738	- added new Network Configuration menu, can be found in basic settings - new cluster functions will be available in this menu, but currently not enabled, because corresponding interface methods are not ready yet - shifted remote crawl settings to new network configuration menu - shifted DHT distribution/receive to the new network configuration menu - adopted some string constants - added cluster configuration settings to yacy.init git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3589 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	62b79aa0a9	bugfix for http://www.yacy-forum.de/viewtopic.php?p=34558#34558 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3586 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2f3b518169	temporary patch for startup-problem: http://www.yacy-forum.de/viewtopic.php?t=3854 This is a serious problem that is caused by the database bug between 0.511 - 0.513 which produced a large number of double-entries in the RWI index. The uniq()-method tries to fix this, and it does not terminate when the index is large and the number of double-occurrences is also large. This patch does simply implement a time-controlled termination, which does not heal the inconsistency problem. The uniq-method itself is correct and does not need a bugfix, the non-termination is simply caused by the large number of data that is shifted during the process. It was possible to reproduce this behaviour in a test environment. A real fix would need to: - enhance the uniq()-method by using a recursive, binary segmentation of the array to be fixed - uniq() must report the entries that are double - the double-entries must be deleted from the collection index (from the index and the collections) to heal the problem git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3583 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	e6fb6426a3	*) Some cosmetical changes and corrections git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3582 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	ba525ebf52	- re-enabled path optimization that was disabled during testing - re-implemented index load/extend optimization that was removed from kelondroFlexTable, this is now part of kelondroIntBytesIndex git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3580 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	595ee10468	fixed datatabase inconsistency bugs inserted many debug lines added a huge number of asserts extended database test methods git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3579 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	ca79362b9d	disabling auto-setting of remote crawl performance see also http://www.yacy-forum.de/viewtopic.php?t=3849 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3577 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	7a7a1c7c29	fight against problems with remove-methods and synchronization - some bugs may have been fixed with wrong removal operations - removed temporary storage of remove-positions and replaced by direct deletions - changed synchronization - added many assets - modified dbtest to also test remove during threaded stresstest git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3576 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b6a5f53020	removed double synchronization from kelondroRecords.USAGE to prevent thread locking. The method synchronization should be sufficient git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3574 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	6186185775	*) Moved some comments to javadoc git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3573 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	063063aa0c	fix for 100% cpu bug during dht selection see also: http://www.yacy-forum.de/viewtopic.php?p=34068#34068 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3570 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
michitux	4990909178	Some bugfixes, new layout/style for image search results: * removed divide by zero bug when 20_dhtdistribution_busysleep is 0 * replaced German comment with wrong charset in source/de/anomic/plasma/plasmaCrawlBalancer.java by an English one * replaced the table-fix for floating behind snipped images by a br with clear * removed unnecessary old xhtml-files (were not in use, they were created when we weren't having xhtml for testing) * new layout for image-search results: replaced the old one with spans and tables inside (not valid) with new divs, now each image snippet container has the same size TODO: * the ids of the snippetLoading-divs aren't valid because ids must start with an alphabetic letter or an underscore, they have to be prefixed * in the returned snippet-xml is an unresolved pattern for status (the status is only set for text snippets) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3566 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	78d04bcbcf	fixed bug in search statistics git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3562 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b79b4082e2	completed search exclusion: - exclusion on index-level (not only from search snippets) - exclusion hand-over at remote search protocol git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3556 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	d66b0276e3	*) removed log-output for PPM-calc git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3553 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	4400acc27d	) created new 8 bit oldschool style font for possible future use ) main method is generalization of main method of ymageFontGenerator: it does not matter how many lines of how many bits a font is made of as long as the values stay the same within the font -> use this class as a template for your own font generators and be a happy camper ) main method checks if font is valid (96 characters, all letters must have same number of lines and same number of bits per line) ) *** I have not checked if the result is really a valid font so far. *** git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3552 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	25070822a5	fix for http://www.yacy-forum.de/viewtopic.php?p=33925#33925 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3551 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	06a7978730	moved url pattern matching for search to better place git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3550 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	159bd0cab5	diverses; b.o. fix for http://www.yacy-forum.de/viewtopic.php?p=33914#33914 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3549 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	cdc7b77a62	fix for http://www.yacy-forum.de/viewtopic.php?p=33916#33916 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3548 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	40c14a4f0e	- better implementation of search query properties - basic protection against start-up problems when database files are corrupted - auto-delete of not-critical databases during startup when load error occurs - on-the-fly reset option for all database tables - automatic on-the-fly reset for seed tables during enumeration exceptions git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3547 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	1696606b7f	*) changing loglevel of "PPM-Calculation" message git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3545 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	f30bf1683e	*) corrected spelling of captcha git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3544 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	fcdf000fbc	bugfix for http://www.yacy-forum.de/viewtopic.php?p=33838#33838 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3543 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	d7edc9740b	) added correct (c) and Last-data git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3542 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	ee241f32e6	*) very basic capcha class (see coding sections of forum for more details) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3541 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6e7340ef52	added exclusion search (you can now search and exclude words from the result with '-') git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3540 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e4734a8b6b	fix for fix in SVN 3537 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3539 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	356033aceb	fixed bug with continuous reset of balancer file index git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3537 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	ba2c307ab3	optimized memory allocation in kelondroRow.Entry such an entry cannot be instantiated without allocation of new byte[]; instead it can re-use memory from other kelondroRow.Entry objects. during bugfixing also other bugs may have been solved, maybe the INCONSISTENCY problem could have been solved. One cause can be missing synchronization during bulk storage when a R/W-path optimization is done. To test this case, the optimization is currently switched off. More memory enhancements can be done after this initial change to the allocation scheme. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3536 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	24ea4ca631	*) adding first version of postscript parser git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3535 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	def0d6124e	*) trying to solve SecurityManager problem during init of soap engine git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3534 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	75eb65028a	*) adding a test if a seucrity manager is active git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3533 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	210ede8230	added a class for byte-array management. This was the result of a very large experiment to replace byte[] objects within kelondro. Frequent System.arraycopy are common when kelondroRow.Entry objects are handled. This class may be used to prevent this. However, experimental replacement of byte[] by kelondroByteArray in kelondroRow.Entry resulted in complete re-write of large parts of kelondro. This experiment did not completely lead to a result, because then the interface to kelondro had to be changed also from byte[] to kelondroByteArray, which may have caused a rewrite of large parts of YaCy. The experiment is therefore abanonded, but this class remains here without any function but possibly for future use. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3531 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	1b7fda12ee	*) SOAP: separate function to get the active/passive/potential peer list git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3526 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6488ec8a80	no deletions in index in case that snippet-loading fails and there is no network connection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3525 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	847349358b	less memory usage during collectionIndex-rebuild should also speed up that process a little bit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3524 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	8ef3ad12a7	*) fix for rare bug in PPM-calc git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3523 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	00bc0c1b47	*) new logging for PPM-Calculation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3522 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	5941577076	*) added some logging to PPM-Calculation to find a rare bug git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3521 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5c3afb3202	added option to configure a path to a secondary index location. this shall be used to store a fragment of the index on another physical device, to split IO load and enhance access speed. The index is splitted in such a way that the LURLs are stored to the secondary location, and the RWIs to the primary location. This is especially useful for environments where symbolic links are not possible and may cause IO access even if there is no write access to the device which hosts the symbolic link. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3519 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	c2e6afbd69	*) bugfix: setting mimeType properly for dir listing with e.g. "?format=xml" git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3516 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	242c19b480	completed TLD categorization git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3515 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	b99f9d870d	*) fixed double selection of peers for the same DHT-chunk. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3513 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	f20b596dc0	) adding servlet to display all deployed SOAP Services - soap related servlets are located in htroot/soap ) new serverContext class for soap git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3511 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	75d90834a2	*) adding additional file extension for powerpoint git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3507 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2cb16824e3	removed support for old database structures. The new collection index will be more generalized to support other indexes i.e. YBR block-rank computation. A clean-up of the many conditions to support the old database was necessary. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3506 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	81b4598487	*) peer profile can now be displayed as vcard e.g. http://localhost:8080/ViewProfile.vcf?hash=localhash git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3504 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	3688ec33e5	release 0.51 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3501 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	1f61c13697	*) RSS-parser extracts the author tags now git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3500 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	602ac42010	fix for OOM case when a kelondroTree Node cache grows See also: http://www.yacy-forum.de/viewtopic.php?p=33275#33275 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3499 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	b374812f01	*) adding rpm packager as author git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3498 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	beb772d6cd	fixed problem with broken notifier image, occurred only at initial start-up git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3497 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	40ce33e664	*) adding RSS feed for yacy news git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3496 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	589cbd8cbf	*) replacing all yacy-news-category strings with corresponding constants Note: please use these constants from now on git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3495 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	f4af360f7c	bugfix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3494 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	7af188ff9a	fix for http://www.yacy-forum.de/viewtopic.php?p=33089#33089 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3491 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5bbf010107	removed synchronization of size() method from numerous classes to avoid thread locking git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3490 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6b9eea3932	- removed differentiation between longTitle and shortTitle; this cannot be used for search results, and it is difficult to get both types from all document types - added some author parsing git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3489 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	a738b57b31	added author tag to indexing content enhanced composition of title tag TODO: insert author information for external parsers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3488 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6be57983a8	another update to the crawl balancer can now alternate between top and bottom of the crawl stack git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3487 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	91cdc1493f	removed query to NAT or responder in case that no other peer is there. this is not needed any more, there are enough peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3486 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4783a30910	- fixed a flush problem in balancer - return to idle divisor in RWI RAM cache flush git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3485 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	91c2a042a7	*) bugfix for wrong proxy traffic accounting git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3484 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	861f41e67e	redesigned NURL-handling: - the general NURL-index for all crawl stack types was splitted into separate indexes for these stacks - the new NURL-index is managed by the crawl balancer - the crawl balancer does not need an internal index any more, it is replaced by the NURL-index - the NURL.Entry was generalized and is now a new class plasmaCrawlEntry - the new class plasmaCrawlEntry replaces also the preNURL.Entry class, and will also replace the switchboardEntry class in the future - the new class plasmaCrawlEntry is more accurate for date entries (holds milliseconds) and can contain larger 'name' entries (anchor tag names) - the EURL object was replaced by a new ZURL object, which is a container for the plasmaCrawlEntry and some tracking information - the EURL index is now filled with ZURL objects - a new index delegatedURL holds ZURL objects about plasmaCrawlEntry obects to track which url is handed over to other peers - redesigned handling of plasmaCrawlEntry - handover, because there is no need any more to convert one entry object into another - found and fixed numerous bugs in the context of crawl state handling - fixed a serious bug in kelondroCache which caused that entries could not be removed - fixed some bugs in online interface and adopted monitor output to new entry objects - adopted yacy protocol to handle new delegatedURL entries all old crawl queues will disappear after this update! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3483 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	9b5fb3908d	*) a peer-message are now created when a blog-comment is written git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3480 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	581db87237	more debug code for http://www.yacy-forum.de/viewtopic.php?p=33009#33009 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3479 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	81c4cc6bf7	better debugging of balancer failure git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3478 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	dd06d4cada	more logging to better trace bug http://www.yacy-forum.de/viewtopic.php?p=33001#33001 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3477 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	96b79bf86d	redesigned remove method in kelondroRowSet This should fix also numerous bugs like http://www.yacy-forum.de/viewtopic.php?p=31077#31077 (java.lang.ArrayIndexOutOfBoundsException in kelondroRowCollection.removeShift) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3476 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	9f929b5438	better snippet handling in case of snippet load fail see also http://www.yacy-forum.de/viewtopic.php?p=31096#31096 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3475 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	d451ad48d3	*) improved peerloadgraphic: - unnecessary (0 %) pieces are removed - percent-values of each thread displayed in legend git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3474 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	a5d668c0c6	added speed-buttons for easy performance setting appears in crawl start and on indexing monitor page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3473 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5b0a84ce09	fix for synchronization deadlock with flushMissNameCache. see also: http://www.yacy-forum.de/viewtopic.php?p=32939#32939 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3472 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	e2ac5f62bd	- Code hübscher machen [von NNs TODO] git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3471 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	f04097c3dd	integrated tor-patch for crawling, if yacyDebugMode is set. (replaces: http://yacy.deruwe.de/overlay/net-misc/yacy-tor/files/disable_dns_checks-svn3132.patch) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3470 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	22fe14f292	*) first version of Peerload-graphic git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3469 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	432d7d4e9c	better catch git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3468 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	8f7e8b6ee2	auto-delete for not-fixable db error in crawl stacker. see also http://www.yacy-forum.de/viewtopic.php?p=32906#32906 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3467 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	7a52b07fcc	better memory protection during freemen cycle see also http://www.yacy-forum.de/viewtopic.php?p=32903#32903 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3466 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6faa262259	fix for NURL-fix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3465 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	909d7a8ae9	fixed wrong implemented row iterator in kelomdroFlexSplitTables this has no effect, until now this iterator was only used on the Index Administration page. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3464 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	a1fb8358b2	lets make a well-formed http link so that other crawlers don't have a problem to follow this link :-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3463 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4edb70f68b	added yacybot info-page from Roland git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3462 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	3ef77d2030	fix for http://www.yacy-forum.de/viewtopic.php?p=29878#29878 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3461 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	3bb3df3fc0	fix for http://www.yacy-forum.de/viewtopic.php?p=32298#32298 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3460 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	243a2f831b	fixed problem with not found NURL-hashes The cause for this problem could still not be found, but the effect is handled much better. The NURL-pop will continue automatically until it found a hash that can be found. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3458 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6ad39bae1e	fixed shutdown problem this fixes the 'inconsistency' messages during start-up git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3457 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	38b93f8cb8	bugfix for my last commit: iterator did not consider secondary start point in case of rotation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3456 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	264a82eec8	- fix for http://www.yacy-forum.de/viewtopic.php?t=3657 - fix for http://www.yacy-forum.de/viewtopic.php?p=32758#32758 - Diff takes any objects now, not only strings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3455 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d755a8026d	- better OOM protection - better memory allocation for FlexTable indexes - splitting between static index and dynamic index (only the dynamic part must grow) - to enable a merge-iteration of new splittet index, a huge number of classes needed to be adopted for new iterator classes - added new iterator classes that support cloneable iterators - adopted all iterator classes to implement cloneable itarators git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3453 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	23338d2070	small fix for RAM computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3447 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	33f97cff7a	changed startup initialization sequence slightly git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3446 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4e8eb1dbe3	some minor changes here and there git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3441 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	03c5906ae7	- minor bugfixes for url-fetcher & http://www.yacy-forum.de/viewtopic.php?t=3646 - PerformanceMemory_p.html is valid XHTML again git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3440 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	3499a364ef	a little bit better memory protection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3439 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	313f6a7680	fix for http://www.yacy-forum.de/viewtopic.php?p=31553#31553 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3438 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	958ebea5c5	fix for http://www.yacy-forum.de/viewtopic.php?p=32470#32470 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3437 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5d5e6ebfcc	fix for http://www.yacy-forum.de/viewtopic.php?p=32631#32631 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3436 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1cba31de43	redesigned ram organization for database caches - each cache can now allocate as much memory as is available - no more fixed limits - replaced old performance memory monitor by new one - added supervision methods as static functions into the classes that provide cache functionality - steering of ram allocation is done with two simple limits that are ram availability-relative git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3434 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	26450a1d9a	*) avoid nullpointerException on seed.getAddress() (reported by netbude) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3431 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	db235f2d61	added some memory protection in collection index multiple merge git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3429 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	c72605ecab	*) adding a function to determine if a given URL is bookmarkt git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3428 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	bd03c6b874	*) bugfix in bookmarksDB: - NullpointerException when trying to get an unknown bookmark - bookmarks can either start with http or https git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3427 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b466baa574	added some memory protection too large collection arrays are now avoided. By default, the biggest collection index is 7. larger collections are dumped into a commons directory, but cannot yet be used. Bevore doing a dump, the collection is splittet into a part which has only root-references, and stored back to the collection; the remaining part goes to commons git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3426 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	ce360ef43e	) no more HTML in plasmaCrawlProfile.java anymore ) <br> will not be displayed in items in Auto Filter Content on WatchCrawler_p.html anymore *) removed unnecessary replaceHTML() git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3425 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	88245e44d8	- improved version of robots.txt (delete your old htroot/robots.txt before updating): - robots.txt is a servlet now - no need to rewrite the whole file each time a section is added or removed - user-defined disallows, added manually, won't be overwritten anymore - new config-setting: httpd.robots.txt, holding names of the disallowed sections git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3423 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	9623bf7bbe	- removed call of java 1.5 method - added config servlet for local robots.txt - removed YPStats_p as it is of no use anymore - supertemplates use XHTML now - quick-fix for http://www.yacy-forum.de/viewtopic.php?p=32296#32296 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3422 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	51e12049fa	third generation of R/W head path optimization - data from collection arrays are read in order - merged data is written in order git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3419 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	a1d68fe092	- use .class rather than Class.forName for classes in class-path - added Bost's patch for Diff.findDiagonale() from: http://www.yacy-forum.de//files/patch_685.txt - fixed minor bugs in Blog git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3416 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	10a3c20b8d	some more enhancements to R/W Head path optimization git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3415 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f4cfd19835	second Generation of collection R/W head path optimization: - permanent cache flush is switched off. The optimized cache flush works better if it is a large number of collections that is flushed together - the flush size can be configured instead the flush divisor. There is only one size for all flushes - collection records that shall be removed during collection transition (jump from one collection file to another) are now not really removed but only marked in RAM. add-operations to the collection use these marked collection spaces - index bulk write operations are now separated for each file of a kelondroFlex git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3414 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1fda50fd3c	correct R/W head positioning in kelondroFlex and some enhancements git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3409 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	304412a049	first generation of collection index R/W head path optimization - collections are now hand-over as collection lists to collection index for merge opertations - collection index lists are separated into 'new' and 'extend' lists - lists are written separately - write operations are done into array sets and array indexes. These are now serialized - write operations into index files are sorted by index; that means that a R/W head does not need to go forward and backward, only forward More enhancements are possible git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3407 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	54fef3574f	*) missing files for last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3406 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	cb89c74d52	) added blog-comments ) removed debug-output when deleting news git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3405 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	6fbe31425a	- some code-cleanup (no more syntax-warnings here) - added deletion from loadedURLs of URLs to be blacklisted in IndexControl_p git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3404 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	32867580ee	update to kelondroRecords needed fo last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3403 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e3480d4ad3	fix for warning in crawl balancer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3402 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	8668ac5d91	preparations for collection index cache flush optimization (hand-over commit, no functional change to current code) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3399 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	39a2000d8b	- added support for [[Bookmark:$bookmarkTag\|description]]-link-listings (requested by theli) to wiki-parser - added support for <pre>-tags to wiki-parser git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3393 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	619653c054	- fix for last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3392 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	26f5757b40	- added support for multiple paths per domain to default-blacklist warning: an interface-change had been neccessary: - remove(String, String) has been renamed to removeAll(String, String), because it removes all path-entries for the specified host - remove(String, String, String) has been added to delete only a path-entry - geBlacklistType(String) has been renamed to getBlacklistType(String) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3391 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	a5a36d9252	- hopefully last fix fo 1.5 methods (sorry for that, eclipse isn't that helpful in identifying those methods) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3387 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	e97b6f0458	- we still use Java 1.4 ... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3386 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	0c7b8cf632	- added first version of new wiki-parser - added blacklist support to manual URLFetcher stack fill - fix for NPE: http://www.yacy-forum.de/viewtopic.php?t=3559 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3385 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f7803a6ce4	enhanced crawl balancer - new domains now get a chance to get crawled early - less IO operations - new balancing method - better dump order at shutdown time - bugfixes regarding not found url hashes (no more superfluous cache kill) - domain access time is now shared over all balancer stacks - viewing the stack does no more disturbish the balancing algorithm that much - intelligent selection of best next domain using domain access times - extra double-check (to double-check the double-check) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3384 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	801eea8849	*) Fixed bug where pairReplace() got caught in infinite recursion. (http://www.yacy-forum.de/viewtopic.php?t=3466 ) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3383 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c3e8c23f5d	fix for 'CANNOT FETCH ENTRY: hash is null' bug git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3380 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	badab8d924	fixed some more bugs in new db handling git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3379 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e72d253577	fixed problem with initial cache load git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3378 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2d8e472cfd	emergeny bugfix for last commit (kelondroTree should work again) the cache prefill is broken and will be fixed later git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3377 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	dc0c06e43d	PLEASE MAKE A BACK-UP OF YOUR COMPLETE DATA DIRECTORY BEFORE USING THIS redesign for better IO performance enhanced database seek-time by avoiding write operations at distant positions of a database file. until now, a USEDC counter was written at the head-section of a kelondroRecords database file (which is the basic data structure of all kelondro database files) to store the actual number of records that are contained in the database. Now, this value is computed from the database file size. This is either done only once at start-time, or continuously when run in asserts enabled. The counter is then updated only in RAM, and written at close of the file. If the close fails, the correct number can be computed from the file size, and if this is not equal to the stored number it is a strong evidence that YaCY was not shut down properly. To preserve consistency, the complete storage-routine had to be re-written. Another change enhances read of nodes in some cases, where the data-tail can be read together with the data-head. This saves another IO lookup during each DB node fetch. Includes also many small bugfixes. IF ANYTHING GOES WRONG, ALL YOUR DATA IS LOST: PLEASE MAKE A BACK-UP git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3375 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	c016fcb10f	- added streaming-support to CrawlURLFetchStack_p servlet - bug for NPE in list.java - use more constants git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3373 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	d114a0136e	- crawl profile: don't add null-values - added some settings and statistics for url-fetcher 'server'-mode - added own stack for fetchable URLs - added possibility to fill stack via shift from peer's queues, via POST (addurls=$count and url$num=$url) or via file-upload - added "htroot" to classpath of linux start-script git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3370 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	b2a9d32f29	why do I always forget some lines? sorry... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3368 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	e1edb23689	*) Bugfix for IllegalMonitorStateException See: http://www.yacy-forum.de/viewtopic.php?t=3522 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3358 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago

... 4 5 6 7 8 ...

2710 Commits (f7c5ccedc7c5f2b86c0739a85f51083e451a6dd1)