yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	84be912e90	fix for null pointer exception that occurred when missing user-agent in request header see also http://forum.yacy-websuche.de/viewtopic.php?f=6&t=78&hilit= git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3943 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e03fcf4627	SSI fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=29 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3936 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	9bbd39b67c	- removed unfinished auto-updater from roland and martin - added new download-option for releases on the status page still mising: - thomas-style restart for linux/mac - untar/gunzip on shell basis (comes next) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3931 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	154ffd7c2c	fix for wrong http connection version and SSIs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3928 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1782ef57e5	- added SSI parser and include directive for <!--# include virtual="<file>" --> - added chunked file transfer for non-yacy clients - SSIs are streamed using chunked transfer, partly delivered pages can be seen in browser before transmission is finished - added client-side network unit identification - cleaned up code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3926 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0e57a8062b	added network definition for different YaCy networks (needs much more work) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3919 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6518bb6c08	changed release strategy: we will provide two different releases in the future, one standard release and one 'pro'-release. the 'pro'-release contains all additional parsers AND has different default performance values. The pro-version differs therefore from the previous 'all'-version by this default values. The pro-configuration is automatically choosen if the libx-folder exists. If a version is once initialized, its configuration stays independently from an existing libx folder. The ant targets had been changed. There are now 3 different targets to create standard and pro-releases, and one target to upgrade: - dist: creates a standard release (only, no libx target any more) - distPro: creates a pro-release (includes the libx) - distExt: creates a libx-release which includes the libx-folder only. It may be used to upgrade from standard to pro Furthermore, the naming of 'dev'-releases had been removed. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3902 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	465145cb6f	revert to insecure, but dau-proof defaults git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3898 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	7ad11ceaaa	security fix for peers without password. allow access only from localhost git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3897 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e4aa8f2a08	disabled more sleep(200) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3889 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	cb38e57622	reduced httpd final waiting time git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3888 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b4585ad67d	im Sommer 2005 wurden die ersten pings zwischen YaCy-Peers ausgetauscht. Das klappte aber merkwürdigerweise nicht immer. Um das Protokoll zu testen schrieb ich eine einfache message-Funktion, so wie sie heute noch in YaCy drin ist. Aber auch die Messages funktionierten nicht richtig. Alex und ich haben lange Zeit gesucht, und den Fehler nie gefunden. Es stellte sich heraus das ein Timing-Detail das Problem lösen konnte, die Ursache haben wir bis heute nicht gefunden. Die Lösung des Problems bestand aus einem kurzen sleep, kurz bevor der httpd Daten zum client zurück geschrieben hat. Das ist natürlich eine fürchterlich schlechte Lösung. Bis heute war diese Sache im httpd. Mit diesem Commit habe ich den sleep auskommentiert, und es steht zu befürchten das wieder irgendwas nicht geht. Wenn jetzt das Netz zusammenbricht, keine pings mehr ankommen oder so, war es dieses sleep, das es verhinderte. Vorschläge willkommen. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3887 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	669f840eab	- added ViewProfile / Impressum (default on) to local peer's robots.txt git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3874 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	66ec8b63c1	added a httpd access tracker: - all requests to the own httdp can now be listed in the access tracker menu - the search statistics had been renamed to access tracker and extended by this tracker git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3861 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	33ad0c8246	added a web structure computation and logging: - all web page parsing operations will now increase a web structure file - the file is computed in memory and dumped at shutdown-time to PLASMASB/webStructure.map in readable form (not a database) - the file can be used externally to analyse the link structure of the crawled pages - the web structure can also be retrieved using a xml-interface at http://localhost:8080/xml/webstructure.xml - the short-term purpose is the computation of a link-graph image (before linuxtag!) - a long-term purpose could be a decentralized computation of the citation rank git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3746 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	601fc7d1c5	- added source to J7Zip-modifed.jar and it's license (changelog is still to come) - moved HTML-*replace-methods from wikiCode to de.anomic.data.htmlTools - prepared use of different wiki parsers as suggested here: http://www.yacy-forum.de/viewtopic.php?p=34444#34444 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3741 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	26f05d1fd0	avoid division by zero if search is done for no words this case is relevant if the bluewords (yacy.blue) are used git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3698 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2fa8b50e54	reverting svn 3691+3692 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3696 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	22a0e9f117	more timeout-control git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3692 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	24db55a541	added timeout for httpd-sockets during read git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3691 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	111ba9e359	- fixed some width problems in new status page - fixed deadlock in dns cache - added termination security for DHT peer selection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3660 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	29fe2beac7	possibly fixed a deadlock cannot find forum link now for that git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3593 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	c2e6afbd69	*) bugfix: setting mimeType properly for dir listing with e.g. "?format=xml" git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3516 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	f20b596dc0	) adding servlet to display all deployed SOAP Services - soap related servlets are located in htroot/soap ) new serverContext class for soap git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3511 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	81b4598487	*) peer profile can now be displayed as vcard e.g. http://localhost:8080/ViewProfile.vcf?hash=localhash git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3504 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	91c2a042a7	*) bugfix for wrong proxy traffic accounting git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3484 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5b0a84ce09	fix for synchronization deadlock with flushMissNameCache. see also: http://www.yacy-forum.de/viewtopic.php?p=32939#32939 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3472 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	a1fb8358b2	lets make a well-formed http link so that other crawlers don't have a problem to follow this link :-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3463 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4edb70f68b	added yacybot info-page from Roland git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3462 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d755a8026d	- better OOM protection - better memory allocation for FlexTable indexes - splitting between static index and dynamic index (only the dynamic part must grow) - to enable a merge-iteration of new splittet index, a huge number of classes needed to be adopted for new iterator classes - added new iterator classes that support cloneable iterators - adopted all iterator classes to implement cloneable itarators git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3453 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	88245e44d8	- improved version of robots.txt (delete your old htroot/robots.txt before updating): - robots.txt is a servlet now - no need to rewrite the whole file each time a section is added or removed - user-defined disallows, added manually, won't be overwritten anymore - new config-setting: httpd.robots.txt, holding names of the disallowed sections git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3423 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	a1d68fe092	- use .class rather than Class.forName for classes in class-path - added Bost's patch for Diff.findDiagonale() from: http://www.yacy-forum.de//files/patch_685.txt - fixed minor bugs in Blog git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3416 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	6fbe31425a	- some code-cleanup (no more syntax-warnings here) - added deletion from loadedURLs of URLs to be blacklisted in IndexControl_p git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3404 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	c016fcb10f	- added streaming-support to CrawlURLFetchStack_p servlet - bug for NPE in list.java - use more constants git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3373 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	bf69a721cb	more protection against mis-use of YaCyHop interface: - target must not be at port 80 - target access not more than every 3 seconds - requester may not access more than every 10 seconds git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3357 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c464157a6e	replaced some toString() see http://www.yacy-forum.de/viewtopic.php?p=31151#31151 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3345 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b4aa195c27	added user-agent check for yacy-hop proxy authentication git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3343 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d25caa07bf	redesigned some parts of http authentication added another access check for peer hops git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3340 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	2401e748a3	- fixed wrong replacement of POST-parameters in httpd ('<' and '>' are still replaced, don't know why): http://www.yacy-forum.de/viewtopic.php?t=3466 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3324 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	e68cdeeeb3	- reverted parseArg(String) to use a byte-array to handle correct UTF-8 parsing - arguments aren't passed html-escaped to the servlets anymore, bug-fix for http://www.yacy-forum.de/viewtopic.php?p=30573 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3321 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	47ab83a7c0	added flag for YaCyHop - proxy access for all paths that start with /yacy/ git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3304 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	25c7d4e25e	fix for form (cookie) login git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3284 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	7c40197e42	- fixed error pages and <label>s for index.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3226 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	b4457763e5	fix for putSafeXML and supertemplates. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3223 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	0c81bd39d4	XSS-safe put as default. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3217 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5515571950	redesign of ymage classes - less memory usage - better usage of awt classes - drawing abstractions: preparations for movable objects for animation class - test applet for animations - known bugs: wrong colours for network picture git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3214 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	b873ad51ab	- fix for http://www.yacy-forum.de/viewtopic.php?t=3369 - merged netBude's alternative for tables in yacysearch.html & search results valid - added statistic info to index.html as proposed here: http://www.yacy-forum.de/viewtopic.php?p=29762#29762 - fixed error-log in httpTemplate git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3189 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	340dc52a9d	- ConfigProfile_p.html now transmits usable encoding for other than 7-bit ASCII charset, see TODO in httpd.parseArg(String) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3174 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	00aa9472d6	- added decode of HTML-entities in request lines - removed Bookmark symbol on search pages and surftips if not authenticated git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3172 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0a050bc043	enhanced ranking - redesign of data storage in plasmaSearchRankingProfile - profiles are extended by new ranking parameters - new RWI ranking parameters are considered during ranking - appearance attributes (i.e. emphasised text) is now considered - faster ranking - some attributes that had been checked during post-ranking can now be checked during pre-ranking phase - removed old ranking parameter on index.html page (will be replaced by profiles in the future) - ranking can now consider appearances of media content - snippet-loading for media types now work correctly (fetches only from the wanted media) - ranking-profiles can be handed over the remote peers and apply there also - re-search of same query with different domain now also re-triggers remote search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3105 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d0c32c6aeb	better protection against fraud peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3104 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	e17591acc3	- parse HTML arguments as UTF-8 strings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3085 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	d30932c7d8	- fix for fix... sry git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3084 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	6118fb73ec	- added decode of UTF-16 escapes in url-arguments (%u0123), bugfix for http://www.yacy-forum.de/viewtopic.php?t=2762 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3083 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	fb7902aa68	fix for http://www.yacy-forum.de/viewtopic.php?p=26142#26142 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3033 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	984285bdd6	better organisation of dns hit/miss cache flush git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3016 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	73c63578ad	- activated the dns miss cache - added a cache-control for cache miss flush to the dns miss cache - better naming of cache variables to distinguish hit- and miss- cache git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3015 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	e3d75f42bd	final version of collection entry type definition - the test phase of the new collection data structure is finished - test data that had been generated is void. There will be no migration - the new collection files are located in DATA/INDEX/PUBLIC/TEXT/RICOLLECTION - the index dump is void. There will be no migration - the new index dump is in DATA/INDEX/PUBLIC/TEXT/RICACHE git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2983 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d34f10c63d	some tests with reverse dns lookup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2954 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	a51417d86b	Bugfix: language of ConfigLanguage_p.html was not changed properly when a different language was choosen here git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2948 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	f77d624b94	*) bugfix for persistent connection support on transfer-encoded requests git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2942 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	114a76a86e	- added flag to urlhash that shows that domain is a local domain - enhanced local domain detection - bugfixing for memory assignment in kelondroFlexSplit - automatic memory assignment to caches according to available RAM - bugfixes for details during search process git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2924 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	e59ff8b657	Bugfix: language of ConfigBasic.html was not changed properly when a different language was choosen here. Note: there's a similair bug on ConfigLanguage_p.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2921 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	29a1f132ec	*) some strings replaced by constants git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2910 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	215c4e65f1	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2887 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	532c23b5c7	*) soap handler - better errorhandling - adding support for outgoing transfer- and content-encoding - avoid holding outgoing messages into memory before sending them git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2872 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	777e39cea0	*) new template to display the dir-listing in xml format. This can e.g. be done by using the url http://localhost:8080/share/?format=xml git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2856 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	88cfdecd38	*) Bugfix: calling close must not close the wrapped input stream, otherwise keep-alive connections would terminate git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2853 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	8a5c2d0a19	fix for supertemplates, too. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2839 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	c35793fb46	fix for last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2838 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	a831c83025	create servletProperties, with the servlet specific funktions from serverObjects git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2835 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	8b56887676	removed unused code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2820 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	68204ff729	*) Suppressing for bad client requests. See: http://www.yacy-forum.de/viewtopic.php?p=26918 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2814 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	df49724f28	*) better error handling for seed upload - test download - problems See: http://www.yacy-forum.de/viewtopic.php?p=26814#26814 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2812 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	b357a13e9a	*) adding synchronization block because SimpleDateFormat is not thread-safe See: http://www.yacy-forum.de/viewtopic.php?p=26906#26906 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2809 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	688cbfb776	- bugfixing for flextable bug - bugfixing for collection index bug - several other bugfixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2785 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	a29b4d4fb5	extended Supertemplates for Headerincludes. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2780 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	a7e11ada50	*) suppressing stacktrace for "server has closed connection" git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2779 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c8f3a7d363	added snippet-url re-indexing - snippets will generate an entry in responseHeader.db - there is now another default profile for snippet loading - pages from snippet-loading will be indexed, indexing depth = 0 - better organization of default profiles git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2733 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	226f2c5b2c	first version, of the Serverlet Debugger git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2717 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	ce7ee74316	*) better errorhandling in filehandler (try catch block now starts before argument parsing) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2704 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	f17ce28b6d	) plasmaHTCache: - method loadResourceContent defined as deprecated. Please do not use this function to avoid OutOfMemory Exceptions when loading large files - new function getResourceContentStream to get an inputstream of a cache file - new function getResourceContentLength to get the size of a cached file ) httpc.java: - Bugfix: resource content was loaded into memory even if this was not requested ) Crawler: - new option to hold loaded resource content in memory - adding option to use the worker class without the worker pool (needed by the snippet fetcher) ) plasmaSnippetCache - snippet loader does not use a crawl-worker from pool but uses a newly created instance to avoid blocking by normal crawling activity. - now operates on streams instead of byte arrays to avoid OutOfMemory Exceptions when operating on large files - snippet loader now forces the crawl-worker to keep the loaded resource in memory to avoid IO ) plasmaCondenser: adding new function getWords that can directly operate on input streams ) Parsers - keep resource in memory whenever possible (to avoid IO) - when parsing from stream the content length must be passed to the parser function now. this length value is needed by the parsers to decide if the parsed resource content is to large to hold it in memory and must be stored to file - AbstractParser.java: new function to pass the contentLength of a resource to the parsers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2701 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5a40ea7866	refactoring of wget string list generation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2692 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	310f1c41cd	added option to see ranking scores in surftipps and some cleanups git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2684 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	cd5f349666	) Better handling of large files during parsing Extracted text of files that are larger than 5MB is stored in a temp file instead of keeping it in memory ) plasmaParserDocument.java; getText now returnes an inputStream instead of a byte array ) plasmaParserDocument.java: new function getTextBytes returns the parsed content as byte array Attention: the caller of this function has to ensure that enough memory is available to do this to avoid OutOfMemory Exceptions ) httpd.java: better error handling if the soaphander is not installed ) pdfParser.java: - better handling of documents with exotic charsets - better handling of large documents - better error logging of encrypted documents ) rtfParser.java: Bugfix for UTF-8 support ) tarParser.java: better handling of large documents ) zipParser.java: better handling of large documents ) plasmaCrawlEURL.java: new errorcode for encrypted documents ) plasmaParserDocument.java: the extracted text can now be passed to this object as byte array or temp file git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2679 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	df1629b05a	- code cleanup - version 0.471 - moved surftipps to own web page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2676 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	c665f6cddb	*) handling of quotes in charset string git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2674 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	009a33170b	*) Content-Location header added git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2658 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	1aa07a52cd	*) Bugfix for UnsupportedEncodingException if the media type contains multiple parameters See: http://www.yacy-forum.de/viewtopic.php?p=25832#25826 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2654 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	ec031eb993	first version of surftipps see http://localhost:8080/index.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2627 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	5afb0cbce8	) setting default charset (for unkown documents) to iso-8859-1 ) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2620 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	97d2a08ef1	*) restructuring needed to support parsing of documents using various charsets - serverFileUtils.java: -- adding methods to copy from stream to writer and readers to writers -- moving httpc writeX methods into serverFileUtils class - serverCharBuffer.java: removing inheritance from Writer class - replacing htmlFilterOutputStream by htmlFilterWriter class which handles content as char stream - htmlFilterContentTransformer.java: deactivating getText mode (still needs to be migrated to use char streams instead of byte streams) - changes in several classes to use htmlFilterWriter instead of htmlFilterOutputStream - changes in Scraper and Transformer classes to operate on chars instead of bytes - httpdProxyHandler.java: bugfix. clientTimeout setting was missing in config file git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2617 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	fc594e8eda	) adding httpContentLengthInputStream.java class to allow reading of http response bodies until EOF even if a persistent connection is used ) httpdByteCountInputStream.java: adding skip method *) httpHeader.java: adding getCharacterEncoding function git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2616 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	2a06ce5538	*) next bugfix for UTF-8 - Sending UFT-8 messages to other peers did not work - httpd.java: minor corrections for UTF-8 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2570 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	bdc51591ae	*) UTF-8 Bug solved (hopefully) See: http://www.yacy-forum.de/viewtopic.php?p=25522 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2569 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	ef751b9d33	*) removing all string operations from the template engine - engine should fully operate on bytes now git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2567 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	fded1f4a5d	*) better handling of maximum file size limit in crawler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2543 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	63893003be	) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use. ) adding first version of maximum filesize check for the crawler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2534 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	9340dbb501	fixed all possible problems with nullpointer exception for LURLs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2513 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	a5ed86105b	*) bugfix for handling of ResourceInfo object in proxy git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2512 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	59a5511dbb	*) added missing static Strings as requested by theli git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2505 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	6578564c9a	*) Ignore more hop by hop http headers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2504 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	dae763d8e3	git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	ffbf416e76	*) direct access to requestheader of htCache.Entry removed to make it more http independent git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2486 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	3870d615e3	*) setting htCache.Entry fields to private git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2485 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	393a7d10be	*) setting htCache.Entry fields to private git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2484 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	1c8300fcec	*) Bugfix for name resolution in proxy mode See: http://www.yacy-forum.de/viewtopic.php?p=25241 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2477 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	d78b824e85	fixed problem with default path after first start-up git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2440 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	6ad471ef96	* applied many compiler warning recommendations * cleaned up code * added unit test code * migrated ranking RCI computation to kelondroFlex and kelondroCollectionIndex git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2414 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	cf1186597b	utf fix from theli git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2412 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	eee44be602	*) adding an interface for customized blacklist classes - now it's possible to use a customized blacklist engine instead of the default one - this can be done by configuring the property BlackLists.class See: http://www.yacy-forum.de/viewtopic.php?t=2108 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2397 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	d2e8e76218	*) now it's possible to configure the yacy blacklist separately for dht, search, proxy, crawler See: http://www.yacy-forum.de/viewtopic.php?t=2541 http://www.yacy-forum.de/viewtopic.php?p=24516 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2389 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	a52f36787f	better templatedebugging git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2371 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	3480d36417	added some debug code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2369 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	d468d665c9	some changes that may help to prevent deadlocks that cause an OutOfMemoryError as described in http://www.yacy-forum.de/viewtopic.php?p=24359 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2353 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	6e676224d0	*) adding support for upnp A new port forwarding method for upnp was added. If this method is enabled, yacy automatically determines an UPnP capable internet gateway and configures the gateway port forwarding settings properly. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2328 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	97fa6788a1	added gettext support: automatic replacement of string appearances in html files by gettext quotes. see also: http://www.yacy-forum.de/viewtopic.php?p=23901#23901 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2309 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	67c486a023	some example Code, how supertemplates can be used. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2304 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	7b0e2521bb	Support for a supertemplate, which can do all thing, a normal template can do. Its a layer under the servlets, this means, #[page]# will be replaced by serverletcode, the rest can be set by you. (TODO: if we use this for layout, we need to read "TITLE" from the servlet's tp, to set it outside of the servlet.) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2302 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	8795875800	dirlisting for all empty directories. no problem to update dir.java anymore, because its only in htroot/htdocsdefault needed. migration to delete old dir.* files in the fileshare git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2294 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	3879a0ecd0	replaced java.net.URL usage by use of new class de.anomic.net.URL This shall be seen as an experiment to exclude all cases where there could be a DNS lookup during URL comparisment. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2290 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	b594ee9a5a	*) Adding possibility to configure if the http proxy should send the X-forwarded-for header (requested by TeeSee) See: http://www.yacy-forum.de/viewtopic.php?t=2577 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2257 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	6866bc2758	be quiet! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2243 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	ed2cb040d1	*) Bugfix for http connection header validation - Connection header was not handled correctly if it contains multiple values, e.g. Connection: TE, close git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2219 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	0621106ef3	git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2214 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	12af69dd86	cosmetics git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2212 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	67a8c74be3	Fix for dynamic login with static password. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2210 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	6fe2fed87e	cookieauth works with static Admin. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2208 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	b23703f260	using cookieAuth. logout for httpauth seems to be broken :-( git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2202 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	7f51a43cba	disabled ipAuth for _p Pages (and broken Form-Login :-() for security reasons git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2201 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	bd22634c44	HTML-login, logout fixed. TODO: If you login with the form, then logout with the form, and then try to login with httpauth, the first try will fail. (should logged_out be resettet in ipAuth? but if there is ipAuth before proxyAuth, the logout would be broken. Maybe a combined method can help.) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2200 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hermens	3f1ebc097e	Limit the size of the DNS cache to 5000 and the age of the entries to one day. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2199 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	d7a3fdb18b	no white pages, when clicking cancel on the password-dialog git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2198 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	5625937d1c	Language improvements One very minor HTML fix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2181 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	26b6cddf51	synchronized the DNS cache, because the non-synchronized version resulted in deadlocks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2168 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	90d569d70f	refactoring of index management: url storage is part of index management; moved plasmaURL to indexURL git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2122 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	b4ab183518	*) Bugfix for NullpointerException if the seeds IP could not be resolved git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2099 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	9938c252dd	better Errorhandling for proxyAccounts git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2082 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	015d044c25	tried to fix some problems with latest changes to httpc very experimental! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2078 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	55c5b41bd0	modified kelondroDyn to work better with new object caches (removed own single object cache) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2077 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	fd7c17e624	added virtual host support: all yacy-to-yacy communication now send the <peer-hexhash>.yacyh virtual domain inside the http 'Host' property field. This shall enable running a yacy peer on a virtual host. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2074 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	727aac4768	*) Bugfix for Transparent-Proxy-Support <-> Port Forwarding problem See: http://www.yacy-forum.de/viewtopic.php?p=20358 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2039 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	cd4aeffea2	*) Bugfix: httpdFileHandler.java did not handle filenames with encoded chars correctly See: http://www.yacy-forum.de/viewtopic.php?t=2265 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2036 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	76ea16a6cb	*) Removing Keep-Alive header (is also a hopByHop header) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2034 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	b0036249c1	added some attributes to network picture git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2032 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	0604203bce	Updated and corrected German language file Changed Italian language file for an Italian/English interface and not Italian/German git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2024 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	14d6e476c9	tried to solve some problems with new picture viewer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2019 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	d8d0ac29c3	added image-viewer servlet that can do: - each image that is requested is stored in the cache - the image is taken from the cache if exists there - the image can be scaled The purpose of creation a scaled image is because of copyright problems In a further stept the retrieval of not-shrinked images is restricted to either access from localhost or with given authentication This servlet can be used for image-preview purpose after an image search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1989 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	42b0b10a95	-Adding Windows Media to types which are not sended compressed -Renaming writeandzip to writeandgzip to avoid confusion about type of compression -Adding new startup message to windows script -The usual language "enhancements" ;-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1953 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	77f3237de3	adapted for isListed() git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1942 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago

1 2 3 4 5 ...

510 Commits (70826bb5015481c450331902e545bde6eb3c523b)