yacy_search_server

Commit Graph

Author	SHA1	Message	Date
theli	e58e85363d	) Bugfix for ConcurrentModificationException while operating on seed properties ) Bugfix for YACY database inconsistency (no more elements available in db '...seed.new.db'), re-set of db. See: http://www.yacy-forum.de/viewtopic.php?p=11836#11836 http://www.yacy-forum.de/viewtopic.php?p=11814#11814 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@995 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	8d827cdb30	tried to fix problems with order of network list by last-seen (which could also improve the network picture) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@980 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	446e7e8bef	*) Bugfix for Seed-Upload - Permission denied problem See: http://www.yacy-forum.de/viewtopic.php?p=11648#11648 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@978 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	097009d910	experimental visualization of DHT access during global search (temporary) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@977 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	4dcbc26ef1	introduction of search profiles; very experimental git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@976 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	9a5ab62928	) Adding yacy specific X-YACY-Index-Control header which can be used by clients to disallow yacy to index the response that belongs to the request where X-YACY-Index-Contro is set to "no-index" ) Bugfix for Seed-List download via Remote Proxy. Now the pragma and cache-control http headers of the request are properly set to "no-cache" See: http://www.yacy-forum.de/viewtopic.php?p=11639#11639 *) Bugfix for http-Proxy yacy has ignored "no-cache"- pragma and cache-control http headers that were send in requests. Now, these request headers are evaluated properly TODO: Missing evaluation of "no-store" request headers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@971 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	5a25ad9109	*) Bugfix for useRemoteProxy4YACY and useRemoteProxy4SSL check git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@969 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	02d9af1a70	) Restructuring and extending of Remote Proxy Support - remote proxy configuration can now be "really" changed on the fly and takes effect immediately - adding possibility to disable remote proxy usage for yacy->yacy communication - adding possibility to disable remote proxy usage for ssl - restructuring proxy configuration so that it is stored in a single place now ) Adding possibility to import a foreign word DB (or even more of them in parallel) at runtime into the peers DB - this can be done by calling IndexImport_p.html - ATTENTION: please not that at the moment this thread must be aborted via gui before a normal server shutdown is done. - TODO: integrating IndexImport Thread into normal server shutdown - TODO: Adding posibility to import crawl-queues, etc. from foreign peers - TODO: removing old import function from yacy.java and calling the new routines instead git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@968 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	af3060938b	*) Bugfix for manual peer ping functionality git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@965 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	40777556c5	) Connection Tracking - adding automatic refresh - accepts new parameter nameLookup which can be used to deactivate yacy-peer name lookup (because we have problems with this on large seed-dbs) ) ViewFile New page that can be used to view - original content - plain text content - parsed content - parsed sentences of a webpage specified by there url hash Mainly for debugging purpose at the moment ) Robots.txt Bugfix for if-modified-since usage TODO: synchronization of downloads to avoid loading the same robots-file multiple times in parallel by different threads ) Shutdown Better abortion of transferRWI and transferURL sessions on server shutdown *) Status Page Adding icon to start/stop crawling via status page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@950 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	bcb0d6d5ff	changed setLastSeen(long rd) to setLastSeen(); git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@949 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	cdbaf637fb	added - getIP(), getJunior(), getSenior(), getPrincipal(); - setIP(), setJunior(), setSenior(), setPrincipal(), setLastSeen(long rd); - isPeerOK(), isOnline(String type); next try to remove hello.class java.util.ConcurrentModificationException:null ;) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@948 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	e642a5d8b7	more constants git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@947 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	d77b982083	small fix for last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@944 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	b00cd5640b	bugfix for 'hello.class java.util.ConcurrentModificationException:null' finals git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@943 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	6260942590	changed search process: received indexes are now buffered and written to wordIndex after search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@934 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	bc56a88cc8	further refactoring of search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@925 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	c8a35a0130	) Adding new connection tracking page (currently only for incoming connections) ) Displaying statistic for incoming connections on status page ) Bugfix for Loop-Access Bug when trying to access the yacy page while yacy is configured as proxy See: http://www.yacy-forum.de/viewtopic.php?p=6826 ) Bugfix for Referer Bug See: http://www.yacy-forum.de/viewtopic.php?p=11098#11098 *) Adding reverse Name lookup for yacy-domain names (used by the connection tracking page) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@916 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	e85989510a	update to network image; added disconneced peers by disconnection time and changed colors git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@890 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	d666b61b83	fix for news-deletion, see also http://www.yacy-forum.de/viewtopic.php?p=11000#11000 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@885 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	4180c422e8	cleaned, finals, Properties git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@884 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	c1c94111b0	added new network picture at Network menu using the new image-servlet method git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@880 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	a2fa75e688	) Asynchronous queuing of crawl job URLs (stackCrawl) various checks like the blacklist check or the robots.txt disallow check are now done by a separate thread to unburden the indexer thread(s) TODO: maybe we have to introduce a threadpool here if it turn out that this single thread is a bottleneck because of the time consuming robots.txt downloads ) improved index transfer The index selection and transmission is done in parallel now to improve index transfer performance. TODO: maybe we could speed up performance by unsing multiple transmission threads in parallel instead of only a single one. ) gzip encoded post requests it is now configureable if a gzip encoded post request should be send on intex transfer/distribution ) storage Peer (very experimentell and not optimized yet) Now it's possible to send the result of the yacy indexer thread to a remote peer istead of storing the indexed words locally. This could be done by setting the property "storagePeerHash" in the yacy config file - Please note that if the index transfer fails, the index ist stored locally. - TODO: currently this index transfer is done by the indexer thread. To seedup the indexer a) this transmission should be done in parallel and b) multiple chunks should be bundled and transfered together ) general performance improvements - better memory cleanup after http request processing has finished - replacing some string concatenations with stringBuffers - replacing BufferedInputStreams with serverByteBuffer - replacing vectors with arraylists wherever possible - replacing hashtables with hashmaps wherever possible This was done because function calls to verctor or hashtable functions take 3 time longer than calls to functions of arraylists or hashmaps. TODO: we should take a look on the class serverObject which is inherited from hashmap Do we realy need a synchronization for this class? TODO: replace arraylists with linkedLists if random access to the list elements is not needed ) Robots Parser supports if-modified-since downloads now If the downloaded robots.txt file is older than 7 days the robots parser tries to download the robots.txt with the if-modified-since header to avoid unnecessary downloads if the file was not changed. Additionally the ETag header is used to detect changes. ) Crawler: better handling of unsupported mimeTypes + FileExtension ) Bugfix: plasmaWordIndexEntity was not closed correctly in - query.java - plasmaswitchboard.java *) function minimizeUrlDB added to yacy.java this function tests the current urlHashDB for unused urls ATTENTION: please don't use this function at the moment because it causes the wordIndexDB to flush all words into the word directory! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@853 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	e5f8163203	fixed a bug with news; news moving could lead to shurtcut loop / 100% CPU; appeared when clicked on a 'Profile' news in Network menu git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@845 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	fbb5e36b80	documentation update git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@843 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	0054d3b1a6	added age in network menu git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@809 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	7fc822a59b	changed handling of time-zones git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@801 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	70a5681a4f	*) Bugfix for inactive scp seed uploader git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@779 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	495bc8bec6	removed cache-control from low and medium priority caches which reduces memory use and computation overhead git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@774 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	1dc94e7753	) Adding support for gzip content-encoding of http post requests used to transferRWIs and transferURLs. See: http://www.yacy-forum.de/viewtopic.php?t=1167#10020 ) adding yacyVersion.java containing constants defining yacy versions that support a given feature. Needed to determine if a remote peer is able to decode gzip content-encoded http post bodies properly. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@772 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	a1f5027a88	finals; cleaned; Properties; git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@770 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	1dd7047af5	finals; cleaned; Properties; git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@767 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	96a5b6e8fb	removed yacy peer types from serverSwitch git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@758 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	b990dc1ad1	) Replacing jsch 0.1.19 lib with newer version 0.1.21 ) Replacing PDFBox 0.7.1 lib with newer version 0.7.2 ) Refactoring of classes httpd/httpc/httpHeaders to make many methods for httpHeader/Requestline parsing reusable for new icap implementation ) adding chunked input stream support - needed by new icap implementation - needed by future httpc HTTP/1.1 support ) httpd.java - moving all connection property contants to class httpHeader - moving readHeader function to class httpHeader - moving parseQuery function to class httpHeader - moving handleTransparentProxy function to class httpHeader ) httpHeader.java - adding new fuction to parse the http response line - adding new function to converte http headers to a string that can be send to the client - adding a function that generates a proper url using all parsed connection properties ) ICAP Support - yacy now supports handling of icap response modification requests - this feature can be used by other icap enabled proxies to contact yacy as icap server, and to handover the downloaded content to yacy.logging for indexing - functionality was successfully tested with squid 2.5Stable 10 + icap patch - further icap services e.g. URL filtering based on yacy's blacklists are possible ) plasmaSwitchboard.java - htcache entries that are still needed for indexing are now properly registered as in use after system restart - extended logging: log message now shows parsing and indexing time for each sb. entry git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@757 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	af9021e956	fixed bug with news caching git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@754 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	71a31f0902	integrated and extended new memory performance menu; found and fixed bug in DHT caching git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@752 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	fb52a82008	added new performance page for memory settings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@751 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	50a9500035	fixed 100% CPU bug with news queue deletion git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@735 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	2148c0cf49	replaced kelondro storage core; much less objects in kelondro cache now; less IO from DB git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@724 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	dff96601fe	*) Bugfix for transferURL: URL list index was not incremented properly. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@723 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	10e7d6f02b	Bugfix for http://www.yacy-forum.de/viewtopic.php?t=1053 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@713 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	2cb084d426	*) Complete Index Transfer See: http://www.yacy-forum.de/viewtopic.php?p=9622 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@707 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	801e902795	small change git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@698 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	019cc716db	*) Undoing last changes on yacySeed. Seems not to work properly. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@697 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	177e8af5b7	*) Bugfix for ConcurrentModification in kelondroAbstractRA.writeMap caused by yacySeed.getMap() See: http://www.yacy-forum.de/viewtopic.php?p=9523 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@695 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	d3c923e6b9	*) Bugfix for "ConcurrentModificationException in hello.class" See: http://www.yacy-forum.de/viewtopic.php?t=723 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@694 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	02c242ae22	minor changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@693 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	7c86c36210	undoing one part of the last commit. do not know, why it didn't work... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@681 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	a79913c6ea	updated german language file git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@680 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	718950c5da	small change git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@679 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago

1 2 3

146 Commits (8194fde3409a0841bc29df63b40a6388923285eb)