yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	40965e183e	bugfix for minimizeurldb and urldbcleanup see http://www.yacy-forum.de/viewtopic.php?p=25539#25539 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2580 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b7e7808ea6	wordmigration now works also for new index database if the new database is switched on, no 'too big' messages appear, all the WORDS files can be completely migrated git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2553 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	005400a137	*) reverted last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2546 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	045ffebbd8	*) added debugline to versionstring-processing to find a possible bug in versiongeneration git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2537 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4866868c0e	added write cache for LURLs This was necessary to speed up the index receive process during global search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2498 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	b515d49f87	*) fix for new combinedVersionString2PrettyString by bost git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2466 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	24316ba937	*) improved implementation of combinedVersionString2PrettyString by bost git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2465 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	57dda1a92c	*)again fixing for wrong version display, now totally working with double instead of float git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2464 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	5e558fbaae	*) hopefully fixed the wrong display of yacy-version git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2462 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b7f4a1521b	added options to switch on or off the kelondroFlexTable for NURL, EURL and PreNURL git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2456 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	db1eae0227	* simplified initialization of database objects * replaced kelondroTree for NURLs by kelondroFlex * replaced kelondroTree for EURLs by kelondroFlex take care, may be very buggy please finish crawls before updating. crawls will be lost. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2452 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	1c99b5a484	)fixed logging for urldbcleanup )changed exception handling in urldbcleanup so that it shows NullPointerException correctly *)added more Blacklisting to urlcleaner git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2436 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	0187c60010	because of a bug in the JRE 1.4.2 there was no memory protection see http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=4686462 this commit fixes the bug by using a memory-computation patch. All uses of Runtime.maxMemory had been replaced by serverMemory.max The bug is not present any more in Java 1.5 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2419 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	314021453f	* more logging * option in yacy.init to set useCollectionIndex usage git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2374 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	279b1d969d	Integrated new indexing data structure 'collections' into the main class for indexing, the plasmaWordIndex. The new data structure is ready-to-use, but currently disabled. It can be activated by setting the static plasmaWordIndex.useCollectionIndex to true. This shall be done for testing purpose. The new index is stored to DATA/INDEX/PUBLIC/TEXT The directory PLASMA shall be used only for crawler in the future. Attention: during testing the data structure in INDEX may change, and created indexes with the new data structure may get useless. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2348 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	c4e922885a	replaced indexURLEntry by new class that uses a kelondroRow.Entry object to store the index entry. This is another step to move to the new database structure. A side effect of this change is, that index storage uses much less RAM space, which affects the index RAM cache. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2341 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	6e676224d0	*) adding support for upnp A new port forwarding method for upnp was added. If this method is enabled, yacy automatically determines an UPnP capable internet gateway and configures the gateway port forwarding settings properly. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2328 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	417ed5102e	redesign of database iterators: an iteration of key elements in kelondroTree databases is no longer supported. this is now replaced by an iteration of kelondroRow.Entry objects from the database Iteration of keys from the database was mostly followed by retrieval of the row from the database, whcih caused unnecessary database load. The index selection was also redesigned to use the new row iteration methods. This affects many funktions, most important is the DHT selection routine which is now much faster. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2327 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	ad692fc6c7	implemented option to extract nurls from the database (plus some iteration enhancements for nurls) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2325 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	1ed3e2daef	added option to extract domains and/or urls from the eurl database when extracting from eurl, the html output format is recommended, since this format adds also the fail reason to the domain/url. The complete syntax for domain extraction is now java -Xmx<megabytes>m -classpath classes yacy -domlist [ -source { lurl \| eurl } ] [ -format { text \| zip \| gzip \| html } ] [ <path to DATA folder> ] git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2322 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	58df8b7bbf	a large collection of different changes * mainly for the transition to the new indexing database structure * a bugfix for an endless loop inside kelondroTree iteration * a bugfix for bulk read inside a kelondroTree iteration; the bug caused that some elements had been iterated twice * very strong speed enhancement for url/domain extraction git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2320 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	493b1cd2bf	better logging for domain extraction git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2319 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	685430a1b5	bugfix in new URL class, better loggin for domain extraction git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2317 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	c57b78722b	added some more logging to domain extraction git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2316 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	cc2be7fb43	fix for genurllist in case of bad urls git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2314 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	ff3f174a2d	case insentive commandline options git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2306 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	ff39a7a0d1	Overlay for welcome.* git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2299 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	8795875800	dirlisting for all empty directories. no problem to update dir.java anymore, because its only in htroot/htdocsdefault needed. migration to delete old dir.* files in the fileshare git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2294 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	3879a0ecd0	replaced java.net.URL usage by use of new class de.anomic.net.URL This shall be seen as an experiment to exclude all cases where there could be a DNS lookup during URL comparisment. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2290 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	92f4cb4d73	added option to configure the start-up delay time for kelondro database files. the start-up delay is used to pre-load the database node cache git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2276 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	53077f5835	*)fixed paths to yacy.logging git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2252 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	12af69dd86	cosmetics git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2212 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	45b39ee1be	*) solving unpacking problems with to long filename by a) renaming the parent folder in the tgz file to yacy (can be configured via build properties file) b) reconfiguring build file to throw an error if a file name is too long Please note that currently there is _no_ proplem with too long class names because of step a. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2207 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	f01bd25489	) Bugfix for OutOfMemory problem during minimizeUrlDB See: http://www.yacy-forum.de/viewtopic.php?t=2498 ) out of date import functions removed (can be done via web gui) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2189 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	4a907a570f	1st step to migrate kelondroTree to usage of kelondroRow instead of byte[][] git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2162 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	eaa6f012f0	refactoring: better naming for classic DB (files in WORDS) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2151 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	5041d330ce	refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2150 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	7b3b12888c	refactoring: integrated indexContainer abstraction layer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2149 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	cb295fbbdc	refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2147 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	bc94a714b2	Better explanation for the auto-dom-filter. Some javadoc. Small change to DetailedSearch. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2146 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	196b8abb30	refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2144 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	757ec28430	refactoring: better data capsulation for indexURL git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2131 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	8b7626f8d1	) Automatic redirection of browser if user changes port settings in ConfigBasic See: http://www.yacy-forum.de/viewtopic.php?t=2415 ) If ssl is available, the browser conntects to yacy via https on yacy startup See: http://www.yacy-forum.de/viewtopic.php?p=21649#21649 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2127 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	90d569d70f	refactoring of index management: url storage is part of index management; moved plasmaURL to indexURL git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2122 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	a930be4ba3	refactoring of index management: generalized the index entry git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2121 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	a474669338	start with refactoring of index management git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2110 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	015d044c25	tried to fix some problems with latest changes to httpc very experimental! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2078 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	fd7c17e624	added virtual host support: all yacy-to-yacy communication now send the <peer-hexhash>.yacyh virtual domain inside the http 'Host' property field. This shall enable running a yacy peer on a virtual host. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2074 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	49f3b56526	) URLCache in minizimeURLDB can be changed now (standart is 4mb) ) moved Exception Stackprints to loggingengine git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2028 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	0c9b61820e	enhanced re-crawl settings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1960 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	42b0b10a95	-Adding Windows Media to types which are not sended compressed -Renaming writeandzip to writeandgzip to avoid confusion about type of compression -Adding new startup message to windows script -The usual language "enhancements" ;-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1953 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	128e4ab199	- in serverSystem: maxPathLength is now a variable, not a method - upon startup the calculated maximum path length is shown git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1932 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	9c85820d35	added MIME-type for wmv and rm removed double copyright at startup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1922 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	488a0ed580	replaced old keyIterator and rowIterator by buffered iterators that are synchronized with database access Main change is done in kelondroTree, other classes are only adoptions git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1918 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	2b31f51896	bugfix for last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1915 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	3286b1f498	re-organisation of lurl-creation and -stacking this was necessary to prevent useless write to the database in case of blacklist appearance of the url git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1905 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	9f979d4fa5	Domain-lists gzip-compressable and sendable via cr-send/receive git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1883 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	f6452879d5	prevent nullpointer git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1858 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	a8fa9990aa	default skins support git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1825 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	3173b5c9b3	fixed port parsing during shutdown for extended port format git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1812 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	1b9b8922d9	* fixed problems with new basic 1-2-3 configuration (now authentication required) * fixed graphics problem * fixed some other problems with default values * 1-2-3 config now appears automatically on start-up if no password is set * added new config menu * moved profile to new config menu git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1792 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	3703f76866	- fixed re-search bug: after a search with several words, a second search could not find the same words as before. This was caused because indexContaines stored the url references with a hashtable. A tree was needed to work with the index conjunction-by-numeration - added permanent ram cache flush (again) - removed direct flush of ram cache after a large container is added. this happens especially during DHT transmission and therefore this fix should speed up DHT transmission on server side. - removed unused and out-dated methods git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1765 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	013b24ea0d	First version of Italian translation by Riccardo Lemmi Updated german language with bugfix. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1726 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	34341a868e	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1701 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	bfd37e34aa	using other XML Parser git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1693 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	9b941fb773	*) bugfix for usage of yacy with extended port binding (e.g. #eth0:8080, 192.168.0.1:8080, etc.) - port was reported incorrectly to other peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1678 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	7eb10675b3	re-organization of index management this was done to be prepared for new storage algorithms git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1635 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	40199cea1f	migration with svn Numbers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1623 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	1e4578aab6	VERY EXPERIMENTAL removal of index ram cache flushing thread. The cache will fill up and flushed explicitely when it is full. This shall remove double-access of assortments (indexing and flush) during indexing process. Hopefully this should reduce IO. The main idea is: the cache shall mainly be flushed by DHT transfer, and only indexes that shall be hosted by the own peer are flushed to the assortments. This needs further work. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1617 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	f61161b90b	fix for translations on startup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1566 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	7bd61ab0e5	Locales will now be in DATA/HTDOCS. So it works with readonly htroot. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1527 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	1f3eaf9f8e	use DATA/HTDOCS for notifier.gif. Works even if htroot is readonly git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1526 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	0fbe1a4515	*) Adding additional shutdown method which is neede to run yacy als windows service See: http://www.yacy-websuche.de/wiki/index.php/De:WinService git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1507 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	ec5d88664a	tried too fix serverSwitch synchronization problems see also: http://www.yacy-forum.de/viewtopic.php?p=16110#16110 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1499 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	3419b3bcdd	fix for bug that caused the peer-counter problem. See http://www.yacy-forum.de/viewtopic.php?p=16016#16016 The kelondroDyn now uses a generic fill character. kelondroDyn-Tables containing peer/word/url-hashes must not use '_' as fill character. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1498 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	48e302252e	*) adding possibility to build a distribution containing an exe file for windows users see: build file target "distWinExe" git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1494 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	fa90c3ca7a	- removed some usage of indexEntity - changed index collection process: indexes are not first flushed to indexEntity, but now collected directly from ram cache and assortments git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1489 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	09dc7bbcd7	*) Adding function to scan seed.DBs for peers affected by the "too short peer hash"-Bug. See: http://www.yacy-forum.de/viewtopic.php?p=16056 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1488 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	2a7c958877	*) Adding function to scan seed.DBs for peers affected by the "too short peer hash"-Bug. See: http://www.yacy-forum.de/viewtopic.php?p=16056 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1487 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	c69f7a39a3	*) adding a startup-test to avoid running into the unzip bug See: http://www.yacy-forum.de/viewtopic.php?t=1763 http://www.yacy-forum.de/viewtopic.php?t=715 http://www.yacy-forum.de/viewtopic.php?t=1674 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1420 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	b4e2efef10	*) first test of new iteration function ATTENTION: please don't use it at the moment git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1418 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	f4ffa9aee5	- implemented more attributes to index entries - implemented hand-over of new word index attributes during remote search - implemented word-distance computation during search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1382 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	b453199c68	first step for a special migration class. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1365 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	695dfb7eab	*) -rwihashlist can now write to a zip-file git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1347 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	4f8127946e	inc Files are now translatable git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1345 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	fe2d983c3e	recursive Translations! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1341 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hermens	971247b78f	- rotate merged indexes after merging see: http://www.yacy-forum.de/viewtopic.php?t=1717 - fix -rwihashlist to correctly shutdown git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1336 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	21fac0b6da	small bugfix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1310 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	2028403670	- consolidated different orderings to kelondroNaturalOrder - added another iteration method to rwihash-enumeration git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1309 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	9544c47684	added some UTF-8 handling. hope this will help somehow.. for shure not THE solution to our UTF-8 problem git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1308 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	537a819824	extended RWIHashList DHT control method: it is now possible to select only assortments or only files in WORDS selection of words only from the ram cache is not yet possible. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1305 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	8b6d31763d	*)added function to create a list of all RWI hashs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1287 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	9086261476	refactoring of base64 encoding: the kelondro database needs specific information about the order of base64-encoded keys. Since no other package depends on base64 (only the httpd uses base64 for encryption, but does not need to encode these strings) it is good to move base64 encoding to the new ordering classes in kelondro. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1284 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	d0c2c67f4c	Update YaWoStat version. See http://www.yacy-forum.de/viewtopic.php?p=14215#14215 for possible use. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1236 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	9b617bcb65	*)compression of -domlist now optional (-format zip git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1230 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	2bd4a66133	*)-domlist now creates a zipped txt-file. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1229 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	4500506735	fixed some bugs concerning url entry retrieval and intexControl interface git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1212 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	bb79fb5d91	- changed handling of error cases retrieving urls from database (no more NULL values are returned, instead, an IOException is thrown) - removed ugly damagedURLS implementation from plasmaCrawlLURL.java (this inserted a static value into the Object which is not really a good style) - re-coded damagedURLS collection in yacy.java by catching an exception and evaluating the exception message to do: - the urldbcleanup feature must be re-tested git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1200 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	5a627a690f	*) Extending hydrox urlDbCleanup function - now the function tries to correct the URL first - if the url can not be corrected it will be deleted See: http://www.yacy-forum.de/viewtopic.php?p=13898 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1197 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	96930f0d2b	*)added function to removed malformed URLs from urlHash.db git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1182 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	d007d14905	re-insert of migrateSwitchConfigSettings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1180 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	0e88ba997e	* added option to generate url-lists as plain text file or in html * modified generation of dom-lists so that they can be also generated as html these options can be called as: java -classpath classes yacy -domlist -format html java -classpath classes yacy -domlist -format html . java -classpath classes yacy -domlist -format text . java -classpath classes yacy -urllist -format html . java -classpath classes yacy -urllist -format text . the -format <type> can be ommitted. The text is default a home path can be asserted or omitted at the end of the parameters git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1178 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	37f88b4017	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1176 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	ec2b39c1ce	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1175 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	76618442e0	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1173 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	7920e1547d	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1163 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	1d6a6d1f85	code cleanup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1159 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	bfe51c7228	added generation of domain-list git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1112 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	8e308cf50e	*) Possibility to change the server port on-the-fly. - Now it's possible to change the server port without the need to restart the whole server. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1089 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	3631cb1f6d	*) deleting empty entities during index selection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1086 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	ca26aab9b1	*) More debugging output for migrateWords git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1085 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	3c11d7b81c	*) Bugfix for minimizeUrlDB - function didn't work correctly because of new url hash structure See: http://www.yacy-forum.de/viewtopic.php?p=12753#12753 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1080 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	9913049009	fixed outOfMemory bug caused by loops in kelondroTree during enumeration git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1079 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	fd58d5f8e6	*) Adding possibility to specify the interface / IP-Address where YaCy should bind to. - e.g. Port = 192.168.0.1:8080 Port = #eth0:8080 Port = 8080 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1071 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	889de6686c	Migration in yacyVersion git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1070 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	79818a320f	introduced citation-rank transmission protocol and activate transport for anonymisation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1055 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	02f8013013	auto-delete of corrupted word files during word-migration git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1047 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
hydrox	56b9f34411	*)removed unused imports git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1015 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	4d1e56e4d9	fixed intermission-bug (removed 'break for intermission' of httpd-thread) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1009 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	4dcbc26ef1	introduction of search profiles; very experimental git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@976 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	02d9af1a70	) Restructuring and extending of Remote Proxy Support - remote proxy configuration can now be "really" changed on the fly and takes effect immediately - adding possibility to disable remote proxy usage for yacy->yacy communication - adding possibility to disable remote proxy usage for ssl - restructuring proxy configuration so that it is stored in a single place now ) Adding possibility to import a foreign word DB (or even more of them in parallel) at runtime into the peers DB - this can be done by calling IndexImport_p.html - ATTENTION: please not that at the moment this thread must be aborted via gui before a normal server shutdown is done. - TODO: integrating IndexImport Thread into normal server shutdown - TODO: Adding posibility to import crawl-queues, etc. from foreign peers - TODO: removing old import function from yacy.java and calling the new routines instead git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@968 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	a98bafb939	Changes to german language file git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@941 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	61502b33de	*) small modifications to importDB function - making it more failsafe - avoiding unnecessary exports of index word entries to string format and reimporting it afterwards git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@935 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	6260942590	changed search process: received indexes are now buffered and written to wordIndex after search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@934 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	b7e21ec107	*) Adding DB import function which allows to import an foreign yacy DB (from directory PLASMADB) into the DB of an other peer. ATTENTION: not tested very well. please use this with care and always make a db backup git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@932 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	371fd67ecf	headless awt mode git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@922 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	6d93ecf947	Thread.getAllStackTraces() removed, needs java 1.5 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@915 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	52036caeac	changed restart message git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@913 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	68aa215479	cleaned git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@866 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	fb27428674	added restart to Status.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@863 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	a2fa75e688	) Asynchronous queuing of crawl job URLs (stackCrawl) various checks like the blacklist check or the robots.txt disallow check are now done by a separate thread to unburden the indexer thread(s) TODO: maybe we have to introduce a threadpool here if it turn out that this single thread is a bottleneck because of the time consuming robots.txt downloads ) improved index transfer The index selection and transmission is done in parallel now to improve index transfer performance. TODO: maybe we could speed up performance by unsing multiple transmission threads in parallel instead of only a single one. ) gzip encoded post requests it is now configureable if a gzip encoded post request should be send on intex transfer/distribution ) storage Peer (very experimentell and not optimized yet) Now it's possible to send the result of the yacy indexer thread to a remote peer istead of storing the indexed words locally. This could be done by setting the property "storagePeerHash" in the yacy config file - Please note that if the index transfer fails, the index ist stored locally. - TODO: currently this index transfer is done by the indexer thread. To seedup the indexer a) this transmission should be done in parallel and b) multiple chunks should be bundled and transfered together ) general performance improvements - better memory cleanup after http request processing has finished - replacing some string concatenations with stringBuffers - replacing BufferedInputStreams with serverByteBuffer - replacing vectors with arraylists wherever possible - replacing hashtables with hashmaps wherever possible This was done because function calls to verctor or hashtable functions take 3 time longer than calls to functions of arraylists or hashmaps. TODO: we should take a look on the class serverObject which is inherited from hashmap Do we realy need a synchronization for this class? TODO: replace arraylists with linkedLists if random access to the list elements is not needed ) Robots Parser supports if-modified-since downloads now If the downloaded robots.txt file is older than 7 days the robots parser tries to download the robots.txt with the if-modified-since header to avoid unnecessary downloads if the file was not changed. Additionally the ETag header is used to detect changes. ) Crawler: better handling of unsupported mimeTypes + FileExtension ) Bugfix: plasmaWordIndexEntity was not closed correctly in - query.java - plasmaswitchboard.java *) function minimizeUrlDB added to yacy.java this function tests the current urlHashDB for unused urls ATTENTION: please don't use this function at the moment because it causes the wordIndexDB to flush all words into the word directory! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@853 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	7fc822a59b	changed handling of time-zones git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@801 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	4aa04972ac	bugfix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@777 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	7991c05b49	homePath instead if RootPath git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@775 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
allo	5bf7d74114	permanent yacy.logging see http://www.yacy-forum.de/viewtopic.php?p=10020 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@773 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	2f732e32a2	enhancements to memory menue git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@762 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	b5337a122c	some more information about available memory in PerformaceMemory menu git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@759 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	e748ba3f6e	super(), finals; other; cleaned; Properties; git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@755 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	fb52a82008	added new performance page for memory settings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@751 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
borg-0300	6d43a4970c	small changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@631 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	4fd5b95b1f	*) Renaming Logger function names to reflect the proper Java Logging API Loglevels - please use logFine instead of logDebug - please use logSevere instead of logFailure and logError See: http://www.yacy-forum.de/viewtopic.php?p=8726#8726 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@615 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	6adf8a4bde	*) Renaming Logger function names to reflect the proper Java Logging API Loglevels - please use logFine instead of logDebug - please use logFailure instead of logError See: http://www.yacy-forum.de/viewtopic.php?p=8726#8726 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@614 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	0dfa8b62e2	*) Changing Proxy-Useragent string according to thread http://www.yacy-forum.de/viewtopic.php?p=8183#8183 A typical useragent string now e.g. looks like: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.7.10; YaCy 0.401/00602; yacy.net) Gecko/20050716 Firefox/1.0.6 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@607 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	ba0a486328	moved printStackTrace() to logging git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@539 6c8d7289-2bf4-0310-a012-ef5d649a1542	20 years ago
allo	a223faace1	not recursive, but it should work on Windows. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@516 6c8d7289-2bf4-0310-a012-ef5d649a1542	20 years ago
allo	ee0a9a2d9b	recursive Translations. You can now translate the Menu and other things in subfolders, too git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@508 6c8d7289-2bf4-0310-a012-ef5d649a1542	20 years ago
orbiter	60eaf3dcde	fix for notifer.gif appearance git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@506 6c8d7289-2bf4-0310-a012-ef5d649a1542	20 years ago
theli	e6aced0162	*) Setting higher priority for session threads See: http://www.yacy-forum.de/viewtopic.php?p=6120#6120 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@490 6c8d7289-2bf4-0310-a012-ef5d649a1542	20 years ago
orbiter	2d8557cb10	minor changes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@487 6c8d7289-2bf4-0310-a012-ef5d649a1542	20 years ago
jerri	3334546340	Started the quest for in-source documentation with javadoc. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@483 6c8d7289-2bf4-0310-a012-ef5d649a1542	20 years ago

1 2 3 4 5 ...

301 Commits (45ae3da7e7dc5480ba005f119427914c64c472d5)