yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	2cb16824e3	removed support for old database structures. The new collection index will be more generalized to support other indexes i.e. YBR block-rank computation. A clean-up of the many conditions to support the old database was necessary. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3506 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	3688ec33e5	release 0.51 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3501 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4783a30910	- fixed a flush problem in balancer - return to idle divisor in RWI RAM cache flush git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3485 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1cba31de43	redesigned ram organization for database caches - each cache can now allocate as much memory as is available - no more fixed limits - replaced old performance memory monitor by new one - added supervision methods as static functions into the classes that provide cache functionality - steering of ram allocation is done with two simple limits that are ram availability-relative git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3434 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	304412a049	first generation of collection index R/W head path optimization - collections are now hand-over as collection lists to collection index for merge opertations - collection index lists are separated into 'new' and 'extend' lists - lists are written separately - write operations are done into array sets and array indexes. These are now serialized - write operations into index files are sorted by index; that means that a R/W head does not need to go forward and backward, only forward More enhancements are possible git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3407 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	dc0c06e43d	PLEASE MAKE A BACK-UP OF YOUR COMPLETE DATA DIRECTORY BEFORE USING THIS redesign for better IO performance enhanced database seek-time by avoiding write operations at distant positions of a database file. until now, a USEDC counter was written at the head-section of a kelondroRecords database file (which is the basic data structure of all kelondro database files) to store the actual number of records that are contained in the database. Now, this value is computed from the database file size. This is either done only once at start-time, or continuously when run in asserts enabled. The counter is then updated only in RAM, and written at close of the file. If the close fails, the correct number can be computed from the file size, and if this is not equal to the stored number it is a strong evidence that YaCY was not shut down properly. To preserve consistency, the complete storage-routine had to be re-written. Another change enhances read of nodes in some cases, where the data-tail can be read together with the data-head. This saves another IO lookup during each DB node fetch. Includes also many small bugfixes. IF ANYTHING GOES WRONG, ALL YOUR DATA IS LOST: PLEASE MAKE A BACK-UP git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3375 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1f1f398bfa	enhanced speed of RAM cache flush by factor 20 (twenty times faster) - the speed was doubled by avoiding read access during the dump - the speed was dramatically increased at least by factor 10 by using a temporary ram-file where the structures are flushed to before it is dumped then as a whole byte-chunk to the file system. The speed enhancements also affects some other parts of the database. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3353 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b2f4087400	redesign of last-seen fieln inside seed: the field contains now a time in UDC-0 (instead relative to local UDC offset) this fixes a bug in peer selection, where an iteration over all seeds ordered by lastseen did not work correctly. Problems may occur because the new meaning of this field may mix with the different meaning of that field in older peers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3322 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b123a404b0	added mime types added peer name in search statistics git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3285 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	cf49d5b0a7	Version switch to 0.501 by /me as Orbiter is at 23C3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3139 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	9b726ac366	release 0.50 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3132 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	0a050bc043	enhanced ranking - redesign of data storage in plasmaSearchRankingProfile - profiles are extended by new ranking parameters - new RWI ranking parameters are considered during ranking - appearance attributes (i.e. emphasised text) is now considered - faster ranking - some attributes that had been checked during post-ranking can now be checked during pre-ranking phase - removed old ranking parameter on index.html page (will be replaced by profiles in the future) - ranking can now consider appearances of media content - snippet-loading for media types now work correctly (fetches only from the wanted media) - ranking-profiles can be handed over the remote peers and apply there also - re-search of same query with different domain now also re-triggers remote search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3105 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c500178fd7	redesign of index creation interface - the input remains in the IndexCreation menu point - after pressing the submit button, the IndexingMonitor is called - the code for creation of new indexing starts was moved to the indexingMonitor - Existing crawl profiles can be monitored in the Indexing Monitor - the code for creation of crawl profile data was shifted from indexing start to indexing monitor - existing crawl profiles can be deleted on the crawl monitor page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3095 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	7ff86d6ba6	- image search now shows thumbnails (in bad order, but it works) - repaired DHT selection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3081 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	10d888e70c	- added a media search for images, audio, video and applications - new search options on search page - new option in ViewInfo to display all links of a file - enhanced collection data structure git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3054 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	052f28312a	removed assortments from indexing data structures removed options to switch on assortments git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3041 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2372b4fe0c	release 0.49 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3040 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f8efb3c948	fixed a null pointer exception problem reported in the forum. I cant find the forum entry any more because my girlfriend switched off the power while the forum window was open. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3039 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	30888e7a2f	implementation of search constraints Such constraints may formulate specific restrictions to web searches This is implemented by scraping information for constraints from a web page during parsing, and storing flags to the pages within the web index. In this first step, only information for index pages ("index of", directory listings) are scraped and stored in flags - added new flag class kelondroBitfield - added scraper method in condenser - added bitfield structure for all scrape types (see also condenser) - added bitfield structure for appearance locations (see RWIEntry) - added handover protocol for remote search and index distribution - extended kelondroColumn class to hold bitfield types - added another search attribute on search page (index.html) - extended search-filter to enable filtering of non-matching constraints - set all new database types to be default - refactoring: moved word hash generation to condenser class git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2999 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	09bcc10344	bugfix for some problems of last change with assortments git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2986 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1751a799ac	- deactivated all write buffers - fixed a storage bug git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2933 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	147d88cf23	re-design of database caching this should reduce IO a lot, because write caches are now actived for all databases - added new caching class that combines a read- and write-cache. - removed old read and write cache classes - removed superfluous RAM index (can be replaced by kelonodroRowSet) - addoped all current classes that used the old caching methods - more asserts, more bugfixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2865 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	4e363108e1	- removed bad debug code that caused a large and unnecessary delay during global search - fixed problem that global search results disappear after a search - removed some stopwords git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2861 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	77a59a115d	refactoring of indexing methods git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2787 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	6396f5971e	bugfixes and migration attempt toward new kelondroFlex db - more synchronization - bugfix for remove in collections - bugfix in kelondroFlex (wrong exception condition!) - options to use RAM, FLEX and TREE tables for Crawl URL stacker - default for Crawl URL stacker is now FLEX (!) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2746 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	97e24b63c7	release 0.48 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2743 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	86047f439d	removed very bad bug that prevented production of any remote search result :-((( Please update! git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2724 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b59d4576af	increased version number to emphasise that the snippet fix _dramatically_ increased search speed git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2690 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	df1629b05a	- code cleanup - version 0.471 - moved surftipps to own web page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2676 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2463e5624a	'quick' release 0.47 - documentation update - necessary bugfixes (missing css for new peers) - reduced effect of search result redundancy filter - removed some debug output, but not all git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2665 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	2d3b96eeba	bugfixes for surftipps - added missing authorization check for votes - second vote on same entry was possible after complete publishing of current vote git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2645 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	c89d8142bb	replaced old 'kCache' by a full-controlled cache there are now two full-controlled caches for incoming indexes: - dhtIn - dhtOut during indexing, all indexes that shall not be transported to remote peers because they belong to the own peer are stored to dhtIn. It is furthermore ensured that received indexes are not again transmitted to other peers directly. They may, however be transmitted later if the network grows. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2574 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1137605edf	- small change to DetailedSearch layout - version 0.463 for new xhtml interface git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2548 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	ae4e8ce03e	- cut for 'probably last html-interface version': version number update - small enhancement to ranking git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2536 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
rramthun	e34e07e0a1	- Changed back to dev namescheme and new 0.461 - Corrected some errors in News.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2450 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	27a159b401	* documentation update * removed doc from release * release information in doc/News.html * release 0.46 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2442 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	23dd972608	fixed memory calculation in performanceMemory web page fixed also maximum cache size computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2429 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	d468d665c9	some changes that may help to prevent deadlocks that cause an OutOfMemoryError as described in http://www.yacy-forum.de/viewtopic.php?p=24359 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2353 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	417ed5102e	redesign of database iterators: an iteration of key elements in kelondroTree databases is no longer supported. this is now replaced by an iteration of kelondroRow.Entry objects from the database Iteration of keys from the database was mostly followed by retrieval of the row from the database, whcih caused unnecessary database load. The index selection was also redesigned to use the new row iteration methods. This affects many funktions, most important is the DHT selection routine which is now much faster. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2327 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	dd2865178a	major bugfix (searched a whole week for the bug) for the kelondroRowBuffer, which has effect mostly to the kelondroFlexTable but also to all other database functions git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2260 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	66964dc015	removed high/med/low from kelondroRecords cache control. this was done because testing showed that cache-delete operations slowed down record access most, even more that actual IO operations. Cache-delete operations appeared when entries were shifted from low-priority positions to high-priority positions. During a fill of x entries to a database, x/2 delete situation happen which caused two or more delete operations. removing the cache control means that these delete operations are not necessary any more, but it is more difficult to decide which cache elements shall be removed in case that the cache is full. There is not yet a stable solution for this case, but the advantage of a faster cache is more important that the flush problem. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2244 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	3a8aecb2ed	reflect new database and authentication methods with version number git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2216 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
theli	45b39ee1be	*) solving unpacking problems with to long filename by a) renaming the parent folder in the tgz file to yacy (can be configured via build properties file) b) reconfiguring build file to throw an error if a file name is too long Please note that currently there is _no_ proplem with too long class names because of step a. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2207 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	572d53506c	new kelondroRow objects now replace byte[][] objects in object cache git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2161 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	338047e056	replaced kelondroDyn write methods for properties and maps by faster version this affects news, robots?, bookmarks?, blogs, the wiki, seed-db, news etc. this all should create less IO git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2107 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
rramthun	0bf51edb40	Changing version number according to http://www.yacy-websuche.de/wiki/index.php/De:Versionsnummern git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2051 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	93eb4f14e6	release 0.45 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2047 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	0f005cbc28	code freeze please for next release git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2033 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	00a5d435e2	- fixed some bugs with domain filter - added new ranking filter "prefermask": urls that match the filter are ranked better git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2022 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago
orbiter	f0833b0328	introduced simple search interface git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2007 6c8d7289-2bf4-0310-a012-ef5d649a1542	19 years ago

1 2 3

103 Commits (8463e29b148c65edea9900117d8c7cd0c4736197)