yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	90a02990d2	NPE fix, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3383#p3383 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4230 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	2fcd18a972	- fixed bad behaviour of search event worker processes - fixed export of url lists in xml git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4229 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	445c0b5333	added domain list extraction and html export format to URL administration menu http://localhost:8080/IndexControlURLs_p.html git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4228 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	d8d77fc4b2	fix for NPE, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3368#p3368 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4227 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	bf6952abe7	- added url export to http://localhost:8080/IndexControlURLs_p.html - removed command-line option to export urls git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4226 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	af10f729df	fixed image search and favicon loading git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4225 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	edba2b7bcc	fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=543 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4224 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	c48b73cda2	redesign of ranking data structure - the index administration now uses the same code base for url selection and collection as the search interface. The index administration is therefore a good test environment for ranking order control - removed old postsorting-algorithms, will be replaced with new one - fixed many bugs occurred before during ranking; especially the contraint filtering method removed too many links - fixed media search flags; had been attached to too many urls. The effect should be a better pre-sorting before media load within snippet fetch git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4223 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	6f1308da2f	- some enhancements to IndexControlURLs (shows more links, connects referrer to another query) - some refactoring to search process git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4222 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	bf9a9e4e5e	fix for NPE in IndexControlRWIs_p.java git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4221 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	c527969185	- enhanced monitoring of ranking parameters for details, please try http://localhost:8080/IndexControlRWIs_p.html - fixed computation of ranking ordering in some cases git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4220 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	bd5673efbe	added cleaning of search event before opening the index administration git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4219 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	55da871211	preparations for better ranking: better debugging of index properties to do this, the index administration interface was extended. It is now possible to select parts of a index. See properties shown in interface after a word search for details. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4218 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	383dc815d2	*) fix for commit 4212 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4217 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	3491531cea	- fixed 'appears in url' flag in index generation - extended index administration page, shows some properties to the web links now git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4216 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
daburna	19176e12e2	-corrected typo made in 4213 -updated translation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4215 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	ec7ba0d3d0	- fixed problem with too small sort fields (sortbound was not set) - slightly changed handling of date in indexURLEntry git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4214 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	ca4ca79eba	removed wrong hints to installation page. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4213 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	a01c42575d	*) 404 error pages will be displayed with correct CSS and favicon now (http://forum.yacy-websuche.de/viewtopic.php?t=482 ) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4212 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	2b57d64de3	*) typo (http://forum.yacy-websuche.de/viewtopic.php?t=428 ) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4211 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	bc2368e907	fix for problem with remote crawl referrers git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4210 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	875096552f	fix for NPE in case that remote search results are empty git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4209 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	64b3b79e44	- fix for termination problem with uniq() - addition to seed dna interpretation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4208 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	0abf33ed03	- tried to remove deadlock - enhanced searchtime in kelondroRowSets - enhanced uniq() - reverse enumeration causes less time in case of mass removal of doubles git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4207 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	a4010f7dc8	*) fixed bug where dots were added after numbers < 1000: "123" was transformed to "123." which is undesirable git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4206 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
daburna	da73cde86e	#German language file - some cosmetics git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4205 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	2421127612	fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=513&hilit= git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4204 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	2e91b724ad	fix for yacysearch/rss-feed bug git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4203 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	d0d2771883	disabled multiprocessoring of rowCollection.sort for testing purpose git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4202 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	edc4da5317	fix for division by zero in test reoutine git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4201 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	df38aaf7bd	update to RowCollection sort speed-enhancements: - better handling of small collections (less overhead) - usage of pre-sorted limits - different re-sort limit - more testing procedures git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4200 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	0eb60cfe6f	better handling of seed properties git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4199 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	ecba35de72	enhanced computing speed of kelondro core function: sorting the enhancement was made by using better organized data structures and multi-threading during the sort. A sort can be divided into two separate processes when the first partition of the quicksort algorithm was done. Generating a separate thread and starting the thread takes only 10 milliseconds, so using a separate thread makes only sense if the data amount is large. statistics about the speed-up: without ehancement: 250 milliseconds for 100000 entries with data structure enhancement: 170 milliseconds for 100000 entries with additional second thread (if second processor is present): 130 milliseconds. For dual-processor systems, this means about 100% speed-up a test can be made with the following command: java -classpath classes de.anomic.kelondro.kelondroRowCollection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4198 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	6eaa5a0e64	enhanced local search speed. The ranking process is now 6 times faster that before. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4197 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
fuchsi	425e4ead66	Allow absolute paths in configuration settings. - before absolute paths would be expanded incorrectly, e.g.: fooPath=/a/b/c would become /path/to/yacy/root/a/b/c. Now you can put nearly every dynamically generated data with a configurable path to a location outside of yacys root dir without having to use symlinks (probably good for third party distribution packaging). - abstractServerSwitch.getConfigPath(setting, default) returns a File instance, either with an absolute path or relative to the applications root path. - exceptions (hardcoded): DATA/LOG/yacy.logging DATA/SETTINGS/httpProxy.conf DATA/SETTINGS/user.db TODO: all of these are the global configuration files and they should probably be put into _one_ command line configurable settings path, so it would be possible to package them in /etc/ for example. - add missing workPath to yacy.init (it was used in code, but there was no default in the file) - fix broken skinPath (was skinsPath in yacy.init but skinsPath in the code) + a few other broken config reading caused by typos. - replaced path setting names and their default values with the related static fields in plasmaSwitchboard where not already done/existing git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4196 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
borg-0300	e8d32d9f62	other loglevel git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4195 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
borg-0300	a5d28785b1	less OOM (works for me) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4194 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	794d296129	project link update git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4193 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	ccbfb15b6b	enhancement to crawl stacker enqueue order git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4192 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	93905e5c7b	fix for show-more bug git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4191 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
hermens	5c5344ae97	Beautify log git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4190 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
hermens	35cf196204	transferRanking(): Do not flush more ranking files than requested by caller. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4189 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
hermens	d0aa8cf25d	Only update handshaked peer's last seed date if it has not been updated recently. Unil now the newer data was overwritten by old data from before the handshake. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4188 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
hermens	8f9d65da67	Small corrections to dhtFlushControl() - Test wCacheMaxChunk against maxURLinCache(), not getMaxWordCount(). This triggered a flush everytime dhtFlushControl() was called. - If triggered, flush at least 1 entry. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4187 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	55c87b3b12	changed behavior of crawl stacker - final flush only when tabletype = RAM - prestacker (dns prefetch) only if tabletype = RAM and busytime <= 100 - number of maximun entries in stacker is configurable in yacy.init (stacker.slots) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4186 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
hermens	18144043e6	Correct UTC Offset at beginning/end of daylight savings time git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4185 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	4fefa53135	removed parser object pool, see also svn 4106 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4184 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	35b1bd66cd	The (old) yacy web page does not need to part of the yacy distribution. The old yacy home page will be replaced by a new one. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4183 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	87b297b4d2	update of link to english forum git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4182 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	a31b9097a4	preparations for mass remote crawls: two main changes must be implemented to enable mass remote crawls: - shift control of robots.txt to crawl queue (away from stacker). This is necessary since remote crawls can contain unchecked urls. Each peer must check the robots to prevent that it is misused as crawl agent for unwanted file retrieval - implement new index files that control double-check of remotely crawled urls After removal of robots.txt checking from stacker threads, the multi-threading of this process is void. Multithreading has been removed. Also the thread pools for the crawl threads had been removed, since creation of these threads is not resource-consuming, for a detailed explanation see svn 4106 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4181 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago

1 2 3 4 5 ...

4025 Commits (814aff60bdf4cf2e83d2651cda88eb19ca35e1cd) All Branches Search

4025 Commits (814aff60bdf4cf2e83d2651cda88eb19ca35e1cd)

All Branches