yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	441fbc26e2	security patch for WeakPriorityBlockingQueue (produced a deadlock) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7307 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	5dcb838293	- removed thread overhead when calling dns services - fixed localsearch (changed it by accident) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7306 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4c50d3428e	smaller file size for array stacks to support smaller deletion sizes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7305 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	becc463d8a	enhanced did-you-mean git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7300 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	93c535d111	fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113 fixed a concurrent modification exception during search and a time-out problem git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7298 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	04932dc268	added rdf data structure for rss feeds git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7297 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	84f2953cd8	fix for rss loader / rss type recognition git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7296 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	4c72885cba	added a sitemap entry parser and loader for sitemaps (a recursion if a sitemap refers to another sitemap) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7295 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	445619f3ec	added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7291 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
sixcooler	85c65475fa	smal but important correction of last commit @ HTTPClient (if there is a response it really should be taken to its end) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7290 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	acd93b1b31	* add failsafe mechanisme to domainlist retrieval domainlist is saved locally, if none of the given urls in network.unit.domainlist could be retrieved, the file from the last boot is used instead git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7289 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	70c95608d4	Added CORS Access header for yacysearch.rss output used some of the recommendations from Copro: http://forum.yacy-websuche.de/viewtopic.php?p=21015#p21015 Original Request: http://forum.yacy-websuche.de/viewtopic.php?p=20829#p20829 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7288 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	def4253555	* add option to network definition to provide a domainlist (syntax like in blacklists) * crawler and search allow only urls matching one in domainlist (if list is provided) * this may be useful to prevent dedicated networks from being "polluted" * FilterEngine is improved Backlist-object, Blacklist may inherit from FilterEngine in the future git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7285 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	fb92f9ae8e	added mime type image/jpeg (image/jpg is wrong but it is left here because it does not harm and this error also exists in configuration of web servers) see also: http://forum.yacy-websuche.de/viewtopic.php?p=21129#p21129 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7279 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	155d556568	- better memory protection - more logging - little bit of refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7278 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
f1ori	7d8de34778	* add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7276 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	e3e3b49d52	- enhanced main release recognition - yacybot user agent now includes the yacy network name (not the peer name!) - refactoring and clean-up (mostly turned tab into spaces) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7266 6c8d7289-2bf4-0310-a012-ef5d649a1542	14 years ago
orbiter	58e74282af	added a word counter statistic in condenser which is used by the did-you-mean to calculate best matches for given search words. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7258 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	863065abc4	added user agent logging to access tracker git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7256 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	ed4371dcf3	enhanced navigation implementation and enhanced tag cloud computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7252 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	ca738ac924	- added a tag cloud to search results (using the topics) - some refactoring of score classes - added default package for new classes add_ymark and delete_ymark git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7251 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	e4d561971e	added more score cluster options and made score cluster usage more transparent git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7248 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	7cd9d9d22a	- enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better - added limitations on DidYouMean result sets according to input and output string length git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7246 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	de722090b5	enhancements in did-you-mean guessing git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7243 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	24f1cba7b2	performance hacks: - faster generation of index abstract compression during remote search - less synchronization in IO record reading - request index abstract generation only if necessary and faster time-out in remote search git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7239 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	d607b30b6a	performance enhancements for search and code review for database functions - removed read cache from Records data structure because the read cache had no cache hit during search operation - copied old read-cache class to CachedRecords and the old, now new Records class does not have the cache any more and a code review checked that data structures and synchronization is clean - removed unnecessary synchronization from Table class during get() git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7237 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	fcd40cd30f	- disabled domZones (buggy, must think about better solution) - increased time-out for dns resolver and isLocal property git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7233 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	ec38eca278	fix for new URI equal method git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7232 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	0d363a94d7	more performance hacks this makes YaCy search results VERY fast for all verify=false search cases and it enhances the search speed also for all other snippet-fetch cases. With this change my peer performed 100 Queries Per Second (!!!) while doing 10 queries simultanously (!!!) in an intranet index of 20000 URLs on my 16-core Mac Check this yourself by doing: cd bin ./searchtestmulti.sh after finishing the run, divide 1000 by the given time per query (which is the qps for one thread) and then multiply again by 10 (because 10 search threads has been started) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7231 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	b8aee6d402	performance hacks for better search performance git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7230 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	091dd3f6ec	- enhanced intranet search speed - enhanced intranet portscan speed (better time-out) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7227 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	aacf572a26	- enhancements for search speed - bug fixes in many classes including basic data structure classes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7217 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
sixcooler	61c82f3105	gzip-compresson @ transferRWI & transferURL back again This reduce upload-volume to suit limited bandwidth of home-users like me :-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7215 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	2c549ae341	fixed a number of small bugs: - better crawl star for files paths and smb paths - added time-out wrapper for dns resolving and reverse resolving to prevent blockings - fixed intranet scanner result list check boxes - prevented htcache usage in case of file and smb crawling (not necessary, documents are locally available) - fixed rss feed loader - fixes sitemap loader which had not been restricted to single files (crawl-depth must be zero) - clearing of crawl result lists when a network switch was done - higher maximum file size for crawler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7214 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	3057a0b939	- intranet scanner now produces urls with host names, not ips if possible - CrawStartIntranet servlet shows IPs and host names git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7210 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	e63896f2a8	added an intranet scanner and a servlet which shows all intranet addresses and an option to start a site-crawl for all these addresses at once. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7203 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	e54cb7fb0c	more bugfixes (also for latest commit) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7202 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	be6b48311c	misc bugfixes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7201 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	d2fd93135c	- moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed - migrated the 'yacy' user agent to 'yacybot' in many client methods since the 'yacy' user agent is only used for the proxy git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7199 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	48c0d508ac	fixes for crawling of smb links (file length not always available) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7190 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
f1ori	e670e1ef8e	add charset auto-detection for htmlParser git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7186 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
f1ori	ddcd5ae78c	fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2989 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7185 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
f1ori	8fe1102452	fix http://forum.yacy-websuche.de/viewtopic.php?p=20889#p18426 reuse code from htmlParser git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7184 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	10a9cb1971	simplified snippet computation process and separated the algorithm into two classes also enhances selection criteria for best snippet line computation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7182 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	84a023cbc8	fixed several search bugs git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7180 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	09c208a3ab	patch for corrupted database files (just work on and forget key) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7177 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	97ee278931	enhanced search speed: - better control of number of running search threads - no time-out waiting time when no ranking feeding takes place - local search queries by a remote peer may be faster up to 300 milliseconds - a local search may even be faster git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7176 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
f1ori	b392ca5024	* add option to show YaCy version, usage: java -cp lib/yacycore.jar net.yacy.yacy -version git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7174 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	ac73072924	added a demonstration class: integrate the YaCy search results in own applications This class requests a YaCy peer remotely and produces search result objects. The class was implemented in such a way that it is as short as possible. To get a better integration of search results, use the cora package. This class is fully stand-alone, it does not need any other external library other than already contained in JRE. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7173 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago
orbiter	8da4eb5de6	addition to patch in SVN 7111 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7170 6c8d7289-2bf4-0310-a012-ef5d649a1542	15 years ago

1 2 3 4 5 ...

415 Commits (2c539b514a320ba469b779ffdcbbab3640d7e7c0)