yacy_search_server

Commit Graph

Author	SHA1	Message	Date
low012	1af728ae09	*) regex for site operator changed as proposed by Lotus git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5441 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	9e58ae036d	*) added site operator which can be used to only show results from a certain domain. example: "test site:edu" shows only documents which contain the word test and which come from an edu domain git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5439 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	19e7c56f7f	*) apply filter to dir list to only show .black files as blacklists git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5438 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	c4c4c223b9	fixed a problem with attribute flags on RWI entries that prevented proper selection of index-of constraint git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5437 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	4bffe664ca	) moved entry field for new expressions to top of the list as requested in forum (http://forum.yacy-websuche.de/viewtopic.php?f=9&t=1678 ) ) added some Javascript to disable list selection on bottom of list in cases it is not needed (edit, delete) and only enable it if needed (move), if JS is turned off everything will work as usual git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5435 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	9d5d30f877	*) http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1672 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5422 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	5448aad328	removed unused code git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5421 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	28d2d28573	added support for filetype search (just use filetype:<type> in the search query) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5418 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	78c568331e	added test channel to /xml/feed.rss can be obtained with http://localhost:8080/xml/feed.rss?set=TEST returns always a single feed entry with a fresh date git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5416 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	e004da48d3	- added fast fingerprint computation for files (any). Will be used in new index dump method - refactoring git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5415 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	eab72424df	*) Fixed small bug: When adding new elements to blacklist via import, the blacklist which the elements were added to was supposed to be displayed, which did not work correctly. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5414 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	0e56675596	*) cleaning up ;-) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5413 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	cf69557ea2	) blacklists can be exported as XML or plain text now ) blacklist import via file upload works now git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5412 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	1594a15be9	*) explicit mentioning of blacklist in blacklist cleaner git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5411 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	5a89266598	*) new parameters for future use (better blacklist handling for im- and export) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5403 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	e34ac22fbd	- added new monitoring servlet at http://localhost:8080/PerformanceConcurrency_p.html - used the new monitoring to do some fine-tuning of the indexing queue git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5402 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	d376d81fc4	replaced busy thread control of crawl stacker by blocking threads git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5400 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	7535fd7447	- refactoring of CrawlEntry and CrawlStacker - introduced blocking queues in CrawlStacker to make it ready for concurrency - added a second busy thread for the CrawlStacker The CrawlStacker is multithreaded. It shall be transformed into a BlockingThread in another step. The concurrency of the stacker will hopefully solve some problems with cases where DNS blocks. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5395 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	6569cbbec1	npe fix: http://forum.yacy-websuche.de/viewtopic.php?t=1646 (break to avoid bad side effects) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5394 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	2802138787	- refactoring of CrawlStacker (to prepare it for new multi-Threading to remove DNS lookup bottleneck) - fix of shallBeOwnWord target computation heuristic git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5392 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	b1e211b258	no error-alert: http://forum.yacy-websuche.de/viewtopic.php?t=1639 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5391 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	13cb0916ee	changes to statistics and content of thread dump servlet (points now more directly to performance leaks without mentioning class calls inside of sun/java calls that cannot be changed anyway) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5390 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	e1acdb952c	fix for problem with userDB and bookmarksDB which was caused by changes in kelondroRA in SVN 5376 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5385 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	e918d64c23	show hand-cursor an labels git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5383 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	4a2dac659e	more speed hacks: - modified and activated write buffer - increased cache flush factor - fixed a problem with deadlocking of indexing process git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5382 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	1fb518a5b4	display <String> etc. git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5380 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	47292e696a	more performance hacks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5379 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	bd1dc9cd5d	thread dump with statistics, a little bit of profiling git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5377 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	d39d420b39	performance hacks git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5376 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	5280ad638d	added basic performance page other performance settings can be found on advanced settings git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5375 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	1a51d9fcfd	display proper values git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5374 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	0b4808ba3d	added new interactive search feature: - during the user types search queries, the local database is searched - results are presented interactively This was implemented using a new JSON result format for search results in YaCy - added JSON as file format for servlets - refactoring of current search servlets (xml and html) - added JSON output format for search results - added AJAX-based search page, that uses the yacysearch.json selrvlet to print results as a query is typed git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5373 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	fea82b54ef	more contrast on search snippets git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5370 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	1951d30a62	addendum to last commit handle words with length < 3 correctly git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5369 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	325ba7bfb8	only query words with length > 2 this is not complete, yet git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5368 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	489edb4473	improved pattern selection git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5367 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	e423fa9846	) added method to only get file names in directory listing which match a filter ) only files which end with .black will be listed as blacklists *) added a little bit of Javadoc git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5366 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	577b53aee6	added more search engines git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5365 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	7f4d411c0d	npe-fix git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5364 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	1545e5440a	* index deletion: checkbox-confirmation * watch crawler: less load on exhausted peers; wait for data before reloading again git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5359 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	10f5ec1040	reverted last commit (more testing needed) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5356 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
daburna	ba5b274b8c	#translation update: -blacklist -crawlstart ... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5353 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	0ca4bc7b79	- added reader and visualization for mediawiki-export files: files exported from mediawiki using the xml schema according to http://www.mediawiki.org/xml/export-0.3/ can be processed to be viewed in a YaCy servlet. To acces such a file, place it into DATA/HTCACHE/mediawiki/ i.e. the export from german wikipedia would be: DATA/HTCACHE/mediawiki/wikipedia.de.xml This file can then be accessed using the URL http://localhost:8080/mediawiki_p.html?dump=wikipedia.de.xml&title=YaCy if this is done the first time, an index file is created (for this case: more than 4 million lines must be written, this takes about 15 minutes) Then try the same url again. - enhanced also the md5 computation speed git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5352 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	4f996a7651	fix for logparser pattern git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5349 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	867d0f2f56	removed some unnecessary pause delays git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5346 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	fd83e59f8e	new remote search average git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5343 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	dba7ef5144	extended crawling constraints: - removed never-used secondary crawl depth - added a must-not-match filter that can be used to exclude urls from a crawl - added stub for crawl tags which will be used to identify search results that had been produced from specific crawls please update the yacybar: replace property name 'crawlFilter' with 'mustmatch'. Additionally, a new parameter named 'mustnotmatch' can be used, which should be by default the empty sring (match-never) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5342 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
orbiter	0ae84f4f8e	set some default values for a crawl start that should cause less confusion and mistakes git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5334 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
lotus	4745e89451	auto-choose crawl type git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5331 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago
low012	421d056550	) changed layout of blacklist adminstration (less cluttered) ) it is possible to move/edit/delete more than one entry at a time now ) it is easier to choose a target for blacklist import now ) fixed several bugs *) to be continued... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5330 6c8d7289-2bf4-0310-a012-ef5d649a1542	17 years ago

1 2 3 4 5 ...

2318 Commits (1af728ae09a9a2344a900de28540da81c43e96e3)