yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	67edfd991c	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
orbiter	d9173ba7ed	added more solr fields to integrate values from URIMetadataRow. All writings to the Metadata-DB are now also done to solr. This includes metadata transfer during search and rwi transfer. The new/added solr fields are: ## time when resource was loaded load_date_dt ## date until resource shall be considered as fresh fresh_date_dt ## id of the host, a 6-byte hash that is part of the document id host_id_s ## ids of referrer to this document referrer_id_ss ## the md5 of the raw source md5_s ## the name of the publisher of the document publisher_t ## the language used in the document; starts with primary language language_ss ## an external ranking value ranking_i ## the size of the raw source size_i ## number of links to audio resources audiolinkscount_i ## number of links to video resources videolinkscount_i ## number of links to application resources applinkscount_i	12 years ago
Michael Peter Christen	70b10e8316	added the JSON response writer to solr interface, add &wt=json to the servlet GET properties to use this format	12 years ago
Michael Peter Christen	3276508d1b	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
Michael Peter Christen	3ce04cecf3	bad hack to prevent a bug appearing in solr	12 years ago
sixcooler	f32aa9a49c	prevent merge of blobs that can't be handled in memory	12 years ago
Michael Peter Christen	bbd242afb4	fix for a NPE	12 years ago
Michael Peter Christen	8d944f6517	nowrap from gaston in forum http://forum.yacy-websuche.de/viewtopic.php?p=26815#p26815	12 years ago
Michael Peter Christen	24d9db1613	snippet retrieval loading processes may use a smaller minimum load time value than crawling processes. This speeds up the search result preparation dramatically.	12 years ago
Michael Peter Christen	ef488a15f7	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
Michael Peter Christen	1687737771	Abstraction of HandleMap and HandleSet	12 years ago
sixcooler	76b037a20a	check content domain fix: search image/media should not show pages containing image/media search text should show all/text but image/media	12 years ago
sixcooler	9cd409682f	close augmented stream if filled from cache to get its content use augmented stream if proxyAugmentation is set only	12 years ago
Michael Peter Christen	e432bb9cd9	better calculation of possible saving in HeapReader index data structure	12 years ago
Michael Peter Christen	9549984c65	documentation/comments	12 years ago
Michael Peter Christen	beb6425f0c	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
sixcooler	83c93e9209	no translation of queue-links	12 years ago
Michael Peter Christen	3bcd9d622b	cleaned up classes and methods which are either superfluous at this time or will be superfluous or subject of complete redesign after the migration to solr. Removing these things now will make the transition to solr more simple.	12 years ago
Michael Peter Christen	6f1ddb2519	Moved solr index-add method to the same method where the YaCy index is written. Also done some code-cleanup.	12 years ago
Michael Peter Christen	315d83cfa0	cleanup	12 years ago
Michael Peter Christen	1f41d9c6f5	bugfix for a NPE	12 years ago
Michael Peter Christen	76202f068e	extended abstraction of local and remote solr index using one front-end for index administration and querying.	12 years ago
Michael Peter Christen	d3f243e2e1	fixed node type calculation for principal peers	12 years ago
Michael Peter Christen	7ec7341f60	added user-authentication protection to solr search (same as implemented for yacysearch)	12 years ago
Michael Peter Christen	e2a97ef8f6	better explain how to access the embedded solr	12 years ago
Michael Peter Christen	826967513b	changed options in IndexFederated_p to switch on/off parts of the index individually. The settings are experimental and the values of the settings will be overwritten when an index migration from urldb to solr starts.	12 years ago
Michael Peter Christen	cba4ab862e	fix for http://bugs.yacy.net/view.php?id=202	12 years ago
Michael Peter Christen	b76836db7b	Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1	12 years ago
reger	36c9875b6e	removed localized number formatting from num-results_totalcount response (this is only used in xml and json where localized format is not valid)	12 years ago
Michael Peter Christen	0640a6f7e6	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
orbiter	69e743d9e3	- more abstraction for the RWI index as preparation for solr integration - added options in search index to switch parts of the index on or off	12 years ago
orbiter	6cc5d1094e	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
orbiter	05a3ffd03a	patches to ensure that solr connectors are active ony if they have a solr object assigned and vice versa	13 years ago
orbiter	5a3c829872	embedded solr is only initiated if it is activated with IndexFederated_p.html	13 years ago
Michael Peter Christen	161005ceaa	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
Michael Peter Christen	bf4968d748	source change in classpath	13 years ago
Lotus	3a350a2f83	partial html fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4454	13 years ago
orbiter	49ee31f837	added classpath for htroot/solr	13 years ago
Michael Peter Christen	97b7bcf2a6	added a solr search index - by default, a (empty) solr storage instance is created at SEGMENTS/solr_36 - the index is written if in /IndexFederated_p.html the flag "embedded solr search index" is switched on - a standard solr query interface is available now with a new servlet at http://127.0.0.1:8090/solr/select To test this, do the following: - switch to webportal mode - switch on the feature as described - do a crawl. this fills the solr index. The normal YaCy search will NOT work now! - do a solr query, like: http://127.0.0.1:8090/solr/select?q=: http://127.0.0.1:8090/solr/select?q=text_t:Help play with different search fields as you can see in /IndexFederated_p.html You can use the standard solr query attributes as described in http://wiki.apache.org/solr/SearchHandler	13 years ago
Michael Peter Christen	f0a079ac9f	allow larger log entries	13 years ago
Michael Peter Christen	9b48c9fe2e	removed a crawler overhead (terminated loop which searches greatest stack that has zero-waiting urls). This should cause a slightly faster crawl for crawl stacks with many different domains in the crawl queue.	13 years ago
Michael Peter Christen	784a4abb18	enhancement in internal data organization which should generate less synchronizations in database access	13 years ago
Michael Peter Christen	f78ce93a80	collection of speed and memory saving hacks	13 years ago
orbiter	c00a3cf74d	less usage of generic logger to avoid logger generation overhead	13 years ago
orbiter	a196f24f60	prevent enqueueing of non-loggeable logging entries	13 years ago
orbiter	482afed07c	reduced logging overhead (a bit)	13 years ago
orbiter	e76159040b	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
orbiter	bbfa497a3c	replaced more size() > 0 by !isEmpty()	13 years ago
Michael Peter Christen	58e7d1952f	reduction of logging to prevent too much IO caused be logging	13 years ago
Michael Peter Christen	83da68c4c1	fixed a memory leak inside the logger which appeared if the log was writter faster that the logger is able to print this out to its out stream. A very large collection of unwritten log outputs had been seen during strong crawling. The new ArrayBlockingQueue is limited to prevent this case.	13 years ago

1 2 3 4 5 ...

8690 Commits (67edfd991c922a03edf4b0cda17b784468654263) All Branches Search

8690 Commits (67edfd991c922a03edf4b0cda17b784468654263)

All Branches