yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	840fa22135	disabled clickdepth computation during craling since that is repeated during clean-up phase.	12 years ago
orbiter	a734fbc4a5	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
orbiter	d74472f562	corrected result counter	12 years ago
orbiter	2555542f7a	removed the dns prefetch because that was not soo useful	12 years ago
orbiter	aa3c26c62e	added recrawl/reload to CrawlStartSite for a timeout of 3 days	12 years ago
orbiter	c1b7e61882	added option to create empty vocabularies	12 years ago
bubu	e0edad689d	fix link to IndexSchema_p.html	12 years ago
Michael Peter Christen	d957739441	removed size request	12 years ago
Michael Peter Christen	c95a84103a	complete redesign of search process: - removed 'worker' processes - no internal time-out behaviour: methods either are successful or return null - waiting is only done on top-level - removed snippet-production; this is replaced by solr snippets - removed statistics based on solr size queries (they had been VERY long); the statistics (like suggestions or tag cloud) are now again based on the old but very fast RWI index. In portal or intranet mode the RWI index is usually switched off; if you like to have statistics again then you must switch on the rwis again in this mode. - fixed many bugs regarding correct page counter	12 years ago
Michael Peter Christen	35fa718b77	testing to use solr for portalsearch caused some bugfixing but no full success: try to comment out the solr search request in yacy-portalsearch.js	12 years ago
Michael Peter Christen	008288719c	fix for schema export to consider also automatically generated coordinate fields	12 years ago
Michael Peter Christen	089dee1770	- generalized SchemaConfiguration into super-class Configuration and adopted other classes which used the configuration-only access for that class - removed many warnings - adjusted logging	12 years ago
Michael Peter Christen	c16de49f64	fix for webgraph delete query	12 years ago
Michael Peter Christen	56d5946a59	- added flags in IndexFederated_p.html to switch on or off the webgraph index (new solr core webgraph) .. this is now off by default - completely redesigned this servlet - added description how to attach a remote solr - adjusted naming of servlet and menues - moved 'lazy initialization' attribut from IndexSchema to IndexFederated (this is a general option) back again.	12 years ago
Michael Peter Christen	461d46101d	- Removed log4j from libraries. This can be removed because the package log4j-over-slf4j is there. From slf4j all loggings are routed to the jdk logger. Now all loggings are consistently done to the jdk logger. - added some lines to the logging properties to suppress many solr logging statements. The number of the logging entries had already become a performance issue, therefore removing these from the log should increase performance.	12 years ago
Michael Peter Christen	b349c8145b	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
orbiter	253a7aee88	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
orbiter	36f9b0fc16	updated wstx-asl to 3.2.9	12 years ago
Michael Peter Christen	14cceb6b17	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Conflicts: htroot/IndexFederated_p.html source/net/yacy/cora/federate/solr/YaCySchema.java source/net/yacy/peers/Protocol.java source/net/yacy/search/Switchboard.java source/net/yacy/search/index/Segment.java also moved portalsearch-dev to yacy-portalsearch to be able to fix problems with new attachment to solr of the search widget	12 years ago
Michael Peter Christen	58e1e6fa2b	fixes to schema	12 years ago
reger	f291d60c5f	on remote Solr search take only locally enabled schema fields from remote solrdocument for the inputdocument added to local index	12 years ago
reger	d31a109efe	remove obsolete Solr "commit within" input field from IndexFederated see `4111606654`	12 years ago
Michael Peter Christen	788288eb9e	added the generation of 50 (!!) new solr field in the core 'webgraph'. The default schema uses only some of them and the resting search index has now the following properties: - webgraph size will have about 40 times as much entries as default index - the complete index size will increase and may be about the double size of current amount As testing showed, not much indexing performance is lost. The default index will be smaller (moved fields out of it); thus searching can be faster. The new index will cause that some old parts in YaCy can be removed, i.e. specialized webgraph data and the noload crawler. The new index will make it possible to: - search within link texts of linked but not indexed documents (about 20 times of document index in size!!) - get a very detailed link graph - enhance ranking using a complete link graph To get the full access to the new index, the API to solr has now two access points: one with attribute core=collection1 for the default search index and core=webgraph to the new webgraph search index. This is also avaiable for p2p operation but client access is not yet implemented.	12 years ago
Michael Peter Christen	89ede0fe84	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
Michael Peter Christen	91a0401d59	introduced a second core named 'webgraph'. This core will hold the link structure, but is not filled yet. To have the opportunity of a second core, multi-core functionality had to be implemented to the deep-embedded solr: - migrated the solr_40 directory content to a subdirectory 'collection1'; the previously used default core is now called collection1 - added solr_40/webgraph subdirectory as second core - added a servlet configuration for the second core 'webgraph' in /IndexSchema_p.html - added instance handling as addition to solr connections: all solr connectors are now instances of an solr 'instance' object; this required a complete re-design of the solr embedding - migrated also caching and sharding ontop of new instance handling - migrated the search apis to handle now the access to a specific core, the default core named 'collection1' - migrated the remote solr search interface to access shards of cores; for the yacy remote search the default core is now called 'solr'; using the peer address as solr address - migrated the solr backup and restore process: old backups cannot be used after this migration! - redesign of solr instance handling in all methods which access the instances: they cannot hold copies of these instances any more; the must retrieve the actuall connection object every time they want to write to it (this solves also some bugs when switching the index/network) - added another schema 'solr.webgraph.schema', the old solr.keys.list is replaced by solr.collection.schema	12 years ago
reger	1951ba61ae	remove CPGEN from Windows batch files (classpath for all needed libraries is defined in manifest of yacycore.jar)	12 years ago
orbiter	594ed63f2a	fixed interactive search which caused an error if pubDate is not present in a search result	12 years ago
Michael Peter Christen	33bc255e85	prevent that crawl starts with very large url lists cause a time-out in the user front-end	12 years ago
Michael Peter Christen	98a4a4aa97	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
Michael Peter Christen	b6de1f42dc	Full redesign of solr connection architecture. This was done to support multiple solr cores instead of just one. Therefore it is now necessary to distuingish between solr server connections (called an 'Instance') and a connection to a single solr core. One Instance may now have multiple connector classes assigned to it, each connecting to a single core. To support multiple cores it is also necessary to distinguish between the connection configuration and the configuration of the index schema. We will have multiple schema configurations in the future, each for every solr core. This caused that the IndexFederated servlet had to be split into two parts, the new Servlet for the Schema editor is now in the IndexSchema Servlet.	12 years ago
Marc Nause	efb6cf7d21	Merge branch 'master' of git@gitorious.org:yacy/rc1.git	12 years ago
Marc Nause	ce5b7afab2	) removed Skype online indicator (was not working anymore) ) updated ICQ URLs	12 years ago
Michael Peter Christen	4111606654	removed the commitWithin attribute because that is not the way how the index is updated the right way for us. May also be be superfluous with the solr 4.0 softcommit.	12 years ago
Michael Peter Christen	c20fa3640d	fix to unbalanced tag and license for null objects	12 years ago
Michael Peter Christen	3a6097966d	added jsonp option to yjson result writer	12 years ago
Michael Peter Christen	de58043205	Added image license generation for solr image search results when results are generated within yjson result writer. This makes it possible to view images in yacyinteractive from solr.	12 years ago
Michael Peter Christen	d3508fa8ff	fixed json search, quotes, auto-facets, urls etc. for yacyinteractive.html	12 years ago
Michael Peter Christen	1db23e9eac	Moved methods from SolrServerConnector to AbstractSolrConnector with the result that most of these methods become superfluous in other classes. This is a generalization step towards multi-indexes in Solr.	12 years ago
Michael Peter Christen	02fa31b5bf	better filesearch layout	12 years ago
Michael Peter Christen	e55ec3071d	reduced number of facets in yacyinteractive (only filetype necessary)	12 years ago
Michael Peter Christen	16d90859b7	reverted put-semantics back to as-usual in serverObjects and introduced an add-method to put in several objects for the same key	12 years ago
Michael Peter Christen	0d888ff69e	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
Michael Peter Christen	c34af7fe94	extended JSON Response Writer and Opensearch Response Writer for the Solr search interface in such way that it is possible to use this interface for the yacyinteractive search. This search interface is now much faster using the Solr search directly. For the Solr interface it was necessary to create a translation from the YaCy search modifiers to the Solr facet selection. This was added in such a way that it becomes generic for the normal YaCy search and as a on-top evaluation for Solr queries.	12 years ago
reger	c37d718f16	make sure yacy.running is deleted if not running (catch exception) - to prevent following log if YaCy was previously not properly shutdown E ... STARTUP WARNING: the file C:\src\git\yacy-rc1\DATA\yacy.running exists, this usually means that a YaCy instance is still running E ... STARTUP FATAL ERROR: java.util.concurrent.TimeoutException java.util.concurrent.ExecutionException: java.util.concurrent.TimeoutException at net.yacy.cora.protocol.TimeoutRequest.call(TimeoutRequest.java:91) at net.yacy.cora.protocol.TimeoutRequest.ping(TimeoutRequest.java:112) at net.yacy.yacy.startup(yacy.java:200) at net.yacy.yacy.main(yacy.java:638) Caused by: java.util.concurrent.TimeoutException - adjust Netbeans path (to solr4.1.jars)	12 years ago
Michael Peter Christen	762b687e47	extended the serverObjects to be able to hold multipel values for a single key. This is done using the solr class MultiMapSolrParams. That class is needed in the OpensearchResultWriter to get multiple facet requests.	12 years ago
Michael Peter Christen	d70d99fab5	added more metadata fields and facets to OpensearchResponseWriter. This should make it possible to replace the original and enriched yacy opensearch result with a solr output in opensearch format.	12 years ago
Michael Peter Christen	6a4878940b	fix in html parser and bookmark generation	12 years ago
Michael Peter Christen	51e7ab4f70	moved bookmarks back to more prominent location (even if this does not fit to the 'Search Interfaces' headline)	12 years ago
Michael Peter Christen	dee8b24d3c	better error handling for bookmarks	12 years ago
Michael Peter Christen	e1da39245a	when searching the network, do not search on robinson peers with the old DHT search interface. Now use the solr interface.	12 years ago

... 3 4 5 6 7 ...

9544 Commits (0c1a018bbde9c9e67bc000b6a3fd8dbd1706a6f3) All Branches Search

9544 Commits (0c1a018bbde9c9e67bc000b6a3fd8dbd1706a6f3)

All Branches