yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	fdaeac374a	- enhanced postprocessing speed and memory footprint (by using HashMaps instead of TreeMaps) - enhanced memory footprint of database indexes (by introduction of optimize calls) - optimize calls shrink the amount of used memory for index sets if they are not changed afterwards any more	11 years ago
reger	ba49ff81ed	little more verbose proxy 403 error message	11 years ago
Michael Peter Christen	d325cb8912	fixes and enhancements for postprocessing	11 years ago
Michael Peter Christen	7c1b968378	another fix for the shutdown exceptions	11 years ago
orbiter	133d41386c	(again) full redesign of ConcurrentUpdateSolrConnector to remove out-of-order transactions regarding add and delete operations. Now all operations (add and delete) are executed concurrently in-order.	11 years ago
Michael Peter Christen	a632b0d2a4	added a forced commit to index deletion to enable synchronized index updates	11 years ago
Michael Peter Christen	1d069c5861	make sure that postprocessed documents are overwritten	11 years ago
Michael Peter Christen	0d2342575e	Merge branch 'master' of ssh://gitorious.org/yacy/rc1	11 years ago
Michael Peter Christen	3cc5c0ffdd	a concurrency enhancement which was not used because tests showed worse indexing speed. I leave the code there since it may be useful in SolrCloud environments.	11 years ago
Michael Peter Christen	e644981697	added one more postprocessing low memory check	11 years ago
reger	5e645f4449	Merge origin/master	11 years ago
reger	3b89176b9f	use config value htroot in Jetty init (was hardcoded) - move htroot exist check from old httpdfilehandler to startup, remove from filehandler and legacy proxyhandler - use SwitchboardConstant.htroot where appropriate	11 years ago
Michael Peter Christen	e1bf65c892	added short memory protection during postprocessing	11 years ago
Michael Peter Christen	90b47e83e6	fixed shutdown error when closing solr connectors	11 years ago
Michael Peter Christen	7640834b37	removed double concurrency to put Solr documents into the index. The writings to the solr index are also buffered in ConcurrentUpdateSolrConnector	11 years ago
Michael Peter Christen	0f6b72f24b	do not use luke requests for remote solr servers if the result is different from normal requests. This happens if the remote solr is actually a solrCloud; in such cases the luke request returns only the result of the single solr peer, not the whole cloud. also done: some refactoring.	11 years ago
Michael Peter Christen	c57026e242	recover from OOM	11 years ago
Michael Peter Christen	907db8b7a6	fix for bad query shortcut hack	11 years ago
Michael Peter Christen	a2b66fe2eb	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	9f6be762a6	- better logging for postprocessing - fixed collection bug in postprocessing	11 years ago
orbiter	da5d4128bf	prevent npe	11 years ago
orbiter	a878c7982c	prevent npe	11 years ago
orbiter	e4eb87d924	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
orbiter	ced1a96f9c	fixed error cache	11 years ago
reger	3ba81bd08a	Merge origin/master	11 years ago
reger	4d896383db	fix: use timeout = proxy.ClientTimeout in ProxyHandler (was 10sec fix) see http://bugs.yacy.net/view.php?id=236	11 years ago
orbiter	cfb647db6e	- introduced a miss cache in ConcurrentUpdateSolrConnector - better usage of cache - bugfix for postprocessing	11 years ago
orbiter	a87d8e4a8e	changed caching of ConcurrentUpdateSolrConnector: it caches now also the url along with the load date. While this takes much more memory, it eliminates database lookups for getURL() requests, which happen equally often. This speeds up remote solr configurations.	11 years ago
orbiter	f6e441dd77	refactoring	11 years ago
orbiter	76c53faeb2	removed unused code (HostStat)	11 years ago
orbiter	d3a88eaecb	introducing ConcurrentUpdateSolrServer for remote solr servers. Scaling of write buffers and update queue size is made according to assigned memory.	11 years ago
reger	809e976578	remove unused java imports form yacy.java	11 years ago
reger	a9b06f8719	add a -config command line parameter e.g. -config "port=9090" "port.ssl=8043" - useful for remote installation to set any config file property - multipe parameter can be set at once, on Windows enclose parameter in doublequotes - special handling "adminAccount=adminuser:adminpwd" sets adminusername and md5 encoded admin-pwd - adjusted windows startbatch to allow command line parameter handling - remove not needed classpath calculation from startYACY_debug.bat	11 years ago
reger	0923b09216	fix: allow 4 character admin user name (was min 5 char)	11 years ago
Michael Peter Christen	254a7ac66c	fixed cleaning of index	11 years ago
Michael Peter Christen	28a7b42e6b	removed warning "sun.misc.BASE64Encoder is internal proprietary API and may be removed in a future release"	11 years ago
Michael Peter Christen	046f5a03cb	one more SolrIndexSearcher bugfix	11 years ago
sixcooler	78c01b3eff	fix for 'AlreadyClosedException: this IndexReader is closed'	11 years ago
Michael Peter Christen	1b5e3d523a	better control over close-state of remote solr connections	11 years ago
Michael Peter Christen	1a364572a5	fix for "org.apache.solr.core.SolrCore Too many close [count:-1] on org.apache.solr.core.SolrCore@51af7c57" -error	11 years ago
Michael Peter Christen	69391e5d9e	changed strategy to test existence of documents in Solr: using the update time. The reason for that is a better caching for the crawler double-check, which needs the update time for crawler steering.	11 years ago
Michael Peter Christen	790f103f32	delete fail-docs during postprocessing to prevent that they will appear again and stay in postprocessing forever.	11 years ago
Michael Peter Christen	ff656ce860	explicit call to optimize to add a expungeDeleted flag	11 years ago
Michael Peter Christen	9eb668e951	enhanced the resource observer The resource observer is now able to recognize free disk space AND available space for YaCy. The amount of space which is assigned for YaCy are defined in new settings in the configuration file. Furthermore, there is now a cleanup process which deletes files in case that an autodelete is activated. The autodelete is now BY DEFAULT ON if the disk space is low, which means that YaCy starts to delete documents when the disk is full!	11 years ago
Michael Peter Christen	fbee98c06f	fixed shortcut self-reference bug	11 years ago
Michael Peter Christen	e7a29a2851	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	bf97e38b83	removed clearURLIndex, which is a stub remaining from the old metadata database and not needed any more	11 years ago
orbiter	14764632b5	clear solr caches in case that an exception occurrs. The reason behind this hack is the occurrence of Exceptions like: W 2014/02/11 18:51:33 ConcurrentLog GC overhead limit exceeded java.io.IOException: GC overhead limit exceeded at net.yacy.cora.federate.solr.connector.AbstractSolrConnector.getDocumentById(AbstractSolrConnector.java:334) at net.yacy.cora.federate.solr.connector.MirrorSolrConnector.getDocumentById(MirrorSolrConnector.java:173) at net.yacy.cora.federate.solr.connector.ConcurrentUpdateSolrConnector.getDocumentById(ConcurrentUpdateSolrConnector.java:415) at net.yacy.search.index.Fulltext.getMetadata(Fulltext.java:331) at net.yacy.search.index.Fulltext.getMetadata(Fulltext.java:317) at net.yacy.search.query.SearchEvent.pullOneRWI(SearchEvent.java:1024) at net.yacy.search.query.SearchEvent.pullOneFilteredFromRWI(SearchEvent.java:1047) at net.yacy.search.query.SearchEvent$3.run(SearchEvent.java:1263) Caused by: java.lang.OutOfMemoryError: GC overhead limit exceeded at java.util.Arrays.copyOfRange(Arrays.java:3077) at java.lang.StringCoding.decode(StringCoding.java:196) at java.lang.String.<init>(String.java:491) at java.lang.String.<init>(String.java:547) at org.apache.lucene.codecs.compressing.CompressingStoredFieldsReader.readField(CompressingStoredFieldsReader.java:187) at org.apache.lucene.codecs.compressing.CompressingStoredFieldsReader.visitDocument(CompressingStoredFieldsReader.java:351) at org.apache.lucene.index.SegmentReader.document(SegmentReader.java:276) at org.apache.lucene.index.BaseCompositeReader.document(BaseCompositeReader.java:110) at org.apache.lucene.index.IndexReader.document(IndexReader.java:436) at org.apache.solr.search.SolrIndexSearcher.doc(SolrIndexSearcher.java:657) at net.yacy.cora.federate.solr.connector.EmbeddedSolrConnector.SolrQueryResponse2SolrDocumentList(EmbeddedSolrConnector.java:230) at net.yacy.cora.federate.solr.connector.EmbeddedSolrConnector.getDocumentListByParams(EmbeddedSolrConnector.java:320) at net.yacy.cora.federate.solr.connector.AbstractSolrConnector.getDocumentById(AbstractSolrConnector.java:330) ... 7 more This problem was analysed with the Eclipse Memory Analyser after a heap dump, where the following problem was reported as the main Problem Suspect: One instance of "org.apache.solr.util.ConcurrentLRUCache" loaded by "sun.misc.Launcher$AppClassLoader @ 0x42e940a0" occupies 902.898.256 (61,80%) bytes. The memory is accumulated in one instance of "java.util.concurrent.ConcurrentHashMap$Segment[]" loaded by "<system class loader>". This memory is part of the result cache of Solr. Flushing this cache appears the most appropriate solution to that problem.	11 years ago
Michael Peter Christen	bc28247089	Added methods in resource observer to calculate the available and the occupied disc space. These values are also shown on the status page. The disc space calculation shall be used for a disk-limitation of the search index.	11 years ago
Michael Peter Christen	0dda979801	adopted network image drawing to increased number of peers	11 years ago

1 2 3 4 5 ...

6942 Commits (fdaeac374a0b804684e9aa9b4f50020b363dbf87)