Commit Graph

2064 Commits (70ba74b23a8e470635c5f6fc1324e38898e696a3)

Author SHA1 Message Date
reger 02fe8b43ba Field Re-Indexing: display list of fields in reindex queue
11 years ago
sixcooler 7f501b7c38 clear some caches before reporting low Memory
11 years ago
reger b355dd52c6 Index Administration - Field Re-Indexing: exclude internal Solr _version_ field from obsolete field check
11 years ago
sixcooler 8a96140f92 fix / workaround for
11 years ago
Michael Peter Christen 2857499467 fix to collection schema; bug appeared for _txt fields with empty String
11 years ago
Michael Peter Christen dbfa865700 added a stub of a class for crawler redesign
11 years ago
Michael Peter Christen 76afcccaaf fix for default boolean post values: the default value MUST NOT be TRUE,
11 years ago
orbiter 252c525709 fixed feed api servlet and and enhanced RSSReader class
11 years ago
orbiter d38c3c14d8 fix for CGI test
11 years ago
Michael Peter Christen 31902f54df fix for NPE which happens within solr code at MultiMapSolrParams.java,
11 years ago
Michael Peter Christen f13df9dbb6 migration to solr 4.4.0
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
sixcooler 7d53ac86a3 fix for Blacklist (-Administration)
11 years ago
reger f2d99053ed Field Re-Indexing: prevent endless error loop in ReindexSolrBusyThread on Solr exception (by skipping query causing the exception)
11 years ago
reger 92d3f71b16 htmlParser: closes input stream -> changed it to leave it open for a reset (used by AugmentParser - even if this is practically not used),
11 years ago
orbiter 87cfeaa4f3 fix for npe
11 years ago
orbiter 268a36aaff emergency fix for crawler: this will otherwise cause loss of complete
11 years ago
orbiter d05e0c5368 wait a bit longer before doing the first peer ping
11 years ago
orbiter b8f57f7703 don't be noisy when doing background tasks that may be allowed to fail
11 years ago
Roland Haeder 0343f0668c Fix for NPE:
11 years ago
Roland Haeder b58ca8622d Some cleanups:
11 years ago
Roland Haeder 7263bb82fb Fix for NPE on shutdown:
11 years ago
Roland Haeder 13433d41a1 Log this exception better
11 years ago
orbiter 080d80c9de do not write an empty failreason in case that there is no fail. Because
11 years ago
Michael Peter Christen 4c242f9af9 always use a default value for boolean options to have transparency for
11 years ago
Michael Peter Christen 61e015268b fix in forced deletion: forced commit needed
11 years ago
Michael Peter Christen 83e2921b39 new test case for http://bugs.yacy.net/view.php?id=141
11 years ago
Michael Peter Christen 304aacb2cc fix for http://bugs.yacy.net/view.php?id=267
11 years ago
Michael Peter Christen c3b2301b2f fix for http://bugs.yacy.net/view.php?id=268
11 years ago
reger aa1a1f1d2c - small adjustment to make sure genericParser is tried last
11 years ago
orbiter 3e901dcb06 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter f50b596e0b do not run dht ditribution if system load is over 2.5
11 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
11 years ago
orbiter 6fb2811e68 fixes for problems with remote solr and non-activated webgraph index
11 years ago
sixcooler af740f3058 changed optimization to a segment-size of index-size/5.000.000
11 years ago
Michael Peter Christen 336f86394c replaced StringBuffer with StringBuilder
11 years ago
Michael Peter Christen aeac2fb763 replaced more containsKey() -> get() usages by a simple get(), followed
11 years ago
orbiter 5364c4dcc9 delayed first peer-ping to send the first ping out after the http got
11 years ago
orbiter e24016e30a added the property federated.service.solr.indexing.timeout to yacy.init
11 years ago
orbiter c124037f19 removed forced non-soft commits to prevent index fragmentation
11 years ago
Michael Peter Christen 31483c47e1 fixed problem with remote luke requests
11 years ago
Michael Peter Christen c15aa758dc removed failreason_t removal patch because that causes too much
11 years ago
reger 2b7a38640a extend content type detection on file extension for .tif .tiff .htm
11 years ago
Michael Peter Christen ac1aad5064 added a getSegmentCount method and use it to disable optimize if wanted
11 years ago
Michael Peter Christen 36035e0a0a - used reger's LukeRequest to generalize the index info in
11 years ago
Michael Peter Christen 39fceb5ccf fix for NPE & bug #264
11 years ago
Michael Peter Christen 735a66eff3 enhancements to crawler
11 years ago
Roland Haeder be0ff6018f Removed trailing spaces + some more final
11 years ago
Roland Haeder aaedc0405d Fixes and avoid of catching bad exceptions (some):
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Felix Ableitner 03044589dd Fixed (?i) appearing in entries, fixed multiple equal lines in file.
11 years ago
Michael Peter Christen 89c0aa0e74 added collection_sxt to error documents
11 years ago
Michael Peter Christen 0df5195cb0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 1fd006cc56 fixes using the embedded connector
11 years ago
orbiter d0dc86cf3d logging of deadlocks (if any) during cleanup process
11 years ago
Michael Peter Christen c6a6f159e8 fix for crawl stack domain counter
12 years ago
Michael Peter Christen 93d1bac140 do a more frequent optimization, reduces IO after optimization
12 years ago
orbiter b71d13a014 added load and deadlock detector in Memory util
12 years ago
orbiter 290e24564b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 5533fc8e01 fix for bug 260
12 years ago
Michael Peter Christen b79471ee67 grr
12 years ago
Michael Peter Christen a79f288ac1 automatically running optimize on solr if user/search is idle for some
12 years ago
orbiter a9c8046c87 do a light optimization at the end of a crawl postprocessing
12 years ago
orbiter a548354c71 replaced type of solr schema object sku of text_en_splitting_tight by
12 years ago
orbiter 2f1ec8d4a2 npe fix
12 years ago
Michael Peter Christen bcc623a843 refactoring of load_delay: this is a matter of client identification
12 years ago
orbiter 0d0b3a30f5 activate api actions after postprocessing of crawls
12 years ago
orbiter 3978c5ca5d fix for http://bugs.yacy.net/view.php?id=255
12 years ago
orbiter 2be456e7fb added a postprocessing field into api/status_p.xml to show if the
12 years ago
orbiter dac88561ae minimum access time has a tight connection to ClientIdentification,
12 years ago
Michael Peter Christen 9a29ab469e another patch to prevent CLOSE_WAIT status on solr connections
12 years ago
Michael Peter Christen 5091d627bc fixed parsing of peer flags
12 years ago
Michael Peter Christen 87e9052081 added Connection:close to all http requests in our http client to
12 years ago
Michael Peter Christen 5c6946dd5f replaced usage of log4j by ConcurrentLog where possible
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
orbiter f4f6551c66 better handling of time-out at solrj in case that a commit is done in a
12 years ago
Michael Peter Christen 07261fe274 Merge remote-tracking branch 'nutomics/blacklist_structure'
12 years ago
Michael Peter Christen dea71851d2 - better concurrency for network scanner
12 years ago
Michael Peter Christen a34e137e27 fix for citation index generation in case that entry.referrerhash() is
12 years ago
Michael Peter Christen a2c8116a8f accept (but ignore) a '+' sign in front of search words
12 years ago
orbiter 9f0cc9b401 enhanced network scanner
12 years ago
sixcooler 308d73f855 do not use remote proxy if not switched on - regardless of the proto
12 years ago
sixcooler 69906b1d2e Revert "do not use remote proxy if not switched on - regardless of the proto"
12 years ago
sixcooler 20f452d228 do not use remote proxy if not switched on - regardless of the proto
12 years ago
sixcooler 9551720d5c re-enable saved setting for proxy-crawl-profile
12 years ago
sixcooler d5d8936f9d For indexes that are changing rapidly in NRT situations, fcs (stands for
12 years ago
Felix Ableitner 44f8fcf62e Changed class structure of Blacklist.
12 years ago
Michael Peter Christen 57ffdfad4c added a crawl option to obey html-meta-robots-noindex. This is on by
12 years ago
Michael Peter Christen 5a5d411ec0 new robots_i attribute fields
12 years ago
Michael Peter Christen fa08bd9d5a hack to prevent long waiting times in crawler
12 years ago
Michael Peter Christen f1c5338210 prepartion for greedy crawl profiles and refactoring
12 years ago
Michael Peter Christen e6f361f474 adding the canonical tag to crawl queues
12 years ago
reger a6bf44212e bugfix: location (lat/lon) meta data retrival (Double.NaN check)
12 years ago
Michael Peter Christen 203921006a redesign of citation index storage
12 years ago
reger 83763ee4a4 jpeg parser: extract GPS location from meta data
12 years ago
Michael Peter Christen 32aa1d4569 removed unused option for queries
12 years ago
Michael Peter Christen 9d291764d1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
sixcooler e5abccdfe4 added optimize-option
12 years ago
Michael Peter Christen 64140f35cd fix for solr requests if no query part is given (prevent npe)
12 years ago