Michael Peter Christen
47b1c81d08
- refactoring
...
- generalized writing of url attributes to solr documents
- added more url attributes to error documents
12 years ago
Michael Peter Christen
697613170d
less logging for postprocessing (this was a debugging logging with high
...
CPU load)
12 years ago
reger
a5019bc470
make Vocabulary Navigator tags a hard result entry filter
...
by checking vocabulary tags also for rwi results (currently a filter is applied to the solr query)
TODO: as vocabularies are only locally valid, auto-switch to Searchdom.LOCAL could be considered.
12 years ago
reger
a67a4b7d86
improve tld: query modifier filter pattern (to prevent tld:net accepting www.abcinet.org)
12 years ago
reger
02fe8b43ba
Field Re-Indexing: display list of fields in reindex queue
...
change servlet to display statistic on 1st click (instead after refresh)
12 years ago
sixcooler
7f501b7c38
clear some caches before reporting low Memory
...
do not break lines in Network-table-rows
12 years ago
Michael Peter Christen
2857499467
fix to collection schema; bug appeared for _txt fields with empty String
...
as content
12 years ago
Michael Peter Christen
58fe986cca
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
cf12835f20
replaced the single-text description solr field with a multi-value
...
description_txt text field
12 years ago
reger
f2d99053ed
Field Re-Indexing: prevent endless error loop in ReindexSolrBusyThread on Solr exception (by skipping query causing the exception)
...
(occured during testing while working on q=store:[* TO *])
12 years ago
orbiter
d05e0c5368
wait a bit longer before doing the first peer ping
12 years ago
orbiter
b8f57f7703
don't be noisy when doing background tasks that may be allowed to fail
12 years ago
Roland Haeder
0343f0668c
Fix for NPE:
...
E 2013/07/26 20:29:29 BUSYTHREAD Runtime Error in
serverInstantThread.job, thread
'net.yacy.search.Switchboard.cleanupJob': null; target exception: null
java.lang.NullPointerException
at
net.yacy.search.schema.CollectionConfiguration.convergenceStep(CollectionConfiguration.java:1116)
at
net.yacy.search.schema.CollectionConfiguration.postprocessing(CollectionConfiguration.java:897)
at net.yacy.search.Switchboard.cleanupJob(Switchboard.java:2296)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at
net.yacy.kelondro.workflow.InstantBusyThread.job(InstantBusyThread.java:107)
at
net.yacy.kelondro.workflow.AbstractBusyThread.run(AbstractBusyThread.java:165)
Conflicts:
source/net/yacy/search/schema/CollectionConfiguration.java
12 years ago
Roland Haeder
b58ca8622d
Some cleanups:
...
- added SKINS_PATH_DEFAULT as same as LISTS_PATH_DEFAULT was added
- Added 'final' keyword to a string
12 years ago
Roland Haeder
7263bb82fb
Fix for NPE on shutdown:
...
java.lang.NullPointerException
at net.yacy.search.Switchboard.storeDocumentIndex(Switchboard.java:2732)
at net.yacy.search.Switchboard.access00(Switchboard.java:207)
at net.yacy.search.Switchboard.run(Switchboard.java:3049)
12 years ago
orbiter
080d80c9de
do not write an empty failreason in case that there is no fail. Because
...
of the lazy instantiation rule this value was not actually written, but
if lazy instantiation is switched on, then this causes that all crawl
starts delete all crawl-start-hosts completely because this looks for
filled error reasons.
12 years ago
Michael Peter Christen
61e015268b
fix in forced deletion: forced commit needed
12 years ago
Michael Peter Christen
c3b2301b2f
fix for http://bugs.yacy.net/view.php?id=268
12 years ago
orbiter
3e901dcb06
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter
f50b596e0b
do not run dht ditribution if system load is over 2.5
12 years ago
orbiter
056b42f5aa
- added information about segment count to status_p.xml
...
- also moved this information from the old index structure, which is
still in use for the RWI/DHT index to that front-end
12 years ago
orbiter
6fb2811e68
fixes for problems with remote solr and non-activated webgraph index
12 years ago
sixcooler
af740f3058
changed optimization to a segment-size of index-size/5.000.000
...
+ one if not idle
+ one (and force) if postprocessing
12 years ago
orbiter
5364c4dcc9
delayed first peer-ping to send the first ping out after the http got
...
up; if the ping comes before the http is up, it cannot be recognized as
senior peer (if at all). See also: http://bugs.yacy.net/view.php?id=266
12 years ago
orbiter
e24016e30a
added the property federated.service.solr.indexing.timeout to yacy.init
...
to provide a configurable time-out for solr; see also:
http://bugs.yacy.net/view.php?id=254
12 years ago
orbiter
c124037f19
removed forced non-soft commits to prevent index fragmentation
12 years ago
Michael Peter Christen
c15aa758dc
removed failreason_t removal patch because that causes too much
...
confusion using an external solr. to clean up the index after a schema
change, use the index cleaner function from the online servlet
12 years ago
Roland Haeder
be0ff6018f
Removed trailing spaces + some more final
12 years ago
Roland Haeder
841a28ae76
Added 'final' for all exception blocks as this helps the Java compiler
...
to optimize memory usage
Conflicts:
source/net/yacy/search/Switchboard.java
12 years ago
Michael Peter Christen
89c0aa0e74
added collection_sxt to error documents
12 years ago
Michael Peter Christen
0df5195cb0
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
1fd006cc56
fixes using the embedded connector
12 years ago
orbiter
d0dc86cf3d
logging of deadlocks (if any) during cleanup process
12 years ago
Michael Peter Christen
c6a6f159e8
fix for crawl stack domain counter
12 years ago
Michael Peter Christen
93d1bac140
do a more frequent optimization, reduces IO after optimization
12 years ago
orbiter
290e24564b
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter
5533fc8e01
fix for bug 260
12 years ago
Michael Peter Christen
b79471ee67
grr
12 years ago
Michael Peter Christen
a79f288ac1
automatically running optimize on solr if user/search is idle for some
...
time
12 years ago
orbiter
a9c8046c87
do a light optimization at the end of a crawl postprocessing
12 years ago
orbiter
a548354c71
replaced type of solr schema object sku of text_en_splitting_tight by
...
string
12 years ago
orbiter
2f1ec8d4a2
npe fix
12 years ago
Michael Peter Christen
bcc623a843
refactoring of load_delay: this is a matter of client identification
12 years ago
orbiter
0d0b3a30f5
activate api actions after postprocessing of crawls
12 years ago
orbiter
2be456e7fb
added a postprocessing field into api/status_p.xml to show if the
...
postprocessing task is running at that time (status: busy) or not
(status:idle)
12 years ago
Michael Peter Christen
5878c1d599
- refactoring of log to ConcurrentLog:
...
jdk-based logger tend to block
at java.util.logging.Logger.log(Logger.java:476) in concurrent
environments. This makes logging a main performance issue. To overcome
this problem, this is a add-on to jdk logging to put log entries on a
concurrent message queue and log the messages one by one using a
separate process.
- FTPClient uses the concurrent logging instead of the log4j logger
12 years ago
Michael Peter Christen
a2c8116a8f
accept (but ignore) a '+' sign in front of search words
12 years ago
sixcooler
d5d8936f9d
For indexes that are changing rapidly in NRT situations, fcs (stands for
...
Field Cache per Segment) may be a better choice than the default fc.
(saves memory)
see: http://wiki.apache.org/solr/SimpleFacetParameters#facet.method
12 years ago
Michael Peter Christen
57ffdfad4c
added a crawl option to obey html-meta-robots-noindex. This is on by
...
default.
12 years ago
Michael Peter Christen
5a5d411ec0
new robots_i attribute fields
12 years ago