Michael Peter Christen
31902f54df
fix for NPE which happens within solr code at MultiMapSolrParams.java,
...
line 52 in case that the array arr.length == 0
11 years ago
Michael Peter Christen
5b7c0d0745
update to pdfbox 1.8.2
11 years ago
Michael Peter Christen
f13df9dbb6
migration to solr 4.4.0
11 years ago
Michael Peter Christen
dc1002e511
cleaned sourcepaths from eclipse classpath
11 years ago
Michael Peter Christen
1b09362949
next development cycle
11 years ago
Michael Peter Christen
58fe986cca
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
cf12835f20
replaced the single-text description solr field with a multi-value
...
description_txt text field
11 years ago
sixcooler
7d53ac86a3
fix for Blacklist (-Administration)
11 years ago
reger
f2d99053ed
Field Re-Indexing: prevent endless error loop in ReindexSolrBusyThread on Solr exception (by skipping query causing the exception)
...
(occured during testing while working on q=store:[* TO *])
11 years ago
reger
92d3f71b16
htmlParser: closes input stream -> changed it to leave it open for a reset (used by AugmentParser - even if this is practically not used),
...
note: stream.close is done by caller (Textparser.parseSource)
- removed unnecessary reset in AugmentParser
- added stream.mark in tdfatripleimpl. to make stream.reset work here
11 years ago
orbiter
f117ea0492
reverted start script options - yacy on windows did not start with the
...
given values
11 years ago
orbiter
87cfeaa4f3
fix for npe
11 years ago
orbiter
268a36aaff
emergency fix for crawler: this will otherwise cause loss of complete
...
crawl queue if latency of remote system is too low
11 years ago
orbiter
743e4878a8
Release 1.6
11 years ago
orbiter
e7fcb81cea
we should not do too much greedylearning at this time as we don't have
...
enough experience with it. set greedylearning.limit.doccount to a much
lower limit.
11 years ago
orbiter
d05e0c5368
wait a bit longer before doing the first peer ping
11 years ago
orbiter
f425b2c61c
re-try to fetch url after a soft commit
11 years ago
orbiter
b8f57f7703
don't be noisy when doing background tasks that may be allowed to fail
11 years ago
orbiter
bf0ad04e1b
apply load limitation also to dht-in
11 years ago
Roland Haeder
0343f0668c
Fix for NPE:
...
E 2013/07/26 20:29:29 BUSYTHREAD Runtime Error in
serverInstantThread.job, thread
'net.yacy.search.Switchboard.cleanupJob': null; target exception: null
java.lang.NullPointerException
at
net.yacy.search.schema.CollectionConfiguration.convergenceStep(CollectionConfiguration.java:1116)
at
net.yacy.search.schema.CollectionConfiguration.postprocessing(CollectionConfiguration.java:897)
at net.yacy.search.Switchboard.cleanupJob(Switchboard.java:2296)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at
net.yacy.kelondro.workflow.InstantBusyThread.job(InstantBusyThread.java:107)
at
net.yacy.kelondro.workflow.AbstractBusyThread.run(AbstractBusyThread.java:165)
Conflicts:
source/net/yacy/search/schema/CollectionConfiguration.java
11 years ago
Roland Haeder
b58ca8622d
Some cleanups:
...
- added SKINS_PATH_DEFAULT as same as LISTS_PATH_DEFAULT was added
- Added 'final' keyword to a string
11 years ago
Roland Haeder
e2ee412160
Use SwitchboardConstants.LISTS_PATH_DEFAULT instead of 'DATA/LISTS'
...
Conflicts:
htroot/api/blacklists_p.java
11 years ago
Roland Haeder
ae19401af0
Removed another duplicate occurance of Blacklist.BLACKLIST_FILENAME_FILTER
11 years ago
Roland Haeder
59225487ea
Fix for blacklist export, also applied the filename filter here
11 years ago
Roland Haeder
952fc0e7bd
Removed superfluous check for files ending '.black' as the previous commit already excluded all other files (e.g. .ser dumps), added logging in catch-all block
11 years ago
Roland Haeder
060fec1577
Reuse Blacklist.BLACKLIST_FILENAME_FILTER
11 years ago
Roland Haeder
29049c71f5
Possible fix for ticket http://bugs.yacy.net/view.php?id=270 , the filter for only including *.black must be applied
11 years ago
Roland Haeder
7263bb82fb
Fix for NPE on shutdown:
...
java.lang.NullPointerException
at net.yacy.search.Switchboard.storeDocumentIndex(Switchboard.java:2732)
at net.yacy.search.Switchboard.access00(Switchboard.java:207)
at net.yacy.search.Switchboard.run(Switchboard.java:3049)
11 years ago
Roland Haeder
13433d41a1
Log this exception better
...
Conflicts:
source/net/yacy/kelondro/blob/Tables.java
11 years ago
orbiter
080d80c9de
do not write an empty failreason in case that there is no fail. Because
...
of the lazy instantiation rule this value was not actually written, but
if lazy instantiation is switched on, then this causes that all crawl
starts delete all crawl-start-hosts completely because this looks for
filled error reasons.
11 years ago
Michael Peter Christen
4c242f9af9
always use a default value for boolean options to have transparency for
...
the outcome if the attribute is missing in servlets
11 years ago
Michael Peter Christen
61e015268b
fix in forced deletion: forced commit needed
11 years ago
Michael Peter Christen
83e2921b39
new test case for http://bugs.yacy.net/view.php?id=141
11 years ago
Michael Peter Christen
304aacb2cc
fix for http://bugs.yacy.net/view.php?id=267
11 years ago
Michael Peter Christen
c3b2301b2f
fix for http://bugs.yacy.net/view.php?id=268
11 years ago
reger
aa1a1f1d2c
- small adjustment to make sure genericParser is tried last
...
-- for some documents genericParser grabs document instead of specific available parser due to unordered pick of 1st to try parser
(like .ps .rdf files and other)
- remove redundant file extension registration
11 years ago
orbiter
3e901dcb06
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
f50b596e0b
do not run dht ditribution if system load is over 2.5
11 years ago
orbiter
9c681cc00d
added segment sizes, postprocessing status and cpu load to crawler
...
monitor
11 years ago
orbiter
86b514cf46
added load info to status_p.xml
11 years ago
orbiter
056b42f5aa
- added information about segment count to status_p.xml
...
- also moved this information from the old index structure, which is
still in use for the RWI/DHT index to that front-end
11 years ago
orbiter
6fb2811e68
fixes for problems with remote solr and non-activated webgraph index
11 years ago
sixcooler
af740f3058
changed optimization to a segment-size of index-size/5.000.000
...
+ one if not idle
+ one (and force) if postprocessing
11 years ago
Michael Peter Christen
336f86394c
replaced StringBuffer with StringBuilder
11 years ago
Michael Peter Christen
aeac2fb763
replaced more containsKey() -> get() usages by a simple get(), followed
...
by a test for NULL. This should increase the application speed and
reduces the lookup time for the affected methods by 50%
11 years ago
orbiter
5364c4dcc9
delayed first peer-ping to send the first ping out after the http got
...
up; if the ping comes before the http is up, it cannot be recognized as
senior peer (if at all). See also: http://bugs.yacy.net/view.php?id=266
11 years ago
orbiter
e24016e30a
added the property federated.service.solr.indexing.timeout to yacy.init
...
to provide a configurable time-out for solr; see also:
http://bugs.yacy.net/view.php?id=254
11 years ago
orbiter
c124037f19
removed forced non-soft commits to prevent index fragmentation
11 years ago
Michael Peter Christen
31483c47e1
fixed problem with remote luke requests
11 years ago
Michael Peter Christen
c15aa758dc
removed failreason_t removal patch because that causes too much
...
confusion using an external solr. to clean up the index after a schema
change, use the index cleaner function from the online servlet
11 years ago