Michael Peter Christen
fdcd4e6a6f
fixes to index deletion: quoting of host name (a '-' may be part of the
...
url) and disabling the engage button when changing the url field at
'Delete by URL matching'
12 years ago
reger
d367b1f4d9
add null pointer check to stopword fix
12 years ago
reger
7480e87386
- fix stopword handling for RWI see example http://bugs.yacy.net/view.php?id=247
...
- append language setting specific stopword list
- remove unused OVERHANG stack type
12 years ago
orbiter
5c7ddc67fe
in GSA api enable usage of solr fq-attribute together with GSA
...
site-attribute
12 years ago
Michael Peter Christen
9fc0c4df98
fix for bad exists 'enhancement'; see bug:
...
http://bugs.yacy.net/view.php?id=245
12 years ago
reger
9ef1fd9bac
fix: enable use of solrcore.properties for property substitution of solrconfig.xml
12 years ago
reger
8a7fcb391d
enable use of solrcore.properties for property substitution of solrconfig.xml
...
- move setting of system property solr.directoryFactory=solr.MMapDirectoryFactory to solrcore.properties
- add check of os.arch for 64bit system, if it fails use default/solrcore.x86.properties (if exists) as solrcore.properties
reason: on 32bit MMapDirectoryFactory may fail with.....
Caused by: java.io.IOException: Map failed
at sun.nio.ch.FileChannelImpl.map(FileChannelImpl.java:849)
at org.apache.lucene.store.MMapDirectory.map(MMapDirectory.java:283)
12 years ago
Michael Peter Christen
f7e887bf49
added missing class
12 years ago
Michael Peter Christen
eb9d0ba5b1
ranking and boost function update, small bugfixes, better default search
...
field for solr
12 years ago
Michael Peter Christen
5f92c68f1f
removed block rank ranking and all YBR files in /ranking
12 years ago
Michael Peter Christen
164603b946
cleanup
12 years ago
Michael Peter Christen
ba793a32c0
added timeout for remote searches of 10 seconds
12 years ago
Michael Peter Christen
1c4c1c0345
try to commit in case of failure which hopefully frees up some RAM
12 years ago
Michael Peter Christen
409d6edf53
Store node/solr search threads to be able to send them an interrupt
...
signal in case that a cleanup process wants to remove the search
process. Added also a new cleanup process which can reduce the number of
stored searches to a specific number which can be higher or lower
according to the remaining RAM. The cleanup process is called every time
a search ist started.
12 years ago
Michael Peter Christen
2a8b99ea82
remove text_t in search result after snippet has been computed to save
...
space in search result cache
12 years ago
Michael Peter Christen
a1644ca0fd
new workflow processor in Segment to enqueue indexing documents to solr
12 years ago
Michael Peter Christen
a8dc4346e8
default configuration of MMapDirectoryFactory for solr, increased lock
...
timeout, less documents from remote searches (too many results had
easily blocked a peer)
12 years ago
Michael Peter Christen
0c1a018bbd
removed 'later' tactic because it used too much RAM, reduced number of
...
soft commits, reduced caching size of search events, ensured that solr
results are processed before connection is closed to keep that stuff not
too long in RAM
12 years ago
Michael Peter Christen
5344a1c5f7
getting the trash out
12 years ago
Michael Peter Christen
709e9b8ce7
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
9e07447d47
added new link for SMW
12 years ago
Michael Peter Christen
3c04dd11de
removed dead link
12 years ago
Michael Peter Christen
1eb9626cca
less logging
12 years ago
Michael Peter Christen
536fd1450e
added new keys for update locations
12 years ago
Michael Peter Christen
281959a2d7
added option to re-boot the embedded solr during run-time. Added also
...
API recording for this method so it can be repeated automatically. The
index dump generation is now also available for API recording. Added
some synchronization in backend which was necessary for this.
12 years ago
Michael Peter Christen
80a7989e8c
fixed ClassCastException: [Ljava.lang.Object; cannot be cast to
...
[Ljava.util.List; in robots.txt servlet
12 years ago
orbiter
da621e827e
prevent NPE in case RWI is disabled
12 years ago
Michael Peter Christen
c2bcfd8afb
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
67757b425a
use a retry handler with retryCount=0 because we usually expect requests
...
to fail if we access non-permanently available resources (peers, web
pages) and want to fail fast without repeating the same request which is
doomed to fail. The previous appearance of http client connection had a
1-2-4-8-second timeout scheme, which caused that connection attempts
lasted for 16 seconds.
12 years ago
Michael Peter Christen
7300d81f40
include API Table deletion requests to the API recorder
12 years ago
Michael Peter Christen
c2b1075dcf
activating pollImmediately in case that DHT receive is off. This will
...
cause a much faster search result when running in public robinson mode.
12 years ago
Michael Peter Christen
d2ade87b49
fixed missing thisaddress in yacysearch.html which caused that the
...
opensearch link was not working
12 years ago
Michael Peter Christen
179d032181
added a (badly formatted) delete button for process scheduler entries
12 years ago
orbiter
888a985dc6
set a higher limit for table copy usage
12 years ago
Michael Peter Christen
2b563debbf
javadoc of new multiple-exist test
12 years ago
reger
c03f75ebc3
fix DHT url receive see http://bugs.yacy.net/view.php?id=242
12 years ago
Marc Nause
8fb1b1e290
*) simplified banner creation code
12 years ago
Marc Nause
cd0b5f31b4
*) updated links to description of regex
12 years ago
Michael Peter Christen
8f2d3ce2f9
reduced locking situation in crawler: shifted synchronized location and
...
reduced time-out of robots.txt load limit
12 years ago
Michael Peter Christen
f93501e6e0
nice crawl name if crawl is started with file:// (was: null)
12 years ago
Michael Peter Christen
b4f0cac102
added the reindexing job servlet to the submenu structure
12 years ago
reger
97ab5b90e8
- odt & ooxml (office document) parser correction to add content to fulltext index
...
- adjust Junit yacyVersionTest & ParserTest
- update yacyVersion.combined2prettyVersion to the default 4-digit minor ver.
12 years ago
Michael Peter Christen
b68fbe7d21
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
...
Conflicts:
source/net/yacy/migration.java
12 years ago
Michael Peter Christen
06d3063dc9
- no downcase when using collection modifier
...
- removed warnings
12 years ago
Michael Peter Christen
8dbc80da70
redesign of index.exist-test: this shall now not be done using a single
...
id to be tested, but with a collection of ids. This will cause only a
single call to solr instead of many. The result is a much better
performace when testing the existence of many urls. The effect should
cause very much less IO during index transmission, both on sender and
receiver side.
12 years ago
reger
7f63d3747d
more generic field selection for reindex option of documents with disabled fields
...
using Luke request to compare config with actual fields in index
12 years ago
Michael Peter Christen
c91c67c3cd
reject bad solr requests
12 years ago
Michael Peter Christen
44e363f37f
refactoring of WorkflowProcessor, added process counter, update of
...
process counter if an blocking thread dies. Added also a new column in
PerformanceConcurrency_p servlet to show the actual number of concurrent
processes.
12 years ago
Michael Peter Christen
4058369288
fixed query expressions for collection selection (added quotes)
12 years ago
Michael Peter Christen
f2e36fbd06
enhanced deletion process for very large number of documents
12 years ago