Michael Peter Christen
06d3063dc9
- no downcase when using collection modifier
...
- removed warnings
12 years ago
Michael Peter Christen
8dbc80da70
redesign of index.exist-test: this shall now not be done using a single
...
id to be tested, but with a collection of ids. This will cause only a
single call to solr instead of many. The result is a much better
performace when testing the existence of many urls. The effect should
cause very much less IO during index transmission, both on sender and
receiver side.
12 years ago
reger
7f63d3747d
more generic field selection for reindex option of documents with disabled fields
...
using Luke request to compare config with actual fields in index
12 years ago
Michael Peter Christen
c91c67c3cd
reject bad solr requests
12 years ago
Michael Peter Christen
44e363f37f
refactoring of WorkflowProcessor, added process counter, update of
...
process counter if an blocking thread dies. Added also a new column in
PerformanceConcurrency_p servlet to show the actual number of concurrent
processes.
12 years ago
Michael Peter Christen
4058369288
fixed query expressions for collection selection (added quotes)
12 years ago
Michael Peter Christen
f2e36fbd06
enhanced deletion process for very large number of documents
12 years ago
reger
79401cb938
added reindex option for documents with disabled or obsolete fields to Solr Schema Editor page (IndexSchema_p.html)
...
this allows to remove obsolete fields from the index (according to current schema config)
by selecting all documents containig disabled fields.
12 years ago
orbiter
cf36c1614f
prevent that concurrent deletion process causes wrong double-check in
...
crawl start
12 years ago
orbiter
aeff31cd44
fix for workflow processor (cause: latest redesign for less threads)
12 years ago
Michael Peter Christen
77faeada4d
small memory leak patch
12 years ago
Michael Peter Christen
b24d1d18e4
removed synchronization and concurrency in Fulltext class, concurrent
...
deletions are now handled in ConcurrentUpdateSolrConnector
12 years ago
Michael Peter Christen
f965d04496
added new peer icons for Mentor peers and Mentee peers (not used yet)
12 years ago
Michael Peter Christen
b9b446bca6
- added ssl configuration sign (a lock) to network statistic/table
...
- fixed a bug in bitfield
12 years ago
Michael Peter Christen
7095446ad3
added checkbox (near port) to switch on ssl support (https access) to
...
the admin interface.
12 years ago
Michael Peter Christen
e6c8b545c2
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter
a83c2fe833
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter
4baa0d4a97
Added a default keystore for ssl encryption of the YaCy web interface.
...
This will enable https-access to YaCy, but this feature is disabled by
default using the new server.https=false attribute. This has two
purposes:
- make it easier for everyone to use https (just set server.https=true)
- provide the basis for secure yacy-to-yacy communication in the future
12 years ago
Michael Peter Christen
0aef60f66e
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
reger
da191c839d
reduce SolrConnectorLogging setting (from default ALL to INFO)
12 years ago
Michael Peter Christen
aaddb4809c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
038f956821
fix for sitemap detection: the sitemap url was not visible if it
...
appeared after the declaration of robots allow/deny for the crawler
because the sitemap parser terminated after the allow/deny rules had
been found. Now the parser reads the robots.txt until the end to
discover also sitemap rules at the end of the file.
12 years ago
reger
4fc6837690
- fix monitor url of crawl job in PerformanceQueues_p.html
...
- reduce logging of every index add (switch embeddedsolr.add from info to debug)
12 years ago
Michael Peter Christen
442ed50be0
removed some unnecessary synchronizations
12 years ago
Michael Peter Christen
9bd2aee180
migrated to solr 4.3.0
12 years ago
Michael Peter Christen
ad050ec88d
- upgraded httpclient, httpcore and httpmime
...
- removed httpclient 3.1 which has been used by solrj < 4.x.x and is now
not used any more
- fixed some parts in YaCy which used methods from httpclient 3.1
12 years ago
Michael Peter Christen
4b100f8b48
Merge branch 'master' of ssh://gitorious.org/yacy/rc1
12 years ago
Michael Peter Christen
3abf516ca7
merged classpath
...
Bitte geben Sie eine Versionsbeschreibung für Ihre Änderungen ein. Zeilen,
12 years ago
orbiter
a1c989002b
fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4652
...
generate dht data even if dht receive and dht transmission is switched
off
12 years ago
orbiter
48e9a54e80
updated pdf parser
12 years ago
Michael Peter Christen
e26bdd4a52
fixes to deletion methods (removed unnecessary concurrency and added
...
removal of crawl queue entries)
12 years ago
Michael Peter Christen
f2c9b0b5f2
better robustness of Concurrent Solr Connector against update/deletion
...
thread failure
12 years ago
Michael Peter Christen
f7f3e28c5e
prevent that the size of the index is computed too many times.
...
Because the index size is now provided by solr, and the only way to do
that is a match for [* TO *], a size computation is quite complex and
time-consuming. Therefore this patch prevents that the method is called
at all and if necessary puts a DOS-preventing barrier in front of it.
12 years ago
Michael Peter Christen
cca19d94d4
re-declared some fields to be of type string rather than text which
...
makes them more efficient and less large
12 years ago
Michael Peter Christen
cc90f82dbb
increased default proxy client timeout to one minute
12 years ago
Michael Peter Christen
ed1d5bace6
draw the names of other peers which receive/send dht into the network
...
graphic
12 years ago
Michael Peter Christen
b528448332
enlarge network graph circle according to image height and reduce the
...
image height in the Network servlet. Overall, the image is now larger
but takes less space on the web page.
12 years ago
Michael Peter Christen
58d85b5b80
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
reger
24d2b4baee
remove pre 1.0 migration statement which possibly overwrites user navigator setting
12 years ago
Michael Peter Christen
f1bb54943e
typo
12 years ago
Michael Peter Christen
d7fd346917
- added regular-expression based deletions
...
- on-demand collection-list generation for collection-based deletions
instead of a default collection-list presentation (this makes calling
the interface much faster since the computation of collections lists for
large indexes may take some seconds)
12 years ago
Michael Peter Christen
3841854c97
abstraction of catchall term
12 years ago
Michael Peter Christen
ea85674be2
added the date to error documents
12 years ago
reger
72003b109b
Merge branch 'master' of git://gitorious.org/yacy/rc1.git
12 years ago
reger
4fec35a665
adjust Test case EmbeddedSolrConnector
12 years ago
Michael Peter Christen
6fafed2180
fix for solr cache when a delete buffer is filled and a document, which
...
is the delete queue, is replaced with a new one.
12 years ago
Michael Peter Christen
20b767f35e
preventing score computation in solr where applicable
12 years ago
orbiter
7de5b9cfa0
fix for http://bugs.yacy.net/view.php?id=233
...
- check geolocation coordinates and accept only those, which are
well-formed
- the solr push process does not stop crawling any more if after 20
requests to Solr Solr does not accept the record. Instead, a severe log
entry asks the user to create a bug request
12 years ago
sixcooler
e145afb8d6
fix for PerformanceMemory showing UNRESOLVED_PATTERN by removing
...
solr-cache-stuff, which is not available anymore
12 years ago
Michael Peter Christen
ee217dbdee
remove sort order in all cases where not needed
12 years ago