Michael Peter Christen
a2b66fe2eb
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
9f6be762a6
- better logging for postprocessing
...
- fixed collection bug in postprocessing
11 years ago
Michael Peter Christen
de8f7994ab
as crawling has a low-cpu demand, we want it to run even if the CPU load
...
is VERY high. This applies also if the CPU load is high because of
in-cache crawling; in that case we want to experience a high-CPU load as
much as possible
11 years ago
Michael Peter Christen
d8e79731df
fixed wrong used memory display
11 years ago
orbiter
da5d4128bf
prevent npe
11 years ago
orbiter
a878c7982c
prevent npe
11 years ago
orbiter
e4eb87d924
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
ced1a96f9c
fixed error cache
11 years ago
reger
3ba81bd08a
Merge origin/master
11 years ago
reger
4d896383db
fix: use timeout = proxy.ClientTimeout in ProxyHandler
...
(was 10sec fix) see http://bugs.yacy.net/view.php?id=236
11 years ago
Michael Benz
072d4aa0c0
Updated German translation and Blacklist_p.html
11 years ago
orbiter
163cbceca5
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
cfb647db6e
- introduced a miss cache in ConcurrentUpdateSolrConnector
...
- better usage of cache
- bugfix for postprocessing
11 years ago
reger
2c8c51ce4b
make use of new -config cmd-line parameter in reconfgureYACY.sh
...
to asure pwd encoding is compatible with DIGEST auth. in future.
11 years ago
orbiter
a87d8e4a8e
changed caching of ConcurrentUpdateSolrConnector: it caches now also the
...
url along with the load date. While this takes much more memory, it
eliminates database lookups for getURL() requests, which happen equally
often. This speeds up remote solr configurations.
11 years ago
orbiter
f6e441dd77
refactoring
11 years ago
orbiter
76c53faeb2
removed unused code (HostStat)
11 years ago
orbiter
d3a88eaecb
introducing ConcurrentUpdateSolrServer for remote solr servers.
...
Scaling of write buffers and update queue size is made according to
assigned memory.
11 years ago
orbiter
c3f6c06f2c
removed host increment on stored documents from crawler (that was wrong)
11 years ago
Michael Peter Christen
f97428fe5d
Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
11 years ago
malykhin.dmitry
746aa32ad5
edit russian locale
11 years ago
reger
809e976578
remove unused java imports form yacy.java
11 years ago
reger
a9b06f8719
add a -config command line parameter e.g. -config "port=9090" "port.ssl=8043"
...
- useful for remote installation to set any config file property
- multipe parameter can be set at once, on Windows enclose parameter in doublequotes
- special handling "adminAccount=adminuser:adminpwd" sets adminusername and md5 encoded admin-pwd
- adjusted windows startbatch to allow command line parameter handling
- remove not needed classpath calculation from startYACY_debug.bat
11 years ago
Michael Benz
edc8e1c4de
Finished translation of changed CrawlStartExpert_p.html
11 years ago
reger
0923b09216
fix: allow 4 character admin user name
...
(was min 5 char)
11 years ago
Michael Peter Christen
7253ca4607
Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
11 years ago
malykhin.dmitry
f8f0f6363d
edit russian locale
11 years ago
Michael Peter Christen
a86c2fe77d
fixed usage of media flag when started by automated process
11 years ago
Michael Peter Christen
254a7ac66c
fixed cleaning of index
11 years ago
Michael Peter Christen
28a7b42e6b
removed warning "sun.misc.BASE64Encoder is internal proprietary API and
...
may be removed in a future release"
11 years ago
Michael Peter Christen
046f5a03cb
one more SolrIndexSearcher bugfix
11 years ago
sixcooler
78c01b3eff
fix for 'AlreadyClosedException: this IndexReader is closed'
11 years ago
Michael Benz
f11314aae7
Improved German de.lng translation and fixed adresses -> addresses in \htroot\CrawlStartScanner_p.html
11 years ago
Michael Peter Christen
f0eec6d0f3
Merge branch 'master' of git://gitorious.org/~copro/yacy/copros-rc1
11 years ago
Michael Peter Christen
1b5e3d523a
better control over close-state of remote solr connections
11 years ago
Michael Benz
6278af4993
Edit German de locale and improved translation
11 years ago
Michael Peter Christen
1a364572a5
fix for
...
"org.apache.solr.core.SolrCore Too many close [count:-1] on
org.apache.solr.core.SolrCore@51af7c57"
-error
11 years ago
Michael Peter Christen
69391e5d9e
changed strategy to test existence of documents in Solr: using the
...
update time. The reason for that is a better caching for the crawler
double-check, which needs the update time for crawler steering.
11 years ago
Michael Peter Christen
790f103f32
delete fail-docs during postprocessing to prevent that they will appear
...
again and stay in postprocessing forever.
11 years ago
Michael Peter Christen
745d6d1c64
Merge branch 'master' of ssh://gitorious.org/yacy/rc1
11 years ago
malykhin.dmitry
ec598991a4
edit russian locale
11 years ago
reger
a02e33dcb6
add edit-link to PK field of table admin
11 years ago
r
c69630c522
edit russian locale
...
edit russian locale
11 years ago
Anatoliy Evladov
66639cb703
Fixed my error
11 years ago
Anatoliy Evladov
baaea6dedc
Edit ru locales
11 years ago
Michael Peter Christen
ff656ce860
explicit call to optimize to add a expungeDeleted flag
11 years ago
Michael Peter Christen
9eb668e951
enhanced the resource observer
...
The resource observer is now able to recognize free disk space AND
available space for YaCy. The amount of space which is assigned for YaCy
are defined in new settings in the configuration file.
Furthermore, there is now a cleanup process which deletes files in case
that an autodelete is activated. The autodelete is now BY DEFAULT ON if
the disk space is low, which means that YaCy starts to delete documents
when the disk is full!
11 years ago
Michael Peter Christen
fbee98c06f
fixed shortcut self-reference bug
11 years ago
Michael Peter Christen
e7a29a2851
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
cb2c25d930
in case that the crawler is running and the search user is the peer
...
admin, we expect that the user wants to check recently crawled document
to ensure that recent crawl results are inside the search results, we do
a soft commit here.
11 years ago