orbiter
d7d38f9135
made number of open files in crawler configurable and increased default
...
maximum number of open files from 100 to 1000. This number can be
changed with the attribut crawler.onDemandLimit
11 years ago
Michael Peter Christen
20cffa34bf
Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
11 years ago
malykhin.dmitry
873f8c2d2c
Update russian translation
11 years ago
Michael Peter Christen
c43acb0e80
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
8ad41a882c
fixed several problems with postprocessing:
...
- unique-postprocessing was destroying results from other
postprocessings; removed cross-updates as they had been not necessary
- unique-postprocessing did not restrict on same protocol
- inefficient concurrent update cache was redesigned completely
- increased limits for concurrent blocking queues to prevent early
time-out
11 years ago
sixcooler
370f1c408e
Changed Windows Firewall Rules to just honor the default Port 8090, but
...
not use any programm-path.
This should match more installations in different paths and also running
YaCy as service (prunsrv).
This commit was contributed and tested on Windows7 by René.
11 years ago
Michael Peter Christen
640b684bb6
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
2f5477ea59
a try to fix the mixed up terms 'Active' -> 'Senior' and 'Passive' ->
...
'Junior'
11 years ago
reger
ca5437dd50
fix crawl of file:// , also http://mantis.tokeek.de/view.php?id=149
...
local files can be crawled (intranet mode) url parsing fixed according to RFC 1738 (for unix and windows)
for win like file:///c:/tmp or file://localhost/c:/tmp
for linux like file:///tmp or file://localhost/tmp
Host is ignored and path must be absolute
11 years ago
Michael Peter Christen
9b4282344b
changed debian dependency to openjdk-7-jre-headless
11 years ago
Michael Peter Christen
ff5b3ac84d
added new fields http_unique_b and www_unique_b which can be used for
...
ranking to prefer urls containing a www subdomain or using the https
protocol
11 years ago
reger
66f6797f52
make config search page layout closer to actual page appearance
11 years ago
reger
9ecf28b708
- upd pom to Solr 4.8.1 and latest jar updates
...
- upd nsis java autodownload package to jre 7u55
11 years ago
Michael Peter Christen
06f2eeda22
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
5d5896b3f6
fixed dependency in debian package on java 7
11 years ago
sixcooler
5b1c4ef191
Monitoring and limit connection-count for Jetty
11 years ago
sixcooler
046e41e376
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
ee7416816b
upgraded poi library (office document format parser) from 3.9 to 3.10
11 years ago
orbiter
ce1dbfeb0f
fix appearance of image search thumbnails.
11 years ago
orbiter
6daae59479
switch on core.service.rwi when switching back from portal mode to p2p
...
mode
11 years ago
orbiter
a12701ddf6
upgraded bouncy caste libraries (needed for encrypted pdfs, dependency
...
in pdfbox) to 1.46
removed the activation.jar library; I don't know which other library
depends on it.
11 years ago
Michael Peter Christen
f0db501630
better handling of ranking parameters and new default values for date
...
navigation which is done using ranking in solr.
11 years ago
Michael Peter Christen
53948da7d0
tried to make last_modified recognition smarter
11 years ago
Michael Peter Christen
2d03037965
'Last-Modified', not 'Last-modified' according to
...
http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html
11 years ago
Michael Peter Christen
2520590b45
migrated from pdfbox 1.8.4 to 1.8.5. They have a very long bugfix list
...
for that update:
http://www.apache.org/dist/pdfbox/1.8.5/RELEASE-NOTES.txt
11 years ago
sixcooler
2d508618a4
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
3dc5fb0050
fix for operator precedence bug (cast binds stronger than bitwise AND)
...
in peer hash hashing. This should not change anything if java casts long
to int by masking with 0xFFFFFFFFL but you never know. The important
thing is, that the hashCode() should not return numbers that have the
same order as the hash code order because hashing of seeds is used to
remove the order in some places.
11 years ago
Michael Peter Christen
6634b5b737
debug code for index distribution testing
11 years ago
Michael Peter Christen
89e13fa34e
fixed bug in test function
11 years ago
sixcooler
bf2ae57126
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
sixcooler
275def648b
Revert "manual merge"
...
This reverts commit 3bfab8566c
.
11 years ago
sixcooler
3bfab8566c
manual merge
11 years ago
sixcooler
3349a8f363
Merge branch 'master' into foobar
...
Conflicts:
source/net/yacy/http/Jetty9HttpServerImpl.java
11 years ago
Michael Peter Christen
d4157184ec
migration to Solr 4.8.1
...
This includes also an update to zookeeper 3.4.6 and a new library that
Solr initializes by default: org.restlet from
http://restlet.com/download/current#release=stable&edition=jse&distribution=zip
which is included in version 2.2.1 from may 6th 2014
11 years ago
orbiter
49e344e8d9
Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter
7705e36703
fix for latest generic warning fix
11 years ago
sixcooler
10326892a8
avoid erros from ConnectHandler, correction for #6d16fa9
11 years ago
orbiter
97983ba89f
fixed generics warnings for generic array instantiation that appeared
...
after migration to Java 7
11 years ago
orbiter
2944822bb0
updated bootstrap seed list
11 years ago
Marc Nause
4723329e29
Improved blacklist XML/JSON API.
11 years ago
sixcooler
830057d788
lower Segment-size (hope to get Segments of 10GB)
...
see:
http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5216&p=30036#p30034
11 years ago
reger
f91b2f51ae
fix: load_Rss remove feed to many parameter for get
...
use form post methode
11 years ago
orbiter
c028ae9b09
Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
reger
e31493e139
"Use remote proxy for yacy" has no function, remove option and related config item
...
see/fix bug http://mantis.tokeek.de/view.php?id=23
http://mantis.tokeek.de/view.php?id=189
11 years ago
reger
89e2c5e884
fix: allow enable of CrawlStartExpert.html #file
11 years ago
reger
1b37b12998
fix: CrawlStartExpert.html # From File with missing filename
...
- crawlName must not be empty
- crawlingFile must not be empty
11 years ago
orbiter
181784a5cb
Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
reger
0587077d06
cleanup obsolete and not used serverswitch Authentify code
...
as auth is mostly delegated to Jetty container.
11 years ago
orbiter
c9f66be20b
move unnecessary nested else out of condition
11 years ago
orbiter
0d8072aa99
removed warnings
11 years ago