reger
3760e2616b
bump up lib/metadata-extractor-2.6.2.jar (used for image parser) with needed code adjustments
12 years ago
Michael Peter Christen
5f92c68f1f
removed block rank ranking and all YBR files in /ranking
12 years ago
Michael Peter Christen
9bd2aee180
migrated to solr 4.3.0
12 years ago
Michael Peter Christen
ad050ec88d
- upgraded httpclient, httpcore and httpmime
...
- removed httpclient 3.1 which has been used by solrj < 4.x.x and is now
not used any more
- fixed some parts in YaCy which used methods from httpclient 3.1
12 years ago
orbiter
48e9a54e80
updated pdf parser
12 years ago
Michael Peter Christen
27907c9739
added missing library after solr upgrade
12 years ago
Michael Peter Christen
cf0acd2cb4
upgrade to solr 4.2.1
12 years ago
Michael Peter Christen
461d46101d
- Removed log4j from libraries. This can be removed because the package
...
log4j-over-slf4j is there. From slf4j all loggings are routed to the jdk
logger. Now all loggings are consistently done to the jdk logger.
- added some lines to the logging properties to suppress many solr
logging statements. The number of the logging entries had already become
a performance issue, therefore removing these from the log should
increase performance.
12 years ago
orbiter
36f9b0fc16
updated wstx-asl to 3.2.9
12 years ago
reger
1951ba61ae
remove CPGEN from Windows batch files
...
(classpath for all needed libraries is defined in manifest of yacycore.jar)
12 years ago
Michael Peter Christen
09a2b09c48
guava update
12 years ago
Michael Peter Christen
80fe3d7860
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
...
Conflicts:
source/net/yacy/cora/federate/solr/connector/EmbeddedSolrConnector.java
12 years ago
Michael Peter Christen
4323621a76
update to Solr 4.1.0
12 years ago
reger
160ce568b3
move testing SolrServlet.main to test, making include of jetty*.jar in distribution and classpath obsolete
...
- move jetty*.jar to test library
- move SolrServlet.main as is to test, add also a junit test simulating main
- add build.xml cleanup for EmbeddedSolrConnectorTest created test/DATA
- adjust some test compile errors
12 years ago
reger
be5d3a1066
adding classpath to Manfiest of yacycore.jar
...
- this allows to start w/o giving explicite java -cp (just java -jar lib/yacycore.jar works)
- especially helpful while running YaCy as Win service,
making it obsolete to adjust classpath cfg of the service wrapper on upgrades of lib/*.jar's
12 years ago
sixcooler
f3e705c4fe
bump to httpclient / httpcore 4.2.3 (bugfix-release)
12 years ago
Michael Peter Christen
9dfc9c95d8
updated slf4j and log4j
12 years ago
Michael Peter Christen
95712fdc8b
update to pdf parser
12 years ago
Michael Peter Christen
e2c4c3c7d3
migration to solr 4.0.0
12 years ago
Michael Peter Christen
69aa39d664
update to libraries required by solr 4.0.0
12 years ago
sixcooler
9d062873d2
bump to httpclient-4.2.2
12 years ago
sof
5cb244b79b
Merge remote branch 'origin/master'
12 years ago
apfelmaennchen
88b062210c
Added a parser for audio file tags (e.g. ID3 tags for MP3 files) based
...
on the jaudiotagger library. The parser is disabled by default as it
needs to store temporary files for non file:// protocols, which might be
disliked. For your local MP3-collection it loads nicely Artist,
Title, Album etc. from the audio files meta data.
12 years ago
sixcooler
9aa21506be
bump to httpcore-4.2.2 (maintenance release)
12 years ago
Michael Peter Christen
d0015df61c
added lucene memory library which is now necessary as solr has to
...
process more complex queries
13 years ago
Michael Peter Christen
80edd8ecd7
some more after-refactoring fixes
13 years ago
Michael Peter Christen
bc865ab816
more cleaning (yacy-cora)
13 years ago
Michael Peter Christen
e65cecc419
- updated lucene libraries to 3.6.1
...
- added lucene-grouping which enables faceted search; try this:
http://localhost:8090/solr/select?q=*:*&start=0&rows=3&facet=true&facet.field=host_s
13 years ago
Michael Peter Christen
2ccf1dba71
upgrade to solr 3.6.1
13 years ago
cominch
e74d66e28c
augmented browsing: remove htmlparser library
13 years ago
cominch
e2119f4e76
augmented browsing: replace htmlparser by jsoup, which is more stable
...
and reliable
13 years ago
sixcooler
a99ef68422
bump to httpclient-4.2.1
13 years ago
Michael Peter Christen
65f56b1fd4
Merge branch 'master' of ssh://gitorious.org/yacy/rc1 into jetty
...
Conflicts:
.classpath
build.xml
htroot/Status.java
source/de/anomic/http/server/HTTPDProxyHandler.java
source/net/yacy/yacy.java
13 years ago
Michael Peter Christen
7b53be141f
upgraded to pdfbox 1.7.0
...
changes in http://www.apache.org/dist/pdfbox/1.7.0/RELEASE-NOTES.txt
with many bugfixes, including performance related
13 years ago
Michael Peter Christen
fad3b14813
added jetty libraries, needed for future use as web server and as
...
application server for the solr search interface
13 years ago
Michael Peter Christen
b9d42fd9c8
using com.google.common.io.Files instead of homebrew methods
13 years ago
Michael Peter Christen
1be0025a9c
- added test for EmbeddedSolrConnector
...
- added needed libraries for this test
this includes most (all) files needed for an embedded solr
13 years ago
Michael Peter Christen
dbdd697f4d
moved RDFaParser.xsl configuration file to defaults
13 years ago
Michael Peter Christen
e12bb254b4
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen
3f55dc7c1e
- added solr core and libraries that solr needs (lucene is missing, will
...
follow later)
- added embedded solr connector which can connect to solr
programmatically (without using a server in between)
13 years ago
Michael Peter Christen
786be7d175
better integration of RDFaParser
13 years ago
cominch
5d20cd324a
Add Triplestore and RDF query interface
...
Conflicts:
build.xml
defaults/yacy.init
source/net/yacy/interaction/AugmentHtmlStream.java
13 years ago
cominch
b21048892b
augmentedParser add features and integrate external html parser to
...
modify existing web pages
Conflicts:
addon/YaCy.app/Contents/Info.plist
build.xml
13 years ago
sixcooler
56087c1f23
bump to httpclient- httpcore-, httpmime- 4.2
13 years ago
Michael Peter Christen
7acd7e88b3
added all shell scripts in /bin to add also latest passwd.sh file
13 years ago
Michael Peter Christen
4d3cc02168
replaced old bzip2 library against better documented commons-compress
...
package from http://commons.apache.org/compress/
13 years ago
Michael Peter Christen
ca7de1dbd0
moved files to defaults
13 years ago
Michael Peter Christen
6c4f8fdc44
removed superfluous files
13 years ago
Michael Peter Christen
62f2554a01
- fixed build problems (deprecated methods using httpclient 3.1)
...
- removed httpclient 3.1 lib which was used by solrj (solrj now uses
httpclient 4)
13 years ago
Michael Peter Christen
f838997126
updated commons io from 2.0.1 to 2.1
13 years ago
Michael Peter Christen
eeb57ae824
updated http client libraries
13 years ago
Michael Peter Christen
ffb72249ea
added missing apicat.sh
13 years ago
Michael Peter Christen
a30b028cc0
updated libraries
13 years ago
Marek Otahal
a231d0eeb9
Run from Java the whole app YACY
...
start for java webStart
allow for better integration with IDE
Conflicts:
source/net/yacy/gui/framework/Browser.java
13 years ago
Michael Peter Christen
e1434635d4
changed required setting for package signing
13 years ago
admin
d171a2fa3e
fixed ant build for deb target: no more svn numbers
13 years ago
sixcooler
d14ee8e464
Revision 9000+ hack
...
do not handle the revision in build.properties anymore
(9000 as fallback)
build-date from git-HEAD (instead when build is fired)
(orginal build-date as fallback)
13 years ago
Michael Christen
7afcdcd573
release 1.01 - now with virtual svn number 9000
13 years ago
sixcooler
b79da58eac
Ant-Task for getting version from git
...
tries to find svn-version or any tag - what ever comes first
be careful using this with non-numerical tags!
13 years ago
sixcooler
69570fda24
bring my master to stuff from remote
13 years ago
sixcooler
d9c56aa37a
Ant-Task for getting version from git
13 years ago
sixcooler
9f8240b350
script for clean copy of URL-tables
13 years ago
apfelmaennchen
9067ab20b2
- included missing image for portalsearch.tar.gz in build.xml
...
- compressed (minify) yacy-portalsearch.js for better performance
- removed language selector, as it doesn't work really well (at least for me)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8026 6c8d7289-2bf4-0310-a012-ef5d649a1542
13 years ago
apfelmaennchen
a425fbd8d6
- created new target 'portalsearch' in build.xml to generate yacy-portalsearch.tar.gz for static hosting
...
- some refactoring for search widget and jquery
- update for ConfigLiveSearch.html to refelct latest changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@8023 6c8d7289-2bf4-0310-a012-ef5d649a1542
13 years ago
orbiter
3f606407bc
added new scripts to bin in build
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7991 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
d2ea250d99
refactoring:
...
- moved many classes from de.anomic to net.yacy
- made more sub-packages for search classes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7973 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
65ab067491
migration to solrj 3.4.0
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7952 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
dc25c48fc9
added more libraries that are needed by solrj
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7922 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
sixcooler
52b477cf6f
bump to httpclient-4.1.2, httpcore-4.1.3 - bugfixrelease
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7876 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
sixcooler
48560a44a9
bump to httpcore-4.1.2: a bugfixrelease
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7853 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
d3c89b90ce
temporary adding the old httpclient-3.1 again because the solrj classes need them. should be removed as soon solrj supports httpclient-4
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7831 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
768c59740c
- replaced solrj 3.1 with solrj 3.3
...
- updated also slf4j
- added authentication for solrj
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7829 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
528b59e078
replaced xerces.jar library that was originally added 2005 with SVN 126 to the libx directory and that was moved to lib in SVN 5781
...
the new replacement is taken from http://xerces.apache.org and has the version 2.11.0 and was inside the file Xerces-J-bin.2.11.0.tar.gz
and consists of two files named xercesImpl.jar and xml-apis.jar
The original purpose of that library was to support:
- content parsers
- optional seed uploader
- SOAP API (which will be committed later)
Since the SOAP API does not exist any more the purpose is to support content parser and an optional seed uploader
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7819 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e7e1a0f328
replaced commons-io v1.4 with v2.0.1
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7818 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
5092a14bcb
replaced fontbox, jempbox, pdfbox v 1.5 with v1.6
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7817 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
suessthomas
ccad615f58
The Java-XMS and Xmx values for the target of "run" (run YaCy) inserted.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7777 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
77fe69395d
added jempbox-1.5.0.jar which is required by pdfbox-1.5 as stated in http://pdfbox.apache.org/dependencies.html
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7774 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
sixcooler
efcd21e0ed
new httpclient, httcore (bugfixrelease)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7769 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
761b1c71dc
added latest pdfbox
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7761 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
3b578a28ef
some patches to prevent that empty or bad IP information is broadcasted
...
- on client-side: fix bad IP reports from remote Peers by replacing their reported IP with their server IP if the reported IP is bad, broken or disallowed
- on server-side: the same during a peer ping (here the ping'ed server acts also as client during the back-ping) and also when receiving a message or a search where the client sends also its seed. Here the IP is replaced by the client IP if the reported IP is broken or bad
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7687 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
c493f101c0
added one more script file to release build script
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7681 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
f6077b3cc0
added more attributes for html parser and enhanced data structures
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7679 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
a0e4960a4d
YMark:
...
- first attempt for a firefox json bookmark importer
- added JSON library json-simple-1.1.jar
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7658 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
19fd13d3bc
Added federated index storage to solr.
...
YaCy supports now the storage to remote solr indexes.
More federated storage (and search) methods may follow.
The remote index scheme is the same as produced by the SolrCell; see
http://wiki.apache.org/solr/ExtractingRequestHandler
Because this default scheme is used, the default example scheme can be used as solr configuration
This is also the same scheme that solr uses if documents are imported with apache tika.
federated solr storage is switched off by default.
To use this, do the following:
- set federated.service.solr.indexing.enabled = true
- download solr from http://www.apache.org/dyn/closer.cgi/lucene/solr/
- extract the solr (3.1) package, 'cd example' and start solr with 'java -jar start.jar'
- start yacy and then start a crawler. The crawler will fill both, YaCy and solr indexes.
- to check whats in solr after indexing, open http://localhost:8983/solr/admin/
Until now it is not possible to use the solr index to search with YaCy in that solr index.
This functionality is now available for two reasons:
1) to compare the functionality of Solr and YaCy and to compare the search speed
2) to use YaCy as a search appliance for people who need a crawler or other source harvesting methods
that YaCy provides (like dublin core reading, wikimedia dump reading, rss feed reader etc) if people still
want to use solr instead of YaCy.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7654 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
4c013d9088
more UTF8 getBytes() performance hacks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7649 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
399d7d6878
* fix permissions of bin/-folder in debian package
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7647 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
21fe5e6c6a
* add bin-folder to debian package
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7638 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
Florian Richter
351d264a48
* yacy domain handler for jetty
...
* rewrite from / to /index.html
14 years ago
orbiter
18ec7fe53c
added a clearall.sh script that deletes the complete index and everything else that belongs to crawling
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7611 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
d98884f1d5
added script for importmediawiki.sh in build.xml
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7610 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
Florian Richter
68ca0fbb2e
* add copyright info
...
* implement basic authentication
* update jetty to 7.3.0
14 years ago
suessthomas
9956dc9dce
Update jcifs-library to Version 1.3.15. Small Changes, read: http://jcifs.samba.org/ - "Minor adjustments have been applied to DcerpcHandle locking routines in the SID class to fix sporadic occurances of "All pipe instances are busy" errors under high load."
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7587 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
Florian Richter
2d2df64870
* jetty works for many pages now
14 years ago
Florian Richter
1989ba64c0
* jetty
14 years ago
sixcooler
45dcfa3460
update to httpclient-4.1
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7473 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
d38c739b31
* add ant-target deleteData, to simulate fresh yacy installation
...
* useful in IDEs like eclipse
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7472 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
19711ff6d6
added target that compiles only the cora package
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7423 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
lotus
2db6ca8104
- do not allow user to disable java installation
...
- java version update
- build cleanup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7284 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
fc3ee9d8a1
changed paths for new libs also in Info.plist
...
when changing paths in build.xml, please also update the paths in /addon/YaCy.app/Contents/Info.plist
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7198 6c8d7289-2bf4-0310-a012-ef5d649a1542
15 years ago
sixcooler
f4357dff03
bump to httpclient-4.0.3 which fixes a number of bugs
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7197 6c8d7289-2bf4-0310-a012-ef5d649a1542
15 years ago