Michael Peter Christen
b3aad6cc35
bugfix for remote search when search is done to solr
12 years ago
Michael Peter Christen
ff3eaa21b0
added remote search to solr on YaCy peers!
...
- when doing a remote search, node peers are selected for solr queries
- the solr query is done concurrently to the standard YaCy rwi search
- the solr search result is feeded into the same data structure that
prepares the rwi search result
- the same remote seach that is done to several outside peers is done to
the local solr index
- the search process works now also without any 'old' RWI data using
solr
12 years ago
Michael Peter Christen
a06123aec6
more abstraction and less parameter overhead for remote search
12 years ago
Michael Peter Christen
f00733186b
code simplifications
12 years ago
Michael Peter Christen
755f5e76cf
removed strange assert statements and simplified code in metadata
...
transformation
12 years ago
Michael Peter Christen
db0d438709
fix for http://bugs.yacy.net/view.php?id=206
12 years ago
orbiter
404b0aab09
refactoring in remote search and stub for remote node peer selection
12 years ago
orbiter
d7ea45f698
- get nice text_t values from metadata conversions that are stored into
...
solr as fulltext search index.
- added slow migration from old metadata to solr index entries: each
entry from the old metadata is removed from that data structure and
written into solr.
12 years ago
orbiter
99ef57f103
reduced sleep times
12 years ago
orbiter
780f8974e7
added ramaining iteration methods for solr in fulltext class
12 years ago
orbiter
acd2dc3575
hack to removed StringBuilder overhead in query construction
12 years ago
orbiter
db6863db77
reduced solr cache sizes to check if that solves memory problems a bit
12 years ago
orbiter
6f01542aaa
explicit double-check in transferURL
12 years ago
orbiter
ee01c12e56
fixes for putDocument and putMetadata
12 years ago
orbiter
cc47a0876e
reverted bf55f69176
...
to have a fall-back option in case that memory problems as reported in
http://forum.yacy-websuche.de/viewtopic.php?p=26901#p26901
for full-solr installation are too strong and we have to work with an
'small memory footprint' peer system.
12 years ago
Michael Peter Christen
0904afe8fb
added concurrent iterator methods to the solr connectors
12 years ago
Michael Peter Christen
d54b80327a
refactoring
12 years ago
Michael Peter Christen
f9fc5cfaba
better check for bad urls in url transmission
12 years ago
Michael Peter Christen
d39463a85c
added deleteByQuery to solr connectors
12 years ago
Michael Peter Christen
0cab06c47c
refactoring
12 years ago
Michael Peter Christen
bf55f69176
removed write methods to old metadata file type; all metadata now goes
...
to solr
12 years ago
Michael Peter Christen
40c0856489
refactoring
12 years ago
Michael Peter Christen
2ccf1dba71
upgrade to solr 3.6.1
12 years ago
Michael Peter Christen
e651d3e320
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
06a78eecb7
code simplification
12 years ago
Michael Peter Christen
54bea21c02
bugfix for solr connector, possibly a cause for
...
http://forum.yacy-websuche.de/viewtopic.php?p=26893#p26893
12 years ago
Michael Peter Christen
9bece5ac5f
enhanced snippet fetch - removed a bug that caused documents to be
...
parsed even if a solr text was available
12 years ago
cominch
8a91f4fa42
local robots.txt: disallow external crawlers to follow the URL proxy
12 years ago
Michael Peter Christen
18f989dfb1
- refactoring (load -> getMetadata)
...
- added getDocument to retrieve Solr documents which shall replace
getMetadata
12 years ago
Michael Peter Christen
395b78a0d8
using the solr search index to concurrently search within solr and the
...
rwis during local search requests.
12 years ago
Michael Peter Christen
6197caf698
added clear-text search words in query params
12 years ago
Michael Peter Christen
efafa79db5
- added a content-encoding: gzip to streamed http server responses
...
- finish and close streamed http responses immediately
- this applies only to the solr interface which should be much faster
now!
12 years ago
Michael Peter Christen
23226676c6
FOR THE BRAVE.. this is a forced migration to solr which is now ready
...
for production as a replacement of the metadata-db.
This intermediate release 1.041 will switch on the previously optional
solr index and the old metadata-db will still work as it did before.
Solr+metadata are accessed in mixed mode, no migration is done yet.
If this causes not a catastrophe until the end of the weekend, we will
do a YaCy 1.1 main release containing this as default.
12 years ago
Michael Peter Christen
a1b2c9a67d
doctype2mime fix, influences metadata conversion between old metadata
...
and solr
12 years ago
Michael Peter Christen
7c31be1c80
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
cominch
6456a1656a
changed local robots.txt to prevent external crawlers to submit random
...
search queries
12 years ago
Michael Peter Christen
a16206e38b
more attempts to clean the index (cleaning is faster then)
12 years ago
Michael Peter Christen
703f427303
fixed some peer-ping connection details
...
- larger time-out
- removed too old seedlist
- fixed a bug in connection test
12 years ago
Michael Peter Christen
597bb76e4f
get the peer location more quickly
12 years ago
orbiter
156d457aec
fix for Index out of bounds exception in Network servlet
12 years ago
orbiter
da93addec3
addon to e74d66e28c
...
(removed htmlparser.jar): for Mac App
12 years ago
Lotus
ae9cd7a118
fix xss bug #204
12 years ago
Michael Peter Christen
1641835fef
replaced yacy xml encoding by solr xml encoding
12 years ago
Michael Peter Christen
89fe13e73d
enhanced GSA and RSS output format: corrected date, added some missing
...
fields, added xml encoding for utf8
12 years ago
Michael Peter Christen
ea49a8aa8c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
d988ba50cf
added a very rudimentary, incomplete, non-verified GSA response writer
...
for solr. Try this:
http://localhost:8090/gsa/searchresult?q=pdf&site=col1&num=10
12 years ago
Michael Peter Christen
aab0b680c3
- added xslt support for solr result formats.
...
try i.e.
http://localhost:8090/solr/select?q=*:*&start=0&rows=10&wt=xslt&tr=json.xsl
- added servlet-side mime-type configuration for streamed servlets. this
is used for the result formatters in solr result formats
12 years ago
cominch
e74d66e28c
augmented browsing: remove htmlparser library
12 years ago
cominch
e2119f4e76
augmented browsing: replace htmlparser by jsoup, which is more stable
...
and reliable
12 years ago
cominch
ad62609ec7
added a possibility to define a custom network definition URL for remote
...
management
12 years ago