Michael Peter Christen
7ab5093321
added new solr title_exact_signature_l and
...
description_exact_signature_l to be able to identify unique title and
unique description fields.
12 years ago
Michael Peter Christen
f24ac518e6
redesign of exists()-query (can now be called with query) and the
...
CachedSolrConnector which based its cache on the key value. This will be
used to correct the title_unique_b and description_unique_b field.
12 years ago
Michael Peter Christen
27d6222880
added new field host_extent_i which, after a crawl and postprocessing,
...
holds the number of documents for the host where the document is hosted.
This is necessary for ranking and the norming of references per local
host in the ranking computation.
12 years ago
Michael Peter Christen
579eb01a49
showing now the details of references count in host browser:
...
external (ext), internal (int) and external hosts (hosts) for each
indexed document.
12 years ago
reger
0f4237d8e5
add admin option to delete load errors from index
12 years ago
reger
518b20147c
skip postprocessing during document.store if no citation index connected (prevent null pointer exception)
12 years ago
Marc Nause
ac478384d3
*) did some long overdue refactoring
12 years ago
Marc Nause
e99c8789ff
*) fixed encoding of query in link to map (in case geolocalization is
...
enabled, "Show search results for "köln" on map")
*) applied suggestions of Checkstyle plugin
12 years ago
Michael Peter Christen
ada3f27de7
added three new field for a better ranking: references_internal_i,
...
references_external_i and references_exthosts_i. These can be used to
count and evaluate the number of external links to every web page. An
experimental ranking function can be i.e.:
div(add(references_internal_i,product(references_external_i,references_exthosts_i)),add(clickdepth_i,1))
12 years ago
Michael Peter Christen
082e3274d6
- setting the same default ranking in the solr interface as for YaCy
...
search interfaces if no other ranking attributes are given
- using the YaCy ranking in the GSA interface only if there was not
given a GSA-style sort attribute
- to avoid confusion about correct ranking attributes, only the default
'0'-ranking profile is used and not scenario-adopted (site, date)
because that should be configurable in the web interface before it is
used actually for ranking.
12 years ago
Michael Peter Christen
a20941c067
resume paused crawls on startup; user expects that restarts 'heal'
...
everything
12 years ago
Michael Peter Christen
edc0b33f6d
- showing references count and clickdepth in host browser
...
- fixed generation and presentation of both values
12 years ago
orbiter
2c3b024196
if the crawl was paused (automatically), show the reason for pausing in
...
the Crawler_p servlet.
12 years ago
reger
566a3b0294
fix: Index Administration > Reverse Word Index (IndexControlRWIs_p) corrected use of word search to word-hash search
...
- removed duplicate QueryParams.hashes2Handles , redundant with .hashes2Set
12 years ago
reger
989575b447
Merge branch 'master' of git://gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
27907c9739
added missing library after solr upgrade
12 years ago
reger
f37b4c984c
adjust Netbeans IDE project.xml classpath for Solr 4.2.1 jars
12 years ago
Michael Peter Christen
c6c01a3ca2
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
cf0acd2cb4
upgrade to solr 4.2.1
12 years ago
reger
40b3f2c5fe
comment out dead menue link
12 years ago
reger
bf1e1ddca1
fix typo in prev commit
12 years ago
reger
d4d93be779
uncomment "used time" calculation for remote search log
12 years ago
reger
36202f27b0
improve remote search log, set "Returned Results" to transmitcount (instead of no value)
12 years ago
reger
e89491271f
- fix opensearch discover err msg - webgraph not enabled - if no opensearchdescription link found in index
...
- remove search2.net from sample config (is down)
12 years ago
reger
6a9d0b60a3
make sure configured port is reported on recreated mySeed.txt
12 years ago
reger
254074b11d
Merge branch 'master' of git://gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
870aedf3c6
fixes for better search interface integration in yaml templates
12 years ago
Michael Peter Christen
735eb70525
better search timing; prevents '0 results' for very large local
...
indexes >> 10 mio documents
12 years ago
Michael Peter Christen
5512be6673
fix in GSA result writer which evaluates result context fields as
...
String. After the migration to Solr 4.1.0 'some' of these fields
suddenly are stored as String[]; this patch compensates this confusion.
12 years ago
Michael Peter Christen
342ba1049b
- callback fix
...
- memory allocation problem in RowCollection: if memory is too low, do
not to try to increase by 1 because this leads to very long execution
time and at the end to the same OOM as if we allocate the memory at the
moment we need it even if the resource observer states that this memory
is not there. To compensate this, the increase size is reduced.
12 years ago
orbiter
65d73e5652
renamed callback function to 'callback' because that is a standard for
...
jsonp which is also used in backbone.js/jquery
12 years ago
reger
31d16f20d7
fix invisible icon not found
12 years ago
orbiter
17ae51e741
increased number of links limitation from 1000 to 10000 for rss feeds
...
and html documents
12 years ago
orbiter
243b66ae6d
Merge branch 'master' of git://gitorious.org/~frankensteen91/yacy/frankensteen91s-yacy
12 years ago
Frank
7763f2554f
add the new PPMbar in Crawler_p for a better style and better use.
12 years ago
orbiter
e4d26d1cb4
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter
940c6849ee
enhanced did-you-mean (a bit): can now remember previously searched
...
words (plus small enhancements)
12 years ago
reger
d57b221921
add: reset Solr schema filed selection to default button in IndexSchema_p
12 years ago
Michael Peter Christen
a725a4242f
main release 1.4
12 years ago
Michael Peter Christen
9406a2e438
fixed NPE during index abstract computation
12 years ago
Michael Peter Christen
16e9d4d1dd
added a restart hint
12 years ago
Michael Peter Christen
d725782440
turned severe message to warning message about network failure events
12 years ago
Michael Peter Christen
b3a54d5b1c
fix for wrong class name in log
12 years ago
Michael Peter Christen
2d36a7eaf5
- do not create a new query for all remote peers
...
- no document search this time
- adjusted banner and network to not show 'WORDS' but DHT Chunks. This
is to avoid confusion for robinson peers which do not create Word
Entries
12 years ago
Michael Peter Christen
4af0839be2
use appropriate ranking for each search situation:
...
- when using the /date modifier, a date ranking profile is used
- when using a site: modifier, a ranking profile supporting longer urls
is used
12 years ago
Michael Peter Christen
b8ed66a55d
added all clickdepth computations for source and target paths in
...
webstructure core
12 years ago
Michael Peter Christen
6300730d7f
refactoring of clickdepth computation as preparation for clickdepth
...
computation of webgraph links
12 years ago
Michael Peter Christen
2080fc7406
removed unused tag fields
12 years ago
reger
7804c12976
fix error msg in ConfigHeuristics_p
12 years ago
reger
230a12bfe2
adjust Opensearch discover function to new webgraph Solr schema
12 years ago