reger
7d863d6254
fix empty text facet entry
...
(noticed on Author facet)
10 years ago
Michael Peter Christen
a39419f2ef
more stacks shall be considered for on-demand loading, not only
...
deep-depth stacks to prevent "too many open files" problem
10 years ago
Michael Peter Christen
5bb52f79be
reduce number of calls to queue.size() because that may be a bottleneck
...
during crawling
10 years ago
Michael Peter Christen
4920ab7b76
optimize usage of size() cache
10 years ago
reger
ee277b9b3e
allow for local yacy.stopwords and yacy.badwords list (in DATA/SETTINGS/)
...
if file in DATA/SETTINGS it is loaded otherwise file in ./defaults is loaded
(if locale ./defaults/stopwords.xx doesn't exist take solr/lang/stopwords_xx.txt as default)
move yacy.stopwords, yacy.stopwords.de and yacy.badwords.example out of root directory to ./defaults directory
10 years ago
reger
de56266bcb
remove redundant toLower for topwords
10 years ago
Michael Peter Christen
a34f837592
better delete all files in path when removing host crawl stack
10 years ago
Michael Peter Christen
10b1db430a
if we have many hosts, use on-demand earlier
10 years ago
Michael Peter Christen
1324927e66
prevent division by zero
10 years ago
Michael Peter Christen
2beb6abeb6
disabled crazy sleep loop
10 years ago
Michael Peter Christen
092d97d7ac
when importing vocabulary csv files, accept also files without semicolon
...
and truncate quotes from literals
10 years ago
Michael Peter Christen
ee9ec40048
added hints to ranking to make ranking boosts using vocabularies easier
10 years ago
Michael Peter Christen
70f03f7c8e
do not cache search requests to Solr if the result is used for
...
doublechecking. If a double-check comes from cached results the
doublecheck fails.
10 years ago
Michael Peter Christen
a0b84e4def
use a LinkedHashMap for factes to maintain facet order as given by solr
10 years ago
reger
ef5dc68313
include domtype to searcheventcache id
...
to differenciate between local / global events for reuse of cached events
fix for http://mantis.tokeek.de/view.php?id=493
10 years ago
Michael Peter Christen
0dc6e0a5f2
added option to enrich vocabularies with synonyms from synonym database
10 years ago
Michael Peter Christen
6a2a669db4
added loading of the synonyms file from addon/synonyms into the
...
knowledge loader
10 years ago
Michael Peter Christen
c67c5c0709
added new solr schema fields which record the occurences of vocabulary
...
matchings. These matches can be used for result boosting, i.e. if a
document contains words from a specific vocabulary, boost it.
10 years ago
Michael Peter Christen
a67a465415
fix field counter for multi-fields in html writer for the solr servlet
10 years ago
Michael Peter Christen
fdba8e2fa0
fix for 2-day network stats table: showing 48 instead of 24 hours from
...
peer history
10 years ago
Michael Peter Christen
ec9d021568
added option in vocabulary editor to import CSV files with different
...
encodings (preselected windows-type character encoding which is typical
for CSV files). Fixed also other problems with character encoding in
dictionary files. Automatically generated vocabularies are now also
noted in the API steering.
10 years ago
reger
b558433211
adjust tag cloud font size calculation
...
to limit max font size to ~ TOPWORDS_MAXSIZE
10 years ago
reger
3c818fc912
add a check of java version string >=1.7 to startup class
...
stopping start with error msg on version < 1.7
10 years ago
Michael Peter Christen
0550b54d56
added fix to postprocessing: avoid caching of postprocessing collection
...
to always get fresh lists of documents. This is necessary since the
postprocessing changes the same documents which the
postprocessing-collection query selects.
10 years ago
Michael Peter Christen
68e8039fd1
added high-precision scheduler for API processes. This allows also to
...
make the execution in dependency of available RAM or CPU load. The
default value for CPU load is 4.0 and the check runs once a minute.
10 years ago
Michael Peter Christen
8aee7f940e
added missing class for latest changes
10 years ago
Michael Peter Christen
97039049e4
fix in key enumeration methods for cases where the enumeration is done
...
in reverse order.
10 years ago
Michael Peter Christen
7e1b0b6712
fix for wildcard patch in search queries
10 years ago
Michael Peter Christen
0a879c98e7
added new 'firstSeen' database table and necessary data structures which
...
hold a date for each URL to record when a url was first seen. This is
then used to overwrite the modification date for urls upon recrawl in
case that the first-seen date is before the latest document date. This
behaviour is necessary due to the common behaviour of content management
systems which attach always the current date to all documents. Using the
firstSeen database it is possible to approximate a real first document
creation date in case that the crawler starts frequently for the same
domain. As a result the search results ordered by date have a much
better quality and the usage of YaCy as search agent for latest news has
a better quality.
10 years ago
Michael Peter Christen
487a733c99
fix for catchall handling in search
10 years ago
sixcooler
33b0234454
added a input-field for setting 'fileHost'
...
Set this to avoid error-messages like 'proxy use not allowed / granted'
on accessing your Peer by its hostname.
10 years ago
Michael Peter Christen
421ee64f33
another fix to ordering of table indexes; fixes also network stats
...
graphics
10 years ago
Michael Peter Christen
1db476c67e
fix for bad table iteration
10 years ago
Michael Peter Christen
e05b7332b9
html fix
10 years ago
sixcooler
bad5dd7836
update to httpclient-4.3.6
10 years ago
reger
c1ad265efd
remove not used accordion javascript call for facet navs
10 years ago
reger
e4316e2d74
skip creation of local var in proxyhandler.storetocache
10 years ago
reger
8a4a2929ae
upd NB project.xml to codec-1.9
10 years ago
sixcooler
9c6e3a6b1c
fix assertation-failure in version-string for Solr-4.10.2 by changing
...
the assert - hope that is ok
+ add forgotten NB-Projekt-changes
10 years ago
sixcooler
725b206fb4
update to solr-/lucene-4.10.2
10 years ago
Michael Peter Christen
ecdfb35f09
added long variables to debug output in index browser
10 years ago
Michael Peter Christen
5c97ecb30f
fix of bad query generation for search facets
10 years ago
Michael Peter Christen
95d87f00b3
fix for bad query generation in doublecheck in postprocessing
10 years ago
orbiter
72c2bc5189
fix for search in case where local peer has no local seed address in
...
portal mode
10 years ago
orbiter
5be352da99
Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
orbiter
a2b5cfb3cf
added reverse button to tables, by default on now (to see latest entries
...
first)
10 years ago
orbiter
fceac5d2d4
added (missing) Tables_p.xml for table xml api
10 years ago
orbiter
0fcd8097a3
removed unused options from BusyThreads
10 years ago
Michael Peter Christen
fe8b1d137d
emergency bugfix for 100% CPU in image drawing
10 years ago
Michael Peter Christen
92007e5d2d
more enhancements to posprocessing speed
10 years ago