Michael Peter Christen
092d97d7ac
when importing vocabulary csv files, accept also files without semicolon
...
and truncate quotes from literals
10 years ago
Michael Peter Christen
ee9ec40048
added hints to ranking to make ranking boosts using vocabularies easier
10 years ago
Michael Peter Christen
70f03f7c8e
do not cache search requests to Solr if the result is used for
...
doublechecking. If a double-check comes from cached results the
doublecheck fails.
10 years ago
Michael Peter Christen
a0b84e4def
use a LinkedHashMap for factes to maintain facet order as given by solr
10 years ago
Michael Peter Christen
0dc6e0a5f2
added option to enrich vocabularies with synonyms from synonym database
10 years ago
Michael Peter Christen
6a2a669db4
added loading of the synonyms file from addon/synonyms into the
...
knowledge loader
10 years ago
Michael Peter Christen
fdba8e2fa0
fix for 2-day network stats table: showing 48 instead of 24 hours from
...
peer history
10 years ago
Michael Peter Christen
ec9d021568
added option in vocabulary editor to import CSV files with different
...
encodings (preselected windows-type character encoding which is typical
for CSV files). Fixed also other problems with character encoding in
dictionary files. Automatically generated vocabularies are now also
noted in the API steering.
10 years ago
reger
b558433211
adjust tag cloud font size calculation
...
to limit max font size to ~ TOPWORDS_MAXSIZE
10 years ago
Michael Peter Christen
0550b54d56
added fix to postprocessing: avoid caching of postprocessing collection
...
to always get fresh lists of documents. This is necessary since the
postprocessing changes the same documents which the
postprocessing-collection query selects.
10 years ago
Michael Peter Christen
68e8039fd1
added high-precision scheduler for API processes. This allows also to
...
make the execution in dependency of available RAM or CPU load. The
default value for CPU load is 4.0 and the check runs once a minute.
10 years ago
Michael Peter Christen
0a879c98e7
added new 'firstSeen' database table and necessary data structures which
...
hold a date for each URL to record when a url was first seen. This is
then used to overwrite the modification date for urls upon recrawl in
case that the first-seen date is before the latest document date. This
behaviour is necessary due to the common behaviour of content management
systems which attach always the current date to all documents. Using the
firstSeen database it is possible to approximate a real first document
creation date in case that the crawler starts frequently for the same
domain. As a result the search results ordered by date have a much
better quality and the usage of YaCy as search agent for latest news has
a better quality.
10 years ago
Michael Peter Christen
487a733c99
fix for catchall handling in search
10 years ago
sixcooler
33b0234454
added a input-field for setting 'fileHost'
...
Set this to avoid error-messages like 'proxy use not allowed / granted'
on accessing your Peer by its hostname.
10 years ago
Michael Peter Christen
1db476c67e
fix for bad table iteration
10 years ago
Michael Peter Christen
e05b7332b9
html fix
10 years ago
reger
c1ad265efd
remove not used accordion javascript call for facet navs
10 years ago
Michael Peter Christen
ecdfb35f09
added long variables to debug output in index browser
10 years ago
Michael Peter Christen
95d87f00b3
fix for bad query generation in doublecheck in postprocessing
10 years ago
orbiter
a2b5cfb3cf
added reverse button to tables, by default on now (to see latest entries
...
first)
10 years ago
orbiter
fceac5d2d4
added (missing) Tables_p.xml for table xml api
10 years ago
orbiter
dbafd4865e
enhanced debug code in host browser
10 years ago
Michael Peter Christen
8f6587e87b
fix for broken protocol navigation
11 years ago
Michael Peter Christen
5c962dd009
better scaling of network statistic graphs
11 years ago
orbiter
3ffe19b85c
replaced old /api/table_p.xml servlet with /Tables_p.xml to avoid double
...
code
11 years ago
Michael Peter Christen
b4585e9546
added new index size history image in /Status.html page
11 years ago
Michael Peter Christen
9aebbbebc0
added network history in /Network.html?page=5
11 years ago
Michael Peter Christen
26279b0993
added debug code for statistics about document attributes related to
...
domains
11 years ago
reger
d65e3f2b53
RankingSolr: display only available or configured boost fields
11 years ago
Michael Peter Christen
4e56d79fc8
replaced input text field with text field for index deletion with query
...
and replaced GET with POST method. This should make it possible to
tubmit here very large queries for deletion.
11 years ago
orbiter
6f707b4305
removed spaces in seedlist.xml to reduce data
11 years ago
orbiter
78c9d31388
fix for bad json
11 years ago
Michael Peter Christen
8098a86f1d
ipv6 fix for api /yacy/seedlist.[json|xml], multiple IPs are now
...
attached to the seed info. API clients must be adopted. Documentation
will be fixed in
http://www.yacy-websuche.de/wiki/index.php/Dev:APIseedlist
Also added a new retrieval option for seeds, they can now be retrieved
by their name with the get parameter name=<name>
11 years ago
Michael Peter Christen
07c5b57953
removed warnings
11 years ago
Michael Peter Christen
509eba2484
automatically zoom to location/POI
11 years ago
orbiter
fa2ad101ec
enhanced graphics computation (avoiding long string parsing for colours)
11 years ago
orbiter
ef813cec91
added proper copyright notice to OSM tiles presented at the search
...
result page
11 years ago
Michael Peter Christen
1269e77dfa
enhanced location search
11 years ago
Michael Peter Christen
75b5f24be4
make browsing of file://z: - paths in index browser easier - this will
...
now show the root paths on a shared drive
11 years ago
Michael Peter Christen
8ac3e9f890
fix for api icon in yacysearch_location.html
11 years ago
Michael Peter Christen
a1dd0ae62c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger
f5967dfedf
add filter to citation page and a on/off button
...
to display only sentences with citations,
while maintaining the sentence number.
Make the filtered list the default in search result citation link
11 years ago
Michael Peter Christen
f818f84adb
more ipv6 fixes
11 years ago
Michael Peter Christen
2c2b50e65d
refactoring (class name should start with uppercase letter)
11 years ago
Michael Peter Christen
14385057c2
added also the NetworkHistory servlet...
11 years ago
Michael Peter Christen
d8beafba3a
fix for values in CrawlProfileEditor table and xml; now the full profile
...
is available in the xml.
11 years ago
Michael Peter Christen
ec95dfa2e6
fixed crawl profile xml result which did not show the correct crawl
...
status.
11 years ago
Michael Peter Christen
8c1a89cb34
added another decoration flag to switch off network graphics in crawler
...
monitor and index browser: decoration.grafics.linkstructure
Please set this to false to remove the graphics from the interface.
11 years ago
Michael Peter Christen
764e4ed673
fixed appearance of RSS icon on search result page
11 years ago
Michael Peter Christen
9b1958e8ca
more ipv6 bugfixes
11 years ago