Michael Peter Christen
f8cbaeef93
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
4 years ago
Michael Peter Christen
a857e3d3d5
fix for json importer
4 years ago
sgaebel
f16cd154f7
removes unused imports and variables
4 years ago
sgaebel
a5488ac8f5
uses edismax queries on query counts > 1 only
4 years ago
sgaebel
26223dc25a
replaces getLoadTime() by exists() with a simpler query
...
since solr-8.8.1 getLoadTime() causes a high cpu usage
4 years ago
Michael Peter Christen
e18d0ef544
trying to set a higher priority to the process that is involved in index
...
export
4 years ago
Michael Peter Christen
8b4394a6c5
fixes for solr 8.8.1 migration
...
- replace new guava 30 with older 25 because that is the correct
dependency for solr 8.8.1. The newer one did actually not work!
- index will be crated in a DATA/INDEX/freeworld/SEGMENTS/solr_8_8_1
subfolder. The older solr_6_6 index is not touched but also not
migrated. The index starts with fresh (empty) content.
- Older indexes must be migrated by hand (export/import) so far until a
better solution is found.
- Large schema adoptions for lucene 8.8.1
4 years ago
Al Sutton
8ade8b8775
Remove forced clear to match new behaviour in 2da71c2a40
4 years ago
Al Sutton
09695fc6d3
Update exceptions to match updated API
4 years ago
Al Sutton
69014a701e
Update API Usage
4 years ago
Michael Peter Christen
198826c362
added network scanner process to discover all YaCy peers in the intranet
...
this will be used to wire YaCy peers in a kubernetes cluster
4 years ago
Michael Peter Christen
5a7f12a9c1
allow network scans for non-standard http/https ports
4 years ago
Michael Peter Christen
d0abb0cedb
enabling all crawl profiles in all network modes
...
also: increased default internet crawl speed to
4 urls/s/host
4 years ago
Michael Peter Christen
43a9f4f574
updated solr 6.6.6 -> 7.7.3
...
dropped GSA support (GSA API is still in YaCy Grid)
The 6.6.6 solr index works without migration also with 7.7.3
4 years ago
Michael Peter Christen
eea2d71851
prevent creation of auth schema factories every time a servlet is called
4 years ago
Michael Peter Christen
787fec0658
reduced complexity - removed concurrency in sort
4 years ago
Michael Peter Christen
36e616271b
do better documentation on how to set a default password
4 years ago
Michael Peter Christen
df2bf9ef28
try to fix maven build error
4 years ago
Michael Peter Christen
7947baeb49
removed all remaining deprecation warnings
4 years ago
sgaebel
4a495df63a
removes some deprecation-warnings
5 years ago
sgaebel
df9ea0a42a
removes some warnings: unused imports, params
5 years ago
Michael Peter Christen
e0ad8ca9da
replaced json library from JSON.org with libandroid-json-java
...
This fixes https://github.com/yacy/yacy_search_server/issues/347
5 years ago
Michael Christen
25227676ae
removed some warnings
6 years ago
luccioman
d16bc99835
Added "Show Metadata" links to the ViewFile.html links mode
...
To conveniently follow parsed links in the file viewer
6 years ago
luccioman
a5771b1f14
Made SNI extension user configurable without the need for server restart
...
TLS Server Name Indication (SNI) extension activation can now be
configured with the new Settings_p.html?page=httpClient administration
page.
SNI extension is also now enabled by default, as in 2019 the
unrecognized_name(112) alert is more properly handled by major web
servers TLS implementations, following the RFC 6066 standard.
Related YaCy issues : #153 #189 and #272
JDK 1.7 bug :
https://bugs.java.com/bugdatabase/view_bug.do?bug_id=7127374
Apache httpd issue :
https://bz.apache.org/bugzilla/show_bug.cgi?id=56241
RFC 6066 : https://tools.ietf.org/html/rfc6066#section-3
6 years ago
luccioman
5b7e41202a
Added Solr GSA writer support for responses from remote instances
6 years ago
luccioman
4d8a948455
Properly close PDF snapshots loaded with pdfbox library
6 years ago
luccioman
74e6d6e984
Added Solr GrepHTML writer support for responses from remote instances
6 years ago
luccioman
5e6501974d
Added Solr snapshots writer support for responses from remote instances
6 years ago
luccioman
5e9a08355a
Improved logging for federated search
...
- Do not use spaces in logger identifier name so the log level can be
configured in yacy.logging
- Hold the logger instance to avoid the logging system to look for it
from its name at each appended log message
6 years ago
luccioman
9782a98a9c
Added the possibility to customize facets sort type and direction
...
Previously search navigators/facets elements were sorted only by counts.
Now from the ConfigSearchPage_p.html admin page, sort direction
(ascending/descending) and type (on counts or labels) can be customized
independently for each navigator.
6 years ago
sgaebel
c2398fd890
remove warnings: 'Statement unnecessarily nested within else clause'
6 years ago
luccioman
08ea0b0397
Added a configurable timeout to wkhtmltopdf calls for pdf snapshots
...
Necessary to prevent blocking the indexing workflow when some
wkhtmltopdf renderings fail without terminating
6 years ago
luccioman
73a6e45524
Extended detection of external tools used for Snapshots generation
...
This enable detecting wkhtmltopdf and Imagemagick convert executables
when they are at system Path in addition to common installation paths.
6 years ago
Michael Peter Christen
c347e7d3f8
Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
6 years ago
Michael Peter Christen
848e9304d9
evil bots may crawl harder
6 years ago
luccioman
e85f231bdf
Fixed termination of Host browser and link structure Solr query threads
...
On some conditions (especially when reaching timeout), concurrent Solr
query tasks used by the /HostBrowser.html and /api/linkstructure.json
never terminated, thus leaking resources, as reported by @Vort in issue
#246
6 years ago
luccioman
a83a56473e
Added suport for PDF snapshots generation when running on MS Windows
6 years ago
luccioman
8852c97cee
Added basic styling for cleaner rendering of missing image snapshots
...
For the output of the Solr snapshots writer
6 years ago
luccioman
50b6edfcf5
Updated Solr snapshots writer for a cleaner html head
7 years ago
luccioman
f366f43d6b
Made snapshots size customizable in Solr snapshots response writer
7 years ago
luccioman
61c337f29a
Decode blacklist entries for easier edition of non ascii chars
...
Not using the JDK URLDecoder.decode() function, as it strips '+'
characters when they occur after '?' (both characters having regular
expression semantics when used in blacklist path patterns)
7 years ago
luccioman
ed93221fa1
Improved normalization of blacklist path patterns having non ascii chars
...
Normalize blacklist path patterns using percent-encoding, at pattern
edition in web interface and at loading from configuration files.
Fixes issue #237
7 years ago
luccioman
db7ad76366
Improved support for Java logs file pattern options
...
- support of "%h" and "%t" pattern components
- more proper initialization of file handler when the data folder is not
the default one, notably to prevent a non blocking but ugly error stack
trace reported by the log manager at startup with that kind of setup
7 years ago
luccioman
9b1c87033b
Fixed logs folder checking and creation
...
Previously, if YaCy log folder was for example at
`/home/user/yacy/DATA/LOG`, because of improper truncation of log path,
an unnecessary directory creation was atempted at `/home/us`.
7 years ago
luccioman
d03c098b54
Removed deprecated warning comments about imports and Debian installer
...
Deprecated by commit be5d3a1066
, as
classpath is now defined in yacycore.jar Manifest file.
7 years ago
luccioman
4ee14ff3c5
Fixed NullPointerException case on malformed crawl queue folder name
7 years ago
luccioman
373edf9eac
Adjusted yjson Solr writer to support responses from an external Solr
...
Worked previously only with responses from YaCy embedded Solr, now able
to render the response when YaCy is configured to use an external Solr
index.
7 years ago
luccioman
87bd17b1cf
Simplified a little bit the RSS OpenSearch Solr writer
7 years ago
luccioman
dc49ca9c27
Fixed a NPE case on the Solr OpenSearch response writer
...
Occurred when omitHeader parameter is set to true
7 years ago