admin
fbf8ddd32d
upgrade of jsoup 1.12.1 -> 1.14.2
3 years ago
Ian Smirlis
53518a91ab
In case of reload404, load only failed documents
3 years ago
Michael Peter Christen
4c889b7ff9
fixed build paths
3 years ago
Michael Peter Christen
683cac125f
updated bouncy castle 1.60 -> 1.69
3 years ago
Michael Peter Christen
e6a87e0426
enhanced crawler
...
a main problem when crawling is long waiting time cuased by crawl-delay
values from robots.txt entries. that attribute is not supported by
google and interpreted by yandex and bing in different ways. In large
crawls there is always one host which blocks the whole crawl with
extreme large values. YaCy now still obeys crawl-delay but limits them
to 10 seconds.
Additionally the blocking logic when loading new robots.txt was analyzed
and a deadlock was removed. Furthermore the construction of new queue
lists was redesigned and it was ensured that always a large list of
different hosts for host-balancing is provided for the loader.
3 years ago
Michael Peter Christen
e9c5e78868
replaced new Number(Number) with Number.instanceOf
...
to remove deprecation warnings for Java 9
3 years ago
Michael Peter Christen
9e13d77de4
removed call to class.finalize() because of deprecation in java 9
...
next: removal of finalize() implementation
after testing with assert false
3 years ago
Michael Peter Christen
9ef4503672
fixed some newInstance() warnings
...
.. by adding .getDeclaredConstructor()
3 years ago
Michael Peter Christen
82df012442
removed old lib
3 years ago
Michael Peter Christen
8a2adb2b15
upgraded commons-compress lib
...
cause: alert in
https://github.com/yacy/yacy_search_server/security/dependabot/pom.xml/org.apache.commons:commons-compress/open
3 years ago
Michael Peter Christen
9182b3dfca
enhanced default value
3 years ago
Michael Peter Christen
294d56d4a2
addressing better GC behavior after removing Xms with earlier heap increase strategy
3 years ago
Michael Peter Christen
3959d43a5c
fixed doku link
3 years ago
Michael Peter Christen
c4659f0fb0
removed Debian and Red Hat build process
...
as announced in
https://twitter.com/yacy_search/status/1414608643241152516
because of lack of community support for these kind of
distributions. We will still support
tarball, Windows, Mac and Docker releases.
3 years ago
Michael Peter Christen
73360ed52b
add gradle to gitignore
3 years ago
Michael Peter Christen
15b7461bc7
removed Xms java memory startup parameter
...
We will use the default value for now on.
This is much better for resource economy and fits better into a
container/docker/kubernetes strategy.
Furthermore, a small memory footprint is essential for the usage on
small devices like RaspberryPi.
3 years ago
admin
c3b3087077
gradle cleanup
3 years ago
admin
a13986d659
replaced maven with gradle
3 years ago
Michael Peter Christen
1d41380f0a
better support for mac-specific tray functions in java 9
3 years ago
Michael Peter Christen
4377bd2b70
fix for wrong crawlName construction
3 years ago
Michael Peter Christen
e81b770f79
enabled crawl starts with very large sets of start urls
...
i.e. 10MB large url list with approx 0.5 million start points
3 years ago
frankenstein91
4b73b3f9f2
docker has no latest-alpine
...
There is no yacy/yacy_search_server:latest-alpine on docker hub
3 years ago
Michael Peter Christen
c623a3252e
fix for jdk 14 bug
4 years ago
Michael Peter Christen
dbd211a1ad
removed/replaced reflection in memory tool
4 years ago
Michael Peter Christen
160f00e59e
removed reconfigure script which is seven years old any may not up to
...
standards of current password implementation.
See https://github.com/yacy/yacy_search_server/issues/409 as hint
4 years ago
Michael Peter Christen
1cdb21592b
added hazelcast and some modifications to align legacy YaCy with
...
YaCyGrid
4 years ago
Michael Christen
42ea2a1c6f
Merge pull request #405 from jfhs/jfhs/support-all-html-entities
...
Improve HTML entities support
4 years ago
Michael Christen
b2af745dd6
Merge pull request #404 from lnceballosz/master
...
NGI0 - Updating licensing aspects according REUSE
4 years ago
jfhs
10bddc2c2d
Decode HTML entities in all property values by default
4 years ago
jfhs
2135d259e3
Replace hardcoded html/xml entities with a file, support decoding all defined HTML entities
4 years ago
Michael Peter Christen
8f876a8c72
added concurrency to enhance indexing speed during json surrogate import
4 years ago
Michael Peter Christen
f8cbaeef93
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
4 years ago
Michael Peter Christen
a857e3d3d5
fix for json importer
4 years ago
sgaebel
7fecd859e5
fixes showing metadata from Searchresult, by removing defType=edismax
...
also removes defType=edismax from IndexBrowser, but still does not show
dates
4 years ago
sgaebel
1546232c94
adds ranking for multi document queries only
4 years ago
sgaebel
93b353d22d
does not boost or add fields for zero-row-queries (exists())
4 years ago
sgaebel
f16cd154f7
removes unused imports and variables
4 years ago
sgaebel
c69c462a15
replaces a expensive getLoadTimeURL() by exists()
...
refactors urlExists to getHarvestProcess as that is what it does
4 years ago
sgaebel
a5488ac8f5
uses edismax queries on query counts > 1 only
4 years ago
sgaebel
26223dc25a
replaces getLoadTime() by exists() with a simpler query
...
since solr-8.8.1 getLoadTime() causes a high cpu usage
4 years ago
sgaebel
8e4d014c06
removes useless SolrRequestInfo.clearRequestInfo(), avoids spamming the
...
log
4 years ago
sgaebel
88c6bc8cd7
adds missing solr lib: opentracing 0.33.0
4 years ago
Lina Ceballos
139b5a4033
improving license info in README
4 years ago
Lina Ceballos
a96752f5ab
adding SPDX license and copyright headers
4 years ago
Lina Ceballos
221038f16d
creating LICENSES directory
4 years ago
Michael Peter Christen
e18d0ef544
trying to set a higher priority to the process that is involved in index
...
export
4 years ago
Michael Peter Christen
c552a2845f
added new commons library (missed in latest commit)
4 years ago
Michael Peter Christen
8b4394a6c5
fixes for solr 8.8.1 migration
...
- replace new guava 30 with older 25 because that is the correct
dependency for solr 8.8.1. The newer one did actually not work!
- index will be crated in a DATA/INDEX/freeworld/SEGMENTS/solr_8_8_1
subfolder. The older solr_6_6 index is not touched but also not
migrated. The index starts with fresh (empty) content.
- Older indexes must be migrated by hand (export/import) so far until a
better solution is found.
- Large schema adoptions for lucene 8.8.1
4 years ago
Michael Peter Christen
3befaaf4f1
reformatting pom.xml to make it easier to update it with recent library versions
4 years ago
Michael Christen
dffe9e1c23
Merge pull request #402 from SebastianoPistore/junitUpdate
...
Workaround for CVE-2020-15250
4 years ago