Commit Graph

1500 Commits (4967b2e0e828e0ffcfc271f1d63bb06384d54e82)

Author SHA1 Message Date
Michael Peter Christen 8f876a8c72 added concurrency to enhance indexing speed during json surrogate import
4 years ago
Michael Peter Christen f8cbaeef93 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
4 years ago
Michael Peter Christen a857e3d3d5 fix for json importer
4 years ago
sgaebel c69c462a15 replaces a expensive getLoadTimeURL() by exists()
4 years ago
sgaebel 26223dc25a replaces getLoadTime() by exists() with a simpler query
4 years ago
Michael Peter Christen 8b4394a6c5 fixes for solr 8.8.1 migration
4 years ago
Al Sutton 69014a701e Update API Usage
4 years ago
Michael Peter Christen 13a2e6dc6e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
4 years ago
Michael Peter Christen 0ae8ccf657 Make it possible to set an empty password disabling the authentication
4 years ago
Michael Peter Christen 96592a10cf added option to set yacy configuration values using environment
4 years ago
Michael Peter Christen 198826c362 added network scanner process to discover all YaCy peers in the intranet
4 years ago
Michael Peter Christen 907f121d0c do not overwrite PW with random PW
4 years ago
Michael Peter Christen 3e6a1e0a49 fixed surrogate process counter
4 years ago
Michael Peter Christen baad56d83d beautified default peer names
4 years ago
Michael Peter Christen 43a9f4f574 updated solr 6.6.6 -> 7.7.3
4 years ago
Michael Peter Christen c0d9a3e9a7 turned HostBrowser into a admin-only page, now called IndexBrowser
4 years ago
Michael Peter Christen 6271e9122c javadoc fix
4 years ago
Michael Peter Christen 52228cb6be added a gc to cleanup process (once every 10 minutes)
4 years ago
Michael Peter Christen 22841ffbf1 creating a threaddump during every cleanup process
4 years ago
sgaebel 3431f91db9 removes unused 'unused' tokens
5 years ago
sgaebel fc03c4b4fe removes some warning and unused objects
5 years ago
sgaebel 4a495df63a removes some deprecation-warnings
5 years ago
sgaebel dd9d4b1188 replace org.junit.Assert.assertThat by
5 years ago
Michael Peter Christen e0ad8ca9da replaced json library from JSON.org with libandroid-json-java
5 years ago
Michael Christen cfa27d2fd5 fixed links
6 years ago
luccioman 6b45cd5799 New optional crawl filter on the URL a doc must match to crawl its links
6 years ago
luccioman a5771b1f14 Made SNI extension user configurable without the need for server restart
6 years ago
luccioman e90405b6f0 Support parsing audio URLs without file extension
6 years ago
luccioman a8316c79da Allow JS resorting of search results by unauthenticated users
6 years ago
luccioman 0ab2b49c31 Made /yacysearch access rate limitations user configurable
6 years ago
luccioman 9782a98a9c Added the possibility to customize facets sort type and direction
6 years ago
sgaebel c2398fd890 remove warnings: 'Statement unnecessarily nested within else clause'
6 years ago
luccioman 08ea0b0397 Added a configurable timeout to wkhtmltopdf calls for pdf snapshots
6 years ago
luccioman e85f231bdf Fixed termination of Host browser and link structure Solr query threads
7 years ago
luccioman fcf6b16db4 Added new crawler attribute for finer control over Media Type detection
7 years ago
luccioman 54fbe166ba Updated pdf cache clear steps consistently with current pdfbox version
7 years ago
luccioman bdafb14336 Removed redundant synchronization lock on network switch function
7 years ago
luccioman dcad393fe5 Fixed exceeding max size of failreason_s Solr field on large link list
7 years ago
luccioman f467601561 Properly lock solrInstances for reboot and restoration of embedded Solr
7 years ago
luccioman 2bdd71de60 Added server side columns sorting on the Process Scheduler table
7 years ago
luccioman e97580dfc7 Fixed unsafe conccurent access to generic SimpleDateFormat instances
7 years ago
luccioman cced94298a Added a new crawler document filter type using Solr syntax
7 years ago
luccioman 40e8c7b89b Use the heavy ConcurrentUpdateSolrClient only when necessary
7 years ago
luccioman b5dc1f376f Made outgoing pools max total connections user configurable
7 years ago
luccioman 387d646c0e Added gzip compression of responses returned to user-agents accepting it
7 years ago
luccioman ee6670fb8f Use a common pooled http connection manager for remote solr instances
7 years ago
luccioman 35826a3091 Added a search page customization setting to display or not favicons
7 years ago
luccioman fa4399d5d2 Small perf improvement : initialize threads names early when possible
7 years ago
luccioman f511e16d50 Prevent duplication of Solr query highlight fields parameters
7 years ago
luccioman e357ade47d Reduced memory footprint of text snippet extraction
7 years ago
luccioman e115e57cc7 Reduced text snippet extraction processing time.
7 years ago
sgaebel 4b79851e12 corrected icons_sizes_sxt to SolrType.string
7 years ago
luccioman 3b89c232db Easier tracking of longest text snippets initializations
7 years ago
luccioman 3c4344cb12 Fixed text snippet max init time statistic rendering
7 years ago
reger a8234b7ea7 Make sure for image resource url enabled index image pixel size fields are filled
7 years ago
luccioman c35d0568b6 Support for preferred https in peers communication on more operations
7 years ago
luccioman a3ec7a7a5f Added analysis optional setting to compute statistics on text snippets
7 years ago
luccioman 0d34034f17 Ensure an embedded Solr is available for Solr dump/restore operations
7 years ago
luccioman d92b191942 Ensure no remote Solr is attached before "Shut Down and Re-Start Solr"
7 years ago
luccioman 26d8ad591c Adjusted Solr select servlet output when using an external Solr only
7 years ago
luccioman 69690c13a0 Optionally allow external Solr server with self-signed certificate
7 years ago
luccioman 2fd4d05e2f Added a shared Java constant for setting key server.servlets.called
7 years ago
luccioman ba9cd14516 Removed hard-coded patch for Solr 5.0 on ranking boost function
7 years ago
luccioman fb3032c530 Added a crawl filtering possibility on documents Media Type (MIME)
7 years ago
luccioman c3ff50c17a Updated the list of audio file formats supported by the audioTagParser
7 years ago
luccioman 1b90479a76 Added missing vocabulary navigator increment on results from RWI
7 years ago
luccioman 3a973dbb23 Removed unused import
7 years ago
reger 87077b8fb6 Adjust and move Language Navigator to be member of the navigatior plugin
7 years ago
luccioman 0cdee4e26a Fixed loss of "meanCount" search param when using facets or page buttons
7 years ago
luccioman 117a859879 Do not clear all search modifiers when unselecting one modifier.
7 years ago
luccioman 33593c22e9 Fixed loss of other modifiers on keywords/tags search navigation links
7 years ago
luccioman a9dc0874c0 Remove old query terms from search results suggestions links.
7 years ago
luccioman 9412881230 Added basic support for autotagging microdata annotated item types.
7 years ago
luccioman 929e0d6eae Replaced improper ByteBuffer.equals() implementation by Arrays.equals()
7 years ago
luccioman 9ddf92d143 Removed unncessary reflection usage for workflow tasks.
7 years ago
luccioman 9624516bf8 Refresh recrawl job profile threshold date like other default profiles
7 years ago
luccioman d47afe6fab Use a constant for crawler reject reason prefix with specific processing
7 years ago
luccioman 8a4ea1c11e Added UI switch to control content domain constraint per search request
7 years ago
reger f8071ac8ae Make TokenizedStringNavigator (used for keyword search facet) active
7 years ago
luccioman e6907fdab3 Added optional search parameter/setting to control content domain filter
7 years ago
luccioman 09c4ee56a7 Added optional https support for remote crawl and profile operations
7 years ago
luccioman 5db1c9155a Do locale independant case conversion on hosts, schemes, and file exts.
7 years ago
luccioman 1c4803e40a Enable optional https support for /yacy/transferURL API calls.
7 years ago
luccioman 17e004599d Started implementing optional https preference for protocol operations
7 years ago
Michael Peter Christen 25573bd5ab added a crawl filter based on <div> tag class names
7 years ago
luccioman a4494d6e01 Improved support for internationalized domain names on "site:" modifier
7 years ago
luccioman d07006bac4 Do locale independant case conversion on "filetype:" query modifier.
7 years ago
luccioman 8fbf25d1ed Made "site:" query modifier case insensitive.
7 years ago
luccioman 867388e05b Refactored 'site:' query modifier parsing into a dedicated function.
7 years ago
luccioman c9d80b5b77 Prefer fine URL match over approximate URL mask regex on final filtering
7 years ago
luccioman 0a120787e3 Improved accuracy of URLs search filters : protocol, tld, host, file ext
7 years ago
luccioman e07ef1b610 Apply tld query modifier on Solr host_s mandatory field.
7 years ago
luccioman 478e92deff Fixed url mask filter generated when protocol modifier is not null
7 years ago
luccioman 29de4a65d7 Refactored url mask filter build from query modifiers
7 years ago
luccioman f01aac31fd Made possible to use https for remote search on peers with SSL enabled.
7 years ago
luccioman 46f37e38dc Customized Threads with generic name for easier monitoring.
8 years ago
luccioman 8e732d437c Enable HTTP Digest authentication for non admin users.
8 years ago
luccioman af198b990b Added an optional login link/status to the search public top nav bar.
8 years ago
luccioman 8303e15419 Reduced number of search navigators refresh requests in JS resort mode
8 years ago
luccioman dbff7b14fc Add a configurable limit to tags initially displayed in search results
8 years ago