Commit Graph

4293 Commits (59181e8009f98371f3294ea4f83cb345d40dc60a)

Author SHA1 Message Date
luccioman 54fbe166ba Updated pdf cache clear steps consistently with current pdfbox version
7 years ago
luccioman 685122363d Added a parser for XZ compressed archives.
7 years ago
luccioman 4ee14ff3c5 Fixed NullPointerException case on malformed crawl queue folder name
7 years ago
luccioman 21ad9435ec Fixed crawl queue folder naming for IPv6 hosts on MS Windows filesystems
7 years ago
luccioman 8a29551c54 Upgraded the OpenGeoDB dump URL
7 years ago
luccioman 373edf9eac Adjusted yjson Solr writer to support responses from an external Solr
7 years ago
luccioman 87bd17b1cf Simplified a little bit the RSS OpenSearch Solr writer
7 years ago
luccioman dc49ca9c27 Fixed a NPE case on the Solr OpenSearch response writer
7 years ago
luccioman f4267ed247 Made Solr OpenSearch RSS writer compatible with external Solr index
7 years ago
luccioman b1410f593a Fixed stylesheet relative URLs rendering in Solr html writer
7 years ago
luccioman 89c59814da Improved rendering of the Solr api relative url in the html writer
7 years ago
luccioman bf4f320b16 Optionally render the response header when using the Solr html writer
7 years ago
luccioman 313204ae2c Override qf and df Solr params with defaults only when they are not set
7 years ago
luccioman bdafb14336 Removed redundant synchronization lock on network switch function
7 years ago
luccioman d5f44ea216 Removed unnecessary synchronization lock from serverSwitch constructor
7 years ago
luccioman dcad393fe5 Fixed exceeding max size of failreason_s Solr field on large link list
7 years ago
luccioman f467601561 Properly lock solrInstances for reboot and restoration of embedded Solr
7 years ago
luccioman 9630f81306 Fixed small unnecessary lines of code
7 years ago
luccioman 876bcd2f54 Fixed useless comparison between int parameter and Long.MAX_VALUE
7 years ago
luccioman c726154a59 Fixed removal of URLs from the delegatedURL remote crawl stack
7 years ago
luccioman 2bdd71de60 Added server side columns sorting on the Process Scheduler table
7 years ago
luccioman bb51555830 Removed remaining unsafe accesses to SimpleDateFormat instances.
7 years ago
luccioman f895745e1c Removed more unsafe concurrent accesses to SimpleDateFormat instances.
7 years ago
luccioman e97580dfc7 Fixed unsafe conccurent access to generic SimpleDateFormat instances
7 years ago
luccioman 8811700e2e Upgraded Jetty dependency from 9.4.9 to 9.4.11
7 years ago
luccioman d53c33e4ef Fixed potential infinite loop case (does not occur in current code base)
7 years ago
luccioman a15ac8e0ca Made CrawlProfile loading tolerant to malformed json string attribute
7 years ago
luccioman a715bb7876 Fixed rendering of solr mustNoMatch value on CrawlProfileEditor_p.xml
7 years ago
luccioman 0b302c5004 Do not block whole server startup on persisted crawl profile load error
7 years ago
luccioman 4d9aa4ed1e Fixed default crawl profile solr mustnotmatch query from previous commit
7 years ago
luccioman cced94298a Added a new crawler document filter type using Solr syntax
7 years ago
Michael Christen e0dc632020 removed transformer
7 years ago
luccioman 9bc7b6c39d Allow edtion of scheduled next execution dates for finer control
7 years ago
luccioman 40e8c7b89b Use the heavy ConcurrentUpdateSolrClient only when necessary
7 years ago
luccioman bd4cfeda3f Add a max acceptable limit to the size of Solr responses on p2p search
7 years ago
luccioman de4ea95687 Consistently allow gzip compression of remote Solr responses
7 years ago
luccioman cea8187161 Reuse expired connections evictors threads provided by apache and solr
7 years ago
luccioman b5dc1f376f Made outgoing pools max total connections user configurable
7 years ago
luccioman 387d646c0e Added gzip compression of responses returned to user-agents accepting it
7 years ago
luccioman a7a4ba3287 Apply remote solr configured timeout on getting connection from pool
7 years ago
luccioman ee6670fb8f Use a common pooled http connection manager for remote solr instances
7 years ago
luccioman d28f9ba0f6 Removed use of deprecated ConcurrentUpdateSolrClient constructor
7 years ago
luccioman 8a749aa5ad Trace level log message for monitoring remote solr response times
7 years ago
luccioman 35826a3091 Added a search page customization setting to display or not favicons
7 years ago
luccioman 0082b5ab2a Added missing default Solr http client connection timeout initialization
7 years ago
luccioman fa4399d5d2 Small perf improvement : initialize threads names early when possible
7 years ago
luccioman 84d82bfdd7 Adjusted suggestions timeout management
7 years ago
luccioman 65854bcb22 Fixed NullPointerException when omitHeader=true on external Solr server
7 years ago
luccioman c4d984cec8 Fixed Solr response header duplication when requesting external Solr
7 years ago
luccioman 124cc24aa3 Properly handle embedded Solr partial results
7 years ago
luccioman 3ce44cf250 Fixed largest snippet get : don't reject ones starting with a space char
7 years ago
luccioman f511e16d50 Prevent duplication of Solr query highlight fields parameters
7 years ago
luccioman e357ade47d Reduced memory footprint of text snippet extraction
7 years ago
luccioman e115e57cc7 Reduced text snippet extraction processing time.
7 years ago
sgaebel 4b79851e12 corrected icons_sizes_sxt to SolrType.string
7 years ago
luccioman 3b89c232db Easier tracking of longest text snippets initializations
7 years ago
luccioman 3c4344cb12 Fixed text snippet max init time statistic rendering
7 years ago
reger a8234b7ea7 Make sure for image resource url enabled index image pixel size fields are filled
7 years ago
luccioman e67df103b5 Removed more remaining uses of deprecated Seed.getIP() function.
7 years ago
luccioman addd18c993 Removed some remaining uses of deprecated Seed.getIP()
7 years ago
luccioman c35d0568b6 Support for preferred https in peers communication on more operations
7 years ago
luccioman e914d17aca Updated call to function deprecated since commons-codec version 1.11
7 years ago
luccioman a3ec7a7a5f Added analysis optional setting to compute statistics on text snippets
7 years ago
luccioman 1889d484de Added Solr HTML writer support for responses from remote instances
7 years ago
luccioman 2af3bf79c7 Improve rendering of remote Solr admin URLs
7 years ago
luccioman bb74de7d59 Removed unnecessary "/admin" suffix from remote Solr instance admin URL
7 years ago
luccioman 0d34034f17 Ensure an embedded Solr is available for Solr dump/restore operations
7 years ago
luccioman d92b191942 Ensure no remote Solr is attached before "Shut Down and Re-Start Solr"
7 years ago
luccioman 26d8ad591c Adjusted Solr select servlet output when using an external Solr only
7 years ago
luccioman 69690c13a0 Optionally allow external Solr server with self-signed certificate
7 years ago
luccioman b882f85900 Fixed NPE case in Solr select servlet on external Solr only setup
7 years ago
luccioman 2fd4d05e2f Added a shared Java constant for setting key server.servlets.called
7 years ago
luccioman ba9cd14516 Removed hard-coded patch for Solr 5.0 on ranking boost function
7 years ago
luccioman fb3032c530 Added a crawl filtering possibility on documents Media Type (MIME)
7 years ago
luccioman e45afedee4 Added support for enclosures (media links) to the RSS loader
7 years ago
luccioman aaefd5219c Reduce log verbosity of RSS loader on feed items with no link
7 years ago
luccioman cf62b571bd Added RSS reader support for `enclosure` feed item sub element.
7 years ago
luccioman e5f5de0fc7 Added some JavaDoc to the RSSMessage class.
7 years ago
luccioman 0d7625ecfb Handle Solr fields restrict and alias in YaCy html and exml writers
7 years ago
luccioman 3da2739bbd Parse and index more common audio metadata text tag fields.
7 years ago
luccioman 846aba00fa Added parsing of URLs eventually present in audio metadata tags
7 years ago
Michael Peter Christen 187075b878 added nav filter
7 years ago
luccioman bcbd0ae1a4 Enabled partial parsing of audio resources.
7 years ago
luccioman fda0189613 Updated audio file extensions with ones recently added to audioTagParser
7 years ago
luccioman 978e2be95b Let a chance for other parsers on audioTagParser error
7 years ago
luccioman 9e5846a26e Small fix on svg parser error message
7 years ago
luccioman 11611dbdcf Reuse existing File copy function to handle audio parser tmp files
7 years ago
luccioman f77f8f40f9 Factored audio parser tag processing
7 years ago
luccioman 9a7a353d0e Removed some unnecessary intermediate list creation on array copy.
7 years ago
luccioman fb6457f5bc Fixed NPE case when on audio resource parsed with null tag
7 years ago
luccioman c3ff50c17a Updated the list of audio file formats supported by the audioTagParser
7 years ago
luccioman 1b90479a76 Added missing vocabulary navigator increment on results from RWI
7 years ago
luccioman 46c9da6428 Allow creation of vocabularies from remote CSV file URLs.
7 years ago
luccioman 17c7a85f18 Make StreamResponse usable in Java try-with-resources statements
7 years ago
luccioman b67742336e Provide user interface messages on vocabulary creation read/write errors
7 years ago
luccioman 3e8dd90211 Use https rather than http in links and queries to openstreetmap.org
7 years ago
luccioman 3a973dbb23 Removed unused import
7 years ago
luccioman e9527cd0e5 Reuse the same Pattern instance when matching multiple key/values
7 years ago
luccioman dbf4c1cd76 Improved blacklist entries editing operations :
7 years ago
reger 87077b8fb6 Adjust and move Language Navigator to be member of the navigatior plugin
7 years ago