Commit Graph

4000 Commits (5216c681a9971b921aea8c041106df8b2102055b)

Author SHA1 Message Date
luccioman 5216c681a9 Added HTML parser support for maximum content bytes parsing limit
7 years ago
luccioman 4aafebc014 Merge pull request #122 from Scarfmonster/patch-1
7 years ago
luccioman 651fad6da5 Added RSS parser support for maximum content bytes parsing limit
7 years ago
luccioman 452a17a8d5 Finer control on bounded input streams with custom stream implementation
7 years ago
luccioman f8f1959ebb Added parsing within bounds implementation to the generic parser.
7 years ago
luccioman e0f400a0bd Support trying multiple parsers even when streaming on large resources.
7 years ago
luccioman 1e84956721 Support loading local files with a per request specified maximum size.
7 years ago
luccioman f369679d1c Fixed read/copy on input streams reading sometimes less than expected.
7 years ago
luccioman bf55f1d6e5 Started support of partial parsing on large streamed resources.
7 years ago
luccioman 90a7c1affa HTML parser : removed unnecessary remaining recursive processing
7 years ago
reger e6e20dab52 upd to Jetty 9.4.6.v20170531
7 years ago
luccioman dcc56318bb Made remote search max system load limits configurable from UI.
7 years ago
reger ddd13b776d Add keyword constraint to rwi query result filter
7 years ago
luccioman e82eaee4b6 Apply consistent behavior on HTTP resource size exceeding limit.
7 years ago
luccioman 0b75e92ac2 Do not wrap unnecessarily loader IOExceptions in IOExceptions
7 years ago
luccioman 433bdb7c0d Respect maxFileSize limit also when streaming HTTP and when relevant.
7 years ago
luccioman 9b1bb2545e Refactored plain-text URLs detection implementation.
7 years ago
luccioman 8da3174867 Ensure lower case conversion consistency with any default locale.
7 years ago
luccioman 286f3018bd Made mime type and extension normalization locale independent.
7 years ago
luccioman 319231a458 Added a generic XML parser, able to parse elements text and URLs.
7 years ago
Ryszard Goń 3cedbbd4ed Wrong password was removed after the SSL certificate import
8 years ago
luccioman 64cec2790d Improved character encoding detection from Content-Type header
8 years ago
luccioman 0487336ec3 Prevent integer overflow in table statistics and use strong typing
8 years ago
luccioman d2a4a27f52 Improved stream-oriented parsing entering conditions.
8 years ago
luccioman 9dd790087d Added HT Cache basic statistics (hit rate)
8 years ago
luccioman 5fdd5d16b1 Use volatile to ensure concurrent threads use up to date property value
8 years ago
luccioman 28b451a0b3 Made Cache compression level and lock timeout user configurable
8 years ago
luccioman a7394b479b Limit the synchronization blocking time on some Cache operations.
8 years ago
Michael Peter Christen c94a8c76bd re-added solr synchronization hack
8 years ago
Michael Peter Christen 6fe735945d migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8
8 years ago
luccioman ce89492319 Ensure system resource release by closing document stream.
8 years ago
luccioman 8399275142 Properly close file output streams even on exceptions scenarios.
8 years ago
luccioman 4e4dc6c4e5 Removed unnecessary finalize implementation.
8 years ago
luccioman a04feac064 Ensure file input streams proper closing in both success and failures
8 years ago
luccioman d98c04853d Ensure proper closing of file input streams.
8 years ago
luccioman c53c58fa85 Unsure closing ChunkIterator stream in every possible use case.
8 years ago
luccioman 29e52bda39 Merge branch 'master' of https://github.com/yacy/yacy_search_server
8 years ago
luccioman a9cb083fa1 Improved consistency between loader openInputStream and load functions
8 years ago
reger a814f3d885 Introduce keyword query parameter
8 years ago
luccioman c226ded799 Fix unescape of URLs having some '%' chars but not percent-encoded
8 years ago
luccioman 306a82dd71 Fixed scraper NullPointerException cases on malformed URLs.
8 years ago
luccioman aa55d71cf5 Fixed a NullPointerException case on Digest authentication.
8 years ago
luccioman 02ec0ed13c Quoted param value in Solr query to avoid unwanted traces in logs
8 years ago
reger 1737af37cf Set request originator to own peer in warc importer
8 years ago
reger 039162fbf0 Change warc importer to use defaultsurrogate-crawl profile, as reported
8 years ago
Michael Peter Christen 3b1d640a3c enhanced debugging
8 years ago
Michael Peter Christen 7de7879f13 added a cache to prevent too many seed enumerations
8 years ago
luccioman bd7411a53a Enable p2p and cluster communication when "Protection of all pages" on
8 years ago
luccioman 31ad043bb9 Added user interface feedback on results feeding termination status.
8 years ago
sgaebel ff6392215e added closing of lst-Tag in solr-Export
8 years ago