Commit Graph

13569 Commits (bcbd0ae1a4ea4550089b82d7ae47b93a8524e787)
 

Author SHA1 Message Date
luccioman 5a646540cc Support parsing gzip files from servers with redundant headers.
7 years ago
luccioman 11a7f923d4 Distinguish response parsing failures from unexpected exceptions.
7 years ago
luccioman 8100c033a2 URL Viewer : apply crawler size limits when adding to local index.
7 years ago
luccioman eda7b0aeb6 Merge branch 'master' of https://github.com/yacy/yacy_search_server
7 years ago
reger 3005be7349 Clean up unmaintained and unused AugmentParser trail.
7 years ago
reger e5cff062b5 Clean up redundant but obsolete jquery.rdfquery-core-1.0.js script lib
7 years ago
luccioman cb4f1358e1 Added gzip parser support for max content bytes limit
7 years ago
luccioman 5216c681a9 Added HTML parser support for maximum content bytes parsing limit
7 years ago
luccioman 4aafebc014 Merge pull request #122 from Scarfmonster/patch-1
7 years ago
luccioman 651fad6da5 Added RSS parser support for maximum content bytes parsing limit
7 years ago
luccioman 452a17a8d5 Finer control on bounded input streams with custom stream implementation
7 years ago
luccioman f8f1959ebb Added parsing within bounds implementation to the generic parser.
7 years ago
luccioman e0f400a0bd Support trying multiple parsers even when streaming on large resources.
7 years ago
luccioman 1e84956721 Support loading local files with a per request specified maximum size.
7 years ago
luccioman f369679d1c Fixed read/copy on input streams reading sometimes less than expected.
7 years ago
reger 23bda133d2 Fix css conflict of YMarks.html to make it viewable.
7 years ago
reger af32d291c2 upd to commons-fileupload-1.3.3.jar
7 years ago
reger a21789d4e7 Fix unresolved pattern in api/share.html by init some display var's
7 years ago
luccioman bf55f1d6e5 Started support of partial parsing on large streamed resources.
7 years ago
luccioman 2a87b08cea Removed temporary html parser test code
7 years ago
luccioman 1b3c169a9c URL Viewer : decode raw text using the eventual response charset.
7 years ago
luccioman 90a7c1affa HTML parser : removed unnecessary remaining recursive processing
7 years ago
reger e6e20dab52 upd to Jetty 9.4.6.v20170531
7 years ago
luccioman e4c730b99f Updated PerformanceQueues_p.xml API with last related servlet changes
7 years ago
luccioman dcc56318bb Made remote search max system load limits configurable from UI.
7 years ago
reger ddd13b776d Add keyword constraint to rwi query result filter
7 years ago
luccioman e82eaee4b6 Apply consistent behavior on HTTP resource size exceeding limit.
7 years ago
luccioman 0b75e92ac2 Do not wrap unnecessarily loader IOExceptions in IOExceptions
7 years ago
luccioman 433bdb7c0d Respect maxFileSize limit also when streaming HTTP and when relevant.
7 years ago
luccioman 4b72b29ea2 Added an informative title on the crawl start robots.txt status icon
7 years ago
luccioman d08f31c3a8 Crawl start Ajax request : properly handle eventual XML parsing errors
7 years ago
luccioman 9b1bb2545e Refactored plain-text URLs detection implementation.
7 years ago
luccioman 8da3174867 Ensure lower case conversion consistency with any default locale.
7 years ago
luccioman 286f3018bd Made mime type and extension normalization locale independent.
7 years ago
luccioman 319231a458 Added a generic XML parser, able to parse elements text and URLs.
7 years ago
reger aeeb8a7dd5 upd to jwat-warc-1.0.6.jar
7 years ago
reger f0ba828627 remove unused Solr optional extra handler lib solr-dataimporthandler-6.6.0.jar
8 years ago
reger 1773b61b3e upd to jsoup-1.10.3.jar
8 years ago
Ryszard Goń 3cedbbd4ed Wrong password was removed after the SSL certificate import
8 years ago
luccioman 64cec2790d Improved character encoding detection from Content-Type header
8 years ago
luccioman 1acb7005d0 Added a basic JUnit test with test gz files for the gzip parser
8 years ago
luccioman 1e2fb76720 Properly close test files in htmlParser unit test
8 years ago
luccioman c41b31dcb3 Cleaned up memory usage page HTML
8 years ago
luccioman 0487336ec3 Prevent integer overflow in table statistics and use strong typing
8 years ago
luccioman 0f80c978d6 Limit the number of initially previewed links in crawl start pages.
8 years ago
luccioman d2a4a27f52 Improved stream-oriented parsing entering conditions.
8 years ago
luccioman 32288a8999 Merge branch 'master' of https://github.com/yacy/yacy_search_server
8 years ago
luccioman e9b4b29f90 Limit scope of some local JavaScript variables.
8 years ago
Michael Peter Christen 369b8e0e0b added json(p) endpoint for crawl start
8 years ago
reger 83ba45ebae make nsis build script require java 8
8 years ago