Commit Graph

240 Commits (fe75f326d8db7db083313a560a7464b8016249d7)

Author SHA1 Message Date
luccioman fe75f326d8 Fixed ProfilingGraph calculation integer overflows and added test class. 7 years ago
luccioman 5bf76f058a Adjusted ResponseHeaderTest to succeed on slow or highly loaded CPU 8 years ago
luccioman 32c9dfa768 Added partial bzip2 stream parsing support and bzipParser Junit test 8 years ago
luccioman dd9cb06d25 Fixed RWI distance calculation on multi words search queries. 8 years ago
luccioman c6ae87168a Added unit tests on the gzip parser. 8 years ago
luccioman 169ffdd1c7 Finer control on max links to parse in the html parser. 8 years ago
luccioman 4743a104b5 Added some unit tests on FileUtils. 8 years ago
luccioman e41d046a9d Improved parsing support for OOXML spreadsheets (.xlsx) 8 years ago
luccioman 780173008e Implemented partial stream parsing of tar archives. 8 years ago
luccioman acab6a6def Also handle text content when parsing XML within limits. 8 years ago
reger f38fb7f02c Add junit test for AbstractOperations.addOperand() 8 years ago
luccioman ed678186a8 Updated xml parser limited parsing test for use latest jdk. 8 years ago
luccioman f369679d1c Fixed read/copy on input streams reading sometimes less than expected. 8 years ago
luccioman bf55f1d6e5 Started support of partial parsing on large streamed resources. 8 years ago
luccioman 2a87b08cea Removed temporary html parser test code 8 years ago
luccioman 90a7c1affa HTML parser : removed unnecessary remaining recursive processing 8 years ago
luccioman 9b1bb2545e Refactored plain-text URLs detection implementation. 8 years ago
luccioman 8da3174867 Ensure lower case conversion consistency with any default locale. 8 years ago
luccioman 286f3018bd Made mime type and extension normalization locale independent. 8 years ago
luccioman 319231a458 Added a generic XML parser, able to parse elements text and URLs. 8 years ago
luccioman 64cec2790d Improved character encoding detection from Content-Type header 8 years ago
luccioman 1acb7005d0 Added a basic JUnit test with test gz files for the gzip parser 8 years ago
luccioman 1e2fb76720 Properly close test files in htmlParser unit test 8 years ago
luccioman 9dd790087d Added HT Cache basic statistics (hit rate) 8 years ago
luccioman 28b451a0b3 Made Cache compression level and lock timeout user configurable 8 years ago
luccioman a7394b479b Limit the synchronization blocking time on some Cache operations. 8 years ago
Michael Peter Christen 6fe735945d migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8 8 years ago
luccioman a04feac064 Ensure file input streams proper closing in both success and failures 8 years ago
luccioman d98c04853d Ensure proper closing of file input streams. 8 years ago
luccioman c226ded799 Fix unescape of URLs having some '%' chars but not percent-encoded 8 years ago
reger 077d062be3 Adjust mergeDocuments to keep youngest last-modified date of document 8 years ago
luccioman 522a268305 Improved new blacklist entries URL scheme detection. 8 years ago
luccioman 31fff2c986 Extended WikiCode template inclusion syntax support. 8 years ago
reger 7a7da698d4 fix unit test MultiProtocolURL(file) assertion for Windows path with 8 years ago
luccioman 23775e76e2 Fixed endless loop case in wikicode processing. 8 years ago
luccioman 0bc868a819 Improved support for non ASCII chars in local file system URLs 8 years ago
reger 777cb5b812 remove test case for Standard_MemoryControl which will always fail 8 years ago
reger 1ccc44e681 fix default/httpd.mime Z file extension to lower case 8 years ago
reger 18c7563dbe Extend DCEntry.getLanguage convert to ISO639-1 codes for more languages 8 years ago
reger 275c0cddd1 Adjust DefaultServlet test case to recent change, 8 years ago
reger 41e2ee0eca Fix call parameter for ConnectionInfo in MonitorHandler 8 years ago
reger f254fcfc67 fix htmlParser <script> text extraction on code containing expression 8 years ago
luccioman 2f191e0e1c Improved MultiprocotolURL non ASCII characters support. 8 years ago
luccioman 5c8958bcea Updated Javadoc and Junit tests for the WebStructureGraph class. 8 years ago
luccioman d9766ca981 Fixed WatchWebStructure_p.html render to include https URLs. 8 years ago
luccioman ed3dd5e31a Fixed webstructure.xml API used with a domain name 'about' parameter. 8 years ago
luccioman 0da1e6ba16 Factored code re-implementing DigestURL.hosthash() method. 8 years ago
luccioman 86adfef30f Added automated unit tests and perfs test for WebStructureGraph class. 8 years ago
luccioman c9889991b9 Fixed 2 failing JUNit tests. 8 years ago
reger 083df255e4 fix html tag attribute parsing containing attribute w/o value 8 years ago