Commit Graph

11 Commits (117a85987989210f3b3295778e12bbaf2f5cd733)

Author SHA1 Message Date
Michael Peter Christen 25573bd5ab added a crawl filter based on <div> tag class names
7 years ago
luccioman 73977ec0fe Added a html parser charset detection unit test
7 years ago
luccioman 169ffdd1c7 Finer control on max links to parse in the html parser.
8 years ago
luccioman 2a87b08cea Removed temporary html parser test code
8 years ago
luccioman 90a7c1affa HTML parser : removed unnecessary remaining recursive processing
8 years ago
luccioman 8da3174867 Ensure lower case conversion consistency with any default locale.
8 years ago
luccioman 1e2fb76720 Properly close test files in htmlParser unit test
8 years ago
luccioman a04feac064 Ensure file input streams proper closing in both success and failures
8 years ago
reger 41e2ee0eca Fix call parameter for ConnectionInfo in MonitorHandler
8 years ago
reger f254fcfc67 fix htmlParser <script> text extraction on code containing expression
8 years ago
reger 84c970eaec move test classes to test/java (subdirectory as in Maven standard subdir layout)
9 years ago