Commit Graph

45 Commits (b92d81b07355b4040206d689c232dee0e8fb89ca)

Author SHA1 Message Date
reger bad34804fe optimize parseInt for <img> tag attribute parsing
9 years ago
reger d2cc11ea8f fix html parser taking <style> content as text.
9 years ago
reger e594130aec add test case for partial update - to discover effect on YaCy for update of documents with multivalued date fields (like dates_in_content_dts)
9 years ago
reger d5da9e5a38 fix test methode (add throw for URIMetadataNode)
9 years ago
reger 4cf875336c complete TODO: getFileExtension handle dot in query part
9 years ago
reger c37dda8849 fix NPE on MultiProtocolURL on url with parameter value and '='
10 years ago
reger 71bf95af8a upd parser calls in test cases
10 years ago
reger f63fff9008 fix snippet containig number with comma as desmo point http://mantis.tokeek.de/view.php?id=344
10 years ago
reger 2ef8ffdb60 apply UTF-8 encoding
10 years ago
reger 7120ea42f1 fix for path with char code > 255
10 years ago
reger 1d81bd0687 fix url encoding for path see http://mantis.tokeek.de/view.php?id=559
10 years ago
reger f94e34058c fix url (path) %-decoding http://mantis.tokeek.de/view.php?id=519
10 years ago
reger 16bc267a32 add test case for snippet html encoding check
10 years ago
reger 77851fa53c fix parser test cases
10 years ago
reger df83fcc4fc disable optimistic GC assumption in StandardMemoryStrategy
10 years ago
Michael Peter Christen 68c605d637 replace with CommonPattern.SPACE for split
10 years ago
reger 9edc7308aa update to metadata-extractor-2.7.0.jar
10 years ago
reger 5d67e165d9 remove redundant null check in ResponseHeader.lastModified
10 years ago
reger ea633a794c including small junit test case for WordTokenizer
10 years ago
reger aa2e15d846 allow url parameter in worktable apicall
10 years ago
reger e88537522d allow single quote " ' " in query
10 years ago
reger e50b2b4d04 fix test case MultiProtocolURL.toString()
10 years ago
reger b510b182d8 - update Maven pom
10 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
10 years ago
reger 1f2eba977d add test case for Records (used in HostBalancer)
11 years ago
reger e94efd4d7c update to JUnit 4.11
11 years ago
reger 3b77e41f1a adding test for HostQueue crawl stack
11 years ago
reger 431a5f9c4e added test case for TextSnippet,
11 years ago
reger 7847a93558 fix AbstractParser.singleList not adding null strings
11 years ago
reger 0b6db04e40 fix contentscraper img height/width parsing
11 years ago
reger bb8181b2be fix: resolve url without path but searchpart
11 years ago
reger 86f6975edc exclude html tags in in/outboundlinks_anchortext_txt parsed text
11 years ago
reger 71649bf22d add test case htmlParser.parse - getCharset
11 years ago
reger 6878c90f99 fix: IPv6 INTRANET_PATTERNS for local ip (see http://bugs.yacy.net/view.php?id=378)
11 years ago
reger c8d437b69a clean up test sources
11 years ago
reger 18a56446ce reorg URL test classes add isLocal test with some IPv6 examples
11 years ago
reger 10a6346056 clean-up test cases
11 years ago
reger b4fdb8c887 cleanup test directory from Jetty 9 implementation samples
11 years ago
reger 71d2655c02 downgrade to Jetty 8 to assure support of JRE 1.6
11 years ago
reger f7f86d8a5d update to Jetty 9 jars
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
reger 4fec35a665 adjust Test case EmbeddedSolrConnector
12 years ago
reger 160ce568b3 move testing SolrServlet.main to test, making include of jetty*.jar in distribution and classpath obsolete
12 years ago
orbiter cd19d0517e added dns resolve to HTTPClient POST using a dns cache to prevent that that not-thread-safe built-in dns cache inside apache http client is used
14 years ago
f1ori fd74bc388c * fix small bug in sessionid-removal
14 years ago