Commit Graph

16 Commits (c4b90eae98eee64cac590e71c48d05781bdaa086)

Author SHA1 Message Date
reger 18c7563dbe Extend DCEntry.getLanguage convert to ISO639-1 codes for more languages
8 years ago
reger 41e2ee0eca Fix call parameter for ConnectionInfo in MonitorHandler
8 years ago
reger f254fcfc67 fix htmlParser <script> text extraction on code containing expression
8 years ago
luccioman c9889991b9 Fixed 2 failing JUNit tests.
8 years ago
reger cb95b7339a include html5 <time> tag in content scraper,
8 years ago
luccioman 7717a3d43d Fixed license headers on files created to improve favicon management.
9 years ago
luccioman 6e1959f469 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
reger b752bcfecb adjust date in text detection to ignore some program version strings
9 years ago
reger b017e97421 optimize condenser language detection a little.
9 years ago
reger ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
9 years ago
reger 1a79c64495 generalize DateDetection with holiday date rules readily available in icu
9 years ago
reger 272cdd496a reactivate sentence counter in WordTokenizer for phrasepos ranking,
9 years ago
reger e310ec5f70 fix posInText ranking calculation to score 0 on no position info
9 years ago
reger ebde21079a refactor xlsParser to include Excel file attribute (like author) in parser result doc.
9 years ago
luc 3cc5619d93 Improved HTML icons indexing and rendering in search results.
9 years ago
reger 84c970eaec move test classes to test/java (subdirectory as in Maven standard subdir layout)
9 years ago