Commit Graph

12 Commits (c78e2f3b4b2020720f173f0fe9ebfea0bd29e975)

Author SHA1 Message Date
reger cb95b7339a include html5 <time> tag in content scraper,
8 years ago
luccioman 7717a3d43d Fixed license headers on files created to improve favicon management.
9 years ago
luccioman 6e1959f469 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
reger b752bcfecb adjust date in text detection to ignore some program version strings
9 years ago
reger b017e97421 optimize condenser language detection a little.
9 years ago
reger ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
9 years ago
reger 1a79c64495 generalize DateDetection with holiday date rules readily available in icu
9 years ago
reger 272cdd496a reactivate sentence counter in WordTokenizer for phrasepos ranking,
9 years ago
reger e310ec5f70 fix posInText ranking calculation to score 0 on no position info
9 years ago
reger ebde21079a refactor xlsParser to include Excel file attribute (like author) in parser result doc.
9 years ago
luc 3cc5619d93 Improved HTML icons indexing and rendering in search results.
9 years ago
reger 84c970eaec move test classes to test/java (subdirectory as in Maven standard subdir layout)
9 years ago