You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document
reger 9e94989237
upd to PDFBox 2.0.1
9 years ago
..
content Refactoring : use StandardCharsets constants instead of hard-coded 9 years ago
importer Refactoring : use StandardCharsets constants instead of hard-coded 9 years ago
language override detected language (statistic langdetect) only with TLD determided 9 years ago
parser upd to PDFBox 2.0.1 9 years ago
AbstractParser.java
Condenser.java override detected language (statistic langdetect) only with TLD determided 9 years ago
DateDetection.java add Portuguese month names to date recognition 9 years ago
Document.java Refactoring : use StandardCharsets constants instead of hard-coded 9 years ago
ImageParser.java BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys 9 years ago
LargeNumberCache.java
LibraryProvider.java Refactoring : use StandardCharsets constants instead of hard-coded 9 years ago
Parser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
Phrase.java
ProbabilisticClassifier.java Refactoring : use StandardCharsets constants instead of hard-coded 9 years ago
SentenceReader.java
SnippetExtractor.java
TextParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
Tokenizer.java added enrichment of synonyms and vocabularies for imported documents 10 years ago
VocabularyScraper.java added enrichment of synonyms and vocabularies for imported documents 10 years ago
WordTokenizer.java added enrichment of synonyms and vocabularies for imported documents 10 years ago