You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document
luccioman 5a646540cc
Support parsing gzip files from servers with redundant headers.
8 years ago
..
content Ensure lower case conversion consistency with any default locale. 8 years ago
importer Set request originator to own peer in warc importer 8 years ago
language Fixed language detector initialization and NullPointerException cases. 8 years ago
parser Support parsing gzip files from servers with redundant headers. 8 years ago
AbstractParser.java Started support of partial parsing on large streamed resources. 8 years ago
Condenser.java Ensure proper closing of file input streams. 8 years ago
DateDetection.java adjust date in text detection to ignore some program version strings 9 years ago
Document.java Added RSS parser support for maximum content bytes parsing limit 8 years ago
ImageParser.java BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys 9 years ago
LargeNumberCache.java Cleaned up some Javadoc warnings. 8 years ago
LibraryProvider.java Cleaned up some Javadoc warnings. 8 years ago
Parser.java Started support of partial parsing on large streamed resources. 8 years ago
Phrase.java
ProbabilisticClassifier.java Fixed a NullPointerException case. 8 years ago
SentenceReader.java
SnippetExtractor.java
TextParser.java Support parsing gzip files from servers with redundant headers. 8 years ago
Tokenizer.java optimize condenser language detection a little. 9 years ago
VocabularyScraper.java added enrichment of synonyms and vocabularies for imported documents 10 years ago
WordTokenizer.java reactivate sentence counter in WordTokenizer for phrasepos ranking, 9 years ago