You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document
Michael Peter Christen 25573bd5ab
added a crawl filter based on <div> tag class names
7 years ago
..
content Ensure lower case conversion consistency with any default locale. 7 years ago
importer added a crawl filter based on <div> tag class names 7 years ago
language Fixed language detector initialization and NullPointerException cases. 8 years ago
parser added a crawl filter based on <div> tag class names 7 years ago
AbstractParser.java added a crawl filter based on <div> tag class names 7 years ago
Condenser.java Ensure proper closing of file input streams. 8 years ago
DateDetection.java Remove old hard-coded holiday dates from DateDection class. 7 years ago
Document.java Added RSS parser support for maximum content bytes parsing limit 7 years ago
ImageParser.java BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys 9 years ago
LargeNumberCache.java Cleaned up some Javadoc warnings. 8 years ago
LibraryProvider.java Cleaned up some Javadoc warnings. 8 years ago
Parser.java added a crawl filter based on <div> tag class names 7 years ago
Phrase.java
ProbabilisticClassifier.java Fixed a NullPointerException case. 8 years ago
SentenceReader.java
SnippetExtractor.java
TextParser.java added a crawl filter based on <div> tag class names 7 years ago
Tokenizer.java optimize condenser language detection a little. 8 years ago
VocabularyScraper.java
WordTokenizer.java reactivate sentence counter in WordTokenizer for phrasepos ranking, 8 years ago