You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document
luccioman fb3032c530
Added a crawl filtering possibility on documents Media Type (MIME)
7 years ago
..
content Ensure lower case conversion consistency with any default locale. 7 years ago
importer added a crawl filter based on <div> tag class names 7 years ago
language Fixed language detector initialization and NullPointerException cases. 8 years ago
parser Added RSS reader support for `enclosure` feed item sub element. 7 years ago
AbstractParser.java added a crawl filter based on <div> tag class names 7 years ago
Condenser.java Added basic support for autotagging microdata annotated item types. 7 years ago
DateDetection.java Remove old hard-coded holiday dates from DateDection class. 7 years ago
Document.java Added a crawl filtering possibility on documents Media Type (MIME) 7 years ago
ImageParser.java BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys 9 years ago
LargeNumberCache.java Cleaned up some Javadoc warnings. 8 years ago
LibraryProvider.java Cleaned up some Javadoc warnings. 8 years ago
Parser.java added a crawl filter based on <div> tag class names 7 years ago
Phrase.java
ProbabilisticClassifier.java Fixed a NullPointerException case. 8 years ago
SentenceReader.java
SnippetExtractor.java skip unused call parameter for hashSentence() 10 years ago
TextParser.java added a crawl filter based on <div> tag class names 7 years ago
Tokenizer.java Refactoring : documented and extracted autotagging processing functions. 7 years ago
VocabularyScraper.java added enrichment of synonyms and vocabularies for imported documents 10 years ago
WordTokenizer.java reactivate sentence counter in WordTokenizer for phrasepos ranking, 8 years ago