You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document
luccioman e90405b6f0
Support parsing audio URLs without file extension
6 years ago
..
content Small perf improvement : initialize threads names early when possible 7 years ago
importer remove warnings: 'Statement unnecessarily nested within else clause' 6 years ago
language Fixed language detector initialization and NullPointerException cases. 8 years ago
parser Support parsing audio URLs without file extension 6 years ago
AbstractParser.java added a crawl filter based on <div> tag class names 7 years ago
Condenser.java Added basic support for autotagging microdata annotated item types. 7 years ago
DateDetection.java Removed remaining unsafe accesses to SimpleDateFormat instances. 7 years ago
Document.java Added a crawl filtering possibility on documents Media Type (MIME) 7 years ago
ImageParser.java
LargeNumberCache.java Cleaned up some Javadoc warnings. 8 years ago
LibraryProvider.java Upgraded the OpenGeoDB dump URL 7 years ago
Parser.java added a crawl filter based on <div> tag class names 7 years ago
Phrase.java
ProbabilisticClassifier.java
SentenceReader.java Reduced memory footprint of text snippet extraction 7 years ago
SnippetExtractor.java Reduced memory footprint of text snippet extraction 7 years ago
TextParser.java Added new crawler attribute for finer control over Media Type detection 6 years ago
Tokenizer.java Refactoring : documented and extracted autotagging processing functions. 7 years ago
VocabularyScraper.java
WordTokenizer.java Reduced text snippet extraction processing time. 7 years ago