You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser
Michael Peter Christen 833d720989
upgraded ppt parser by migration of org.apache,poi from 3.17 to 5.3.0
7 months ago
..
html fixed documentation and some details of handling of keywords 2 years ago
images patched a 'java.lang.NoSuchMethodError: com.twelvemonkeys.imageio.util.IIOUtil.lookupProviderByName' problem which occurred only on ARM 2 years ago
rdfa Revised the RDFaParser main launcher for minimal proper operation. 7 years ago
xml always use HTTPClient by 'try with resources' pattern to free up 3 years ago
AbstractCompressorParser.java crawl profile adoption to new tag valency attribute 2 years ago
GenericXMLParser.java Also handle text content when parsing XML within limits. 8 years ago
XZParser.java Added a parser for XZ compressed archives. 7 years ago
apkParser.java removed warnings 2 years ago
audioTagParser.java Support parsing audio URLs without file extension 6 years ago
bzipParser.java crawl profile adoption to new tag valency attribute 2 years ago
csvParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
docParser.java removed warnings 2 years ago
dwgParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
genericParser.java Added parsing within bounds implementation to the generic parser. 8 years ago
gzipParser.java crawl profile adoption to new tag valency attribute 2 years ago
htmlParser.java crawl profile adoption to new tag valency attribute 2 years ago
linkScraperParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
mmParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
odtParser.java fix delete of temp file after odt % ooxml parser 9 years ago
ooxmlParser.java Improved parsing support for OOXML spreadsheets (.xlsx) 8 years ago
pdfParser.java Added a zim parser to the surrogate import option. 1 year ago
pptParser.java upgraded ppt parser by migration of org.apache,poi from 3.17 to 5.3.0 7 months ago
psParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
rdfParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
rssParser.java Added RSS reader support for `enclosure` feed item sub element. 7 years ago
rtfParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
sidAudioParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
sitemapParser.java another possible fix for 1 year ago
tarParser.java crawl profile adoption to new tag valency attribute 2 years ago
torrentParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
vcfParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
vsdParser.java removed warnings 2 years ago
xlsParser.java removed warnings 2 years ago
zipParser.java crawl profile adoption to new tag valency attribute 2 years ago