You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser
Michael Peter Christen 910a496c9f
replaced http links with https
9 months ago
..
html replaced http links with https 9 months ago
images replaced http links with https 9 months ago
rdfa Revised the RDFaParser main launcher for minimal proper operation. 7 years ago
xml always use HTTPClient by 'try with resources' pattern to free up 3 years ago
AbstractCompressorParser.java crawl profile adoption to new tag valency attribute 2 years ago
GenericXMLParser.java Also handle text content when parsing XML within limits. 8 years ago
XZParser.java Added a parser for XZ compressed archives. 7 years ago
apkParser.java replaced http links with https 9 months ago
audioTagParser.java replaced http links with https 9 months ago
bzipParser.java crawl profile adoption to new tag valency attribute 2 years ago
csvParser.java replaced http links with https 9 months ago
docParser.java removed warnings 3 years ago
dwgParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
genericParser.java replaced http links with https 9 months ago
gzipParser.java crawl profile adoption to new tag valency attribute 2 years ago
htmlParser.java replaced http links with https 9 months ago
linkScraperParser.java replaced http links with https 9 months ago
mmParser.java replaced http links with https 9 months ago
odtParser.java fix delete of temp file after odt % ooxml parser 9 years ago
ooxmlParser.java Improved parsing support for OOXML spreadsheets (.xlsx) 8 years ago
pdfParser.java Added a zim parser to the surrogate import option. 1 year ago
pptParser.java upgraded ppt parser by migration of org.apache,poi from 3.17 to 5.3.0 9 months ago
psParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
rdfParser.java replaced http links with https 9 months ago
rssParser.java replaced http links with https 9 months ago
rtfParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
sidAudioParser.java replaced http links with https 9 months ago
sitemapParser.java replaced http links with https 9 months ago
tarParser.java replaced http links with https 9 months ago
torrentParser.java replaced http links with https 9 months ago
vcfParser.java result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode. 9 years ago
vsdParser.java removed warnings 3 years ago
xlsParser.java removed warnings 3 years ago
zipParser.java replaced http links with https 9 months ago