You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser
Michael Peter Christen 6a2a669db4
added loading of the synonyms file from addon/synonyms into the
10 years ago
..
augment removed warnings 10 years ago
html fix for image alt attachment to AnchorURLs in html parser. 11 years ago
images fix image search expand box, cut-off of 2nd capture line height 10 years ago
rdfa added an option to set 'obey nofollow' for links with rel="nofollow" 11 years ago
xml do YaCy p2p connections using a timeout-request which covers the http 11 years ago
apkParser.java activated the new apk parser which was already ready but not included in 11 years ago
audioTagParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
bzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
csvParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
docParser.java extract author and keywords in .doc and .ppt parser 11 years ago
dwgParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
genericParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
gzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
htmlParser.java fix for image alt attachment to AnchorURLs in html parser. 11 years ago
linkScraperParser.java added linkScraperParser, a parser which ignores the text like the 11 years ago
mmParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
odtParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
ooxmlParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
pdfParser.java add link extraction to pdfParser 10 years ago
pptParser.java extract author and keywords in .doc and .ppt parser 11 years ago
psParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
rdfParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
rssParser.java simplify rssreader and improve atom feed link extraction 11 years ago
rtfParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
sevenzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
sidAudioParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
sitemapParser.java fix for image alt attachment to AnchorURLs in html parser. 11 years ago
swfParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
tarParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
torrentParser.java added loading of the synonyms file from addon/synonyms into the 10 years ago
vcfParser.java added an option to set 'obey nofollow' for links with rel="nofollow" 11 years ago
vsdParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
xlsParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
zipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago