You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser
Michael Peter Christen b44626e55b
fixed target_alt_t in webgraph
10 years ago
..
augment
html fixed target_alt_t in webgraph 10 years ago
images strong redesign of html parser: object recursion is now made using a 11 years ago
rdfa added an option to set 'obey nofollow' for links with rel="nofollow" 10 years ago
xml
apkParser.java small bugfixes 10 years ago
audioTagParser.java
bzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
csvParser.java
docParser.java extract author and keywords in .doc and .ppt parser 11 years ago
dwgParser.java
genericParser.java
gzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
htmlParser.java added linkScraperParser, a parser which ignores the text like the 11 years ago
linkScraperParser.java added linkScraperParser, a parser which ignores the text like the 11 years ago
mmParser.java
odtParser.java
ooxmlParser.java
pdfParser.java optimize pdfParser 11 years ago
pptParser.java extract author and keywords in .doc and .ppt parser 11 years ago
psParser.java
rdfParser.java
rssParser.java
rtfParser.java
sevenzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
sidAudioParser.java
sitemapParser.java added an option to set 'obey nofollow' for links with rel="nofollow" 10 years ago
swfParser.java
tarParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
torrentParser.java
vcfParser.java added an option to set 'obey nofollow' for links with rel="nofollow" 10 years ago
vsdParser.java
xlsParser.java
zipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago