You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser
Michael Peter Christen de3e373913
using precompiled CommonPattern.TAB for split
10 years ago
..
augment removed warnings 10 years ago
html using precompiled pattern CommonPattern.SEMICOLON for splits 10 years ago
images applying precompiled CommonPattern.COMMA.split to all places where 10 years ago
rdfa added an option to set 'obey nofollow' for links with rel="nofollow" 11 years ago
xml do YaCy p2p connections using a timeout-request which covers the http 11 years ago
apkParser.java activated the new apk parser which was already ready but not included in 11 years ago
audioTagParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
bzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
csvParser.java using precompiled CommonPattern.TAB for split 10 years ago
docParser.java applying precompiled CommonPattern.COMMA.split to all places where 10 years ago
dwgParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
genericParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
gzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
htmlParser.java recognize more html file extensions 10 years ago
linkScraperParser.java added linkScraperParser, a parser which ignores the text like the 11 years ago
mmParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
odtParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
ooxmlParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
pdfParser.java removed debug code 10 years ago
pptParser.java applying precompiled CommonPattern.COMMA.split to all places where 10 years ago
psParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
rdfParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
rssParser.java simplify rssreader and improve atom feed link extraction 11 years ago
rtfParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
sevenzipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
sidAudioParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
sitemapParser.java fix for image alt attachment to AnchorURLs in html parser. 11 years ago
swfParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
tarParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago
torrentParser.java Added and integrated new date detection class which can identify date 10 years ago
vcfParser.java using precompiled pattern CommonPattern.SEMICOLON for splits 10 years ago
vsdParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
xlsParser.java - replaced the properties object in AnchorURL with distinct variables 12 years ago
zipParser.java - added a new Crawler Balancer: HostBalancer and HostQueues: 11 years ago