You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser
orbiter 77fe69395d
added jempbox-1.5.0.jar which is required by pdfbox-1.5 as stated in http://pdfbox.apache.org/dependencies.html
14 years ago
..
html - applied many small performance hacks 14 years ago
images hack to reduce resource contention caused by massive UTF8 decodings which use java.nio resources: 14 years ago
xml more UTF8 getBytes() performance hacks 14 years ago
bzipParser.java *) minor changes 14 years ago
csvParser.java - enhanced html parser: recognized much more details in the content 14 years ago
docParser.java - enhanced html parser: recognized much more details in the content 14 years ago
genericParser.java - enhanced html parser: recognized much more details in the content 14 years ago
gzipParser.java fixed bugs in parser and ftp client 15 years ago
htmlParser.java reduce teh effect of 'Bildersuche findet generierte HTML-Seiten als Bilder' 14 years ago
mmParser.java - enhanced html parser: recognized much more details in the content 14 years ago
odtParser.java - enhanced html parser: recognized much more details in the content 14 years ago
ooxmlParser.java - enhanced html parser: recognized much more details in the content 14 years ago
pdfParser.java added jempbox-1.5.0.jar which is required by pdfbox-1.5 as stated in http://pdfbox.apache.org/dependencies.html 14 years ago
pptParser.java - enhanced html parser: recognized much more details in the content 14 years ago
psParser.java - enhanced html parser: recognized much more details in the content 14 years ago
rssParser.java - enhanced html parser: recognized much more details in the content 14 years ago
rtfParser.java - enhanced html parser: recognized much more details in the content 14 years ago
sevenzipParser.java - enhanced html parser: recognized much more details in the content 14 years ago
sidAudioParser.java - enhanced html parser: recognized much more details in the content 14 years ago
sitemapParser.java better abstraction of http client identification 14 years ago
swfParser.java - enhanced html parser: recognized much more details in the content 14 years ago
tarParser.java *) minor changes 14 years ago
torrentParser.java - enhanced html parser: recognized much more details in the content 14 years ago
vcfParser.java - enhanced html parser: recognized much more details in the content 14 years ago
vsdParser.java - enhanced html parser: recognized much more details in the content 14 years ago
xlsParser.java - enhanced html parser: recognized much more details in the content 14 years ago
zipParser.java - applied many small performance hacks 14 years ago