You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser
Michael Peter Christen 4d3cc02168
replaced old bzip2 library against better documented commons-compress
13 years ago
..
html add-on to latest commit 13 years ago
images smaller bug fixes for search behavior; should produce less unnecessary removals and an exact number of results as shown in counter 13 years ago
xml some last-minute performance hacks 13 years ago
bzipParser.java replaced old bzip2 library against better documented commons-compress 13 years ago
csvParser.java - enhanced html parser: recognized much more details in the content 14 years ago
docParser.java - enhanced html parser: recognized much more details in the content 14 years ago
dwgParser.java removed (not all) warnings 13 years ago
genericParser.java - enhanced html parser: recognized much more details in the content 14 years ago
gzipParser.java - Redesigned crawler and parser to accept embedded links from the NOLOAD 13 years ago
htmlParser.java bugfixes 13 years ago
mmParser.java - enhanced html parser: recognized much more details in the content 14 years ago
odtParser.java set a limit to CharBuffer object size to fight against bad/too large 13 years ago
ooxmlParser.java set a limit to CharBuffer object size to fight against bad/too large 13 years ago
pdfParser.java memory hacks 13 years ago
pptParser.java - enhanced html parser: recognized much more details in the content 14 years ago
psParser.java - enhanced html parser: recognized much more details in the content 14 years ago
rssParser.java - enhanced html parser: recognized much more details in the content 14 years ago
rtfParser.java - enhanced html parser: recognized much more details in the content 14 years ago
sevenzipParser.java - Redesigned crawler and parser to accept embedded links from the NOLOAD 13 years ago
sidAudioParser.java - enhanced html parser: recognized much more details in the content 14 years ago
sitemapParser.java better abstraction of http client identification 14 years ago
swfParser.java removed stack trace from swf parser since we cant do anything there 13 years ago
tarParser.java - Redesigned crawler and parser to accept embedded links from the NOLOAD 13 years ago
torrentParser.java - enhanced html parser: recognized much more details in the content 14 years ago
vcfParser.java some last-minute performance hacks 13 years ago
vsdParser.java - enhanced html parser: recognized much more details in the content 14 years ago
xlsParser.java - enhanced html parser: recognized much more details in the content 14 years ago
zipParser.java - Redesigned crawler and parser to accept embedded links from the NOLOAD 13 years ago