add link extraction to pdfParser

this extracts clickable links in pdf and adds it to the list of links

include a test case for this function

this is the corrected comment for commit:
aa2e15d846
pull/1/head
reger 10 years ago
parent aa2e15d846
commit 59c6532a65

@ -173,7 +173,7 @@ public class pdfParser extends AbstractParser implements Parser {
contentBytes = writer.getBytes(); // get final text before closing writer
pdflinks = extractPdfLinks(pdfDoc);
} catch (final Throwable e) {
// close the writer (in finally)
//close the writer (in finally)
//throw new Parser.Failure(e.getMessage(), location);
} finally {
try {pdfDoc.close();} catch (final Throwable e) {}

Loading…
Cancel
Save