Commit Graph

14 Commits (aee35bff6f75d41f05419b8af44577ac005406a8)

Author SHA1 Message Date
orbiter 49bbb9bd45 replaced tar library with integrated apache ant tar lib 16 years ago
orbiter b2263bc720 enhanced document type recognition 16 years ago
orbiter 50cf80056f removed jmimemagic library 16 years ago
orbiter 3f113f38a8 removed unused imports 16 years ago
f1ori 076ae02c44 * added pl and py to extensions excepted by htmlParser 16 years ago
low012 fc1dc38b55 *) added spaces to make sure that no words are concatinated by accident 16 years ago
low012 f242e7d7bc *) using Apache POI library to parse Word documents now 16 years ago
orbiter caedd72400 - enhanced logging and exception details for parsers 16 years ago
orbiter 4b74ad0a46 fixed setting of parser configuration servlets 16 years ago
orbiter 57a88d435b redesign of parser mime type detection and parser steering 16 years ago
orbiter 21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes 16 years ago
orbiter 8ca1f5d400 - some work to integrate the html parser the same way as the other parsers are integrated (not finished) 16 years ago
low012 1ee109761f *) added changes which were lost 16 years ago
orbiter dafffd0153 refactoring of parsers and document processing 16 years ago