Commit Graph

3892 Commits (67da20647f1f3499066ab786d6092baa40c3d12b)

Author SHA1 Message Date
f1ori 67da20647f * add new odf parser based on sax-xml-parser 16 years ago
f1ori 6d0e6d591b * ops, fix compiler error :( 16 years ago
f1ori 3e5beb1654 * fix for empty version in seedlist 16 years ago
orbiter 5bb8074150 removed the indexing queue. This queue was superfluous since the introduction of the blocking queues last year, where documents are parsed, analysed and stored in the index with concurrency. 16 years ago
orbiter 597393db3b changed default visibility of classes/objects in upnp lib 16 years ago
orbiter eea4c17ef2 removed rpm parser 16 years ago
orbiter b332dfad67 - inserted request object into response object which carries this now instead generating new objects 16 years ago
orbiter ca72ed7526 -removed superfluous crawl cache 16 years ago
orbiter 8103ccec4c removed compiler warnings in imported classes 16 years ago
lotus 52e371b8f7 suppress warnings for upnplib code 16 years ago
lotus 477807e0e6 * updated jxpath to latest v1.3 16 years ago
orbiter 13c63f4082 a set of small fixes to crawling behaviour 16 years ago
orbiter a564df3984 update to mime types in parsers and httpd.mime 16 years ago
orbiter 43c8defd79 enhanced parser with more extension + mime attributes 16 years ago
orbiter aee35bff6f replaced StringBuffer with StringBuilder in tar lib 16 years ago
orbiter 49bbb9bd45 replaced tar library with integrated apache ant tar lib 16 years ago
orbiter f987fc6b4a added tar classes from apache ant tools 16 years ago
orbiter b2263bc720 enhanced document type recognition 16 years ago
lotus aa38eb5a20 * maxfilesize -1 for infinite filesize 16 years ago
lotus 9cfe89c8fc * process content-length as soon as it is received 16 years ago
orbiter 50cf80056f removed jmimemagic library 16 years ago
orbiter 3f113f38a8 removed unused imports 16 years ago
lotus 9f083bb6b2 check filetype before loading (no more mp4 loading) 16 years ago
f1ori 076ae02c44 * added pl and py to extensions excepted by htmlParser 16 years ago
f1ori d5e51cfd09 * workaround for non-working build property replacements 16 years ago
f1ori f814e0fa81 enable warnings and fix most of it 16 years ago
f1ori 8931c8d6b4 improvments to debianpackage: 16 years ago
low012 fc1dc38b55 *) added spaces to make sure that no words are concatinated by accident 16 years ago
low012 f242e7d7bc *) using Apache POI library to parse Word documents now 16 years ago
orbiter caedd72400 - enhanced logging and exception details for parsers 16 years ago
orbiter 4b74ad0a46 fixed setting of parser configuration servlets 16 years ago
orbiter 57a88d435b redesign of parser mime type detection and parser steering 16 years ago
lotus e15d27bc63 avoiding double/wrong parser errors 16 years ago
orbiter 21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes 16 years ago
orbiter 8ca1f5d400 - some work to integrate the html parser the same way as the other parsers are integrated (not finished) 16 years ago
low012 1ee109761f *) added changes which were lost 16 years ago
orbiter 499723891d removed all non-http daemons; they had not been used and may be a potential security risk. 16 years ago
orbiter 0e8647d62f refactoring of search classes 16 years ago
orbiter dafffd0153 refactoring of parsers and document processing 16 years ago
low012 8041e91f56 *) Ooops! 16 years ago
low012 69551ff3d9 *) added several MIME types (derived from http://filext.com/), some of them might be rather uncommon 16 years ago
low012 11dfb2d54f minor changes: 16 years ago
orbiter 77d2a3782c removed strange debugging strings 16 years ago
lotus 4320f69574 universal handling for crashed parsers 16 years ago
orbiter 024744245c small refactoring to prepare for new queues 16 years ago
orbiter 16efcd0366 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2252&hilit=&p=16389#p16389 16 years ago
f1ori 0f3246e90a * fix debian package 16 years ago
f1ori 8544cfd5a6 * remove seperate build-files for parsers 16 years ago
orbiter 24cb6d68bc - renamed Stack to RecordStack to avoid name confusion with new classes 16 years ago
orbiter 995da28c73 all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path 16 years ago