Commit Graph

37 Commits (3c4064932cdee819179e210deb58ff1f3ad28995)

Author SHA1 Message Date
orbiter 031e6eefbd some updates to dublin core, metadata browsing, file indexing and parser stability
15 years ago
orbiter c3a4aee255 some redesign with a possible fix for the ReferenceContainerCache.
15 years ago
orbiter 23ab6fbca4 - navigation appear at correct position when opengeodb-results are also presented after a search
15 years ago
orbiter 902d16cf6c fixes to parser
15 years ago
lotus dce450e2e0 possible fix for "hung" doc-documents
15 years ago
orbiter 68465c37af added a convenience class to add files into a YaCy index
15 years ago
orbiter 18aa0609ca fix for caching of word hash computation
15 years ago
orbiter 44579fa06d - fixed a problem loading images through yacy's document loader,
15 years ago
orbiter 10d3e856b5 better concurrency, less blocking & performance hacks
15 years ago
orbiter 1a9cfd8718 some performance hacks (CPU only, not IO)
15 years ago
orbiter 72e5407115 refactoring of snippet cache
15 years ago
low012 93b2622503 *) repaired and added IM online status indicators
15 years ago
f1ori d515bc11e2 added ooxmlparser
15 years ago
orbiter c0e17de2fb - fixes for some problems with the new crawling/caching strategies
16 years ago
orbiter 634a01a9a4 replaced wget-requests with caching requests
16 years ago
orbiter 4da9042e8a code simplification
16 years ago
orbiter 1d8d51075c refactoring:
16 years ago
f1ori 67da20647f * add new odf parser based on sax-xml-parser
16 years ago
orbiter eea4c17ef2 removed rpm parser
16 years ago
orbiter ca72ed7526 -removed superfluous crawl cache
16 years ago
orbiter 13c63f4082 a set of small fixes to crawling behaviour
16 years ago
orbiter a564df3984 update to mime types in parsers and httpd.mime
16 years ago
orbiter 43c8defd79 enhanced parser with more extension + mime attributes
16 years ago
orbiter 49bbb9bd45 replaced tar library with integrated apache ant tar lib
16 years ago
orbiter b2263bc720 enhanced document type recognition
16 years ago
orbiter 50cf80056f removed jmimemagic library
16 years ago
orbiter 3f113f38a8 removed unused imports
16 years ago
f1ori 076ae02c44 * added pl and py to extensions excepted by htmlParser
16 years ago
low012 fc1dc38b55 *) added spaces to make sure that no words are concatinated by accident
16 years ago
low012 f242e7d7bc *) using Apache POI library to parse Word documents now
16 years ago
orbiter caedd72400 - enhanced logging and exception details for parsers
16 years ago
orbiter 4b74ad0a46 fixed setting of parser configuration servlets
16 years ago
orbiter 57a88d435b redesign of parser mime type detection and parser steering
16 years ago
orbiter 21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes
16 years ago
orbiter 8ca1f5d400 - some work to integrate the html parser the same way as the other parsers are integrated (not finished)
16 years ago
low012 1ee109761f *) added changes which were lost
16 years ago
orbiter dafffd0153 refactoring of parsers and document processing
16 years ago