Commit Graph

3937 Commits (67eddaec4b845edc15ea45440e41baa71d936043)

Author SHA1 Message Date
orbiter eea4c17ef2 removed rpm parser
16 years ago
orbiter b332dfad67 - inserted request object into response object which carries this now instead generating new objects
16 years ago
orbiter ca72ed7526 -removed superfluous crawl cache
16 years ago
orbiter 8103ccec4c removed compiler warnings in imported classes
16 years ago
lotus 52e371b8f7 suppress warnings for upnplib code
16 years ago
lotus 477807e0e6 * updated jxpath to latest v1.3
16 years ago
orbiter 13c63f4082 a set of small fixes to crawling behaviour
16 years ago
orbiter a564df3984 update to mime types in parsers and httpd.mime
16 years ago
orbiter 43c8defd79 enhanced parser with more extension + mime attributes
16 years ago
orbiter aee35bff6f replaced StringBuffer with StringBuilder in tar lib
16 years ago
orbiter 49bbb9bd45 replaced tar library with integrated apache ant tar lib
16 years ago
orbiter f987fc6b4a added tar classes from apache ant tools
16 years ago
orbiter b2263bc720 enhanced document type recognition
16 years ago
lotus aa38eb5a20 * maxfilesize -1 for infinite filesize
16 years ago
lotus 9cfe89c8fc * process content-length as soon as it is received
16 years ago
orbiter 50cf80056f removed jmimemagic library
16 years ago
orbiter 3f113f38a8 removed unused imports
16 years ago
lotus 9f083bb6b2 check filetype before loading (no more mp4 loading)
16 years ago
f1ori 076ae02c44 * added pl and py to extensions excepted by htmlParser
16 years ago
f1ori d5e51cfd09 * workaround for non-working build property replacements
16 years ago
f1ori f814e0fa81 enable warnings and fix most of it
16 years ago
f1ori 8931c8d6b4 improvments to debianpackage:
16 years ago
low012 fc1dc38b55 *) added spaces to make sure that no words are concatinated by accident
16 years ago
low012 f242e7d7bc *) using Apache POI library to parse Word documents now
16 years ago
orbiter caedd72400 - enhanced logging and exception details for parsers
16 years ago
orbiter 4b74ad0a46 fixed setting of parser configuration servlets
16 years ago
orbiter 57a88d435b redesign of parser mime type detection and parser steering
16 years ago
lotus e15d27bc63 avoiding double/wrong parser errors
16 years ago
orbiter 21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes
16 years ago
orbiter 8ca1f5d400 - some work to integrate the html parser the same way as the other parsers are integrated (not finished)
16 years ago
low012 1ee109761f *) added changes which were lost
16 years ago
orbiter 499723891d removed all non-http daemons; they had not been used and may be a potential security risk.
16 years ago
orbiter 0e8647d62f refactoring of search classes
16 years ago
orbiter dafffd0153 refactoring of parsers and document processing
16 years ago
low012 8041e91f56 *) Ooops!
16 years ago
low012 69551ff3d9 *) added several MIME types (derived from http://filext.com/), some of them might be rather uncommon
16 years ago
low012 11dfb2d54f minor changes:
16 years ago
orbiter 77d2a3782c removed strange debugging strings
16 years ago
lotus 4320f69574 universal handling for crashed parsers
16 years ago
orbiter 024744245c small refactoring to prepare for new queues
16 years ago
orbiter 16efcd0366 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2252&hilit=&p=16389#p16389
16 years ago
f1ori 0f3246e90a * fix debian package
16 years ago
f1ori 8544cfd5a6 * remove seperate build-files for parsers
16 years ago
orbiter 24cb6d68bc - renamed Stack to RecordStack to avoid name confusion with new classes
16 years ago
orbiter 995da28c73 all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path
16 years ago
orbiter aac89bf8ca trying to avoid "exceeding limit" message of server
16 years ago
f1ori 48d78166ed * fix double copy of libraries
16 years ago
lotus 7f868ca3c2 resource observer: support for yacyroot\DATA on an NTFS hardlink (Windows)
16 years ago
orbiter 409538e17a code cleanup and code simplifcation
16 years ago
orbiter 160031758d fix for problem with initializer
16 years ago
orbiter 302a02cec8 moved all libraries from libx to lib
16 years ago
orbiter 1f1399e5c5 extending visibility of objects and methods to avoid synthetic accessor methods and increase performance
16 years ago
orbiter 154bbc3364 code cleanup: call of static methods directly to the class
16 years ago
orbiter 222850414e simplification of the code: removed unused classes, methods and variables
16 years ago
orbiter 93dfb51fd4 problems with code style
16 years ago
orbiter adf01c676e reduce lookup time when merging a large number of BLOBs
16 years ago
orbiter 9a674d8047 - After the removal of the Tree class some code simplifications are possible. This affects mostly the Records class, which can be refactored and the result of the refactoring results in a reduced number of classes.
16 years ago
orbiter c5122d6836 completed migration of BLOBTree to BLOBHeaps:
16 years ago
orbiter d1083a6913 maybe we have less problems with open connections to the server if we don't do BF forced sleeps (just a test)
16 years ago
low012 ebe6c823ac *) changed svn properties agains (hopefully doing it right this time)
16 years ago
low012 a80ac3a415 *) fixed wrong parser descriptions
16 years ago
low012 457b6c0d6d *) updated Apache POI library to be able to parse Visio files
16 years ago
apfelmaennchen a10c8022d1 DidYouMean:
16 years ago
f1ori 7eb3bff5b3 * workaround for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2220&hilit=#p16128
16 years ago
orbiter 99fa265e1d fix for search bug caused by tenant patch
16 years ago
orbiter 79875782af be a bit more lazy when removing domain navigation entries
16 years ago
orbiter 57af311627 fix for wrong urls in navigator when a tenant is used
16 years ago
lotus 76b96337e2 just some chatty code
16 years ago
low012 91785d895c *) minor changes in comments
16 years ago
orbiter bdda140c02 fix for json output (no doubleqotes any more, doublequote quoting did not work)
16 years ago
orbiter 2f84736120 ignore signature files that cannot be downloaded because of failed encoding
16 years ago
orbiter 041d9c253e some refactoring and more error-awareness in LogalizeHandler
16 years ago
orbiter 6b307d6d59 more tolerance for corrupted index entries in exported row sets
16 years ago
orbiter 33aafa9b4b better logging when writing merged dumps
16 years ago
lotus db70badcf0 possibility to set remote host on upnp device
16 years ago
orbiter 4d29e90708 uaeh
16 years ago
orbiter 3c3e6499ae added more logging for merge operation
16 years ago
orbiter 15180fc95e - patch for future computation in SplitTable
16 years ago
orbiter 9a5ec20b3c avoid merge during startup
16 years ago
lotus bf6b92343c try to avoid stuck pdf parser
16 years ago
lotus c695c7f512 try to remove hung swf parser from queue
16 years ago
orbiter fc69a76197 update to web structure picture:
16 years ago
orbiter ae015e8e98 refactoring of blob package classes
16 years ago
orbiter 8b8877c233 moved image collector
16 years ago
orbiter be1c7ddc64 refactoring of search classes -- moved Ranking Profile to search package
16 years ago
orbiter fd31a3616a - more logging in server process
16 years ago
orbiter 5a7fd6b4c8 just some comment lines
16 years ago
orbiter 31f60a3b3e when doing searches, also apply a online caution to DHT transmission and stop transmissions while heavy load caused by searching. This omits the many requests to the URL database that are needed for DHT transfer and it avoids collisions with URL retrieval needed for search results.
16 years ago
orbiter 17dc6d4be5 small fix for new Logger
16 years ago
orbiter ce1adf9955 serialized all logging using concurrency:
16 years ago
lotus aec3e7995a autoconfig.pac can be used to browse .yacy-domains only
16 years ago
orbiter bc6dd8194b refactoring: moved search query class to new search package
16 years ago
orbiter a4805defdd added stub for new search process
16 years ago
orbiter b8e738a7be a collection of
16 years ago
apfelmaennchen 39779e4796 DidYouMean: as I moved to only 8 consumer and 4 producer threads, I removed poison pills as it does not make sense anymore - threads are interrupted directly. Having a consumer thread per test case just didn't make sense either (see svn 6070) due to the massive overhead.
16 years ago
apfelmaennchen c3c4dd0933 DidYouMean - changed to much simpler LinkedBlockingQueue
16 years ago
apfelmaennchen 01ac1b5d7e - blocking queue implementation of DidYouMean
16 years ago
orbiter b8bb1bb364 join with a timeout does not cause that the corresponding thread is stopped after the time-out. It does only cause that the waiting is stopped. Here we need additionally a signal to the thread to stop after we finished waiting.
16 years ago
orbiter b69f22e9ca mistake in last commit: computation of loops in ReversingTwoConsecutiveLetters
16 years ago
orbiter 3130334932 - start first with threads that run more loops
16 years ago