Commit Graph

3997 Commits (3671c37989c0a4e085fe17a43e1d544e946e3aef)

Author SHA1 Message Date
low012 53bbdfd19a *) setting SVN keywords
16 years ago
low012 25f6145934 *) preventing null pointer exception in case empty search word or only one character is enterd or all search words are removed by filters
16 years ago
low012 248f3fd9b5 *) cleaned up code for better readability
16 years ago
orbiter eaddf2d464 - corrected layout of map preview
16 years ago
hermens 4b83875abd Small fixes for the heapCacheIterator in ReferenceContainerCache:
16 years ago
orbiter fd668f531b fixed map layout
16 years ago
orbiter 2740d9dd79 added integration of osm maps for search
16 years ago
orbiter af3a696fc4 added a fast-fail concept in search processes. The search now has better control if all the remote searches may bring any result. If all processes are finished, then all search tasks fail fast.
16 years ago
orbiter ce972ff4ef update to default ranking profile which has now some settings to deny some phpbb3 pages which are redundant in the index when crawling phpbb3.
16 years ago
orbiter 44579fa06d - fixed a problem loading images through yacy's document loader,
16 years ago
orbiter 67eddaec4b changed way to integrate dictionary files:
16 years ago
orbiter d656a94f55 fix for bad paths in dictionary processing
16 years ago
orbiter 3b9aaf9e9f - inserted new library tests inside DidYouMean
16 years ago
orbiter 8c35ffe34c fixes to the dymlib
16 years ago
orbiter bfa273bcc1 added a library provider which holds libraries in static objects,
16 years ago
orbiter 1762a7bcd6 - moved DidYouMean to the data package
16 years ago
orbiter bf8ed00e9e removed debugging code
16 years ago
orbiter ead48c4b25 fix for preparation of search result pages with offset > 10:
16 years ago
orbiter 39a311d608 better care to do not loose the merge/dump thread
16 years ago
orbiter 10d3e856b5 better concurrency, less blocking & performance hacks
16 years ago
orbiter 1a9cfd8718 some performance hacks (CPU only, not IO)
16 years ago
orbiter 92407009b2 cleanup
16 years ago
orbiter 0ba1beaf56 separated rwi constraint evaluation from rwi ranking and added concurrency
16 years ago
orbiter ce7924d712 better concurrency for rwi entry parsing during search processing
16 years ago
orbiter b0637600d5 enhanced url constraint computation: better position of constraint check during retrieval process
16 years ago
orbiter 61748285c3 more refactoring of search
16 years ago
orbiter 323a8e733d removed unused classes
16 years ago
orbiter 72e5407115 refactoring of snippet cache
16 years ago
orbiter 0e471ba33b - fixed a bug in fast digest computation
16 years ago
low012 93b2622503 *) repaired and added IM online status indicators
16 years ago
orbiter e7736d9c8d more refactoring: made all variables in SearchEvent private
16 years ago
orbiter 4b92d0b9b7 patch for possible problems with normalization of '/' in urls. This applies in rare cases when '/' appear in post-properties
16 years ago
orbiter d8ca6e6bf1 more refactoring for search
16 years ago
orbiter fe4a4e3f6b added missing class
16 years ago
orbiter 72ac5bd80f refactoring of search process.
16 years ago
hermens c4d0e22a77 Further speed upof concurrent DHT-receive
16 years ago
hermens 2fbc0696bf Fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2334
16 years ago
f1ori d515bc11e2 added ooxmlparser
16 years ago
orbiter d9744b1b5d replaced old caching strategy control class with lightweight simplearc
16 years ago
orbiter 8e56c2ace6 fix for fixes from this afternoon
16 years ago
orbiter cf739edc2e fix for possible deadlock, see
16 years ago
orbiter 6354b5e447 removed possible deadlock, see
16 years ago
orbiter 5cc17ccf8a a better caching with less overhead and more appropriate
16 years ago
orbiter 92edd24e70 fixed problem with switching of networks
16 years ago
orbiter 0575f12838 fix for deadlock
16 years ago
orbiter fbfdaf063d - patch to omit IndexOutOfBoundsException when a b64-encoded key appears not to be well-formed. In that case the key is still accepted but rated higher than other regular keys to create a virtual ordering between well-formed and ill-formed keys
16 years ago
orbiter c0e17de2fb - fixes for some problems with the new crawling/caching strategies
16 years ago
orbiter 634a01a9a4 replaced wget-requests with caching requests
16 years ago
orbiter c6c97f23ad - added cache usage properties to crawl start
16 years ago
orbiter c4ae2cd03f fixed bug that caused deletion of crawl profiles at every application startup
16 years ago
orbiter 161d2fd2ef redesign of access to the HTCache (now http.client.Cache):
16 years ago
f1ori ba2e6de538 fix empty version string again
16 years ago
orbiter 51534df0cb fix for possible synchronization problem
16 years ago
orbiter 4da9042e8a code simplification
16 years ago
orbiter 1d8d51075c refactoring:
16 years ago
f1ori 67da20647f * add new odf parser based on sax-xml-parser
16 years ago
f1ori 6d0e6d591b * ops, fix compiler error :(
16 years ago
f1ori 3e5beb1654 * fix for empty version in seedlist
16 years ago
orbiter 5bb8074150 removed the indexing queue. This queue was superfluous since the introduction of the blocking queues last year, where documents are parsed, analysed and stored in the index with concurrency.
16 years ago
orbiter 597393db3b changed default visibility of classes/objects in upnp lib
16 years ago
orbiter eea4c17ef2 removed rpm parser
16 years ago
orbiter b332dfad67 - inserted request object into response object which carries this now instead generating new objects
16 years ago
orbiter ca72ed7526 -removed superfluous crawl cache
16 years ago
orbiter 8103ccec4c removed compiler warnings in imported classes
16 years ago
lotus 52e371b8f7 suppress warnings for upnplib code
16 years ago
lotus 477807e0e6 * updated jxpath to latest v1.3
16 years ago
orbiter 13c63f4082 a set of small fixes to crawling behaviour
16 years ago
orbiter a564df3984 update to mime types in parsers and httpd.mime
16 years ago
orbiter 43c8defd79 enhanced parser with more extension + mime attributes
16 years ago
orbiter aee35bff6f replaced StringBuffer with StringBuilder in tar lib
16 years ago
orbiter 49bbb9bd45 replaced tar library with integrated apache ant tar lib
16 years ago
orbiter f987fc6b4a added tar classes from apache ant tools
16 years ago
orbiter b2263bc720 enhanced document type recognition
16 years ago
lotus aa38eb5a20 * maxfilesize -1 for infinite filesize
16 years ago
lotus 9cfe89c8fc * process content-length as soon as it is received
16 years ago
orbiter 50cf80056f removed jmimemagic library
16 years ago
orbiter 3f113f38a8 removed unused imports
16 years ago
lotus 9f083bb6b2 check filetype before loading (no more mp4 loading)
16 years ago
f1ori 076ae02c44 * added pl and py to extensions excepted by htmlParser
16 years ago
f1ori d5e51cfd09 * workaround for non-working build property replacements
16 years ago
f1ori f814e0fa81 enable warnings and fix most of it
16 years ago
f1ori 8931c8d6b4 improvments to debianpackage:
16 years ago
low012 fc1dc38b55 *) added spaces to make sure that no words are concatinated by accident
16 years ago
low012 f242e7d7bc *) using Apache POI library to parse Word documents now
16 years ago
orbiter caedd72400 - enhanced logging and exception details for parsers
16 years ago
orbiter 4b74ad0a46 fixed setting of parser configuration servlets
16 years ago
orbiter 57a88d435b redesign of parser mime type detection and parser steering
16 years ago
lotus e15d27bc63 avoiding double/wrong parser errors
16 years ago
orbiter 21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes
16 years ago
orbiter 8ca1f5d400 - some work to integrate the html parser the same way as the other parsers are integrated (not finished)
16 years ago
low012 1ee109761f *) added changes which were lost
16 years ago
orbiter 499723891d removed all non-http daemons; they had not been used and may be a potential security risk.
16 years ago
orbiter 0e8647d62f refactoring of search classes
16 years ago
orbiter dafffd0153 refactoring of parsers and document processing
16 years ago
low012 8041e91f56 *) Ooops!
16 years ago
low012 69551ff3d9 *) added several MIME types (derived from http://filext.com/), some of them might be rather uncommon
16 years ago
low012 11dfb2d54f minor changes:
16 years ago
orbiter 77d2a3782c removed strange debugging strings
16 years ago
lotus 4320f69574 universal handling for crashed parsers
16 years ago
orbiter 024744245c small refactoring to prepare for new queues
16 years ago