Commit Graph

1114 Commits (0c8ff6729f69d3890be585ba21e0f0055bcc2f55)

Author SHA1 Message Date
borg-0300 6b5f28b746 answer for last commit: no
18 years ago
borg-0300 d98ba7bc33 fix for memory limit computation ?
18 years ago
orbiter c48374d14a new memory limit computation for indexing queue
18 years ago
orbiter 08ac4c5ed0 bugfix for http://www.yacy-forum.de/viewtopic.php?p=29045#29045
18 years ago
orbiter 8e3bd17554 adopted DetailedSearch page to new ranking options
18 years ago
orbiter 93a7e88245 more ranking parameter usage
18 years ago
orbiter 2dbea612c9 fixed display bug for image search preview
18 years ago
orbiter 0a050bc043 enhanced ranking
18 years ago
orbiter 61798f0ae6 added option to distinguish between text crawl and media crawl
18 years ago
orbiter febe6b114a design update of crawler monitor
18 years ago
allo 782db9099d version independent name for commons-pool lib
18 years ago
orbiter 7ff86d6ba6 - image search now shows thumbnails (in bad order, but it works)
18 years ago
orbiter ee3d91cb6b print-out of links that result from contraint-filtering
18 years ago
orbiter e4570bffaf -implemented a specialized snippet-fetch for media content
18 years ago
low012 694a6e4f44 *) better text snipptes: any possible searchword (welt, linux, tag) in welt-linux-tag will be marked correctly now
18 years ago
orbiter bddc197453 reverted by-mistake removed change from low012/SVN 3068
18 years ago
orbiter 1377c53aa3 extraction of media links from search results
18 years ago
low012 586add4c6c *) Better snippets: words like GNU/Linux will not prevent Linux or GNU from being marked if they are searchword (see http://www.yacy-forum.de/viewtopic.php?t=2891)
18 years ago
borg-0300 8b7c543885 NullPointer fix
18 years ago
orbiter 937ccd4e76 fix for snippet-generation
18 years ago
auron_x c086c71f17 *) fixed ArrayIndexOutOfBoundsException
18 years ago
orbiter c93cfdc23a fix for http://www.yacy-forum.de/viewtopic.php?p=28564#28564
18 years ago
orbiter 93a5ace330 fix for http://www.yacy-forum.de/viewtopic.php?p=28544#28544
18 years ago
orbiter bf0d820659 - added correct flagging of word properties
18 years ago
orbiter 10d888e70c - added a media search for images, audio, video and applications
18 years ago
orbiter a603c4d5e8 more code simplifications
18 years ago
orbiter 9a85f5abc3 cleanup
18 years ago
borg-0300 fbe1ee402b plasmaCrawlLURL$kiter cleanup
18 years ago
orbiter 773ba1e91a - generalized object order handling
18 years ago
borg-0300 15381cbf73 other bugfix
18 years ago
borg-0300 ad65cc9d2f NullPointer fixes
18 years ago
borg-0300 d33745a7ea NullPointer
18 years ago
orbiter 3a4933b63c bugfix for
18 years ago
orbiter 109ed0a0bb - cleaned up code; removed methods to write the old data structures
18 years ago
orbiter 052f28312a removed assortments from indexing data structures
18 years ago
orbiter 2372b4fe0c release 0.49
18 years ago
orbiter f8efb3c948 fixed a null pointer exception problem reported in the forum.
18 years ago
orbiter ad1e4aa88e added selection of audio, video, image and application resources
18 years ago
orbiter 7cc4cec9c9 bugfix for assertion bugs documented in
18 years ago
orbiter 7dbcd358b4 fix for http://www.yacy-forum.de/viewtopic.php?p=28231#28231
18 years ago
orbiter 86394e7a56 fix for cache-delete problem:
18 years ago
orbiter ceb9e3aa17 - enhanced parser: collection of audio, video, image and application links
18 years ago
orbiter 0b9370a9dc fix for http://www.yacy-forum.de/viewtopic.php?p=28108#28108
18 years ago
orbiter b5a29e9651 - fix for snippets that are too short
18 years ago
orbiter f1528672b1 filtering of non-index pages during index-of search
18 years ago
orbiter 8e7215475b - extended ViewFile to use is as debugging-tool: you can now use the
18 years ago
orbiter 30888e7a2f implementation of search constraints
18 years ago
orbiter 49a83f99d9 - fix for wrong DHT ordering in DHT selection
18 years ago
orbiter f4b547dc13 limited index transfer to peer with version 0.486
18 years ago
orbiter 10a4ab5195 disabled some (more) write caches
18 years ago
orbiter 09bcc10344 bugfix for some problems of last change with assortments
18 years ago
orbiter e3d75f42bd final version of collection entry type definition
18 years ago
orbiter c9364246cc introduced new RWI-Object.
18 years ago
orbiter e628d34e16 patches for bad data
18 years ago
orbiter 497428c8ec refactoring
18 years ago
orbiter 76fceb9997 refactoring
18 years ago
orbiter eeda881553 bugfix for last commit
18 years ago
orbiter bb7d4b5d5e refactoring to prepare new RWI entry object
18 years ago
orbiter bdc9216366 - more asserts
18 years ago
orbiter 1751a799ac - deactivated all write buffers
18 years ago
orbiter ba967c4875 - bugfixes and debug code
18 years ago
orbiter ee4715a21c - more asserts
18 years ago
orbiter 114a76a86e - added flag to urlhash that shows that domain is a local domain
18 years ago
orbiter b2d51be33c bugfix for latest changes to entry generalization
18 years ago
hermens 8385557672 Small fix for the Cache Monitor when using proxyCacheLayout=hash
18 years ago
orbiter f1ed55a5fc bugfix for last commit
18 years ago
orbiter 8fdefd5c68 generalization of payload definition of index storage
18 years ago
theli ad248d61ca *) more verbose exception
18 years ago
hydrox 7e8669b15c *) added possibility to "recycle" a DHTChunk that failed to transfer.
18 years ago
low012 4feaa91890 *) Added additional MIME-Type.
18 years ago
low012 89af433879 *) Deleted parts of WebCat that were not needed for parsing SWFs.
18 years ago
orbiter 46a712e195 - more asserts
18 years ago
low012 8c9bc7e341 *) extracting urls works now
18 years ago
low012 493391e42d *) new flash parser, still experimental
18 years ago
orbiter 215c4e65f1 code cleanup
18 years ago
orbiter bd4f43cd66 - fixed a null pointer exception bug
18 years ago
auron_x 194d42b6a7 *) changed PPM-calculation to be more accurate
18 years ago
orbiter fe8afaf426 switched off usage of write cache for imprortant databases
18 years ago
orbiter d3431433b0 more anonymization in logging
18 years ago
orbiter e6044e5198 bugfix for
18 years ago
orbiter 78b7f6f7fd bugfix for index remove bug,
18 years ago
orbiter 147d88cf23 re-design of database caching
18 years ago
orbiter 4e363108e1 - removed bad debug code that caused a large and unnecessary delay during global search
18 years ago
orbiter 2a9d868f6d - removed object cache from kelondroTree
18 years ago
orbiter 3ffc5b8793 fixed problem with serverCharBuffer.append(char)
18 years ago
orbiter 06854988da - full integration of new LURL database in INDEX
18 years ago
octoate e4a3574b77 StringBuffer now resets every time the parser is called
18 years ago
karlchenofhell ce237aefad - assortment-sizes table from PerformanceQueues_p.html is not shown if not used
18 years ago
theli a5b9b514c1 *) retry crawling without content-encoding if the content-encoding header was not correct
18 years ago
theli 92f774edd1 *) Better charset encoding detection
18 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
18 years ago
octoate cc24dde5e0 First version of a MS Excel parser based on Apache POI
18 years ago
karlchenofhell 4c63129136 - stupid mistake...
18 years ago
karlchenofhell ebf0da2a45 - now the fix http://www.yacy-forum.de/viewtopic.php?t=2974 works
18 years ago
theli 3d152bfe43 *) Logging message added
18 years ago
karlchenofhell b5e40e2fa2 - fix for http://www.yacy-forum.de/viewtopic.php?t=2974 (no cache-sizes for new db)
18 years ago
orbiter 77a59a115d refactoring of indexing methods
18 years ago
theli cbb1e710b9 *) removing old class
18 years ago
orbiter c6d46f7ebd null pointer bugfix
18 years ago
theli decb09df6d *) Trying to be more tolerant against wrong charset names
18 years ago
theli e9afe39cbb *) Trying to be more tolerant against wrong charset names
18 years ago
theli 7526c831a8 *) Suppressing stracktrace
18 years ago
orbiter 50f2578c55 - some bugfixing and code cleanup
18 years ago
orbiter bdf4c7c51e added missing files for last commit
18 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
18 years ago
octoate 1c4076da8a First version of the MS Powerpoint parser based on Apache POI
18 years ago
theli 5b75d64d7d *) bugfix for last commit
18 years ago
theli 71ed104bc7 *) adding additional rpm mimetype (used by packman)
18 years ago
orbiter 6396f5971e bugfixes and migration attempt toward new kelondroFlex db
18 years ago
hermens 48f81acc0e reverse SVN 2744, it is not needed
18 years ago
hermens 1da9aece12 Repair DNS prefetch during cacheScan
18 years ago
theli 22649408ad *) Better errorhandling for charset encoding problem during content parsing
18 years ago
theli a9c7e3f061 *) Bugfix for NoSuchElementException
18 years ago
orbiter c8f3a7d363 added snippet-url re-indexing
18 years ago
low012 2cfd4633ac *) even better handling of searchwords in snippets, words can consist of letters and numbers now
18 years ago
orbiter e17fea7015 files in htcache are now stored in different hash/tree subdirectories
18 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details)
18 years ago
orbiter 25ae3d3161 generalized definition of hexhash
18 years ago
orbiter f0d747c723 removed deprecated method
18 years ago
orbiter 5ff77612ac bugfix for old WORDS storage method
18 years ago
orbiter 0f10bdde22 more generic cache methods
18 years ago
hermens 6557112d8f small fix for plasmaURLPool.getURL() needed for new alternative htcache layout
18 years ago
hermens 440c6ee657 Implement alternative htcache layout
18 years ago
orbiter fd61209797 lines inside tags without punctuation are extended by a single dot.
18 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
18 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
18 years ago
orbiter db294687ea enhanced logging
18 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
18 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
18 years ago
theli 1586d57187 *) odtParser: better handling of large files
18 years ago
theli f17ce28b6d *) plasmaHTCache:
18 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM
18 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
18 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
18 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
18 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
18 years ago
theli cd5f349666 *) Better handling of large files during parsing
18 years ago
low012 f8ac694e51 *) fixed a bug where searchword in snippets were not displayed bold in front of a punctuation mark (see http://www.yacy-forum.de/viewtopic.php?p=25998)
18 years ago
orbiter df1629b05a - code cleanup
18 years ago
theli b73efd5565 *) missing changes needed because of last commit
18 years ago
orbiter 2463e5624a 'quick' release 0.47
18 years ago
theli 625c2ce6b1 *) bugfix for snippet fetching problem if content but not http header is available in cache
18 years ago
theli 813a8a8179 *) migration of mimeTypeParser to jmimemagic 0.1
18 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes
18 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
18 years ago
theli 1dc12d6659 *) Bugfix for shutdown problem caused by cacheScan thread
18 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
18 years ago
theli 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui
18 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
18 years ago
theli 5c6251bced *) some improvements for extended html document charset support
18 years ago