Commit Graph

1499 Commits (7429687601d13972bedc5ea5d015af36ac9085a0)

Author SHA1 Message Date
orbiter f1ed55a5fc bugfix for last commit
19 years ago
orbiter 8fdefd5c68 generalization of payload definition of index storage
19 years ago
theli ad248d61ca *) more verbose exception
19 years ago
hydrox 7e8669b15c *) added possibility to "recycle" a DHTChunk that failed to transfer.
19 years ago
low012 4feaa91890 *) Added additional MIME-Type.
19 years ago
low012 89af433879 *) Deleted parts of WebCat that were not needed for parsing SWFs.
19 years ago
orbiter 46a712e195 - more asserts
19 years ago
low012 8c9bc7e341 *) extracting urls works now
19 years ago
low012 493391e42d *) new flash parser, still experimental
19 years ago
orbiter 215c4e65f1 code cleanup
19 years ago
orbiter bd4f43cd66 - fixed a null pointer exception bug
19 years ago
auron_x 194d42b6a7 *) changed PPM-calculation to be more accurate
19 years ago
orbiter fe8afaf426 switched off usage of write cache for imprortant databases
19 years ago
orbiter d3431433b0 more anonymization in logging
19 years ago
orbiter e6044e5198 bugfix for
19 years ago
orbiter 78b7f6f7fd bugfix for index remove bug,
19 years ago
orbiter 147d88cf23 re-design of database caching
19 years ago
orbiter 4e363108e1 - removed bad debug code that caused a large and unnecessary delay during global search
19 years ago
orbiter 2a9d868f6d - removed object cache from kelondroTree
19 years ago
orbiter 3ffc5b8793 fixed problem with serverCharBuffer.append(char)
19 years ago
orbiter 06854988da - full integration of new LURL database in INDEX
19 years ago
octoate e4a3574b77 StringBuffer now resets every time the parser is called
19 years ago
karlchenofhell ce237aefad - assortment-sizes table from PerformanceQueues_p.html is not shown if not used
19 years ago
theli a5b9b514c1 *) retry crawling without content-encoding if the content-encoding header was not correct
19 years ago
theli 92f774edd1 *) Better charset encoding detection
19 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
19 years ago
octoate cc24dde5e0 First version of a MS Excel parser based on Apache POI
19 years ago
karlchenofhell 4c63129136 - stupid mistake...
19 years ago
karlchenofhell ebf0da2a45 - now the fix http://www.yacy-forum.de/viewtopic.php?t=2974 works
19 years ago
theli 3d152bfe43 *) Logging message added
19 years ago
karlchenofhell b5e40e2fa2 - fix for http://www.yacy-forum.de/viewtopic.php?t=2974 (no cache-sizes for new db)
19 years ago
orbiter 77a59a115d refactoring of indexing methods
19 years ago
theli cbb1e710b9 *) removing old class
19 years ago
orbiter c6d46f7ebd null pointer bugfix
19 years ago
theli decb09df6d *) Trying to be more tolerant against wrong charset names
19 years ago
theli e9afe39cbb *) Trying to be more tolerant against wrong charset names
19 years ago
theli 7526c831a8 *) Suppressing stracktrace
19 years ago
orbiter 50f2578c55 - some bugfixing and code cleanup
19 years ago
orbiter bdf4c7c51e added missing files for last commit
19 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
19 years ago
octoate 1c4076da8a First version of the MS Powerpoint parser based on Apache POI
19 years ago
theli 5b75d64d7d *) bugfix for last commit
19 years ago
theli 71ed104bc7 *) adding additional rpm mimetype (used by packman)
19 years ago
orbiter 6396f5971e bugfixes and migration attempt toward new kelondroFlex db
19 years ago
hermens 48f81acc0e reverse SVN 2744, it is not needed
19 years ago
hermens 1da9aece12 Repair DNS prefetch during cacheScan
19 years ago
theli 22649408ad *) Better errorhandling for charset encoding problem during content parsing
19 years ago
theli a9c7e3f061 *) Bugfix for NoSuchElementException
19 years ago
orbiter c8f3a7d363 added snippet-url re-indexing
19 years ago
low012 2cfd4633ac *) even better handling of searchwords in snippets, words can consist of letters and numbers now
19 years ago
orbiter e17fea7015 files in htcache are now stored in different hash/tree subdirectories
19 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details)
19 years ago
orbiter 25ae3d3161 generalized definition of hexhash
19 years ago
orbiter f0d747c723 removed deprecated method
19 years ago
orbiter 5ff77612ac bugfix for old WORDS storage method
19 years ago
orbiter 0f10bdde22 more generic cache methods
19 years ago
hermens 6557112d8f small fix for plasmaURLPool.getURL() needed for new alternative htcache layout
19 years ago
hermens 440c6ee657 Implement alternative htcache layout
19 years ago
orbiter fd61209797 lines inside tags without punctuation are extended by a single dot.
19 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
19 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
19 years ago
orbiter db294687ea enhanced logging
19 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
19 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
19 years ago
theli 1586d57187 *) odtParser: better handling of large files
19 years ago
theli f17ce28b6d *) plasmaHTCache:
19 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM
19 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
19 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
19 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
19 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
19 years ago
theli cd5f349666 *) Better handling of large files during parsing
19 years ago
low012 f8ac694e51 *) fixed a bug where searchword in snippets were not displayed bold in front of a punctuation mark (see http://www.yacy-forum.de/viewtopic.php?p=25998)
19 years ago
orbiter df1629b05a - code cleanup
19 years ago
theli b73efd5565 *) missing changes needed because of last commit
19 years ago
orbiter 2463e5624a 'quick' release 0.47
19 years ago
theli 625c2ce6b1 *) bugfix for snippet fetching problem if content but not http header is available in cache
19 years ago
theli 813a8a8179 *) migration of mimeTypeParser to jmimemagic 0.1
19 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes
19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
19 years ago
theli 1dc12d6659 *) Bugfix for shutdown problem caused by cacheScan thread
19 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
19 years ago
theli 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui
19 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
19 years ago
theli 5c6251bced *) some improvements for extended html document charset support
19 years ago
orbiter f453c14b5d removed unreacheable catch blocks and unused imports
19 years ago
theli ad7f600f25 *) Bugfix. re-enabling inheritance of serverCharBuffer from writer class
19 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
19 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
19 years ago
orbiter f644a1c3a7 better evaluation of index abstracts
19 years ago
allo 2fd610b556 http://www.yacy-forum.de/viewtopic.php?p=25611#25611
19 years ago
theli 06fa891152 *) htmlFilterContentScraper.java: using proper charset for document title
19 years ago
theli 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser)
19 years ago
theli c5d3020941 *) better errorhandling for last commit
19 years ago
theli d0a5a53789 *) changes needed for multi-language support
19 years ago
orbiter 26ab1fa885 fixed null pointer exception
19 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem
19 years ago
orbiter 41e27b85b7 fix for crawler condition
19 years ago
theli 9ecf7f0da2 *) some TODO makers for UTF-8 problem
19 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
19 years ago
orbiter 6e2907135a bugfixes for remote search server part
19 years ago
orbiter cf9884e22b first attempt to implement a secondary search
19 years ago
orbiter b251076e64 avoid ConcurrentModificationException
19 years ago
orbiter 75b198bc02 - updated references to indexContainer
19 years ago
orbiter b7e7808ea6 wordmigration now works also for new index database
19 years ago
theli a0ddf2ec11 *) AbstractCrawlWorker.java: delete already downloaded data on crawling error
19 years ago
orbiter 4f9e42d5ed more changes towards better join-search
19 years ago
orbiter a7281a9b4d fix for last commit
19 years ago
orbiter 82a6054275 - fixed bug with new indexAbstract generation
19 years ago
theli fded1f4a5d *) better handling of maximum file size limit in crawler
19 years ago
orbiter 74d1dea30b changes towards better join-search
19 years ago
orbiter ae4e8ce03e - cut for 'probably last html-interface version': version number update
19 years ago
orbiter 64bed59ee8 enhancements to ranking
19 years ago
theli 63893003be *) Adding settings page for the crawler which allows to specify a file size limit and the timeout to use.
19 years ago
orbiter 94d7ced900 fix for last ranking commit
19 years ago
orbiter 03835c2ee8 enhanced search result computation
19 years ago
orbiter ac3419b65f better debugging for indexOutOfBoundException bug
19 years ago
orbiter a8bc768206 enhancements to ranking evaluation
19 years ago
theli 33898ae7e9 *) ResourceInfoFactory.java: Bugfix for classNotFoundException
19 years ago
theli 406e170e25 *) more verbose error message
19 years ago
theli b298474e22 *) Bugfix needed because of changed plasmaCrawlLURL.load behavior
19 years ago
orbiter 96c6e4e322 - enhancements to detailed search page
19 years ago
orbiter 9340dbb501 fixed all possible problems with nullpointer exception for LURLs
19 years ago
theli a5ed86105b *) bugfix for handling of ResourceInfo object in proxy
19 years ago
hermens ff4362b02d some more fixes for new plasmaCrawlLURL.load behavior
19 years ago
hermens 7aeadbe7cc another NullPointerException in http.ResourceInfo
19 years ago
orbiter 141f9e5bb4 fix for new plasmaCrawlLURL.load behavior
19 years ago
hermens 087f7511f8 prevent NullPointerException in http.ResourceInfo
19 years ago
orbiter a2525072f2 bugfix for kelondroRow - property generation
19 years ago
theli b44514242a *) crawler/ftp/CrawlWorker.java: better errorhandling
19 years ago
theli 7d7f30139c *) crawler/ftp/CrawlWorker.java: delete old cache file
19 years ago
theli 4ae0f122f8 *) ResourceInfo.java: License header added
19 years ago
theli 043edfa4d8 *) ftp/ResourceInfo.java ResourceInfo object for ftp resources added
19 years ago
orbiter 4866868c0e added write cache for LURLs
19 years ago
orbiter 8a0e35618b enhancements to search result preparation
19 years ago
theli 5c1bb53d2a Missing description for last commit
19 years ago
theli dae763d8e3 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli 4825bfaaf3 *) Bugfix for PrintWriter Problem
19 years ago
theli 7930839594 *) URL.java: userinfo was not taken over when generating a new url from a base url and a rel. path
19 years ago
theli 7a35b8e237 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
19 years ago
theli ffbf416e76 *) direct access to requestheader of htCache.Entry removed to make it more http independent
19 years ago
theli 3870d615e3 *) setting htCache.Entry fields to private
19 years ago
theli 393a7d10be *) setting htCache.Entry fields to private
19 years ago
theli ab5a9bee66 *) adding some copyright headers
19 years ago
theli 5847492537 *) next step of restructuring for new crawlers
19 years ago
theli fce9e7741b *) next step of restructuring for new crawlers
19 years ago
theli e3f0136606 *) next step of restructuring for new crawlers
19 years ago
theli 9ded4e8d5a *) Bugfix for name resolution in proxy mode
19 years ago
theli 1c8300fcec *) Bugfix for name resolution in proxy mode
19 years ago
theli 4e2a950ac9 *) next step of restructuring for new crawlers
19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers
19 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
19 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
19 years ago
theli b4acbdaa97 *) better handling of server shutdown
19 years ago
theli f3ac4dbbb9 *) better handling of server shutdown
19 years ago
theli 959b779aba *) avoid performance loss if log level is greater than 'fine'
19 years ago
orbiter 18b6876860 new cache flush configuration settings
19 years ago
hermens f0278b4092 Bugfix for / by zero when the AssortmentCluster is empty
19 years ago
orbiter 14e0bb0dcf allow more references per word for new db
19 years ago
orbiter 985dcbde7f changed some parameters that may cause better memory usage and more indexing speed
19 years ago
orbiter b7f4a1521b added options to switch on or off the kelondroFlexTable for NURL, EURL and PreNURL
19 years ago
orbiter c26da4893b turned back NURL usage of kelondroTree, kelondroFlexTable has still problems with deleted entries
19 years ago
orbiter db1eae0227 * simplified initialization of database objects
19 years ago
hermens 0b73f2b132 Repair DNS prefetch during cacheScan
19 years ago
orbiter 27a159b401 * documentation update
19 years ago
theli f80f776b89 *) Trying to solve NullpointerException problem in function addURLtoErrorDB
19 years ago
hydrox 1c99b5a484 *)fixed logging for urldbcleanup
19 years ago
orbiter 8f3f4ab0eb enhanced synchronisation in plasmaWordIndex
19 years ago
orbiter 23dd972608 fixed memory calculation in performanceMemory web page
19 years ago
orbiter 1ce3c22761 better memory control:
19 years ago
orbiter 39b4c26bdc more memory control:
19 years ago
orbiter 3e9d509c39 some small fixes
19 years ago
orbiter eb633c0a4f server threads must now supply a method that can be called in case
19 years ago
orbiter f5720cb2fa removed most synchronization in wordIndex (for testing)
19 years ago
orbiter 0187c60010 because of a bug in the JRE 1.4.2 there was no memory protection
19 years ago
orbiter cfb51fdef1 less synchronization in plasmaWordIndex
19 years ago
orbiter d6a928c2da quickfix for http://www.yacy-forum.de/viewtopic.php?t=2705
19 years ago
orbiter 6ad471ef96 * applied many compiler warning recommendations
19 years ago
hydrox 9da3aa74d3 silly me, fix for the fix as advised by theli
19 years ago
hydrox bb3d9a5582 *) e.getMessage().indexOf() can only be used if there is actually an ExceptionMessage.
19 years ago
hydrox 7a54010a9c *) Iterators can't be casted to IndexContainer
19 years ago
orbiter cd5f7e137c fixed problem with NURL-generation upon first startup
19 years ago
orbiter 8418af141a added several consistency checks and small changes
19 years ago
theli 9d13aeca13 *) removing class. does not work so far
19 years ago
theli 95a84ae469 *) adding missing classes
19 years ago
theli eee44be602 *) adding an interface for customized blacklist classes
19 years ago
orbiter 6d2f15971a there is a very strange error that causes that the kelondroRecords structure
19 years ago
theli d2e8e76218 *) now it's possible to configure the yacy blacklist separately for dht, search, proxy, crawler
19 years ago
orbiter 9ae9062bd3 * disabled new kelondroFlex table for NURLs
19 years ago
orbiter 689bbcf9cd replaced kelondroTree db for NURLs by new kelondroFlexTable
19 years ago
orbiter 7fbba41962 synchronization fixes
19 years ago
orbiter 328f9859a5 more synchronization in plasmaWordIndex
19 years ago
orbiter 130e6d4719 generalized index object for eurl, nurl and lurl to prepare move
19 years ago
orbiter acdf24877f more synchronization against outOfMemoryError in wordIndex
19 years ago
orbiter 95160d7f2c fixed size computation of index elements from the collection index
19 years ago
orbiter 26116cabde added missing rowdef assignment
19 years ago
orbiter abf22f6e60 removed url normalform computation from htmlFilterContentScraper.
19 years ago
orbiter 740d49751d * strict type and size check in kelondroRow handling
19 years ago
orbiter 314021453f * more logging
19 years ago
orbiter 61b151b083 * added another auto-fix for collection index inconsitency check
19 years ago
orbiter f58283def2 better control of index flush
19 years ago
orbiter 4be21a3cab ups
19 years ago
orbiter 80b6c90d54 enhancements to prevent blocking during dht transfer receive
19 years ago
theli 9f298083cd *) adding more urls to the error url
19 years ago
hermens d56f06401e - Cache known URLs during indexReceive to avoid getting blocked during loadedURL.exists() whenever possible
19 years ago
theli c09f734d06 *) offer router configuration on ConfigBasic.html
19 years ago
hermens dcbb4d0a6b Display the size of HashBlacklistedCache on PerformanceMemory page.
19 years ago
orbiter d799622da1 better flush limit for index collections
19 years ago
orbiter 279b1d969d Integrated new indexing data structure 'collections' into the main class
19 years ago
orbiter 4ff742e42d implemented indexCollectionRI
19 years ago
orbiter 01f95eccd3 re-write of kelondroCollectionIndex. This is the data structure that
19 years ago
orbiter ebc2233092 * implemented (finished) class indexRowSetContainer
19 years ago
orbiter 9183d21f25 renamed new index class to old name
19 years ago
orbiter c4e922885a replaced indexURLEntry by new class that uses a kelondroRow.Entry object
19 years ago
orbiter e357599f92 * fixed problem with indexContainer iteration from RAM:
19 years ago
orbiter 8b77afd72c some fixes to new container merger
19 years ago
orbiter 417ed5102e redesign of database iterators:
19 years ago
orbiter ad692fc6c7 implemented option to extract nurls from the database
19 years ago
orbiter 7fd90ca7c8 * strict handling of NURL entry element generation, storage and stacking
19 years ago
orbiter 5f72be2a95 some redesign of EURL storage
19 years ago
orbiter 1ed3e2daef added option to extract domains and/or urls from the eurl database
19 years ago
orbiter 58df8b7bbf a large collection of different changes
19 years ago
orbiter e4f1820b58 protection against too long authentication strings in switchboard
19 years ago
theli b3c569f706 *) renaming of function getTransferedEntitySpeed to getTransferedEntrySpeed to avoid confusion
19 years ago
orbiter 5214f571cd simplified method call in balancer
19 years ago
orbiter 7935f27038 enhanced synchronization in balancer
19 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL
19 years ago
orbiter 07900366ac deactivated cache-initialization for file-indexes (files in WORDS)
19 years ago
orbiter 40aa735520 fixe timing problem causing too long delay during initialization of kelondroTree objects
19 years ago
theli 24a02cbeef *) Bugfix for not parsable application/xhtml+xml resources if
19 years ago
orbiter b0ca5fa784 some correction algorithm for preload time computation during assortment open
19 years ago
orbiter e22cbaee97 - extended logging for preload
19 years ago
orbiter 671fd9a5c9 work towards new indexing database structure
19 years ago
orbiter 92f4cb4d73 added option to configure the start-up delay time for kelondro database files.
19 years ago
orbiter 6643da3fbd bugfix for http://www.yacy-forum.de/viewtopic.php?p=23463#23463
19 years ago
hydrox 8ba8e2b7d9 *) added cache for blacklists urlhashs recieved by DHT. DHT does not request URLs listed in this cache.
19 years ago
hermens 53cbcc6d6e Implement emergency break in index receive when the limit of the ramCache is exceeded by more than cacheLimit
19 years ago
orbiter 66964dc015 removed high/med/low from kelondroRecords cache control.
19 years ago
borg-0300 4c6083b264 network picture;
19 years ago
borg-0300 955915385a network picture;
19 years ago
borg-0300 027fa8ab1c network picture;
19 years ago
theli b20496e42b *) make DHT DoS check configurable (requested by KoH)
19 years ago
orbiter 12af69dd86 cosmetics
19 years ago
allo 67a8c74be3 Fix for dynamic login with static password.
19 years ago
allo ef9eb50c3c fix for adminlogin
19 years ago
allo 6fe2fed87e cookieauth works with static Admin.
19 years ago
theli 45b39ee1be *) solving unpacking problems with to long filename by
19 years ago
theli fb090652df *) use a more compact for plasmaWordIndexAssortmentImporter.java because the long name
19 years ago
theli 4ca0857c0c *) Index transfer now considers the pause time send by busy peers during
19 years ago
orbiter 75ed507d39 some debugging of new kelondroFlexTable class
19 years ago
orbiter 370c481fa7 bugfixes
19 years ago
orbiter c36e9fc8d3 full integration of kelondroRow
19 years ago
orbiter c75cacda95 added a flex-width-array: this is a table where it is
19 years ago
orbiter 4a907a570f 1st step to migrate kelondroTree to usage of kelondroRow instead of byte[][]
19 years ago
orbiter 09f780df27 more bugfixes for the new row/stack handling changes
19 years ago
orbiter 3c3c047d0a integrated kelondroRow into kelondroStack
19 years ago
orbiter 5bb565944f integration of new kelondroRow into some parts of kelondro,
19 years ago
orbiter eaa6f012f0 refactoring: better naming for classic DB (files in WORDS)
19 years ago
orbiter 5041d330ce refactoring
19 years ago
orbiter 7b3b12888c refactoring: integrated indexContainer abstraction layer
19 years ago
orbiter cb295fbbdc refactoring
19 years ago
rramthun bc94a714b2 Better explanation for the auto-dom-filter.
19 years ago
orbiter 196b8abb30 refactoring
19 years ago
hermens b48327904a Don't disconnect peers that report 'busy' during index transfer.
19 years ago
orbiter 4d8f8ba384 added cache-performance analysis for node caches
19 years ago
orbiter bd057b44dd - automatic setting of peer-does-not-accept-remote-crawl
19 years ago
orbiter 81e79f2caf fixed new cache behaviour changes
19 years ago
orbiter cda087f43b - integrated cache miss storage into object cache
19 years ago
orbiter 757ec28430 refactoring: better data capsulation for indexURL
19 years ago
theli 61078b3885 *) adding support for delayed shutdown
19 years ago
orbiter 90d569d70f refactoring of index management:
19 years ago
orbiter a930be4ba3 refactoring of index management:
19 years ago
hermens df7e1d9df3 Changes to plasmaURL and subclasses:
19 years ago
orbiter a474669338 start with refactoring of index management
19 years ago
rramthun f08e33680c Added Blog-news-symbol as requested.
19 years ago
theli f331def5d8 *) Bugfix for distribution. Incorrect behavior if peerCount == selectedCount
19 years ago
auron_x 55ea4cbfe6 *)reverted patch for memory-display issue
19 years ago
theli 5048b05bc6 *) Index Transfer should only restart at the beginning if the delete
19 years ago
auron_x 53d9ab6db7 *)fixed bug in PerformanceMemory_p.java which caused negative memory-values on big peers
19 years ago
theli ddfe0f0e27 *) don't try to parse referer string if it's null
19 years ago
theli bcc950c533 *) Bugfix for Index Transfer
19 years ago
orbiter 015d044c25 tried to fix some problems with latest changes to httpc
19 years ago
orbiter 3e31820c3d - corrections to PerformanceMemory display of object cache
19 years ago
orbiter 461548698c configuration of index transfer chunk size
19 years ago
orbiter 29b1b0823c added monitoring of new object cache to performanceMemory page
19 years ago
theli 9104001e7c *) Better error handling for assortment import
19 years ago
hermens 51e3bb576f Don't increase dhtTransferIndexCount when the last transferred index was smaller
19 years ago
hermens a0ca4c5fb8 Remove a possible race condition between DHT transfer and deQueue
19 years ago
hermens 0cfba8950f Removing unnecessary and possibly dangerous synchronization of the wordIndex
19 years ago
orbiter d6213f8a85 quickfix for http://www.yacy-forum.de/viewtopic.php?p=19482#19482
19 years ago
orbiter b0036249c1 added some attributes to network picture
19 years ago
hermens cbcf7418ef Cleanup synchronization in plasmaWordIndex
19 years ago
orbiter 60e5aff9fc some enhancements to the remote crawl trigger
19 years ago
orbiter dbe96e6541 added hand-over of search filter and prefer ranking to yacy protocol
19 years ago
rramthun 0604203bce Updated and corrected German language file
19 years ago
orbiter 00a5d435e2 - fixed some bugs with domain filter
19 years ago
orbiter 14d6e476c9 tried to solve some problems with new picture viewer
19 years ago
orbiter 9324425165 fix for remote crawl reject
19 years ago
borg-0300 30e4fc39a5 HTCache extended
19 years ago
orbiter d0dd8b14d2 fixed picture tag and presentation
19 years ago
borg-0300 da6a8bafa2 rename currCacheSize -> curCacheSize;
19 years ago
borg-0300 92110aea32 nullpointer fix for profile(); other minor change;
19 years ago
orbiter f0833b0328 introduced simple search interface
19 years ago
orbiter 47b541b2d1 added better option handling in yacysearch
19 years ago
orbiter c9e16bfd48 first try to insert image search (does not work yet)
19 years ago
orbiter f77775220b fixed parser error
19 years ago
orbiter 22de954a57 added some log output to parser
19 years ago
orbiter 83e0e765ec redesigned some parts of the html scanner & parser
19 years ago
orbiter ac114d69c0 tried to fix some problems with time-outs during search
19 years ago
orbiter e2e8d0c188 some kind of refactoring of yacysearch:
19 years ago
orbiter 6b63e26cbb - removed search function from index.html/java, only imput left
19 years ago
orbiter bc3e80fe42 quickfix
19 years ago
orbiter d8d0ac29c3 added image-viewer servlet that can do:
19 years ago
orbiter ddc6394d9b fixed bug about auto-depth 0
19 years ago
orbiter 60351fa3f7 small fix to previous commit
19 years ago
orbiter a469874e3f added and fixed time-out behaviour during search
19 years ago
orbiter 1d0b0d6e2a synchronized local searched to prevent that several searches are performed at the same time
19 years ago
hermens 22b9d03bbf Correcting remaining time issue in getContainers
19 years ago
orbiter d58788b753 added some synchronisation
19 years ago
orbiter e566d1d8d6 some bugfixes regarding new crawling options
19 years ago
orbiter c7f1300300 -fixes for last commit
19 years ago
orbiter f2421f6a47 some small attribut changes regarding cache flush
19 years ago
orbiter 7a650d0023 several bugfixes
19 years ago
orbiter 59d52fb4a9 fixed some problems with crawl profiles
19 years ago
orbiter 708cc6c8d9 fixed some bugs for auto-filter and added monitor in profile list
19 years ago
rramthun 250864406f ...
19 years ago
orbiter e82899ba57 fixed missing urls map initializer
19 years ago
orbiter 63f39ac7b5 added 3 new crawling steering options:
19 years ago
orbiter 1fc3b34be6 some pre-work (without function yet) to implement:
19 years ago
theli c9e6b5e391 *) check size of indexing-queue and crawler pool before processing remote triggered crawl jobs
19 years ago
orbiter 1509314ea6 set tighter control during DHT index and peer selection
19 years ago
hydrox fcc0683200 *) undoing last commit
19 years ago
hydrox 9411961eec *) another little fix for DHT-Transfer
19 years ago
hydrox 8b14a0c833 *) little fix for DHT-Transfer
19 years ago
orbiter 1f4412a146 adopted isListed to discussed new behavior as discussed (url, getFile)
19 years ago
orbiter 063ef4660a bug?
19 years ago
orbiter 82358677a9 added another shiftK2W to flushCacheSome
19 years ago
orbiter 128e4ab199 - in serverSystem: maxPathLength is now a variable, not a method
19 years ago
orbiter 30e3e3a0fd adopted MAXPATHLENGTH to host system capabilities
19 years ago
borg-0300 85bb8e32a1 Bugfix for last commit
19 years ago
borg-0300 3fe402069f try to fix
19 years ago
orbiter f16f1f15cd bugfix for 100% CPU bug; thanks to Matthias for analysis
19 years ago
borg-0300 254a13efd9 MAXPATHLENGTH used
19 years ago
borg-0300 8865948e4e Cleanup;
19 years ago
orbiter 6c70f4a0cf renamed wordHashes for a word hash set generation to wordHashSet
19 years ago
orbiter d5f8f40c31 removed correcting iterator
19 years ago
orbiter 488a0ed580 replaced old keyIterator and rowIterator by buffered iterators
19 years ago
hermens 4e9a8f41fd rwiDBCleaner + dbImporter: Iterate over small excerpts of
19 years ago
hermens 474379ae63 remove TABs from plasmaDbImporter.java
19 years ago
orbiter dba02f399f starting of re-design of kelondroTree iterator
19 years ago
orbiter f02b426073 made kelondroTree.nodeIterator private
19 years ago
borg-0300 5f6fdf1786 Bugfix for getCachePath(URL url)
19 years ago
orbiter 303b6463a8 added debug line to URL storage for testing
19 years ago
orbiter 91dca2cd8d fixed a bug in last commit: LURL entries cannot be written,
19 years ago
orbiter 3286b1f498 re-organisation of lurl-creation and -stacking
19 years ago
orbiter 0b903c5317 removed usage of kelondroNaturalOrder from plasmaCondenser to experimental
19 years ago
orbiter 4239db0d1c fixed new ordering for backup iterator TreeSet
19 years ago
orbiter 33eba5ecb8 temporary disabling last change, does not work (cannot debug right now)
19 years ago
orbiter f0464042fc fix for latest iterator-replacement-fix:
19 years ago
borg-0300 ec21c585cb try to fix path too long
19 years ago
orbiter a6a3f4b694 fix for svn 1888
19 years ago
hydrox 8da13088e9 *)removed multiple DHT_Distribution_Threads
19 years ago
orbiter 283a7181c6 try to fix new 100% cpu bug, possibly caused by iterator method
19 years ago
orbiter f588c0724f removed cache flush in case of DHT receive
19 years ago
orbiter e94b374d56 update to cache flush method
19 years ago
orbiter bcd99fe83e introduced a second RAM cache for DHT transfer
19 years ago
hydrox 360a460da8 *)URL-Cleaner: moved logging-statement to correct position
19 years ago
orbiter 02f9765013 quickfix for time problem during cache restore
19 years ago
hermens ad119f06af *) Don't overwrite new entries with older ones
19 years ago
orbiter be88687d8c fixed some problems with new cache flush karenz
19 years ago
theli d3da7c9a08 *) Adding support for robots Allow directive
19 years ago
hydrox f046e1814a *fix or last commit
19 years ago
hydrox c55c51e2a8 *)added keywords to IndexCleaner_p.java
19 years ago
orbiter ddbeda738e added minimum age of word in cache to performance menu
19 years ago
orbiter f188611fc6 apply blacklist on rwis during dht receive
19 years ago
orbiter 0ec28b8f8e added DBCleaner from Hydrox
19 years ago
theli fb4100d47b *) undoing last commit.
19 years ago
theli a84cc71218 *) removing getTotalRuntime
19 years ago
auron_x dce08771d1 *) Fix for wrong estimated and elapsed times when import was paused
19 years ago
hermens b34713324a DBImport: remove words from source index even if nothing has been added to home index
19 years ago
orbiter 520b60f15b fix for http://www.yacy-forum.de/viewtopic.php?p=18610#18610
19 years ago
orbiter bae3783d38 added a snippet marking
19 years ago
orbiter f0a38873eb * added yacysearch page with better view on search results
19 years ago
orbiter f0041d504d remove of several results from a single domain is stopped if the result set is smaller than the wanted number of results
19 years ago
theli 89286478e7 *) removing thread pool eviction for now. Not needed at the moment
19 years ago
theli 759800f543 *) Bugfix for storeHTCache problem
19 years ago
orbiter a8548c0484 * several bugfixes regarding basic configuration
19 years ago
orbiter 1b9b8922d9 * fixed problems with new basic 1-2-3 configuration (now authentication required)
19 years ago
auron_x 8c6f38fe70 *) added Blog to YaCy (atm not reachable through interface) -> Blog.html
19 years ago
orbiter ce5274c194 yacybot user agent
19 years ago
hermens 351bd0a678 *) dbImport: convert cacheSize to kb when creating plasma* objects
19 years ago
orbiter eaffcfefe2 * added more ranking attributes (without function; this will be added later)
19 years ago
orbiter 87e90b9d8c refinements in ram cache flush procedure and default timing
19 years ago
orbiter d31a4e0b4f some small enhancements with cache flushing parameters and data structures
19 years ago
orbiter 3703f76866 - fixed re-search bug: after a search with several words, a second search could not
19 years ago
theli fbbbf5f411 *) remote trigger for proxy-crawl
19 years ago
theli dc9174c809 *) Implementing snippet fetching via ajax
19 years ago
orbiter 1d8ca6e082 serialized dhtChunk deletion with indexing
19 years ago
theli 2336f0f013 *) allow pausing/resuming of crawlJob Threads separately
19 years ago
orbiter 60dac4325e serialized indexing with dht selection
19 years ago
orbiter a840755964 moved parts of index transfer logic back to switchboard
19 years ago
orbiter 134253a603 fixed bug with cache flush
19 years ago
orbiter c2d863855d different flush limit
19 years ago
borg-0300 64441b1f78 ADDED: yacy.badwords list to filter the topwords
19 years ago
orbiter f9063e2040 added some synchronization to avoid that several tasks can trigger a cache flush simultanously
19 years ago
orbiter 2c4e4ae6a2 further refactoring of dht selection, transfer and flushing
19 years ago
orbiter 73dad68cf1 outsourced thelis DHT flush class into own file
19 years ago
allo aa4b04e3dd reverted last change
19 years ago
allo 4b0dae8fcf added a possiblity to get the ranking values for an url.
19 years ago
orbiter 85ac7d8386 * moved DHT transfer thread to own class file, needed for further modularization
19 years ago
orbiter 7df2e6e571 bugfix for last commit
19 years ago
orbiter cd41e9a0eb moved DHT index selection to new object that holds indexes to be send away to other peer.
19 years ago
theli 42a5f56723 *) Bugfix for broken dht thread configuration
19 years ago
theli f95d98142f *) displaying amount of items in the existsIndex caches
19 years ago
hydrox e2af2a3f45 *) it's now possible to run more then one indexDistribution-Thread
19 years ago
theli 40dd6ec4fd *) experimental restructuring of db import function
19 years ago
theli 2da18ab359 *) correcting logging output
19 years ago
theli 8ffc6e35ad *) correcting logging output
19 years ago
theli 980e986b64 *) Re enabling short cycle for already removed nurl entries
19 years ago
hermens 3b6328ad02 *) Consistent use of minCount for index transfer
19 years ago
hermens 0b60b9bf51 *) Remove entries from AssortmentCluster before reinserting the rest into the ramCache
19 years ago
hydrox 8ab1d6ff4b *) fixed NullPointerException in plasmaWordIndexEntity
19 years ago
allo a26574c894 Migration from tagName as key to wordhash(tagName) as key for bookmarkTags.db
19 years ago
orbiter 7eb10675b3 re-organization of index management
19 years ago
orbiter 1e4578aab6 VERY EXPERIMENTAL removal of index ram cache flushing thread.
19 years ago
hermens 954f02d22e *) Bugfix: Prevent wordIndex.getContainer() from returning and even manipulating
19 years ago
orbiter fe39493145 changed default ranking parameters
19 years ago
orbiter 365a3fff8e fixings for ranking attributes
19 years ago
orbiter 8e55098b74 fixed detailed search
19 years ago
orbiter 0cb940a8e5 added detailed search.
19 years ago
orbiter c695928f7c adopted search page to new detailed search (to be commited later)
19 years ago
orbiter 45323e7b76 fixed null pointer exception during search
19 years ago
orbiter fb7411d7bb re-structuring of ranking application:
19 years ago
orbiter d98418390b - introduced rankingProfile Class
19 years ago
orbiter eab1805bca refactoring: plasmaSearchProfile -> plasmaSearchTimingProfile
19 years ago
orbiter 6eef848954 re-design of post-ranking process
19 years ago
orbiter be77fe1a88 code clean-up
19 years ago
orbiter 0bc2aaeb42 added normalization to search attributes
19 years ago
theli 008bcb7fb8 *) simplifying code by moving closeTransferIndexes into final block
19 years ago
theli 50d85657b8 *) new import function for IndexImport_p.html
19 years ago
theli 214302284e *) undoing last commit because of problems with getUpdateTime
19 years ago
theli 408de3beee *) avoiding to search in the treemap two times for the same key
19 years ago
borg-0300 139ba4e0c8 Bugfix for getCachePath(URL url)
19 years ago
theli 442807cb29 *) Bugfix for last commit
19 years ago
theli 22fd1ca9aa *) minor changes
19 years ago
theli 6a99304b2b *) Redesign of db import functionality
19 years ago
orbiter 3834675084 fixed bug that caused wrong behavior of search result preparation
19 years ago
hermens 31c8476b5d plasmaWordIndexCache.getContainer:
19 years ago
orbiter 3419b3bcdd fix for bug that caused the peer-counter problem.
19 years ago
hermens 4f43816ec0 *) Fix wrong class cast in indexSize()
19 years ago
orbiter a7f0adf6fa bugfix in entity iterator
19 years ago
orbiter fa90c3ca7a - removed some usage of indexEntity
19 years ago
orbiter aea3e00864 cleanup: removed unused temporary index management in indexEntity.
19 years ago
orbiter 03c65742ba changes towards the new index storage scheme:
19 years ago
theli ab7a911bb3 *) Trying to solve pool not open problem
19 years ago
hydrox d665f3c39c *) fixed Threadnames for stackCrawl-Threads
19 years ago
theli 3d5347bc8e *) changing loglevel for some messages
19 years ago
theli 0fcd113c42 *) last bugfix part. Seems to work now for the stackCrawler
19 years ago
theli b9c9eaeb44 *) next try todo a bugfix :-((
19 years ago
theli 4b4b93c413 *) next try todo a bugfix :-(
19 years ago
theli d9fbad71b9 *) next try todo a bugfix
19 years ago
theli 6da97bd2e4 *) next bugfix for threadpool problem
19 years ago
theli bea2b9edee *) further redesign of threadpools to solve too many thread problem
19 years ago
theli 784fd50437 *) more verbose thread names
19 years ago
theli 56e4dbeb71 *) displaying current active + current idle threads in PerformanceQueues_p.html now
19 years ago
theli 859c6a88f5 *) testing various thread pool eviction settings to avoid outOfMemory - Thread creation problem
19 years ago
orbiter f2b18cede9 AND-bugfix
19 years ago
orbiter b946e28e61 some ranking enhancements
19 years ago
rramthun 6c02f889f7 Cosmetic changes.
19 years ago
theli b191f06d16 *) Adding additional logging message to locate problems with stackcrawl threads
19 years ago
theli d9bcd73d93 *) Bugfix for exception
19 years ago
theli f5abfe8d57 *) more failsafe threadpools
19 years ago
orbiter a56fefe0d3 added missing forced-flush for index cache
19 years ago
hermens 78bcb8014a *) Limit range for selection of indexes for distribution to a DHTDistance of 0.2
19 years ago
hermens 861aae678d *) cleanup cacheAge database when cleaning up the HTCache
19 years ago
theli b4e2efef10 *) first test of new iteration function
19 years ago
orbiter eabf4a0386 fix for null pointer exception during shut-down
19 years ago
orbiter 47843e69e2 auto-reset for switchboard queue stack
19 years ago
orbiter d6581c445b added content iterator for corrupted database files
19 years ago
theli ecdc1f7547 *) Bugfix for crawling URLs with query parameters
19 years ago
orbiter fc4ae899f7 added word-position to ranking (this is only a first step)
19 years ago
orbiter bb2095fe39 assortment files are now not deleted, but shifted to a backup directory.
19 years ago
orbiter 7366e39dd3 tried to fix 100% CPU bug.
19 years ago
orbiter f14d49fae9 enhancements, bugfixes and additions to word index attribute storage
19 years ago
allo 4d33020f56 Migration to WORK
19 years ago
rramthun 1e5feedf0e Fix for http://www.yacy-forum.de/viewtopic.php?p=15547#15547
19 years ago
orbiter f4ffa9aee5 - implemented more attributes to index entries
19 years ago
orbiter 90b940e90e fixed position storage problem.
19 years ago
orbiter 0371494010 tried to add word position to index
19 years ago
orbiter f1cfee7703 removed tabs from condenser
19 years ago
hermens 37791fd529 *) Close indexEntities when "found not enough peers for distribution"
19 years ago
borg-0300 c5b6154136 added CRDistOn = true/false
19 years ago
orbiter 71d5c2b2ca better control for target peer selection for RWI transfer
19 years ago
hermens ca7407b7e1 *) Don't change maxTime if zero or negative
19 years ago
orbiter 3d7c8aaeae removed confusing method
19 years ago
orbiter 4cd0c45a77 code cleanup
19 years ago
hermens 971247b78f - rotate merged indexes after merging
19 years ago
orbiter e2ff1767b5 fix for last DHT distribution bug-fix
19 years ago
orbiter 060e5a0df0 fixed problem with DHT target peer selection:
19 years ago
theli 7c22afe3de *) Bugfix for NullpointerException in deleteOldHTCache
19 years ago
orbiter b21b9df2d0 added section headlines generation to html parser
19 years ago
rramthun c4487deba9 Minor changes collected over some time.
19 years ago
allo 6822dce57b Using Orbiters function for auth
19 years ago
orbiter 2028403670 - consolidated different orderings to kelondroNaturalOrder
19 years ago
orbiter 9544c47684 added some UTF-8 handling.
19 years ago
borg-0300 9d8dca750e BUGFIX for my last commit
19 years ago
borg-0300 5449193167 bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706 (i hope)
19 years ago
borg-0300 2a23f5d419 F..., Sorry, no time, later
19 years ago
borg-0300 3a2d13786e bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706
19 years ago
borg-0300 dc0999ec9c adapted to new HTCache structure
19 years ago
orbiter 9086261476 refactoring of base64 encoding:
19 years ago
borg-0300 b24fcc8ca4 oom
19 years ago
borg-0300 7da232b5b9 HTCache Reset if necessary
19 years ago
borg-0300 4f18f24d81 small change
19 years ago
borg-0300 c652527620 YaCy removes now the old HTCACHE data
19 years ago
borg-0300 69f65210e2 ".yacy" has its own directory;
19 years ago
allo 351fffc129 DATA/WORK for user-created content
19 years ago
allo a81cc9d969 no DATA/DATA to avoid confusion.
19 years ago
borg-0300 b95c5d5781 BUGFIX for URLs how "/../" ...;
19 years ago
allo 9cce3c5709 dates Table for bookmarksdb(needed for del.icio.us api)
19 years ago
hermens 11fe95832e avoid division by zero when index transfer is extremely fast
19 years ago
allo 4ac0fd328a First Version of the Bookmarksmanager
19 years ago
theli d7b6dcbe2e *) Bugfix for MalformedURL problem if Location header is empty.
19 years ago
hermens 5b3e01bd3c avoid division by zero when importing very small indexes (<100 entries)
20 years ago
borg-0300 b7f9adc2c9 new filters added
20 years ago
theli 79667a172e *) Bugfix for additional parser problem
20 years ago
theli 8c594841a8 *) Bugfix for incorrectly indexing of URLs that were requested with Cookies in the
20 years ago
orbiter b5d02d649a fixed bug caused strange search result behaviour
20 years ago
orbiter 4500506735 fixed some bugs concerning url entry retrieval and intexControl interface
20 years ago
orbiter 83a34b838d * added Object allocation monitor on performanceMemory page
20 years ago
orbiter 4ff3d219e8 increased delay for cacheScan start and slowed down scan process
20 years ago
orbiter 3031903d50 re-design of RAM cache flush into assortment cluster
20 years ago
orbiter 0c762daf4b better startup failure handling
20 years ago
orbiter f27f9ecf15 * activated write buffer for databases.
20 years ago
orbiter c59d1b2f5e - Tests with write buffer (new class kelondroBufferedIOChunks, not yet active)
20 years ago
orbiter bb79fb5d91 - changed handling of error cases retrieving urls from database
20 years ago
theli e7d16ef831 *) Corrections in jMimeMagic MagicRule-file to detect some special rss feeds
20 years ago
theli 386d9e45d8 *) Bugfix for code cleanup
20 years ago
theli 5a1d45715d *) Bugfix for parser configuration bug
20 years ago
rramthun a1061495d4 Fixed some spelling mistakes and added some text which (should) make it easier to understand the options.
20 years ago
orbiter 0cdc58aaea fixed indexing of local domains.
20 years ago
theli e1c2d8ec5f *) Speedup "removed from queue"
20 years ago
hydrox 96930f0d2b *)added function to removed malformed URLs from urlHash.db
20 years ago
theli 8862b6ba4b *) Corrections for code cleanup 1175
20 years ago
orbiter 13fdebc50d added authentication for link deletion in search result
20 years ago
orbiter 37f88b4017 code cleanup
20 years ago
orbiter ec2b39c1ce code cleanup
20 years ago
orbiter 8f1f2daa5e implemented interactive link deletion of search results.
20 years ago
theli 6d0f7e6988 *) Adding missing file
20 years ago
theli 44fa94ac52 *) Modifications for dbImport functionality
20 years ago
orbiter dc778659fb fixed problem with time-out during result joint which caused OR behavior instead of AND beahvior
20 years ago
orbiter 3d8a5ae652 code cleanup
20 years ago
theli 64478b1f02 *) Adding possibility to delete crawler queue entries using regular expressions
20 years ago
orbiter a04930f025 code cleanup
20 years ago
low012 90b0eb144e just a typo...
20 years ago
theli 129b15f3e1 *) Correcting logging output of db importer thread
20 years ago
orbiter 420d56ce79 extended db-testing
20 years ago
orbiter ecf765ec33 temporary fix to make jrpm extension compilable with my netbeans environment
20 years ago
theli 8ed0aaae8d *) Adding content Parser for RPM Files
20 years ago
theli 818d37ce44 *) Removing getSimpleName
20 years ago
theli b35c5a48bf *) First version of urlRedirector.pl script
20 years ago
theli bdf30117c1 *) Redesign of parser configuration
20 years ago
theli d4ac3e25b1 *) Bugfix for file system link bug during detection of invalid URLs
20 years ago
orbiter adf75bc9fa better logging for invalid file path detection
20 years ago
orbiter 40621a5663 anhancements in ranking preparation and fixed problem with parser/mime recognition
20 years ago
theli c650b112ea *) Bugfix for relative URL Bug in Crawler
20 years ago
theli 4e73035aef *) Bugfix for "too many open files" during index distribution
20 years ago
orbiter f57e2d67f5 shortened network overview (less columns fit easier on page)
20 years ago
orbiter 85282b1d98 enhanced YBR recognition and search result heuristics
20 years ago
orbiter b9cc9029e3 added ybr selection for remote search
20 years ago
orbiter 0e25020f51 added first generation and usage of YBR index-files. Enhanced overall ranking of search results.
20 years ago
theli 90d6c6223b *) Adding color codes to network graphic legend
20 years ago
orbiter bfe51c7228 added generation of domain-list
20 years ago
orbiter 0ec54d9c5f enhanced CR-file handling and added first RCI-evaluation tests
20 years ago
theli c2fe3a1670 *) Updating jMimeMagic Ruleset
20 years ago
orbiter 88e3234393 fine-tuning of rci-generation
20 years ago
orbiter a12759c1bf first try to implement a rci-computation from cr-files
20 years ago
orbiter 4a8e8f269e refactoring of cr-processing; new kelondro class to handle the attribute file format
20 years ago
orbiter 24dc0e0760 implemented cr-file processing and further transmission steps
20 years ago
orbiter 9d9a87f445 limited htcache storage length
20 years ago
theli d0dfccdb77 *) Making CrawlStacker pool configurable via GUI and config file
20 years ago
theli 3631cb1f6d *) deleting empty entities during index selection
20 years ago
theli ca26aab9b1 *) More debugging output for migrateWords
20 years ago
theli 9b35ae9027 *) Correcting wrong % values on IndexTransfer_p page
20 years ago
theli e6bf9d90a5 *) Fixing Problems with MalformedURLs during Word Selection
20 years ago
theli 86a9210264 *) indexing queue slots are now configurable via config file
20 years ago
theli 3c11d7b81c *) Bugfix for minimizeUrlDB
20 years ago
orbiter 9913049009 fixed outOfMemory bug caused by loops in kelondroTree during enumeration
20 years ago
theli bbb936b9ea *) Bugfix for not human readable content of PDFs while viewing the URL Content via GUI
20 years ago
theli 445e3a620f *) Avoid rejecting of html content by the crawler when the file extension is not set properly
20 years ago
theli 444a5a9368 *) Bugfix for Entries with null url in GlobalQueue
20 years ago
borg-0300 ebac51df52 restore defaultRemoteProfile
20 years ago
borg-0300 5778428455 move cutUrlText to nxTools,
20 years ago
borg-0300 9158845c3b bugfix for snippet text null bytes
20 years ago
orbiter f763923e0a added missing files for last commit
20 years ago
orbiter 79818a320f introduced citation-rank transmission protocol and activate transport for anonymisation
20 years ago
theli 7e0647f692 *) Bugfix for userDB usage during authentication
20 years ago
orbiter 02f8013013 auto-delete of corrupted word files during word-migration
20 years ago
orbiter d2731418bf added creation of global ranking files and changed url normal form usage
20 years ago
theli 6f9f8ed8f8 *) Automatic Reset of Stack Crawler DB on startup errors
20 years ago