Commit Graph

746 Commits (9f298083cd2154de7a0470e83c5321c81749e2de)

Author SHA1 Message Date
orbiter 47b541b2d1 added better option handling in yacysearch
19 years ago
orbiter c9e16bfd48 first try to insert image search (does not work yet)
19 years ago
orbiter f77775220b fixed parser error
19 years ago
orbiter 22de954a57 added some log output to parser
19 years ago
orbiter 83e0e765ec redesigned some parts of the html scanner & parser
19 years ago
orbiter ac114d69c0 tried to fix some problems with time-outs during search
19 years ago
orbiter e2e8d0c188 some kind of refactoring of yacysearch:
19 years ago
orbiter 6b63e26cbb - removed search function from index.html/java, only imput left
19 years ago
orbiter bc3e80fe42 quickfix
19 years ago
orbiter d8d0ac29c3 added image-viewer servlet that can do:
19 years ago
orbiter ddc6394d9b fixed bug about auto-depth 0
19 years ago
orbiter 60351fa3f7 small fix to previous commit
19 years ago
orbiter a469874e3f added and fixed time-out behaviour during search
19 years ago
orbiter 1d0b0d6e2a synchronized local searched to prevent that several searches are performed at the same time
19 years ago
hermens 22b9d03bbf Correcting remaining time issue in getContainers
19 years ago
orbiter d58788b753 added some synchronisation
19 years ago
orbiter e566d1d8d6 some bugfixes regarding new crawling options
19 years ago
orbiter c7f1300300 -fixes for last commit
19 years ago
orbiter f2421f6a47 some small attribut changes regarding cache flush
19 years ago
orbiter 7a650d0023 several bugfixes
19 years ago
orbiter 59d52fb4a9 fixed some problems with crawl profiles
19 years ago
orbiter 708cc6c8d9 fixed some bugs for auto-filter and added monitor in profile list
19 years ago
rramthun 250864406f ...
19 years ago
orbiter e82899ba57 fixed missing urls map initializer
19 years ago
orbiter 63f39ac7b5 added 3 new crawling steering options:
19 years ago
orbiter 1fc3b34be6 some pre-work (without function yet) to implement:
19 years ago
theli c9e6b5e391 *) check size of indexing-queue and crawler pool before processing remote triggered crawl jobs
19 years ago
orbiter 1509314ea6 set tighter control during DHT index and peer selection
19 years ago
hydrox fcc0683200 *) undoing last commit
19 years ago
hydrox 9411961eec *) another little fix for DHT-Transfer
19 years ago
hydrox 8b14a0c833 *) little fix for DHT-Transfer
19 years ago
orbiter 1f4412a146 adopted isListed to discussed new behavior as discussed (url, getFile)
19 years ago
orbiter 063ef4660a bug?
19 years ago
orbiter 82358677a9 added another shiftK2W to flushCacheSome
19 years ago
orbiter 128e4ab199 - in serverSystem: maxPathLength is now a variable, not a method
19 years ago
orbiter 30e3e3a0fd adopted MAXPATHLENGTH to host system capabilities
19 years ago
borg-0300 85bb8e32a1 Bugfix for last commit
19 years ago
borg-0300 3fe402069f try to fix
19 years ago
orbiter f16f1f15cd bugfix for 100% CPU bug; thanks to Matthias for analysis
19 years ago
borg-0300 254a13efd9 MAXPATHLENGTH used
19 years ago
borg-0300 8865948e4e Cleanup;
19 years ago
orbiter 6c70f4a0cf renamed wordHashes for a word hash set generation to wordHashSet
19 years ago
orbiter d5f8f40c31 removed correcting iterator
19 years ago
orbiter 488a0ed580 replaced old keyIterator and rowIterator by buffered iterators
19 years ago
hermens 4e9a8f41fd rwiDBCleaner + dbImporter: Iterate over small excerpts of
19 years ago
hermens 474379ae63 remove TABs from plasmaDbImporter.java
19 years ago
orbiter dba02f399f starting of re-design of kelondroTree iterator
19 years ago
orbiter f02b426073 made kelondroTree.nodeIterator private
19 years ago
borg-0300 5f6fdf1786 Bugfix for getCachePath(URL url)
19 years ago
orbiter 303b6463a8 added debug line to URL storage for testing
19 years ago
orbiter 91dca2cd8d fixed a bug in last commit: LURL entries cannot be written,
19 years ago
orbiter 3286b1f498 re-organisation of lurl-creation and -stacking
19 years ago
orbiter 0b903c5317 removed usage of kelondroNaturalOrder from plasmaCondenser to experimental
19 years ago
orbiter 4239db0d1c fixed new ordering for backup iterator TreeSet
19 years ago
orbiter 33eba5ecb8 temporary disabling last change, does not work (cannot debug right now)
19 years ago
orbiter f0464042fc fix for latest iterator-replacement-fix:
19 years ago
borg-0300 ec21c585cb try to fix path too long
19 years ago
orbiter a6a3f4b694 fix for svn 1888
19 years ago
hydrox 8da13088e9 *)removed multiple DHT_Distribution_Threads
19 years ago
orbiter 283a7181c6 try to fix new 100% cpu bug, possibly caused by iterator method
19 years ago
orbiter f588c0724f removed cache flush in case of DHT receive
19 years ago
orbiter e94b374d56 update to cache flush method
19 years ago
orbiter bcd99fe83e introduced a second RAM cache for DHT transfer
19 years ago
hydrox 360a460da8 *)URL-Cleaner: moved logging-statement to correct position
19 years ago
orbiter 02f9765013 quickfix for time problem during cache restore
19 years ago
hermens ad119f06af *) Don't overwrite new entries with older ones
19 years ago
orbiter be88687d8c fixed some problems with new cache flush karenz
19 years ago
theli d3da7c9a08 *) Adding support for robots Allow directive
19 years ago
hydrox f046e1814a *fix or last commit
19 years ago
hydrox c55c51e2a8 *)added keywords to IndexCleaner_p.java
19 years ago
orbiter ddbeda738e added minimum age of word in cache to performance menu
19 years ago
orbiter f188611fc6 apply blacklist on rwis during dht receive
19 years ago
orbiter 0ec28b8f8e added DBCleaner from Hydrox
19 years ago
theli fb4100d47b *) undoing last commit.
19 years ago
theli a84cc71218 *) removing getTotalRuntime
19 years ago
auron_x dce08771d1 *) Fix for wrong estimated and elapsed times when import was paused
19 years ago
hermens b34713324a DBImport: remove words from source index even if nothing has been added to home index
19 years ago
orbiter 520b60f15b fix for http://www.yacy-forum.de/viewtopic.php?p=18610#18610
19 years ago
orbiter bae3783d38 added a snippet marking
19 years ago
orbiter f0a38873eb * added yacysearch page with better view on search results
19 years ago
orbiter f0041d504d remove of several results from a single domain is stopped if the result set is smaller than the wanted number of results
19 years ago
theli 89286478e7 *) removing thread pool eviction for now. Not needed at the moment
19 years ago
theli 759800f543 *) Bugfix for storeHTCache problem
19 years ago
orbiter a8548c0484 * several bugfixes regarding basic configuration
19 years ago
orbiter 1b9b8922d9 * fixed problems with new basic 1-2-3 configuration (now authentication required)
19 years ago
auron_x 8c6f38fe70 *) added Blog to YaCy (atm not reachable through interface) -> Blog.html
19 years ago
orbiter ce5274c194 yacybot user agent
19 years ago
hermens 351bd0a678 *) dbImport: convert cacheSize to kb when creating plasma* objects
19 years ago
orbiter eaffcfefe2 * added more ranking attributes (without function; this will be added later)
19 years ago
orbiter 87e90b9d8c refinements in ram cache flush procedure and default timing
19 years ago
orbiter d31a4e0b4f some small enhancements with cache flushing parameters and data structures
19 years ago
orbiter 3703f76866 - fixed re-search bug: after a search with several words, a second search could not
19 years ago
theli fbbbf5f411 *) remote trigger for proxy-crawl
19 years ago
theli dc9174c809 *) Implementing snippet fetching via ajax
19 years ago
orbiter 1d8ca6e082 serialized dhtChunk deletion with indexing
19 years ago
theli 2336f0f013 *) allow pausing/resuming of crawlJob Threads separately
19 years ago
orbiter 60dac4325e serialized indexing with dht selection
19 years ago
orbiter a840755964 moved parts of index transfer logic back to switchboard
19 years ago
orbiter 134253a603 fixed bug with cache flush
19 years ago
orbiter c2d863855d different flush limit
19 years ago
borg-0300 64441b1f78 ADDED: yacy.badwords list to filter the topwords
19 years ago
orbiter f9063e2040 added some synchronization to avoid that several tasks can trigger a cache flush simultanously
19 years ago
orbiter 2c4e4ae6a2 further refactoring of dht selection, transfer and flushing
19 years ago
orbiter 73dad68cf1 outsourced thelis DHT flush class into own file
19 years ago
allo aa4b04e3dd reverted last change
19 years ago
allo 4b0dae8fcf added a possiblity to get the ranking values for an url.
19 years ago
orbiter 85ac7d8386 * moved DHT transfer thread to own class file, needed for further modularization
19 years ago
orbiter 7df2e6e571 bugfix for last commit
19 years ago
orbiter cd41e9a0eb moved DHT index selection to new object that holds indexes to be send away to other peer.
19 years ago
theli 42a5f56723 *) Bugfix for broken dht thread configuration
19 years ago
theli f95d98142f *) displaying amount of items in the existsIndex caches
19 years ago
hydrox e2af2a3f45 *) it's now possible to run more then one indexDistribution-Thread
19 years ago
theli 40dd6ec4fd *) experimental restructuring of db import function
19 years ago
theli 2da18ab359 *) correcting logging output
19 years ago
theli 8ffc6e35ad *) correcting logging output
19 years ago
theli 980e986b64 *) Re enabling short cycle for already removed nurl entries
19 years ago
hermens 3b6328ad02 *) Consistent use of minCount for index transfer
19 years ago
hermens 0b60b9bf51 *) Remove entries from AssortmentCluster before reinserting the rest into the ramCache
19 years ago
hydrox 8ab1d6ff4b *) fixed NullPointerException in plasmaWordIndexEntity
19 years ago
allo a26574c894 Migration from tagName as key to wordhash(tagName) as key for bookmarkTags.db
19 years ago
orbiter 7eb10675b3 re-organization of index management
19 years ago
orbiter 1e4578aab6 VERY EXPERIMENTAL removal of index ram cache flushing thread.
19 years ago
hermens 954f02d22e *) Bugfix: Prevent wordIndex.getContainer() from returning and even manipulating
19 years ago
orbiter fe39493145 changed default ranking parameters
19 years ago
orbiter 365a3fff8e fixings for ranking attributes
19 years ago
orbiter 8e55098b74 fixed detailed search
19 years ago
orbiter 0cb940a8e5 added detailed search.
19 years ago
orbiter c695928f7c adopted search page to new detailed search (to be commited later)
19 years ago
orbiter 45323e7b76 fixed null pointer exception during search
19 years ago
orbiter fb7411d7bb re-structuring of ranking application:
19 years ago
orbiter d98418390b - introduced rankingProfile Class
19 years ago
orbiter eab1805bca refactoring: plasmaSearchProfile -> plasmaSearchTimingProfile
19 years ago
orbiter 6eef848954 re-design of post-ranking process
19 years ago
orbiter be77fe1a88 code clean-up
19 years ago
orbiter 0bc2aaeb42 added normalization to search attributes
19 years ago
theli 008bcb7fb8 *) simplifying code by moving closeTransferIndexes into final block
19 years ago
theli 50d85657b8 *) new import function for IndexImport_p.html
19 years ago
theli 214302284e *) undoing last commit because of problems with getUpdateTime
19 years ago
theli 408de3beee *) avoiding to search in the treemap two times for the same key
19 years ago
borg-0300 139ba4e0c8 Bugfix for getCachePath(URL url)
19 years ago
theli 442807cb29 *) Bugfix for last commit
19 years ago
theli 22fd1ca9aa *) minor changes
19 years ago
theli 6a99304b2b *) Redesign of db import functionality
19 years ago
orbiter 3834675084 fixed bug that caused wrong behavior of search result preparation
19 years ago
hermens 31c8476b5d plasmaWordIndexCache.getContainer:
19 years ago
orbiter 3419b3bcdd fix for bug that caused the peer-counter problem.
19 years ago
hermens 4f43816ec0 *) Fix wrong class cast in indexSize()
19 years ago
orbiter a7f0adf6fa bugfix in entity iterator
19 years ago
orbiter fa90c3ca7a - removed some usage of indexEntity
19 years ago
orbiter aea3e00864 cleanup: removed unused temporary index management in indexEntity.
19 years ago