Commit Graph

188 Commits (115c4edfcc26d040de92bc4ec2586b1ccfa123f3)

Author SHA1 Message Date
rramthun 0f11399d16 Some corrections...
20 years ago
orbiter 9f505af7aa preparations for bulk remote crawls
20 years ago
orbiter 9c72b4cdec replaced index dump stack by an dump array and limited url number in assortment ram (prevents too much RAM occupation)
20 years ago
orbiter 51962d55bf added 'PPM', page-per-minute statistics
20 years ago
orbiter 159f795f65 bugfix (null pointer exception in assortments)
20 years ago
orbiter 1d2155675b changed assortment memory cache flush
20 years ago
orbiter 19dbed7cc8 code clean-up
20 years ago
orbiter 00f63ea00d fail-save patch for pattern matching
20 years ago
orbiter 0a6be961ea added pattern organization
20 years ago
orbiter 40036ba69c fixed dht transmission; added url-blacklist blocking also for remote search
20 years ago
orbiter 311e627363 blocking of blacklisted urls in indexReceive and small changes
20 years ago
orbiter 2f0d7ea8d3 removed htcache stati (superfluous now)
20 years ago
orbiter 277048501e bugfix
20 years ago
orbiter 8b89c46afe fixed problem with cache write
20 years ago
orbiter 455ae9f55f fixed htcache-store problem and due-time for remote crawls
20 years ago
theli 55d10b864c *) further improvements in shutdown behaviour
20 years ago
orbiter 419f8fb398 fixed bugs/missing code regarding new crawl stack
20 years ago
orbiter 112c5d3332 the new file-based indexing queue
20 years ago
orbiter 858cd94299 replaced indexing ram-queue by file-based stack-queue
20 years ago
theli 57c30f1d78 *) bugfix for usage of httpc without gzip content encoding
20 years ago
theli 0e2c33ee55 *) Network.html/Network.java:
20 years ago
orbiter 5159a090b0 fixed parser bug with lowercase force (appeared in: http://spellbound.sourceforge.net/)
20 years ago
orbiter 7f7cbc5019 fixed bug with snippets
20 years ago
orbiter eb74fa0c82 fixed a bug with snippet-length
20 years ago
orbiter 86f2aa8478 fixed seed-load date bug (evaluating server date for age computation)
20 years ago
orbiter 664bceced5 removed debug-lines
20 years ago
orbiter 75ebdbc852 enhanced snippet-generation (case where snippet is too long)
20 years ago
orbiter 8a4f297324 fixed/enhanced snippet error-handling; suppression of results where no snippet exists
20 years ago
orbiter 712fe9ef18 bugfixed utf-8 decoding and parser
20 years ago
theli eee6322aaf *) Adding redirection support to plasmaCrawlWorker.java
20 years ago
theli cd279907c0 *) Adding redirection support to plasmaCrawlWorker.java
20 years ago
theli 6697d5e52e *) correcting fkt. mediaExtContains
20 years ago
orbiter 3addf58046 enhanced snippet-loading with threads
20 years ago
orbiter 56d28a16f0 bugfixes
20 years ago
orbiter d6c85228a6 enhanced snippet computation
20 years ago
theli fafda068f9 *) allowing crawler to process resources with statuscode 203
20 years ago
theli aae9a433a6 *) correcting usage of supportedFileExt-List
20 years ago
orbiter 1e7f062350 many bugfixes, memory leak fixes, performance enhancements; new kelondroHashtable; activated snippets
20 years ago
orbiter 68dc2b0c6b added kelondroArray, the basis for upcoming kelondroHash and some bug fixes
20 years ago
orbiter a19541e563 code-enhancements after analysis with AppPerfect
20 years ago
orbiter 85075269a6 extended fail-safe memory-managament. prevents too much allocation, too often GC and should help for the 100%CPU-bug
20 years ago
orbiter e3c92818db avoiding OutOfMemoryError routines
20 years ago
orbiter 3e8ee5a46d enhanced caching in kelondroRecords and added better synchronization/finalizer
20 years ago
theli db3ed75728 *) closing stream correctly
20 years ago
orbiter 5d06ded005 enhanced html parser speed
20 years ago
orbiter 5a490aa065 fixed html parser
20 years ago
orbiter a25b5b4986 fixed possible memory leak in htmlScraper: be aware that now links can get lost; further work necessary
20 years ago
theli 9e47ba5ad6 *) adding missing calls for function close() to avoid "too many open file" bug
20 years ago
theli 9a98988c3c *) Bugfix for SSL/NIO Bug
20 years ago
orbiter a1ffc27041 preparations for image/movie/music indexing
20 years ago
orbiter a5b40923b6 added word migration to assortments (start with 'java -classpath classes yacy -migratewords')
20 years ago
theli 890e3f4d4a *) adding missing calls for function close() to avoid "too many open file" bug*) adding
20 years ago
theli 6dd3ec0dc4 *) Adding debug="true" debuglevel="lines,vars,source" to ant build files
20 years ago
orbiter 4f9c30ef49 using mime-type instead of file extension for doctype
20 years ago
theli ee9e110366 *) removing old logging configuration properties from yacy.init
20 years ago
theli c1a4e0dc28 *) changing reference to logger
20 years ago
theli d0083f845f *) changing reference to logger
20 years ago
theli 1b5ae054f8 *) changing reference to logger
20 years ago
theli 68f30811fa *) changing reference to logger
20 years ago
theli fbbea813c5 *) changing references to logger
20 years ago
orbiter 4574fa4ce7 bugfixes
20 years ago
theli 83b41ef2f7 *) Adding timeouts for shutdown
20 years ago
theli ef6851798b *) changing thread priority while parsing a pdf file to avoid 100% CPU usage.
20 years ago
orbiter 33f9315e58 implemented multithreading of indexing
20 years ago
orbiter ca3b4ccaf4 added snippet-routines (not yet finished)
20 years ago
orbiter ee0758fe4d bugfixes/empty-dir-deletion/snippet-test-activation
20 years ago
orbiter 594c591223 changes towards 0.38
20 years ago
orbiter d8fdc2526e added experimental snipplet-generation (to be disabled for 0.38)
20 years ago
orbiter 3771b10b89 implemented automated migration indexCache 0.37 -> indexAssortmentCluster
20 years ago
orbiter e89ded9e41 bugfixes
20 years ago
orbiter 650ca3955a added flush-thread for index cache and added language-name mapping in Language_p
20 years ago
orbiter 3d8a2ff937 enhanced parallelization of local/global/remote crawling
20 years ago
orbiter a05d738ea4 enhanced caching, removed bug causing outOfMemory
20 years ago
orbiter 21110dcd5e fixed bugs with open files and caching
20 years ago
orbiter f8f8dd05db fixed "Too many open files" - bug
20 years ago
theli 74eb21f62e *) adding image tag into rss template
20 years ago
orbiter 5f90daa265 implemented localization environment
20 years ago
theli 84f9d8f7f0 *) migrating ant build files to generate a single extension tar per default
20 years ago
orbiter fdd606c8c8 fixed bugs
20 years ago
theli 8bd49ba535 *) setting root dir for all tar files properly
20 years ago
orbiter 0c35171c85 assortment fine-tuning
20 years ago
orbiter 76dc892017 refined assortment
20 years ago
theli 0484c41a84 *) replacing system.xxx.println with logging statements
20 years ago
theli 7994c485f1 *) Trying to set the document title properly
20 years ago
theli 285936d778 *) trying to set document title properly
20 years ago
theli 573a8e8047 *) setting document title properly
20 years ago
orbiter 4b01ff7548 activated assortments, removed write-queues
20 years ago
orbiter e26ac60c3e modified assortment data structures
20 years ago
orbiter 79be6f003d enhanced Assortment class
20 years ago
theli 9ee3e69021 *) Solving "Warning: You did not close the PDF Document" problem when an OutOfMemory Exception occured ...
20 years ago
orbiter 5c6147a54c introduced assortment structure (generalization of singletons)
20 years ago
theli 73e297f30f *) adding proper default values for RealtimeParsableMimeTypes if something goes wrong with the configuration file
20 years ago
theli 893a662329 *) Adding missing cast statement
20 years ago
theli 361f05978d Multiple updates regarding the yacy seedUpload facility,
20 years ago
theli ddc5675781 *) Correcting typo
20 years ago
theli d2c4e9a55e *) Implementing yacy forum wishlist item: "Pause Crawling"
20 years ago
orbiter 287d2e6f10 further enhanced caching (new cache flush methods)
20 years ago
orbiter 376b917c91 fixed shut-down by stopYACY.sh
20 years ago
orbiter ea478f3975 enhanced indexing-caching
20 years ago
orbiter b4030e5023 implemented serverSwitchActions - action-hooks
20 years ago