Commit Graph

2710 Commits (f7c5ccedc7c5f2b86c0739a85f51083e451a6dd1)

Author SHA1 Message Date
orbiter 89b9b2b02a redesigned remote crawl process:
17 years ago
fuchsi 69521d92e5 Add another external dependency from PDFBox package ("Bouncy Castle"). This is necessary for parsing of some encrypted PDF files.
17 years ago
orbiter 90a02990d2 NPE fix, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3383#p3383
17 years ago
orbiter 2fcd18a972 - fixed bad behaviour of search event worker processes
17 years ago
orbiter 445c0b5333 added domain list extraction and html export format
17 years ago
orbiter d8d77fc4b2 fix for NPE, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3368#p3368
17 years ago
orbiter bf6952abe7 - added url export to http://localhost:8080/IndexControlURLs_p.html
17 years ago
orbiter af10f729df fixed image search and favicon loading
17 years ago
orbiter c48b73cda2 redesign of ranking data structure
17 years ago
orbiter 6f1308da2f - some enhancements to IndexControlURLs (shows more links, connects referrer to another query)
17 years ago
orbiter c527969185 - enhanced monitoring of ranking parameters
17 years ago
orbiter bd5673efbe added cleaning of search event before opening the index administration
17 years ago
orbiter 55da871211 preparations for better ranking: better debugging of index properties
17 years ago
low012 383dc815d2 *) fix for commit 4212
17 years ago
orbiter 3491531cea - fixed 'appears in url' flag in index generation
17 years ago
orbiter ec7ba0d3d0 - fixed problem with too small sort fields (sortbound was not set)
17 years ago
orbiter bc2368e907 fix for problem with remote crawl referrers
17 years ago
orbiter 875096552f fix for NPE in case that remote search results are empty
17 years ago
orbiter 64b3b79e44 - fix for termination problem with uniq()
17 years ago
orbiter 0abf33ed03 - tried to remove deadlock
17 years ago
low012 a4010f7dc8 *) fixed bug where dots were added after numbers < 1000: "123" was transformed to "123." which is undesirable
17 years ago
orbiter 2421127612 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=513&hilit=
17 years ago
orbiter d0d2771883 disabled multiprocessoring of rowCollection.sort for testing purpose
17 years ago
orbiter edc4da5317 fix for division by zero in test reoutine
17 years ago
orbiter df38aaf7bd update to RowCollection sort speed-enhancements:
17 years ago
orbiter 0eb60cfe6f better handling of seed properties
17 years ago
orbiter ecba35de72 enhanced computing speed of kelondro core function: sorting
17 years ago
orbiter 6eaa5a0e64 enhanced local search speed. The ranking process is now 6 times faster that before.
17 years ago
fuchsi 425e4ead66 Allow absolute paths in configuration settings.
17 years ago
borg-0300 e8d32d9f62 other loglevel
17 years ago
borg-0300 a5d28785b1 less OOM (works for me)
17 years ago
orbiter ccbfb15b6b enhancement to crawl stacker enqueue order
17 years ago
hermens 5c5344ae97 Beautify log
17 years ago
hermens 35cf196204 transferRanking(): Do not flush more ranking files than requested by caller.
17 years ago
hermens d0aa8cf25d Only update handshaked peer's last seed date if it has not been updated recently.
17 years ago
hermens 8f9d65da67 Small corrections to dhtFlushControl()
17 years ago
orbiter 55c87b3b12 changed behavior of crawl stacker
17 years ago
hermens 18144043e6 Correct UTC Offset at beginning/end of daylight savings time
17 years ago
orbiter 4fefa53135 removed parser object pool, see also svn 4106
17 years ago
orbiter a31b9097a4 preparations for mass remote crawls:
17 years ago
fuchsi a718858e8b seed.CCOUNT is interpreted as a double value not int
17 years ago
fuchsi 0e1738899f * Complete number localization and provide a more reasonable interface to serverObjects:
17 years ago
orbiter f8318436a1 fix for last commit
17 years ago
orbiter 7d57b80598 distinct keepOrder strategy, more discrete implementation of enhancement introduced in SVN 4158
17 years ago
orbiter 9a7b093eed tried to avoid endless loop, see also:
17 years ago
orbiter b856e377a9 some additions and a small bugfix to SVN 4158
17 years ago
hermens 501a7aae90 Small correction
17 years ago
hermens caff520988 Removed unnecessary and unused code.
17 years ago
hermens d732840f8a Avoid ConcurrentModificationException when accessing the PerformanceQueues page while yacy is indexing.
17 years ago
fuchsi 35303f9504 add real size values (KBytes) of the DHT-In/Out-RAM-Caches to the PerformanceQueues page. A lot of users seem to tweak this value and it might help in finding the best size in relation to the peer's memory ressources.
17 years ago
fuchsi 38bbd4a4b3 no code changes. just touched yacyClient.java to trigger a rebuild of the file in an uncleaned tree.
17 years ago
fuchsi f717beecb1 - Changed yFormatter handling to be more flexible and produce more readable code for server pages. There are serverObject.putNum() methods to allow adding of number type values in a formatted form, and put() methods for number types that add them without formatting. This reduces the need to transform them into Strings in server pages and removes the HTML encoding step which is unecessary for numbers.
17 years ago
fuchsi ca83f5a8d9 Add external lib FontBox which is part of the PDFBox (they extracted the font handling code into this package in 0.7.3).
17 years ago
fuchsi 3352474dd8 Remove grouping separator in Network.xml (yacystats will woork without it) and format a few more numbers.
17 years ago
fuchsi 06e6a1ff62 Add a generalized Formatter class yFormatter inspired by http://forum.yacy-websuche.de/viewtopic.php?f=5&t=437
17 years ago
fuchsi e77aec8c9d fix handling of encrypted PDF-Documents (with default user password "")
17 years ago
fuchsi b5f7df8d0a Speed up remove operations in rowCollections.
17 years ago
low012 fdb0b861f8 *) fixed wrong calculation of network words, network links, network PPM if peer is senior or principal peer
17 years ago
fuchsi 508de558f7 sbStackCrawlThread is null during first cleanProfiles() run at startup.
17 years ago
fuchsi 70614385ef Attempt to fix the "lost profile handle" bug.
17 years ago
low012 507ecd8afa *) added banner that can be displayed like this: http://localhost:8080/Banner.png
17 years ago
fuchsi 9b0948cb4c gnarf. mixed up the positions. finally fixed...
17 years ago
fuchsi c0f5fc51ef bugfix for last commit
17 years ago
orbiter 33fb2f756d added emergency fail case in remote crawls
17 years ago
fuchsi c5a8585ac6 fix more encooding problems in yacysearch.rss.
17 years ago
fuchsi 6b00fe0c4e fix ArrayIndexOutOfBoundsException
17 years ago
orbiter 3e60ae93b9 modified remote search snippet fetch behavior: do not fetch snippets for more than 300 milliseconds, even if the snippets can be found locally without online fetch
17 years ago
orbiter 97f1ca52bd fox for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=390
17 years ago
orbiter 143fa40d77 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=394&p=2382#p2382
17 years ago
orbiter 711641f167 extended client connection clean-up:
17 years ago
orbiter b19bb6e5b1 - reverted svn 4132; this did not solve the problem and removed the emergency mehtod which caused production failure for shure within some hours
17 years ago
fuchsi 1eba408d2f Make sure that sockets which couldn't be opened aren't handled as active connections, in which case they wouldn't be closed.
17 years ago
fuchsi 03c5b4ad68 more fixes to the yacysearch.rss, it's now 100% valid according to http://feedvalidator.org
17 years ago
orbiter d69d386f7d added additional forced client connection closing
17 years ago
orbiter dea7bee049 - increased minimum time before an active connection is interrupted from 1 minute to 10 minutes
17 years ago
orbiter c1440d2241 fixed problem with redirection: redirected URLs had not been tested with the double-check
17 years ago
fuchsi 7404f2c35c Fix some of the issues with the RSS search interface, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=392
17 years ago
orbiter 98abe0804d another enhancement to crawl starts with link files
17 years ago
orbiter 1b42152a76 fixed and enhanced some details in crawl start with file
17 years ago
orbiter 4465db7399 removed debug information from network grafic
17 years ago
orbiter 01e0669264 re-designed some parts of DHT position calculation (effect is the same as before)
17 years ago
hermens d547c3b4bd Avoid NullPointerException in yacySeedDB.lookupByIP
17 years ago
orbiter 5b1a937ed8 fix for crawl stack database format change, introduced in SVN 4113
17 years ago
orbiter af25c98306 enhanced local search performance in case of a remote search:
17 years ago
orbiter 842308ea97 - redesigned crawl start menu, integrated monitoring pages
17 years ago
orbiter 341f7cb327 steps to enhance remote search performance:
17 years ago
orbiter 2f1ff048ba some fixes to socket connection time-out
17 years ago
orbiter 3c74014004 automatic deletion of dead client connections
17 years ago
orbiter 11b4f80bde - fixed non-closing client connections
17 years ago
orbiter d352853f2d fix for non-closing client sessions
17 years ago
orbiter 1488769e1f cleanup of unmaintained and outdated performance methods:
17 years ago
orbiter 3cb9cdc9be try to fix connection problem, possible cause for wrong junior status and non-passive passive peers:
17 years ago
borg-0300 ba59de773f again and again junior - test
17 years ago
orbiter 4275727d69 fix for peer ping problem (implemented a 3-time re-ping); cause for 'Connection reset' still unknown
17 years ago
orbiter 07d1e98909 fixed round-robin method of peer-ping order (the successfully pinged peer was not updated to current last-seed date)
17 years ago
orbiter 76e4c2d69e fix for peer-ping in case that remote peer does not respond with valid values
17 years ago
fuchsi e192f99134 fix small bug introduced in r4089 that appeared when we tried to remove "gzip" encoding from Accept-Encodings header
17 years ago
fuchsi ae4b9308ef Fix problems with some web servers which couldn't handle the way yacy was sending requests. Thx to celle for the patch.
17 years ago
fuchsi 6601e37512 clear caches after changing blacklists, closes http://forum.yacy-websuche.de/viewtopic.php?f=6&t=241&p=1964#p1964
17 years ago
fuchsi 5b0c1449e1 various fixes and cleanups for blacklist handling:
17 years ago