Commit Graph

2916 Commits (f01c50cf8dbfd66c0a58dea113ac2072b56b1602)

Author SHA1 Message Date
orbiter 7d875290b2 more generics
17 years ago
orbiter 9d693ee635 more generics
17 years ago
orbiter 0f5c4abaca more generics
17 years ago
orbiter 974fea7933 added term-frequency ranking
17 years ago
orbiter 1a296af6ff more generics
17 years ago
orbiter 4a80902081 - added ViewProfile as rdf in foaf syntax
17 years ago
orbiter da8c850a25 disabled IO path optimization (seems to block other methods)
17 years ago
hermens d177ceb3b3 Fix for growing responseHeader[12].db when using proxyCacheLayout = hash
17 years ago
apfelmaennchen b1fae9b5af fixed import Netscape Bookmarks
17 years ago
orbiter 2485681002 added termination control for RotateIterator
17 years ago
orbiter e2e7f065e9 minor fixes, some generics
17 years ago
orbiter 15397298dc - refactoring of indexControlRWIs: moved statics to own class; better Dublin Core naming
17 years ago
apfelmaennchen f3a9e9c542 added getFolderList() to bookmarksDB
17 years ago
orbiter db25425893 more generics
17 years ago
orbiter 9e7cd4fdbb more generics
17 years ago
orbiter 4e70dff8cf more generics
17 years ago
orbiter 6dc679785f - fixed bad sort behavior of kelondroRowSet, in this case: no sort at all!
17 years ago
orbiter 0b4205eb5a - fix double-deletion in eco tables
17 years ago
orbiter 4ce6fab428 added special handling for doubles in eco tables after initialization
17 years ago
orbiter 002a109c4d patch for http://forum.yacy-websuche.de/viewtopic.php?p=4597#p4597
17 years ago
orbiter 634430c48a - more logging
17 years ago
orbiter d372a78aef some fixes to bring back lulabads peer..
17 years ago
low012 f4799c2334 *) removed since I decided to turn this into a project of it's own using Perl to gather n-gram data which YaCy will be able to use
17 years ago
orbiter 4ffbcd54a4 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=754
17 years ago
apfelmaennchen e81bced2bd reorganized the code and adjusted getTagIterator() to suit folders
17 years ago
orbiter 85dc62c16f refactoring: more dublin core - compliant naming
17 years ago
orbiter efd0b8371a - added parsing of Dublin Core - compliant metadata (see RFC 5013 and ISO 15836) to html parser
17 years ago
low012 cfd4fecd12 *) blanks in paths for restart and update script are replaced by backslash+blank now (see http://forum.yacy-websuche.de/viewtopic.php?t=745)
17 years ago
orbiter f945ee21d2 some security additions, keep maximum byte[] size to 2^27
17 years ago
orbiter 2f3b2f3481 - extended dbtest for comparisment tests
17 years ago
orbiter 9eb746863d interface enhancements for eco records memory statistics
17 years ago
orbiter 9abc927645 to fix inconsistencies in collection index, a double reference reporting mechanism has been implemented
17 years ago
orbiter 58a1f518f8 fixed some problems with eco tables
17 years ago
orbiter d4d07802ac better RAM protection using eco tables
17 years ago
orbiter f4e9ff6ce9 more generics
17 years ago
orbiter cbefc651ac more generics
17 years ago
orbiter 45339c3db5 more generics
17 years ago
orbiter 94f21d9403 activated new kelondroEcoTable file structure.
17 years ago
orbiter a0f7f2faad some more generics
17 years ago
orbiter dc26d6262b - removed write buffer from kelondroCache (was never used because buggy; will now be replaced by new EcoBuffer)
17 years ago
orbiter dbdec0f4d3 another fix for the "too many processes in loader queue, dismissed" - problem:
17 years ago
orbiter b806a6af8b renamed kelondroEcoRecords to kelondroFullRecords (the "Eco"-name will be used for something else)
17 years ago
orbiter 065ba2d60f fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=719&hilit=
17 years ago
orbiter f3f02b08ec no distinction between standard and pro releases in auto-updater
17 years ago
borg-0300 3cab85158c update for last commit
17 years ago
borg-0300 53367d941a more information (BASE64)
17 years ago
orbiter b3636f5ba8 re-implemented file index in kelondroFlex
17 years ago
orbiter a6ca3b51be more generics
17 years ago
orbiter a5054c038d - added large number of generics
17 years ago
orbiter 71bcf02d3a - removed pro-version (is the same as standard version, use the standard instead)
17 years ago
orbiter ce7257483d fix for bad fix with random access files (no performace enhancement)
17 years ago
apfelmaennchen 704de4dee8 Neue Funktion angelegt - notwendig für Einschränkung der Tagwolke
17 years ago
orbiter 016fc594af more generics
17 years ago
orbiter ecd7f8ba4e - added NEAR operator (must be written in UPPERCASE in search query)
17 years ago
orbiter 3e3d2e39a4 - some refactoring and redesign of kelondroBytesIntMap (created new class kelondroRAMIndex)
17 years ago
orbiter 03e7782269 more generics
17 years ago
orbiter f7c5ccedc7 more generics
17 years ago
low012 7af60fb24d *) fixed bug in update script
17 years ago
low012 ae6d07bdb8 *) "Did you mean:" will only be displayed if the list of suggested URLs is not empty.
17 years ago
low012 408cb7a29b *) added check if archive for update is OK, install if OK, else just restart (http://forum.yacy-websuche.de/viewtopic.php?t=663)
17 years ago
orbiter df2a7a8ac8 more generics
17 years ago
orbiter 9d8b17188a more generics, bugfixes for wrong cast
17 years ago
low012 b08f877e97 *) tried to get rid of warnings when compiling parsers (http://forum.yacy-websuche.de/viewtopic.php?t=660)
17 years ago
orbiter 4dc438f7e7 moved to Java 1.5:
17 years ago
orbiter db0d3d5e54 release 0.56 (and some last fixes)
17 years ago
fuchsi d517e96714 last cleanup bits to serverDate before the release. only safe refactoring (method renaming) changes outside of serverDate.
17 years ago
hermens 4748d5c1ab Some enhancements to time management:
17 years ago
orbiter 52dd015218 new release strategy: the standard release is now built the same way as the pro release
17 years ago
fuchsi 1cb6e431a6 Replace the ISO8601 aka W3C datetime parser by one that supports every representation allowed by this standard, see http://www.w3.org/TR/NOTE-datetime
17 years ago
fuchsi 33ee6745f6 more cleanup in serverDate
17 years ago
fuchsi 3c30c2da75 more cleanup and API consistency changes, more to come...
17 years ago
fuchsi f41172f850 Merge httpDate into serverDate as suggested. Removed some unnecessary code and fixed a possible synchronization problem.
17 years ago
fuchsi a52681dd49 add buffering for the performance graph to avoid ConcurrentModificationException
17 years ago
orbiter 814aff60bd - (re-)activated ftp protocol. see discussion here: http://forum.yacy-websuche.de/viewtopic.php?f=6&t=623&hilit=&p=3875#p3875
17 years ago
low012 6fbda9ef4f *) cleaned up code
17 years ago
fuchsi 21f7e13fa1 fix stupid tiny bug introduced in rev 4276 that broke request URL parsing almost completely
17 years ago
fuchsi 5d406d0094 - fixed url "file extension" parsing when there is no extension (like http://yacy.net/ would have extracted .net/)
17 years ago
fuchsi 21b8d1b918 small cosmetic change for static fields in serverCore (special protocol ASCII entities) to improve readability
17 years ago
orbiter 270d016d89 fix for missing anonymization in search profiling
17 years ago
orbiter e3e4f06be4 enhanced search result preparation in the case that no result is found (fast abandon of search)
17 years ago
fuchsi 1bd02762de Improve HTTP/ICAP header processing.
17 years ago
orbiter 01554f4012 fixed bug with double-check in crawler
17 years ago
orbiter b1e08d354c repaired indexing after search snippet loading
17 years ago
orbiter 48138952ff added memory measurement for index recreation to avoid OOM during index RAM space extension
17 years ago
orbiter 9e23acf2d6 introduced new 'authority' ranking property
17 years ago
orbiter a1b80017e0 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=594&p=3630#p3630
17 years ago
orbiter a3bfd668aa opening of array files at startup time, not when first time the web index is accessed
17 years ago
orbiter ca488e03f5 fixed authorization case
17 years ago
orbiter 6a3a292015 - smoothed ymage font
17 years ago
low012 7397152e04 *) quick hack for antialiasing, works only on borders now => less blurry image
17 years ago
orbiter 2954f96fae - removed public peer info box on status page, this info can now be seen in the status banner
17 years ago
low012 4eb40c4f61 *) added 2 filters: blur and antialiasing (which in fact is nothing more than a mild blur) to ymageMatrix
17 years ago
orbiter aeb1cf83a6 - corrected banner link (relative now)
17 years ago
orbiter e22014dc83 some memory enhancements when generating and displaying ymage objects
17 years ago
orbiter f243e338cf implemented online caution also for local and remote search
17 years ago
orbiter c57eb76b13 removed CMY color model from ymage classes and re-introduced RGB color model
17 years ago
orbiter b46bcaa5d8 changed method of profiling
17 years ago
low012 76cd6ed6f6 *) New methods to insert bitmaps that feature transparencies.
17 years ago
orbiter be214e594f - generalized ymage initialization options
17 years ago
low012 ee8a177c26 *) Logo is in the middle of free space now.
17 years ago
low012 72698fcd36 *) Banner features a logo now. It does not look nice, but at least it works. Banner is not finished yet.
17 years ago
fuchsi 39d0f10ca1 Fix parsing oof dates in HTTP headers.
17 years ago
orbiter aefb3f7765 added memory graph picture to PerformanceMemory_p.html
17 years ago
orbiter 9b0ae4b989 added referrer to remote crawl url list
17 years ago
fuchsi 18e516317d Fix problem with buggy HTTP-Servers which send illegal control characters in HTTP-Headers, they are ignored now.
17 years ago
orbiter 7d5544e9b1 added some security checks to new remote crawl pull method to prevent that indexer is overloaded
17 years ago
orbiter 89b9b2b02a redesigned remote crawl process:
17 years ago
fuchsi 69521d92e5 Add another external dependency from PDFBox package ("Bouncy Castle"). This is necessary for parsing of some encrypted PDF files.
17 years ago
orbiter 90a02990d2 NPE fix, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3383#p3383
17 years ago
orbiter 2fcd18a972 - fixed bad behaviour of search event worker processes
17 years ago
orbiter 445c0b5333 added domain list extraction and html export format
17 years ago
orbiter d8d77fc4b2 fix for NPE, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3368#p3368
17 years ago
orbiter bf6952abe7 - added url export to http://localhost:8080/IndexControlURLs_p.html
17 years ago
orbiter af10f729df fixed image search and favicon loading
17 years ago
orbiter c48b73cda2 redesign of ranking data structure
17 years ago
orbiter 6f1308da2f - some enhancements to IndexControlURLs (shows more links, connects referrer to another query)
17 years ago
orbiter c527969185 - enhanced monitoring of ranking parameters
17 years ago
orbiter bd5673efbe added cleaning of search event before opening the index administration
17 years ago
orbiter 55da871211 preparations for better ranking: better debugging of index properties
17 years ago
low012 383dc815d2 *) fix for commit 4212
17 years ago
orbiter 3491531cea - fixed 'appears in url' flag in index generation
17 years ago
orbiter ec7ba0d3d0 - fixed problem with too small sort fields (sortbound was not set)
17 years ago
orbiter bc2368e907 fix for problem with remote crawl referrers
17 years ago
orbiter 875096552f fix for NPE in case that remote search results are empty
17 years ago
orbiter 64b3b79e44 - fix for termination problem with uniq()
17 years ago
orbiter 0abf33ed03 - tried to remove deadlock
17 years ago
low012 a4010f7dc8 *) fixed bug where dots were added after numbers < 1000: "123" was transformed to "123." which is undesirable
17 years ago
orbiter 2421127612 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=513&hilit=
17 years ago
orbiter d0d2771883 disabled multiprocessoring of rowCollection.sort for testing purpose
17 years ago
orbiter edc4da5317 fix for division by zero in test reoutine
17 years ago
orbiter df38aaf7bd update to RowCollection sort speed-enhancements:
17 years ago
orbiter 0eb60cfe6f better handling of seed properties
17 years ago
orbiter ecba35de72 enhanced computing speed of kelondro core function: sorting
17 years ago
orbiter 6eaa5a0e64 enhanced local search speed. The ranking process is now 6 times faster that before.
17 years ago
fuchsi 425e4ead66 Allow absolute paths in configuration settings.
17 years ago
borg-0300 e8d32d9f62 other loglevel
17 years ago
borg-0300 a5d28785b1 less OOM (works for me)
17 years ago
orbiter ccbfb15b6b enhancement to crawl stacker enqueue order
17 years ago
hermens 5c5344ae97 Beautify log
17 years ago
hermens 35cf196204 transferRanking(): Do not flush more ranking files than requested by caller.
17 years ago
hermens d0aa8cf25d Only update handshaked peer's last seed date if it has not been updated recently.
17 years ago
hermens 8f9d65da67 Small corrections to dhtFlushControl()
17 years ago
orbiter 55c87b3b12 changed behavior of crawl stacker
17 years ago
hermens 18144043e6 Correct UTC Offset at beginning/end of daylight savings time
17 years ago
orbiter 4fefa53135 removed parser object pool, see also svn 4106
17 years ago
orbiter a31b9097a4 preparations for mass remote crawls:
17 years ago
fuchsi a718858e8b seed.CCOUNT is interpreted as a double value not int
17 years ago
fuchsi 0e1738899f * Complete number localization and provide a more reasonable interface to serverObjects:
17 years ago
orbiter f8318436a1 fix for last commit
17 years ago
orbiter 7d57b80598 distinct keepOrder strategy, more discrete implementation of enhancement introduced in SVN 4158
17 years ago
orbiter 9a7b093eed tried to avoid endless loop, see also:
17 years ago
orbiter b856e377a9 some additions and a small bugfix to SVN 4158
17 years ago
hermens 501a7aae90 Small correction
17 years ago
hermens caff520988 Removed unnecessary and unused code.
17 years ago
hermens d732840f8a Avoid ConcurrentModificationException when accessing the PerformanceQueues page while yacy is indexing.
17 years ago
fuchsi 35303f9504 add real size values (KBytes) of the DHT-In/Out-RAM-Caches to the PerformanceQueues page. A lot of users seem to tweak this value and it might help in finding the best size in relation to the peer's memory ressources.
17 years ago
fuchsi 38bbd4a4b3 no code changes. just touched yacyClient.java to trigger a rebuild of the file in an uncleaned tree.
17 years ago
fuchsi f717beecb1 - Changed yFormatter handling to be more flexible and produce more readable code for server pages. There are serverObject.putNum() methods to allow adding of number type values in a formatted form, and put() methods for number types that add them without formatting. This reduces the need to transform them into Strings in server pages and removes the HTML encoding step which is unecessary for numbers.
17 years ago
fuchsi ca83f5a8d9 Add external lib FontBox which is part of the PDFBox (they extracted the font handling code into this package in 0.7.3).
17 years ago
fuchsi 3352474dd8 Remove grouping separator in Network.xml (yacystats will woork without it) and format a few more numbers.
17 years ago
fuchsi 06e6a1ff62 Add a generalized Formatter class yFormatter inspired by http://forum.yacy-websuche.de/viewtopic.php?f=5&t=437
17 years ago
fuchsi e77aec8c9d fix handling of encrypted PDF-Documents (with default user password "")
17 years ago
fuchsi b5f7df8d0a Speed up remove operations in rowCollections.
17 years ago
low012 fdb0b861f8 *) fixed wrong calculation of network words, network links, network PPM if peer is senior or principal peer
17 years ago
fuchsi 508de558f7 sbStackCrawlThread is null during first cleanProfiles() run at startup.
17 years ago
fuchsi 70614385ef Attempt to fix the "lost profile handle" bug.
17 years ago
low012 507ecd8afa *) added banner that can be displayed like this: http://localhost:8080/Banner.png
17 years ago
fuchsi 9b0948cb4c gnarf. mixed up the positions. finally fixed...
17 years ago
fuchsi c0f5fc51ef bugfix for last commit
17 years ago
orbiter 33fb2f756d added emergency fail case in remote crawls
17 years ago
fuchsi c5a8585ac6 fix more encooding problems in yacysearch.rss.
17 years ago
fuchsi 6b00fe0c4e fix ArrayIndexOutOfBoundsException
17 years ago
orbiter 3e60ae93b9 modified remote search snippet fetch behavior: do not fetch snippets for more than 300 milliseconds, even if the snippets can be found locally without online fetch
17 years ago
orbiter 97f1ca52bd fox for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=390
17 years ago
orbiter 143fa40d77 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=394&p=2382#p2382
17 years ago
orbiter 711641f167 extended client connection clean-up:
17 years ago
orbiter b19bb6e5b1 - reverted svn 4132; this did not solve the problem and removed the emergency mehtod which caused production failure for shure within some hours
17 years ago
fuchsi 1eba408d2f Make sure that sockets which couldn't be opened aren't handled as active connections, in which case they wouldn't be closed.
17 years ago
fuchsi 03c5b4ad68 more fixes to the yacysearch.rss, it's now 100% valid according to http://feedvalidator.org
17 years ago
orbiter d69d386f7d added additional forced client connection closing
17 years ago
orbiter dea7bee049 - increased minimum time before an active connection is interrupted from 1 minute to 10 minutes
17 years ago
orbiter c1440d2241 fixed problem with redirection: redirected URLs had not been tested with the double-check
17 years ago
fuchsi 7404f2c35c Fix some of the issues with the RSS search interface, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=392
17 years ago
orbiter 98abe0804d another enhancement to crawl starts with link files
17 years ago
orbiter 1b42152a76 fixed and enhanced some details in crawl start with file
17 years ago
orbiter 4465db7399 removed debug information from network grafic
17 years ago
orbiter 01e0669264 re-designed some parts of DHT position calculation (effect is the same as before)
17 years ago
hermens d547c3b4bd Avoid NullPointerException in yacySeedDB.lookupByIP
17 years ago
orbiter 5b1a937ed8 fix for crawl stack database format change, introduced in SVN 4113
17 years ago
orbiter af25c98306 enhanced local search performance in case of a remote search:
17 years ago
orbiter 842308ea97 - redesigned crawl start menu, integrated monitoring pages
17 years ago
orbiter 341f7cb327 steps to enhance remote search performance:
17 years ago
orbiter 2f1ff048ba some fixes to socket connection time-out
17 years ago
orbiter 3c74014004 automatic deletion of dead client connections
17 years ago
orbiter 11b4f80bde - fixed non-closing client connections
17 years ago
orbiter d352853f2d fix for non-closing client sessions
17 years ago
orbiter 1488769e1f cleanup of unmaintained and outdated performance methods:
17 years ago
orbiter 3cb9cdc9be try to fix connection problem, possible cause for wrong junior status and non-passive passive peers:
17 years ago
borg-0300 ba59de773f again and again junior - test
17 years ago
orbiter 4275727d69 fix for peer ping problem (implemented a 3-time re-ping); cause for 'Connection reset' still unknown
17 years ago