Commit Graph

2940 Commits (f597185026b2350954992f6523f1a21685d36a27)

Author SHA1 Message Date
orbiter ba622bb240 addendum to svn 4553
17 years ago
orbiter 5530b8e1ca reverted changes to yacy protocol classes: they caused the sciencenet to loose connections
17 years ago
orbiter b664a53553 fix for NPE during search
17 years ago
orbiter b4ed937f1e - modified zone navigation (does still not work correctly)
17 years ago
orbiter 8d0470a5c6 new method to compute search history IDs
17 years ago
orbiter 65785da8f2 new method for best hash computation
17 years ago
orbiter 9eddc1506b - one try to fix the httpd problem
17 years ago
orbiter 7cc4ff05c9 some code enhancements and bugfixes
17 years ago
danielr 6788f8f7c1 fixed error 'FTPC cannot change directory'
17 years ago
orbiter 7ce76c8ff8 added missing file
17 years ago
orbiter bfed9c2da6 - some refactoring in search process
17 years ago
borg-0300 3445b1e10b *better logging
17 years ago
borg-0300 4b0339fec0 *fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=927
17 years ago
orbiter 275a226cc5 refactoring
17 years ago
apfelmaennchen bc3d3b4c97 fixed rebuildTags() to correctly rebuild folders...
17 years ago
danielr fbe335db73 consistent use of de.anomic.server.serverMemory to get information about memory statistics
17 years ago
orbiter 8c06436c4a removing the error-db upon each time a start-up is made.
17 years ago
orbiter 4fdf695064 - fixed a bug in remote search that prevented that any results had been generated (!)
17 years ago
borg-0300 0ddbed9451 Less memory consumption at start
17 years ago
orbiter 1dce2f1079 more multithreading support:
17 years ago
orbiter 6779b455d7 another fix for the punycode parser/generator (should work now!)
17 years ago
orbiter 1b127406d0 update to punycode encoding (still not working)
17 years ago
orbiter 83860507c9 - added punycode class from gnu idn library
17 years ago
orbiter 253a453413 removed possible synchronization deadlock
17 years ago
orbiter 3f321ece7d added a search history to the new search page
17 years ago
orbiter c48e25d784 - fixed selection box for topwords
17 years ago
orbiter 87a8747ce3 - enhanced recognition, parsing, management and double-occurrence-handling of image tags
17 years ago
low012 652086159a *) Replaced System.err.println() by logging function. Left System.err.println()s as comments to be able to quickly revert changes since gzip is an application with it's own main method and Orbiter maybe wants to keep it this way.
17 years ago
orbiter 677ee2ea04 added remove operation to collection index (re-activation)
17 years ago
orbiter d477483373 stronger criteria to use RAM copy to use table copy
17 years ago
orbiter a7abee3578 - fixed some data types in new search stack
17 years ago
orbiter 81687b6bd5 added missing hachCode computation for previous feature
17 years ago
orbiter bedd8dfbe2 - added image sorting by image size. This is the default now.
17 years ago
orbiter 727feb4358 - fixed some bugs in ranking computation
17 years ago
orbiter f4c73d8c68 - fixed highslide usage
17 years ago
orbiter 2327451653 - changed order of database initialisation (index first)
17 years ago
orbiter 3441ec3928 - some small changes to highslide integration to get it working... (does not work yet)
17 years ago
orbiter 6c3cd2b4f2 - added new way to watch images from the image search:
17 years ago
orbiter 61a81820e3 - refactoring of search tracker
17 years ago
lulabad 9ecc17baef fixed double Blog entrys
17 years ago
orbiter 36b898ca7a - tested successfully z-presentation of yacy seed encoding
17 years ago
orbiter 066c88140f quickfix for OOM, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=875&hilit=&p=5686#p5686
17 years ago
orbiter 4079c38ce0 - probably slightly better default ranking
17 years ago
orbiter 8fd5e52f04 added basket icons and experimental gif animation class
17 years ago
lulabad 94e256e13b * removed single Blogview, now links direct to BlogComments.html
17 years ago
orbiter ff5969901c modified dir servlet to cooperate with intranet indexing from the own HTDOCS repository:
17 years ago
lulabad 00f5f917de - more refactoring to blog
17 years ago
orbiter f890b039ee experiments wit openstreetmaps
17 years ago
orbiter 7f445f34a6 bitte die Java 5 - typischen Warnings einschalten!
17 years ago
lulabad c1b9a03304 * some refactoring to Blog
17 years ago
lulabad 766a04bc06 fixed sort problem in Blog. see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=639
17 years ago
borg-0300 bfe171e693 Small change (generics)
17 years ago
borg-0300 2589290ded better ping
17 years ago
borg-0300 dae9053b21 BUGFIX
17 years ago
borg-0300 77ba446332 seedDB helpers update/cleanup
17 years ago
borg-0300 dd215e7f6b NPE fix
17 years ago
orbiter bd63999801 - faster search: using different data structures that avoid multiplr calculations
17 years ago
lulabad 8358652fa9 some small changes to blog
17 years ago
orbiter 159aaf8889 re-introduced global search limitation when index receive is switched off
17 years ago
borg-0300 a9c4e9c309 Small change (ping)
17 years ago
borg-0300 9ab6ad8b73 more seedDB helpers
17 years ago
lulabad 6a85764e1a Second bugfix for numberbug in Blog.
17 years ago
orbiter efd5807a7c - some renaming of variables to support DC
17 years ago
lulabad 40a0591942 Fixed numberbug in Blog, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=639. This wont fix existing Blogentrys (comes later).
17 years ago
orbiter 141db7ba48 there is less RAM needed for eco table (its just a security-plus for RAM check)
17 years ago
orbiter 249d61759a fix for false RAM table activation in EcoTables
17 years ago
orbiter ff6b69b37e fix for NPE in access tracker
17 years ago
orbiter 3c7b94c119 - fix for online caution delay settings, see
17 years ago
orbiter f35a3794e0 auto-healing (deletion) of bad peer addresses during start-up
17 years ago
orbiter 42c1e11f2b added another link double-check
17 years ago
orbiter a5d388bfff fix for HTCache organisation that may have caused unlimited grow of the cache
17 years ago
orbiter 96c5e6acc7 added a double-check for search results
17 years ago
orbiter a1e9e6e2e6 fix for search result page navigation
17 years ago
orbiter 7404256997 - no more search time-out!
17 years ago
orbiter cd3e0d6f03 tried to fix another eco bug
17 years ago
orbiter 08a12e9bb5 - removed dashed line from default skin (looks much better!)
17 years ago
orbiter 89169d54fd fixed search result preparation
17 years ago
orbiter acf771d5e1 - fixed bug with too much RAM in crawler queue
17 years ago
orbiter a8a5df4a51 - more dublin core naming of page metadata
17 years ago
orbiter fa3b8f0ae1 fixed bug in remote search
17 years ago
orbiter 7d875290b2 more generics
17 years ago
orbiter 9d693ee635 more generics
17 years ago
orbiter 0f5c4abaca more generics
17 years ago
orbiter 974fea7933 added term-frequency ranking
17 years ago
orbiter 1a296af6ff more generics
17 years ago
orbiter 4a80902081 - added ViewProfile as rdf in foaf syntax
17 years ago
orbiter da8c850a25 disabled IO path optimization (seems to block other methods)
17 years ago
hermens d177ceb3b3 Fix for growing responseHeader[12].db when using proxyCacheLayout = hash
17 years ago
apfelmaennchen b1fae9b5af fixed import Netscape Bookmarks
17 years ago
orbiter 2485681002 added termination control for RotateIterator
17 years ago
orbiter e2e7f065e9 minor fixes, some generics
17 years ago
orbiter 15397298dc - refactoring of indexControlRWIs: moved statics to own class; better Dublin Core naming
17 years ago
apfelmaennchen f3a9e9c542 added getFolderList() to bookmarksDB
17 years ago
orbiter db25425893 more generics
17 years ago
orbiter 9e7cd4fdbb more generics
17 years ago
orbiter 4e70dff8cf more generics
17 years ago
orbiter 6dc679785f - fixed bad sort behavior of kelondroRowSet, in this case: no sort at all!
17 years ago
orbiter 0b4205eb5a - fix double-deletion in eco tables
17 years ago
orbiter 4ce6fab428 added special handling for doubles in eco tables after initialization
17 years ago
orbiter 002a109c4d patch for http://forum.yacy-websuche.de/viewtopic.php?p=4597#p4597
17 years ago
orbiter 634430c48a - more logging
17 years ago
orbiter d372a78aef some fixes to bring back lulabads peer..
17 years ago
low012 f4799c2334 *) removed since I decided to turn this into a project of it's own using Perl to gather n-gram data which YaCy will be able to use
17 years ago
orbiter 4ffbcd54a4 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=754
17 years ago
apfelmaennchen e81bced2bd reorganized the code and adjusted getTagIterator() to suit folders
17 years ago
orbiter 85dc62c16f refactoring: more dublin core - compliant naming
17 years ago
orbiter efd0b8371a - added parsing of Dublin Core - compliant metadata (see RFC 5013 and ISO 15836) to html parser
17 years ago
low012 cfd4fecd12 *) blanks in paths for restart and update script are replaced by backslash+blank now (see http://forum.yacy-websuche.de/viewtopic.php?t=745)
17 years ago
orbiter f945ee21d2 some security additions, keep maximum byte[] size to 2^27
17 years ago
orbiter 2f3b2f3481 - extended dbtest for comparisment tests
17 years ago
orbiter 9eb746863d interface enhancements for eco records memory statistics
17 years ago
orbiter 9abc927645 to fix inconsistencies in collection index, a double reference reporting mechanism has been implemented
17 years ago
orbiter 58a1f518f8 fixed some problems with eco tables
17 years ago
orbiter d4d07802ac better RAM protection using eco tables
17 years ago
orbiter f4e9ff6ce9 more generics
17 years ago
orbiter cbefc651ac more generics
17 years ago
orbiter 45339c3db5 more generics
17 years ago
orbiter 94f21d9403 activated new kelondroEcoTable file structure.
17 years ago
orbiter a0f7f2faad some more generics
17 years ago
orbiter dc26d6262b - removed write buffer from kelondroCache (was never used because buggy; will now be replaced by new EcoBuffer)
17 years ago
orbiter dbdec0f4d3 another fix for the "too many processes in loader queue, dismissed" - problem:
17 years ago
orbiter b806a6af8b renamed kelondroEcoRecords to kelondroFullRecords (the "Eco"-name will be used for something else)
17 years ago
orbiter 065ba2d60f fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=719&hilit=
17 years ago
orbiter f3f02b08ec no distinction between standard and pro releases in auto-updater
17 years ago
borg-0300 3cab85158c update for last commit
17 years ago
borg-0300 53367d941a more information (BASE64)
17 years ago
orbiter b3636f5ba8 re-implemented file index in kelondroFlex
17 years ago
orbiter a6ca3b51be more generics
17 years ago
orbiter a5054c038d - added large number of generics
17 years ago
orbiter 71bcf02d3a - removed pro-version (is the same as standard version, use the standard instead)
17 years ago
orbiter ce7257483d fix for bad fix with random access files (no performace enhancement)
17 years ago
apfelmaennchen 704de4dee8 Neue Funktion angelegt - notwendig für Einschränkung der Tagwolke
17 years ago
orbiter 016fc594af more generics
17 years ago
orbiter ecd7f8ba4e - added NEAR operator (must be written in UPPERCASE in search query)
17 years ago
orbiter 3e3d2e39a4 - some refactoring and redesign of kelondroBytesIntMap (created new class kelondroRAMIndex)
17 years ago
orbiter 03e7782269 more generics
17 years ago
orbiter f7c5ccedc7 more generics
17 years ago
low012 7af60fb24d *) fixed bug in update script
17 years ago
low012 ae6d07bdb8 *) "Did you mean:" will only be displayed if the list of suggested URLs is not empty.
17 years ago
low012 408cb7a29b *) added check if archive for update is OK, install if OK, else just restart (http://forum.yacy-websuche.de/viewtopic.php?t=663)
17 years ago
orbiter df2a7a8ac8 more generics
17 years ago
orbiter 9d8b17188a more generics, bugfixes for wrong cast
17 years ago
low012 b08f877e97 *) tried to get rid of warnings when compiling parsers (http://forum.yacy-websuche.de/viewtopic.php?t=660)
17 years ago
orbiter 4dc438f7e7 moved to Java 1.5:
17 years ago
orbiter db0d3d5e54 release 0.56 (and some last fixes)
17 years ago
fuchsi d517e96714 last cleanup bits to serverDate before the release. only safe refactoring (method renaming) changes outside of serverDate.
17 years ago
hermens 4748d5c1ab Some enhancements to time management:
17 years ago
orbiter 52dd015218 new release strategy: the standard release is now built the same way as the pro release
17 years ago
fuchsi 1cb6e431a6 Replace the ISO8601 aka W3C datetime parser by one that supports every representation allowed by this standard, see http://www.w3.org/TR/NOTE-datetime
17 years ago
fuchsi 33ee6745f6 more cleanup in serverDate
17 years ago
fuchsi 3c30c2da75 more cleanup and API consistency changes, more to come...
17 years ago
fuchsi f41172f850 Merge httpDate into serverDate as suggested. Removed some unnecessary code and fixed a possible synchronization problem.
17 years ago
fuchsi a52681dd49 add buffering for the performance graph to avoid ConcurrentModificationException
17 years ago
orbiter 814aff60bd - (re-)activated ftp protocol. see discussion here: http://forum.yacy-websuche.de/viewtopic.php?f=6&t=623&hilit=&p=3875#p3875
17 years ago
low012 6fbda9ef4f *) cleaned up code
17 years ago
fuchsi 21f7e13fa1 fix stupid tiny bug introduced in rev 4276 that broke request URL parsing almost completely
17 years ago
fuchsi 5d406d0094 - fixed url "file extension" parsing when there is no extension (like http://yacy.net/ would have extracted .net/)
17 years ago
fuchsi 21b8d1b918 small cosmetic change for static fields in serverCore (special protocol ASCII entities) to improve readability
17 years ago
orbiter 270d016d89 fix for missing anonymization in search profiling
17 years ago
orbiter e3e4f06be4 enhanced search result preparation in the case that no result is found (fast abandon of search)
17 years ago
fuchsi 1bd02762de Improve HTTP/ICAP header processing.
17 years ago
orbiter 01554f4012 fixed bug with double-check in crawler
17 years ago
orbiter b1e08d354c repaired indexing after search snippet loading
17 years ago
orbiter 48138952ff added memory measurement for index recreation to avoid OOM during index RAM space extension
17 years ago
orbiter 9e23acf2d6 introduced new 'authority' ranking property
17 years ago
orbiter a3bfd668aa opening of array files at startup time, not when first time the web index is accessed
17 years ago
orbiter ca488e03f5 fixed authorization case
17 years ago
orbiter 6a3a292015 - smoothed ymage font
17 years ago
low012 7397152e04 *) quick hack for antialiasing, works only on borders now => less blurry image
17 years ago
orbiter 2954f96fae - removed public peer info box on status page, this info can now be seen in the status banner
17 years ago
low012 4eb40c4f61 *) added 2 filters: blur and antialiasing (which in fact is nothing more than a mild blur) to ymageMatrix
17 years ago
orbiter aeb1cf83a6 - corrected banner link (relative now)
17 years ago
orbiter e22014dc83 some memory enhancements when generating and displaying ymage objects
17 years ago
orbiter f243e338cf implemented online caution also for local and remote search
17 years ago
orbiter c57eb76b13 removed CMY color model from ymage classes and re-introduced RGB color model
17 years ago
orbiter b46bcaa5d8 changed method of profiling
17 years ago
low012 76cd6ed6f6 *) New methods to insert bitmaps that feature transparencies.
17 years ago
orbiter be214e594f - generalized ymage initialization options
17 years ago
low012 ee8a177c26 *) Logo is in the middle of free space now.
17 years ago
low012 72698fcd36 *) Banner features a logo now. It does not look nice, but at least it works. Banner is not finished yet.
17 years ago
fuchsi 39d0f10ca1 Fix parsing oof dates in HTTP headers.
17 years ago
orbiter aefb3f7765 added memory graph picture to PerformanceMemory_p.html
17 years ago
orbiter 9b0ae4b989 added referrer to remote crawl url list
17 years ago
fuchsi 18e516317d Fix problem with buggy HTTP-Servers which send illegal control characters in HTTP-Headers, they are ignored now.
17 years ago
orbiter 7d5544e9b1 added some security checks to new remote crawl pull method to prevent that indexer is overloaded
17 years ago
orbiter 89b9b2b02a redesigned remote crawl process:
17 years ago
fuchsi 69521d92e5 Add another external dependency from PDFBox package ("Bouncy Castle"). This is necessary for parsing of some encrypted PDF files.
17 years ago
orbiter 90a02990d2 NPE fix, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3383#p3383
17 years ago
orbiter 2fcd18a972 - fixed bad behaviour of search event worker processes
17 years ago
orbiter 445c0b5333 added domain list extraction and html export format
17 years ago
orbiter d8d77fc4b2 fix for NPE, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=549&hilit=&p=3368#p3368
17 years ago
orbiter bf6952abe7 - added url export to http://localhost:8080/IndexControlURLs_p.html
17 years ago
orbiter af10f729df fixed image search and favicon loading
17 years ago
orbiter c48b73cda2 redesign of ranking data structure
17 years ago
orbiter 6f1308da2f - some enhancements to IndexControlURLs (shows more links, connects referrer to another query)
17 years ago
orbiter c527969185 - enhanced monitoring of ranking parameters
17 years ago
orbiter bd5673efbe added cleaning of search event before opening the index administration
17 years ago
orbiter 55da871211 preparations for better ranking: better debugging of index properties
17 years ago
low012 383dc815d2 *) fix for commit 4212
17 years ago
orbiter 3491531cea - fixed 'appears in url' flag in index generation
17 years ago
orbiter ec7ba0d3d0 - fixed problem with too small sort fields (sortbound was not set)
17 years ago
orbiter bc2368e907 fix for problem with remote crawl referrers
17 years ago
orbiter 875096552f fix for NPE in case that remote search results are empty
17 years ago
orbiter 64b3b79e44 - fix for termination problem with uniq()
17 years ago
orbiter 0abf33ed03 - tried to remove deadlock
17 years ago
low012 a4010f7dc8 *) fixed bug where dots were added after numbers < 1000: "123" was transformed to "123." which is undesirable
17 years ago
orbiter 2421127612 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=513&hilit=
17 years ago
orbiter d0d2771883 disabled multiprocessoring of rowCollection.sort for testing purpose
17 years ago
orbiter edc4da5317 fix for division by zero in test reoutine
17 years ago
orbiter df38aaf7bd update to RowCollection sort speed-enhancements:
17 years ago
orbiter 0eb60cfe6f better handling of seed properties
17 years ago
orbiter ecba35de72 enhanced computing speed of kelondro core function: sorting
17 years ago
orbiter 6eaa5a0e64 enhanced local search speed. The ranking process is now 6 times faster that before.
17 years ago
fuchsi 425e4ead66 Allow absolute paths in configuration settings.
17 years ago
borg-0300 e8d32d9f62 other loglevel
17 years ago
borg-0300 a5d28785b1 less OOM (works for me)
17 years ago
orbiter ccbfb15b6b enhancement to crawl stacker enqueue order
17 years ago
hermens 5c5344ae97 Beautify log
17 years ago
hermens 35cf196204 transferRanking(): Do not flush more ranking files than requested by caller.
17 years ago
hermens d0aa8cf25d Only update handshaked peer's last seed date if it has not been updated recently.
17 years ago
hermens 8f9d65da67 Small corrections to dhtFlushControl()
17 years ago
orbiter 55c87b3b12 changed behavior of crawl stacker
17 years ago
hermens 18144043e6 Correct UTC Offset at beginning/end of daylight savings time
17 years ago
orbiter 4fefa53135 removed parser object pool, see also svn 4106
17 years ago
orbiter a31b9097a4 preparations for mass remote crawls:
17 years ago
fuchsi a718858e8b seed.CCOUNT is interpreted as a double value not int
17 years ago
fuchsi 0e1738899f * Complete number localization and provide a more reasonable interface to serverObjects:
17 years ago
orbiter f8318436a1 fix for last commit
17 years ago
orbiter 7d57b80598 distinct keepOrder strategy, more discrete implementation of enhancement introduced in SVN 4158
17 years ago
orbiter 9a7b093eed tried to avoid endless loop, see also:
17 years ago
orbiter b856e377a9 some additions and a small bugfix to SVN 4158
17 years ago
hermens 501a7aae90 Small correction
17 years ago
hermens caff520988 Removed unnecessary and unused code.
17 years ago
hermens d732840f8a Avoid ConcurrentModificationException when accessing the PerformanceQueues page while yacy is indexing.
17 years ago
fuchsi 35303f9504 add real size values (KBytes) of the DHT-In/Out-RAM-Caches to the PerformanceQueues page. A lot of users seem to tweak this value and it might help in finding the best size in relation to the peer's memory ressources.
17 years ago
fuchsi 38bbd4a4b3 no code changes. just touched yacyClient.java to trigger a rebuild of the file in an uncleaned tree.
17 years ago
fuchsi f717beecb1 - Changed yFormatter handling to be more flexible and produce more readable code for server pages. There are serverObject.putNum() methods to allow adding of number type values in a formatted form, and put() methods for number types that add them without formatting. This reduces the need to transform them into Strings in server pages and removes the HTML encoding step which is unecessary for numbers.
17 years ago
fuchsi ca83f5a8d9 Add external lib FontBox which is part of the PDFBox (they extracted the font handling code into this package in 0.7.3).
17 years ago
fuchsi 3352474dd8 Remove grouping separator in Network.xml (yacystats will woork without it) and format a few more numbers.
17 years ago
fuchsi 06e6a1ff62 Add a generalized Formatter class yFormatter inspired by http://forum.yacy-websuche.de/viewtopic.php?f=5&t=437
17 years ago
fuchsi e77aec8c9d fix handling of encrypted PDF-Documents (with default user password "")
17 years ago
fuchsi b5f7df8d0a Speed up remove operations in rowCollections.
17 years ago
low012 fdb0b861f8 *) fixed wrong calculation of network words, network links, network PPM if peer is senior or principal peer
17 years ago
fuchsi 508de558f7 sbStackCrawlThread is null during first cleanProfiles() run at startup.
17 years ago
fuchsi 70614385ef Attempt to fix the "lost profile handle" bug.
17 years ago
low012 507ecd8afa *) added banner that can be displayed like this: http://localhost:8080/Banner.png
17 years ago
fuchsi 9b0948cb4c gnarf. mixed up the positions. finally fixed...
17 years ago
fuchsi c0f5fc51ef bugfix for last commit
17 years ago
orbiter 33fb2f756d added emergency fail case in remote crawls
17 years ago
fuchsi c5a8585ac6 fix more encooding problems in yacysearch.rss.
17 years ago