Commit Graph

3427 Commits (d4281b78da96dffdefda4b265068781853304ee1)

Author SHA1 Message Date
orbiter 21dbb39afa switched two balancer cases
16 years ago
orbiter 1bbf362cef update to the crawl balancer: better organization and better crawl delay prediction
16 years ago
orbiter ddcf285499 - fixed a bug in performance setting (did not work with german translation)
16 years ago
orbiter 0cd0fee546 fixed bug with wrong proxy result enqueueing. See:
16 years ago
orbiter 670244849d fix for http://forum.yacy-websuche.de/viewtopic.php?p=9835#p9835
16 years ago
lotus fd9233244e configurable free disk space via disk.free
16 years ago
orbiter 25a62cdc3f small fixes
16 years ago
lotus 73f233bb11 * set resource observer to 1000MB
16 years ago
orbiter 5fbccfd75e fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1366&p=9348#p9348
16 years ago
orbiter a28faabfd2 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1351&p=9242#p9242
16 years ago
apfelmaennchen 7b63c66a08 - bugfix in bookmarksDB.Tag.hasPublicItems()
16 years ago
orbiter 1fb1665e71 increased dht interval to avoid peer selection failure
16 years ago
orbiter 1eb813bd43 shifted index deletion-on-exit rule to the class where the errors are produced
16 years ago
f1ori ba76995d2c * fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1415
16 years ago
f1ori bea6c13139 * with r5137 robotParser didn't work at all -> fix
16 years ago
lotus 3ded1efe84 kelondroExceptionCounter didn't work
16 years ago
f1ori ae677e1738 * fix problem in robotparser, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1421&p=9742
16 years ago
lotus 383d89481e count errors before deleting collection.index
16 years ago
lotus 0bb4fbc403 delete corrupted collecion.index on exit for rebuild on next start
16 years ago
lotus b68d06a6e8 performance settings based on network's remote crawl speed
16 years ago
danielr d60b2b198d proxy fixed 'not modified' http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1419
16 years ago
f1ori bd0318ba81 * YaCy only supports gzip-encoding, so remove any other encoding from request
16 years ago
orbiter bb5c898441 enhancements to localsearch behavior
16 years ago
orbiter 42e2d195ac added hint from http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1294
16 years ago
orbiter 39964e88fa fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1329#p9121
16 years ago
orbiter 3f3673b6e5 extended balancer:
16 years ago
orbiter 3c6e8d2015 set default ppm when network is switched
16 years ago
orbiter 3288c19c1a reduce remote crawl PPM for fresh peers in freeworld to 6 PPM
16 years ago
lotus 5ce9a100bb fix(2) for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1416
16 years ago
danielr cf29ca19d4 possible fix for POST character encoding http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374
16 years ago
danielr a2eeb6138c fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1416
16 years ago
orbiter d09ddabd09 corrected a design mistake (5-byte hashes not necessary)
16 years ago
orbiter c97d0fcee7 modified the domain list export function:
16 years ago
orbiter 77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet
16 years ago
orbiter 80a7bc93d6 - added statistical evaluation about domains that appear during crawling
16 years ago
orbiter 4fbee21cea - added fetch-ahead again (had been removed in last commit)
16 years ago
lotus 423a89ebe8 * fix if yacy was installed to a path with whitespace
16 years ago
orbiter fc03b0437a fixed a error case where a second search after a first search with a different search word failed
16 years ago
orbiter eca171ba2e fix for case where javascript was not filtered by the html parser
16 years ago
lotus e645bae29f display table in log
16 years ago
orbiter ead39064c5 fixed problem with wrong result number calculation
16 years ago
hermens 2437beb96c fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1360&p=9321#p9321
16 years ago
orbiter 7b12e77a63 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1393&hilit=&p=9655#p9655
16 years ago
orbiter 05dbba4bab added logging conditions to all fine and finest log line calls
16 years ago
orbiter d3d41e2ee4 - fixed problem with searching with quotes (still not complete, but not as bad as before)
16 years ago
lotus 3fbfd5a78b * fix for non-changing offset on new search term
16 years ago
danielr 219b93df6a - fixed internal error after receiving chunked POST
16 years ago
lotus c245c7a45e delete index.dhtin/out.heap if restore fails
16 years ago
danielr cd19d0aee6 - added warnings for failed transferRWI (dht-in)
16 years ago
orbiter df4ff423c4 added additional properties to query id's to distinguish search events better
16 years ago
danielr d6d9b0f14a fixed transferRWI.html 'Read timed out'
16 years ago
danielr e503158527 Proxy: fix for never ending loading after POST
16 years ago
danielr 1a1d57e449 Proxy: added binary passthrough for POST
16 years ago
apfelmaennchen aa6ae77e5e - autoReCrawl: fix for filter settings
16 years ago
apfelmaennchen 8ae29bad57 - fix to previous change of Crawl Profile Names
16 years ago
apfelmaennchen 434104e4a0 - change Crawl profile name for autoreCrawl
16 years ago
danielr 9ff4fc11da partial fix (images,audio,video) for proxy and content-type problem http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1374
16 years ago
lotus 0df2e47012 changed auto recrawl to comply with new date format
16 years ago
lotus d9d9c522a1 addendum to last commit
16 years ago
lotus 480497f7c9 changed recrawl
16 years ago
orbiter da1b0b2fc6 added two new classes that will be used for the new htcache
16 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
16 years ago
borg-0300 08cdf6db8a fix for wrong "VegaYacyB" peers
16 years ago
danielr 4d937f6b21 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1396
16 years ago
apfelmaennchen bd931a82f7 - added dynamic filters to autoReCrawl.conf
16 years ago
apfelmaennchen b3fc5e96a3 - removed unused import from bookmarksDB
16 years ago
apfelmaennchen bc048db7b6 - bugfix for bookmarksDB's rebuildDates()
16 years ago
danielr 3c68905540 remove redundant null checks
16 years ago
danielr 753a1ae430 - changed default browser from netscape to firefox
16 years ago
orbiter 7989335ed6 Preparations to replace the HTCache with a new storage data structure:
16 years ago
danielr be28af50f5 - fixed "yacy2yacy no proxy"-problem
16 years ago
f1ori f99c307eff * correct debian build dependencies
16 years ago
orbiter bdae051d9a - extended new performance graph (better timing)
16 years ago
danielr d9cea5ff23 removed annotations which broke the build with java 1.5
16 years ago
danielr a087090bbb fixed starting crawl results in "No parser available to parse mimetype 'application/octet-stream'"
17 years ago
danielr 7e7e6a099a undo 5044
17 years ago
danielr f2d0bd7790 fix for NPE in JakartaHttpClient.setProxy
17 years ago
danielr bb6a6fc233 fixed 'FileUploadException Stream ended unexpectedly'
17 years ago
danielr 8422ee5ec4 - fixed UnsupportedEncoding (in proxy) using defaultCharset if no characterEncoding can be determined
17 years ago
hermens 3ac1988059 Add some sanity checks for invalid seeds
17 years ago
hermens cff4393f0c Fix HTCache so oldest Files get deleted first
17 years ago
danielr 31d97f2b9f replaced httpd.parseMultipart() by a 'right' implementation
17 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net)
17 years ago
apfelmaennchen 0500b1179e added a 2 min start up delay to serverBusyThread autoReCrawl to avoid a Null Pointer Exception...
17 years ago
apfelmaennchen e1574fe02e - added autoReCrawl folders to bookmarks (DATA/SETTINGS/autoReCrawl.conf)
17 years ago
orbiter ebb40d324b enhanced memory chart: shows now also the size of the word cache as third vector.
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
lotus 7e92484400 fix for open browser on windows 2000
17 years ago
f1ori b0724e5ec0 * add config option to disable cookie monitoring (disabled by default)
17 years ago
lotus 0b2f67577e Index Transfer:
17 years ago
lotus 694084c570 fix for NPE on shutdown
17 years ago
lotus 5f77f55ed7 possible fix for negative speed values
17 years ago
orbiter 50ef5c406f - refactoring of robots parser (removed opaque Objects[] result vector)
17 years ago
danielr 7913bdb75b Flextable: filename in errormessage if inconsistent
17 years ago
lotus d42eae25f8 yacyTray:
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
orbiter 3ca98fee42 removed superfluous copyright statement
17 years ago
danielr c049d80fbd fixed login problem with yacy as proxy (POST and Cookies)
17 years ago
lotus 62afea0c9f some improvements for yacyTray
17 years ago
danielr 7c110e07f0 removed debug
17 years ago
danielr eadc204130 gzip POST wiederholbar gemacht (macht transferURL stabiler)
17 years ago
lotus 28c39e2aa4 fix for new starter files
17 years ago
lotus fa695c2d9f tray is now only shown on Windows and doesn't block on linux
17 years ago
lotus d77ed28e2f temporary disabled tray because of flaws on only-shell-linux
17 years ago
lotus f8a1e3175e new yacyTray
17 years ago
orbiter 05c26d58d9 fixed missing remove operation in balancer
17 years ago
orbiter 606b323a2d fixed bug that appeared when a new crawl ist started
17 years ago
orbiter 38eb5bd1ee fixed a bug in kelondroBLOBHeap. The following files are probably inconsistent and should be deleted:
17 years ago
orbiter 28d5703f8a - fixed a bug in Robots.txt loader which could have caused that robots.txt files had been loaded from the same domain more than once
17 years ago
orbiter 7b1c9e6aee discovered and removed a (possibly large) memory leak:
17 years ago
orbiter 0f5fe8cc53 refactoring of method calling for objects from kelondroMapDataMining
17 years ago
orbiter 01d1ae6676 patch for negative time in case that the time of the computer is changed
17 years ago
orbiter 4acf0a61cd refactoring of kelondroObjects (mainly renaming to kelondroMap)
17 years ago
orbiter 441e9c861e fix for npe in HTCache cleaning process
17 years ago
orbiter f7aaeb3fad created new main menu entry 'Customization and Integration'
17 years ago
lotus 5488543b8f disabled disk usage logpoints
17 years ago
orbiter 1e6d12f146 Major update to BLOB data structures:
17 years ago
orbiter 81f75f5056 - removed unnecessary classes (these objects are much easier to handle using generics)
17 years ago
orbiter b38f467e3c better SRU compliance
17 years ago
orbiter 7052f2f61f - added copyright header of ResourceObserver
17 years ago
orbiter 1400cdc91e - refactoring of resourceObserver (moved it to crawler)
17 years ago
f1ori b6301a54fa * added class ListDirs to provoid generic listing of directories in systemdirectories and jar-files
17 years ago
lotus f2e2d09916 - fix for index transfer
17 years ago
orbiter a6719dfd2b - refactoring of robots parser
17 years ago
orbiter e81be7d4f2 added many missing user-agent declarations for yacy http client connections.
17 years ago
orbiter 474e29ce4a added options to configure the 'corporate identity'-icons, the home page link and the greeting line from
17 years ago
orbiter 474659a71f - modified and enhanced the crawl balancer: better list export, fixing of damaged crawl queue at start-up, re-sorting at start-up to enhance domain order
17 years ago
orbiter 080cda97ef added another peer selection rule:
17 years ago
orbiter d37fd064f9 changed peer selection for search targets:
17 years ago
orbiter 69aac0d74c modified the diskUsage class regarding the following two aspects:
17 years ago
danielr da917cf4b1 undo reduced menu
17 years ago
danielr 0c1dc703e4 - set staticIP at startUp
17 years ago
danielr f7f9ceb967 diskUsage: replaced blocking sleep with semaphore
17 years ago
lotus 4a53649ee7 fixed dht-urls and ranking distribution log statistics
17 years ago
lotus 8d83185cb4 fixed dht-chunks/protocol log statistics
17 years ago
danielr 63eadfdf84 fixed unlimited FileSizeLimit
17 years ago
lotus 2dc7c00c1c fixed indexing log statistics
17 years ago
danielr dba7ba079e fixed NPE seen with queues_p.xml (serverClassLoader finds already loaded classes)
17 years ago
det 273fb01142 revert last fix; was wrong
17 years ago
det b6f50851fa fix memory requirement calculation
17 years ago
lotus ac85c52bae better readability for MIN_FREE_DISK_SPACE
17 years ago
lotus 54a73b58cf fixed restart on Windows when directory had spaces in it's name
17 years ago
det 609aaf0df3 rework of the windows part
17 years ago
det 1a4f26ba30 exclude HTDOCS from recursiv scan
17 years ago
det 6c07e894d9 add needed sleep
17 years ago
hermens d742cc080c Fix for RAMCache not flushing
17 years ago
danielr 6b7e873962 resourceObserver refactoring and some synchronisation for console output
17 years ago
orbiter 6bdd99e065 - more asserts to solve the ooB-problem
17 years ago
orbiter b928ae492a some code-cleanup and possible speed enhancements in different core methods
17 years ago
danielr 6a9cc29cdd workaround for IndexOutOfBoundsException in ResultURLs.getExecutorHash() seen @ CrawlResults.html?process=4
17 years ago
orbiter c998dc6556 - added security functions to flush url and search caches in case that memory is full
17 years ago
orbiter f4ae8082c3 - better error analysis for ooRange Exception in kelondroBase64Ordering
17 years ago
orbiter 84cbe75005 more asserts
17 years ago
orbiter e269c12710 small changes in partition routine
17 years ago
orbiter 31efb8fbee - fix for LOG path generation when the DATA/LOG does not exists (fix for bug introduced in SVN 4923)
17 years ago
lotus 877299cc74 better installer on Windows Vista
17 years ago
danielr 726218dd4a fixed logging
17 years ago
orbiter 21c87c36e3 added a log line
17 years ago
danielr 68c38c2d34 - WatchCrawler shows status without JavaScript
17 years ago
lotus fc79f013c4 better solution to update shortcut
17 years ago
det c0dfe49743 also exclude collection.0028.commons and RANKING at startup check
17 years ago
det 11656741f1 exclude LOCALE and RELEASE at startup check
17 years ago
lotus 48edbef5c7 * fix: display proper port on 1st startup
17 years ago
det 0727bb1e63 rework of console message handling; add of debugging output
17 years ago
lotus 43c47218ef fox for open browser on Windows
17 years ago
orbiter f5ef7f222e - fixed a bug in parser (directory paths had not been recognized)
17 years ago
lotus ed24eab737 small fix for windows in resource observer
17 years ago
hermens 75b4a5ced4 reinstate old timeout values for transferRWI and transferURL
17 years ago
orbiter 3330181aa0 refactoring:
17 years ago
orbiter 9a9737a54e fix for "no more elements available" exception
17 years ago
lotus 260553c3a5 better messages
17 years ago
det 6afeb535cd another bugfix for the windows drive check
17 years ago
det b416af7568 bugfix for the windows drive check
17 years ago
danielr 4b71912e76 fixed wrong class name
17 years ago
danielr 7feae906aa - organize imports
17 years ago
det f597185026 Initial import of the resource observer framework
17 years ago
orbiter c10eaf9bdb - fix for pop-up page upon first start
17 years ago
danielr d3037c2950 Accept all SSL-certificates (not only valid and self-signed), but put a warning into log file
17 years ago
orbiter e0e7f86f82 some bugfixes for the peer-ping process
17 years ago
danielr cbe722c480 small code cleanUp
17 years ago
orbiter b21598bdd0 - enhanced handling of own IP address inside seed
17 years ago
orbiter 40d7f485f3 - fixed several NPE bugs
17 years ago
orbiter 2f381b8d7a - fixed at least two causes for a NPE after a use case switch.
17 years ago
orbiter 2a604b7402 added superfast search result computation which can be obtained for local search when snippet fetching is disabled. An example search for the rss interface would be:
17 years ago
orbiter 9bef20b537 - added cleanup for unused server loggings: they are removed after the client had not been seen since one hour
17 years ago
orbiter 5f35ba6f05 added a list for content of the brute-force registration hashtable in access tracker
17 years ago
orbiter 74e3a547db more logging for mySeed loading error cases
17 years ago
orbiter 6f1a3fce05 BF Bugfix
17 years ago
lotus b09af53643 proper links to directories in repository dirlisting
17 years ago
lotus ba4091c5b2 proxy sends status code now
17 years ago
orbiter 1a1841392c small fix
17 years ago
orbiter 698293ef32 fix for lonely peers in networks with only one peer, especially intranet peers
17 years ago
orbiter 8a0e401320 - fix for bad code in peer actions
17 years ago
orbiter 0b52ef3e4b - update of grafics
17 years ago
orbiter 18ad12eceb added another fix for localhost addresses in seeds
17 years ago
orbiter 11e00a0849 - refactoring of seedURL handling
17 years ago
orbiter e91bf4c8cc - fix for bad reset of index / bad index location after deletion
17 years ago
lotus f284386b63 update deploy improvements for windows - ready for release now :-)
17 years ago
f1ori fd8bd5d0d1 * fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1176 (encoding issue)
17 years ago
orbiter 25192e0d36 added a deletion button to indexControlRWIs that deletes the complete web index
17 years ago
orbiter 01b3e9431a - fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1140&p=7626#p7626
17 years ago
orbiter 8be462986e fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1174&p=7841#p7841
17 years ago
orbiter f8b015949c fix for bug in html scraper that appears if opening and closing tag are not both in same case
17 years ago
orbiter d8277e6af1 - added parsing of numeric html entities for crawler
17 years ago
low012 f06c046f6c *) refactoring (several instances of Indetificator can be created and used parallel now)
17 years ago
orbiter 0c173821fd more access security regarding database access and snippet retrieval: restrict number of results for not-authorized searchers
17 years ago
lotus eac62a6882 * ported restart on Windows to unix-style, works on _noconsole now
17 years ago
orbiter 03438ee977 added missing implementation of network-path reference
17 years ago
orbiter 2ba7914f0b fix for NPE exception while fetching remote crawl jobs
17 years ago
lotus 4a48717017 * automatic update for windows
17 years ago
orbiter faed00d75d added use cases to basic configuration
17 years ago
orbiter 4229cd275c fixed several details about network switching, default password, random password and localhost authentification
17 years ago
orbiter d9d1c8de70 more protection against remote shutdown attacks: prevent loading using the crawler
17 years ago
orbiter c1d721dd2d fix for attacks on localhost-authorized peers from web pages with links to localhost addresses:
17 years ago
orbiter 6aa9bf43f9 fixed server port change
17 years ago
low012 a7dadf7f2f *) first version of a way to determine the language a text is written in (not perfect, but it works)
17 years ago
orbiter 1127d62b64 some enhancements to the access tracker (less synchronization)
17 years ago
orbiter 56a300f92a bugfix / local-search predicate
17 years ago
orbiter 2f29ab8779 more target server access security
17 years ago
orbiter 3bd1db776a implemented switch for admin authorization from localhost:
17 years ago
orbiter cfe6790498 - added option to switch between yacy networks, especially between the two default networks (freeworld and intranet),
17 years ago
orbiter 78087da287 - changed seed file storage to clear text
17 years ago
orbiter 5fde679acb - fixed problem in performance configuration
17 years ago
orbiter db032fb6de - added RWI transmissions to the event terminal
17 years ago
orbiter 239cc4428d - better domain graph, faster when more links exist, looks better
17 years ago
orbiter 415b92bb07 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1148&hilit=&p=7711#p7711
17 years ago
apfelmaennchen 2113672bf2 small fix on tag comporator functions
17 years ago
orbiter dd75b3cabc - patch for bad profiles
17 years ago
danielr 2080ff72b7 ftpc fix for npe
17 years ago
lotus e021278bf0 unescape link display in search results
17 years ago
danielr 3f1721b827 informational comment
17 years ago
danielr 04a51b775a changed .org/.net back to America
17 years ago
danielr 74b1a60043 fixed "java.lang.NoClassDefFoundError: org/a"
17 years ago
danielr f44d5d302b updated TLDs
17 years ago
hermens 5bfc02ccfb Repair publishThread
17 years ago
orbiter f42c8cf69c updated terminal and dynamic webstructure applet: can now change when crawl is running
17 years ago
orbiter 906c144799 - design update to new terminal and rssTerminal
17 years ago
orbiter 7ec01d444a fix for npe
17 years ago
orbiter ad0f905124 fix for npe in crawler
17 years ago
danielr ae03a54d23 pdfParser: updated lib, fixed ClassNotFoundException: CMSError
17 years ago
danielr 0d3808bd9e minor refactoring
17 years ago
orbiter 719f5defb1 updated some grafics at new terminal_p
17 years ago
lotus 9bc56a9edc xss protection
17 years ago
orbiter b32736762c enhanced rssTerminal
17 years ago
orbiter fbb712c669 refactoring:
17 years ago
orbiter 1689030ee8 refactoring: moved all crawler classes into their own package
17 years ago
orbiter d2ba1fd2ab major step forward to network switching (target is easy switch to intranet or other networks .. and back)
17 years ago
danielr d70a472460 added file for previous commit
17 years ago
danielr d32fe84472 added default User-Agent
17 years ago