Commit Graph

71 Commits (676e384b45ce07887f5afe321732c9aedc1d949a)

Author SHA1 Message Date
orbiter 37f88b4017 code cleanup
19 years ago
theli 44fa94ac52 *) Modifications for dbImport functionality
19 years ago
orbiter 3d8a5ae652 code cleanup
19 years ago
theli d4ac3e25b1 *) Bugfix for file system link bug during detection of invalid URLs
19 years ago
orbiter adf75bc9fa better logging for invalid file path detection
19 years ago
theli c650b112ea *) Bugfix for relative URL Bug in Crawler
19 years ago
orbiter d2731418bf added creation of global ranking files and changed url normal form usage
19 years ago
borg-0300 00ab4d8723 cleaned, small change, Properties
19 years ago
theli b8ceb1ffde *) Adding better https support for crawler
19 years ago
hydrox 56b9f34411 *)removed unused imports
19 years ago
theli 525c8dcbd4 *) Adding Traffic Statistic for Crawler
19 years ago
theli 02d9af1a70 *) Restructuring and extending of Remote Proxy Support
19 years ago
theli a2fa75e688 *) Asynchronous queuing of crawl job URLs (stackCrawl)
19 years ago
orbiter 0c3a20d44f more + changed log for better understanding of outOfMemory bug and others
19 years ago
theli 28c5687ff9 *) Bugfix for "download of non supported file content" via crawler
19 years ago
theli 35c6c5ead7 *) Bugfix for "Blacklist und Crawlen" Bug.
19 years ago
theli d1de71e9f6 *) Suppress stacktrace on proxy error for "No route to host Exception"
19 years ago
theli 56160cbd01 *) Bugfix for "YaCy verzählt sich ..." Bug.
19 years ago
theli 51b48a10e8 *) Suppress stacktrace on proxy error for "ValidatorException: No trusted certificate found"
19 years ago
theli 0aafb83edc *) Bugfix for robots.txt isDisallowed Check.
19 years ago
borg-0300 81cb8feb15 back to 649 :/
19 years ago
borg-0300 5194511e8e *) attempt to find bug
19 years ago
theli 6991b9e2b9 *) Suppress stacktrace on crawler error for "Connection reset"
19 years ago
theli a47f9238fe *) Blacklist is now also used by the crawler
19 years ago
borg-0300 cc493ef8c1 Added change from Hermes
19 years ago
theli 59b8a98c7e *) Bugfix for suppressing of stacktrace in log on crawler error "MalformedURLException"
19 years ago
theli 4fd5b95b1f *) Renaming Logger function names to reflect the proper Java Logging API Loglevels
19 years ago
theli 6adf8a4bde *) Renaming Logger function names to reflect the proper Java Logging API Loglevels
19 years ago
theli f19c09b227 *) Suppress stacktrace on crawler error for "MalformedURLException"
19 years ago
theli 9b818b1ce3 *) Pausing Crawlers if there is not enough space on disk
19 years ago
theli 34790acf02 *) Bugfix for suppressing of stacktrace in log on crawler error "unknown host"
19 years ago
theli af7b8f75bd *) Making proxyAccessLogging configureable via yacy.logging file
19 years ago
theli cb1f11c96b *) Suppress stacktrace on crawler error for "Unknown Host"
20 years ago
theli e338a13de3 *) Suppress stacktrace on crawler error for "Read timed out"
20 years ago
theli 2e43e744de *) Suppress stacktrace on crawler error for "connect timed out"
20 years ago
theli 36cbe04e3e *) Bugfix for Crawler Redirection Bug
20 years ago
theli 17be77a468 *) Bugfix for "Crawler data will not be removed from htcache if content parsing failed"
20 years ago
theli 330eae7cf3 *) Normalizing CrawlerStartURL now before crawling is started
20 years ago
theli ea9a992f05 *) Before the crawler retries to download a URL it checks if the server is already doing a shutdown
20 years ago
theli ea26b84eed *) Bugfix for http://www.yacy-forum.de/viewtopic.php?t=954
20 years ago
orbiter ba0a486328 moved printStackTrace() to logging
20 years ago
theli 89c9faa89e *) More graceful logging output in crawler
20 years ago
theli b32e7c516c git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@507 6c8d7289-2bf4-0310-a012-ef5d649a1542
20 years ago
theli 86305f051d *) Trying to solve "java.net.BindException: Address already in use: JVM_Bind" Problem
20 years ago
theli 1d83d7e4d7 *) httpdFileHandler.java:
20 years ago
orbiter 91163db52e fix for more time-related problems in proxy
20 years ago
rramthun eacff63eda Typos...
20 years ago
rramthun 6f2f54a312 Translation, spelling...
20 years ago
rramthun 0f11399d16 Some corrections...
20 years ago
orbiter 2f0d7ea8d3 removed htcache stati (superfluous now)
20 years ago