Commit Graph

78 Commits (9b941fb77341bfbb95085bb915787ca43e0ffde1)

Author SHA1 Message Date
theli 3d5347bc8e *) changing loglevel for some messages
19 years ago
theli 0fcd113c42 *) last bugfix part. Seems to work now for the stackCrawler
19 years ago
theli bea2b9edee *) further redesign of threadpools to solve too many thread problem
19 years ago
theli f5abfe8d57 *) more failsafe threadpools
19 years ago
theli ecdc1f7547 *) Bugfix for crawling URLs with query parameters
19 years ago
rramthun c4487deba9 Minor changes collected over some time.
19 years ago
theli d7b6dcbe2e *) Bugfix for MalformedURL problem if Location header is empty.
19 years ago
orbiter 37f88b4017 code cleanup
19 years ago
theli 44fa94ac52 *) Modifications for dbImport functionality
19 years ago
orbiter 3d8a5ae652 code cleanup
19 years ago
theli d4ac3e25b1 *) Bugfix for file system link bug during detection of invalid URLs
19 years ago
orbiter adf75bc9fa better logging for invalid file path detection
19 years ago
theli c650b112ea *) Bugfix for relative URL Bug in Crawler
19 years ago
orbiter d2731418bf added creation of global ranking files and changed url normal form usage
19 years ago
borg-0300 00ab4d8723 cleaned, small change, Properties
19 years ago
theli b8ceb1ffde *) Adding better https support for crawler
19 years ago
hydrox 56b9f34411 *)removed unused imports
19 years ago
theli 525c8dcbd4 *) Adding Traffic Statistic for Crawler
19 years ago
theli 02d9af1a70 *) Restructuring and extending of Remote Proxy Support
19 years ago
theli a2fa75e688 *) Asynchronous queuing of crawl job URLs (stackCrawl)
19 years ago
orbiter 0c3a20d44f more + changed log for better understanding of outOfMemory bug and others
19 years ago
theli 28c5687ff9 *) Bugfix for "download of non supported file content" via crawler
19 years ago
theli 35c6c5ead7 *) Bugfix for "Blacklist und Crawlen" Bug.
19 years ago
theli d1de71e9f6 *) Suppress stacktrace on proxy error for "No route to host Exception"
19 years ago
theli 56160cbd01 *) Bugfix for "YaCy verzählt sich ..." Bug.
19 years ago
theli 51b48a10e8 *) Suppress stacktrace on proxy error for "ValidatorException: No trusted certificate found"
19 years ago
theli 0aafb83edc *) Bugfix for robots.txt isDisallowed Check.
19 years ago
borg-0300 81cb8feb15 back to 649 :/
19 years ago
borg-0300 5194511e8e *) attempt to find bug
19 years ago
theli 6991b9e2b9 *) Suppress stacktrace on crawler error for "Connection reset"
19 years ago
theli a47f9238fe *) Blacklist is now also used by the crawler
19 years ago
borg-0300 cc493ef8c1 Added change from Hermes
19 years ago
theli 59b8a98c7e *) Bugfix for suppressing of stacktrace in log on crawler error "MalformedURLException"
19 years ago
theli 4fd5b95b1f *) Renaming Logger function names to reflect the proper Java Logging API Loglevels
19 years ago
theli 6adf8a4bde *) Renaming Logger function names to reflect the proper Java Logging API Loglevels
19 years ago
theli f19c09b227 *) Suppress stacktrace on crawler error for "MalformedURLException"
19 years ago
theli 9b818b1ce3 *) Pausing Crawlers if there is not enough space on disk
19 years ago
theli 34790acf02 *) Bugfix for suppressing of stacktrace in log on crawler error "unknown host"
19 years ago
theli af7b8f75bd *) Making proxyAccessLogging configureable via yacy.logging file
19 years ago
theli cb1f11c96b *) Suppress stacktrace on crawler error for "Unknown Host"
20 years ago
theli e338a13de3 *) Suppress stacktrace on crawler error for "Read timed out"
20 years ago
theli 2e43e744de *) Suppress stacktrace on crawler error for "connect timed out"
20 years ago
theli 36cbe04e3e *) Bugfix for Crawler Redirection Bug
20 years ago
theli 17be77a468 *) Bugfix for "Crawler data will not be removed from htcache if content parsing failed"
20 years ago
theli 330eae7cf3 *) Normalizing CrawlerStartURL now before crawling is started
20 years ago
theli ea9a992f05 *) Before the crawler retries to download a URL it checks if the server is already doing a shutdown
20 years ago
theli ea26b84eed *) Bugfix for http://www.yacy-forum.de/viewtopic.php?t=954
20 years ago
orbiter ba0a486328 moved printStackTrace() to logging
20 years ago
theli 89c9faa89e *) More graceful logging output in crawler
20 years ago
theli b32e7c516c git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@507 6c8d7289-2bf4-0310-a012-ef5d649a1542
20 years ago