Commit Graph

40 Commits (78d65e128ef1b7db29559996381c8d45e43e46de)

Author SHA1 Message Date
theli f17ce28b6d *) plasmaHTCache: 19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher) 19 years ago
theli ab5a9bee66 *) adding some copyright headers 19 years ago
theli 5847492537 *) next step of restructuring for new crawlers 19 years ago
theli e3f0136606 *) next step of restructuring for new crawlers 19 years ago
theli 4e2a950ac9 *) next step of restructuring for new crawlers 19 years ago
theli 09b106eb04 *) next step of restructuring for new crawlers 19 years ago
theli eb9b138986 *) next step of restructuring for new crawlers 19 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols 19 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL 19 years ago
rramthun bc94a714b2 Better explanation for the auto-dom-filter. 19 years ago
theli 89286478e7 *) removing thread pool eviction for now. Not needed at the moment 19 years ago
theli 0fcd113c42 *) last bugfix part. Seems to work now for the stackCrawler 19 years ago
theli bea2b9edee *) further redesign of threadpools to solve too many thread problem 19 years ago
theli 56e4dbeb71 *) displaying current active + current idle threads in PerformanceQueues_p.html now 19 years ago
theli 859c6a88f5 *) testing various thread pool eviction settings to avoid outOfMemory - Thread creation problem 19 years ago
theli f5abfe8d57 *) more failsafe threadpools 19 years ago
orbiter 3d8a5ae652 code cleanup 20 years ago
borg-0300 00ab4d8723 cleaned, small change, Properties 20 years ago
theli 02d9af1a70 *) Restructuring and extending of Remote Proxy Support 20 years ago
theli 4fd5b95b1f *) Renaming Logger function names to reflect the proper Java Logging API Loglevels 20 years ago
theli 6adf8a4bde *) Renaming Logger function names to reflect the proper Java Logging API Loglevels 20 years ago
theli 17be77a468 *) Bugfix for "Crawler data will not be removed from htcache if content parsing failed" 20 years ago
theli 330eae7cf3 *) Normalizing CrawlerStartURL now before crawling is started 20 years ago
orbiter ba0a486328 moved printStackTrace() to logging 20 years ago
theli 470839a16a *) Crawler/Session pool settings will now be stored properly into configfile 20 years ago
theli 55d10b864c *) further improvements in shutdown behaviour 20 years ago
orbiter 419f8fb398 fixed bugs/missing code regarding new crawl stack 20 years ago
orbiter 858cd94299 replaced indexing ram-queue by file-based stack-queue 20 years ago
theli fbbea813c5 *) changing references to logger 20 years ago
theli 83b41ef2f7 *) Adding timeouts for shutdown 20 years ago
theli 361f05978d Multiple updates regarding the yacy seedUpload facility, 20 years ago
theli 2aa5fe8f50 *) Import statements reorganized 20 years ago
theli 65fc650109 *) plasmaCrawlLoader shutdown problem fixed (hopefully) 20 years ago
theli 58b1a0ba40 *) adding an new package for extra content parsers 20 years ago
theli c9c0a1f11c *) Trying to speedup local crawling 20 years ago
(no author) f39812da91 *) Some performance improvements 20 years ago
orbiter c0807abd33 new crawl/proxy/cache design + fixes 20 years ago
orbiter e7d055b98e very experimental integration of the new generic parser and optional disabling of bluelist filtering in proxy. Does not yet work properly. To disable the disable-feature, the presence of a non-empty bluelist is necessary 20 years ago
orbiter 248077d3f0 initial load with yacy 0.36 20 years ago