You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/de/anomic/crawler
orbiter 9b25d07295
- added geo information parsing to html parser
14 years ago
..
retrieval - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion 14 years ago
Balancer.java - enhancements to DNS IP caching and crawler speed 14 years ago
CrawlProfile.java enhancements to web cache and less strict caching rules 14 years ago
CrawlQueues.java fix for 'invisible entry', see http://forum.yacy-websuche.de/viewtopic.php?p=22133#p22133 14 years ago
CrawlStacker.java - removed file upload function in crawl start and replaced it with an input field for a file path where the crawl start file is loaded. This was necessary to support the API steering for file crawl starts, for two reasons: 14 years ago
CrawlSwitchboard.java enhancements to web cache and less strict caching rules 14 years ago
ImporterException.java added final where possible 17 years ago
Latency.java more memory protection: auto-flush of caches in case of memory shortage 14 years ago
NoticedURL.java - removed file upload function in crawl start and replaced it with an input field for a file path where the crawl start file is loaded. This was necessary to support the API steering for file crawl starts, for two reasons: 14 years ago
RSSLoader.java *) set SVN properties 14 years ago
ResourceObserver.java same units for memory observer configuration (MiB) 14 years ago
ResultImages.java more memory protection: auto-flush of caches in case of memory shortage 14 years ago
ResultURLs.java - added geo information parsing to html parser 14 years ago
RobotsEntry.java replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'. 14 years ago
RobotsTxt.java * show info in log, if robots.txt is rejected due to wrong mime-type 14 years ago
SitemapImporter.java enhanced crawler: 14 years ago
ZURL.java moved getBytes() to UTF8.getBytes() to use a default String encoding 14 years ago
robotsParser.java added a sitemap entry parser and loader for sitemaps 14 years ago