You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser/html
luccioman eb20589e29
Fixed issue #158 : completed div CSS class ignore in crawl
7 years ago
..
AbstractScraper.java Fixed issue #158 : completed div CSS class ignore in crawl 7 years ago
AbstractTransformer.java skip creation of unused Bluelist contenttransformer 10 years ago
CharacterCoding.java fix for bad URL decoding 11 years ago
ContentScraper.java Fixed issue #158 : completed div CSS class ignore in crawl 7 years ago
ContentScraperListener.java Advanced Crawl from local file : better processing of large files. 8 years ago
ContentTransformer.java skip creation of unused Bluelist contenttransformer 10 years ago
EmbedEntry.java - the webgraph shall store all links which appear on a web page and not 12 years ago
Evaluation.java Cleaned up some Javadoc warnings. 8 years ago
IconEntry.java Fixed license headers on files created to improve favicon management. 8 years ago
IconLinkRelations.java Fixed license headers on files created to improve favicon management. 8 years ago
ImageEntry.java Cleaned up some Javadoc warnings. 8 years ago
Scraper.java Fixed issue #158 : completed div CSS class ignore in crawl 7 years ago
ScraperInputStream.java added a crawl filter based on <div> tag class names 7 years ago
ScraperListener.java Advanced Crawl from local file : better processing of large files. 8 years ago
Transformer.java strong redesign of html parser: object recursion is now made using a 11 years ago
TransformerWriter.java Fixed issue #158 : completed div CSS class ignore in crawl 7 years ago