You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
yacy_search_server/source/net/yacy/document/parser/html
reger 2048b7e057
support scraping start-/enddate from html tag with property "datetime"
9 years ago
..
AbstractScraper.java strong redesign of html parser: object recursion is now made using a 11 years ago
AbstractTransformer.java skip creation of unused Bluelist contenttransformer 10 years ago
CharacterCoding.java fix for bad URL decoding 11 years ago
ContentScraper.java support scraping start-/enddate from html tag with property "datetime" 9 years ago
ContentTransformer.java skip creation of unused Bluelist contenttransformer 10 years ago
EmbedEntry.java - the webgraph shall store all links which appear on a web page and not 11 years ago
Evaluation.java added a new way of content browsing in search results: 10 years ago
ImageEntry.java fix for image alt attachment to AnchorURLs in html parser. 10 years ago
Scraper.java strong redesign of html parser: object recursion is now made using a 11 years ago
ScraperInputStream.java Refactoring : use StandardCharsets constants instead of hard-coded 9 years ago
ScraperListener.java refactoring of yacy documents and parsers: they depend now only on the kelondro classes 15 years ago
Transformer.java strong redesign of html parser: object recursion is now made using a 11 years ago
TransformerWriter.java add links with image extension not automatically to image links. 9 years ago