Commit Graph

11 Commits (740cbfd875d1abd414c8ad7af4a40b84a27f93df)

Author SHA1 Message Date
Michael Peter Christen 9fcd8f1bda added canonical filter
2 years ago
luccioman 6b45cd5799 New optional crawl filter on the URL a doc must match to crawl its links
6 years ago
luccioman fcf6b16db4 Added new crawler attribute for finer control over Media Type detection
6 years ago
luccioman cced94298a Added a new crawler document filter type using Solr syntax
7 years ago
luccioman fb3032c530 Added a crawl filtering possibility on documents Media Type (MIME)
7 years ago
Michael Peter Christen d8beafba3a fix for values in CrawlProfileEditor table and xml; now the full profile
10 years ago
Michael Peter Christen ec95dfa2e6 fixed crawl profile xml result which did not show the correct crawl
10 years ago
Michael Peter Christen 329262231f unresolved pattern fix
10 years ago
orbiter f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option
14 years ago
orbiter dba7ef5144 extended crawling constraints:
16 years ago
f1ori b9602e891a * added CrawlProfileEditor_p.xml for monitoring in yacybar
17 years ago