Commit Graph

14 Commits (beed1c417eed5f05b0a3b510b6c3f622f5e3ea82)

Author SHA1 Message Date
Michael Peter Christen f810915717 added crawl start from a clone with very, very large url: they are now
10 years ago
Michael Peter Christen 97930a6aad added must-not-match filter to snapshot generation.
10 years ago
Michael Peter Christen 535f1ebe3b added a new way of content browsing in search results:
10 years ago
Michael Peter Christen 1309619a71 remove remote indexing option in crawl start if not in p2p mode
10 years ago
Michael Peter Christen 606d00c8f2 cloning a crawl now accepts the class name of vocabulary scapers
10 years ago
Michael Peter Christen b5ac29c9a5 added a html field scraper which reads text from html entities of a
10 years ago
Michael Peter Christen 8df8ffbb6d enhanced the snapshot functionality:
10 years ago
Michael Peter Christen d83de9ecf5 added another path for the convert command because on older Macs
10 years ago
Michael Peter Christen 6f0167fac1 get cloned crawl start parameter for snapshots
10 years ago
Michael Peter Christen 97f6089a41 YaCy can now create web page snapshots as pdf documents which can later
10 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
10 years ago
Michael Peter Christen f23c4142e0 added option to configure a custom user agent within allip networks
11 years ago
Michael Peter Christen a2fba6584f use submitted default userAgent if cloning a crawl
11 years ago
orbiter d29b6db270 made crawl start pages public since they do not reveal individual
11 years ago