Commit Graph

20 Commits (2f57327f2040d7b2a0ed105a54a7d840b82f0231)

Author SHA1 Message Date
Michael Peter Christen 57ffdfad4c added a crawl option to obey html-meta-robots-noindex. This is on by
12 years ago
Michael Peter Christen c091000165 added collection attribute also to the rss feed reader
12 years ago
orbiter aa3c26c62e added recrawl/reload to CrawlStartSite for a timeout of 3 days
12 years ago
orbiter b55ea2197f - redesign of crawl start servlet
12 years ago
orbiter 1c66de4bd4 - removed scheduled crawling options in crawl start because it is
12 years ago
orbiter 354ef8000d - added 'deleteold' option to crawler which causes that documents are
12 years ago
Michael Peter Christen ac9540dfb6 removed options for stopwords which are not used
12 years ago
orbiter ae246c30c3 fixed interpretation of directDocByURL attribute during crawl start
12 years ago
Michael Peter Christen b2b516cc3e added a collection attribute to crawls and searches:
12 years ago
Michael Peter Christen 19efbf1b0f - apply directDocByURL to NOLOAD Queue
13 years ago
Michael Peter Christen 8bfc987374 enhanced hint how to enter file:// urls
13 years ago
Michael Peter Christen 9aa73a13a8 stop words are on by default in site crawl. This causes normally nothing
13 years ago
orbiter ebd840ebf6 - enhanced description on search front page
13 years ago
orbiter e4a82ddd8b produce a bookmark entry from every crawl start. these bookmarks are always private.
13 years ago
orbiter ff32469272 added a link to /api/util/getpageinfo_p.xml as API to crawl start info and to ViewFile.html
13 years ago
orbiter 11bebe356b fixed crawl start: with SVN 7225 the name of the crawl start url was not given in input field and therefore all crawl starts had contained the empty string as crawl start url
14 years ago
mikeworks 70576e88d2 de.lng: Added some more untranslated strings I found and uncommented old ones that were removed
14 years ago
orbiter 2c549ae341 fixed a number of small bugs:
14 years ago
orbiter f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option
14 years ago
orbiter daeea96aea renamed servlet CrawlStart_p.html to CrawlStartSite_p.html to circumvent problem with translation which still showed old expert crawl start page
14 years ago