yacy_search_server

Author	SHA1	Message	Date
Michael Peter Christen	7db0534d8a	Added a zim parser to the surrogate import option. You can now import zim files into YaCy by simply moving them to the DATA/SURROGATE/IN folder. They will be fetched and after parsing moved to DATA/SURROGATE/OUT. There are exceptions where the parser is not able to identify the original URL of the documents in the zim file. In that case the file is simply ignored. This commit also carries an important fix to the pdf parser and an increase of the maximum parsing speed to 60000 PPM which should make it possible to index up to 1000 files in one second.	1 year ago
Michael Peter Christen	03bf259601	fix for https://github.com/yacy/yacy_search_server/issues/363 We still need to set the load in the process because a demand for higher crawl speed may require to increase the maximum load limit. However, following the criticism in the bug, we do never reduce the load limit again.	1 year ago
Michael Peter Christen	9fcd8f1bda	added canonical filter attention: this is on by default! (it should do the right thing)	2 years ago
Michael Peter Christen	5a52b01c09	front-end integration of tag valency	2 years ago
Michael Christen	4304e07e6f	crawl profile adoption to new tag valency attribute	2 years ago
Michael Peter Christen	5acd98f4da	introduction of tag-to-indexing relation TagValency	2 years ago
Michael Christen	867f96a32b	removed warnings	2 years ago
Michael Peter Christen	adbda4c71b	moved all remaining servlet classes to new location	2 years ago