yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	34a9fc1a07	bugfixes to zim reader:	1 year ago
Michael Peter Christen	7db0534d8a	Added a zim parser to the surrogate import option. You can now import zim files into YaCy by simply moving them to the DATA/SURROGATE/IN folder. They will be fetched and after parsing moved to DATA/SURROGATE/OUT. There are exceptions where the parser is not able to identify the original URL of the documents in the zim file. In that case the file is simply ignored. This commit also carries an important fix to the pdf parser and an increase of the maximum parsing speed to 60000 PPM which should make it possible to index up to 1000 files in one second.	1 year ago
Michael Peter Christen	70e29937ef	added a check in zim importer which tests if import URLs actually exist	1 year ago
Michael Peter Christen	496f768c44	modified cache strategy for zim clusters	1 year ago
Michael Peter Christen	fdc6311dc7	added parsing rules for wikibooks and wikinews in zim reader	1 year ago
Michael Peter Christen	2ea54b3503	fixed blob iterator in zim cluster definition	1 year ago
Michael Peter Christen	54fa5d3c2e	added a cluster cache but it requires more testing	1 year ago
Michael Peter Christen	53b01dbf2e	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	1 year ago
Michael Peter Christen	41856e9f34	added an optimized zim file entry iterator	1 year ago
Michael Peter Christen	1c0df28bfb	added a zim importer that can be used for surrogate imports. Can not be used yet because it requires some security additions to verify that the given urls actually work.	1 year ago
Michael Peter Christen	b9912ff50d	repaired dockerfiles for aarch64 and armv7	1 year ago
Michael Peter Christen	33b6878ded	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	1 year ago
Michael Christen	68554cea07	Merge pull request #605 from okybaca/readme-docker-link added a link to docker build guide	1 year ago
Michael Christen	06bfd5802f	Merge pull request #603 from okybaca/dark-green-css fine tuned the dark-green color scheme	1 year ago
Michael Christen	43d5cd101e	Merge pull request #607 from okybaca/wikilinks replaced all the links to legacy legacy wiki to legacy wiki	1 year ago
okybaca	4add1f6bc7	replaced all the links to legacy legacy wiki to legacy wiki	1 year ago
Michael Peter Christen	e2c86a8eba	added a ZIM cluster pointer cache	1 year ago
Michael Peter Christen	4a54b24703	fix for "negative seek offset" error during extension of heap files. This would have always happend when a heap file exceeds 2GB. should fix https://github.com/yacy/yacy_search_server/issues/372	1 year ago
okybaca	69db75ce45	added a link to docker build guide	1 year ago
Michael Peter Christen	9c8fb97985	introduced url list and title list caching and enhanced input stream performance in ZIM reader	1 year ago
Michael Peter Christen	b0ae660790	added Zstandard compressed data decompression for ZIM files type 5 also: more generalization and performance enhancements	1 year ago
Michael Peter Christen	ad8ee3a0b6	fixed typo in class name	1 year ago
Michael Peter Christen	c4082c4ff2	refactoring of ZIM reader, simplification, removed unnecessary code	1 year ago
Michael Peter Christen	c2b6b6e7b9	Fixed a large number of problems in the ZIM reader. This library was not prepared for large data because it was missing long data types for pointers. I had to modify the code-base in a fundamental way: - Proof-Reading, - unclustering, - refactoring, - naming adoption to https://wiki.openzim.org/wiki/ZIM_file_format, - change of Exception handling, - extension to more attributes as defined in spec (bugfix for mime type loading) - bugfix to long parsing (prevented reading of large files) The code is furthermore very inefficient and requires more attention. However the format is very useful for YaCy as there are numerous data sources for ZIM-Files.	1 year ago
Michael Peter Christen	5ba5fb5d23	upgraded pdfbox to 3.0.0	1 year ago
Michael Peter Christen	c10944bd4a	updated bcmail-jdk15on 1.75 to bcmail-jdk18on 1.67	1 year ago
Michael Peter Christen	1fefae9baf	integrated the source code of a openzim file format reader. These are the raw format reader files with no integration in YaCy yet, which will maybe follow as a next step. The zim file format is documented in https://openzim.org and the reader code was taken from the archived, non-maintained repository at https://github.com/openzim/zimreader-java	1 year ago
okybaca	ec2d14e973	fine tuning the dark-green color scheme	1 year ago
Michael Peter Christen	4308aa5415	removed concept of empty passwords as "no passwords used", because we now start YaCy with a default password (yacy). This has impact of all function that check the current state of password-protection that included the empty password situation, including the warnings to set a password in case that none is set (which cannot be the case any more).	1 year ago
Michael Peter Christen	2c60ff14bb	fixed default pw comparison	1 year ago
Michael Peter Christen	4da320bebf	added a warning message in ConfigBasic in case that the default password was not changed.	1 year ago
Michael Peter Christen	7830268be1	fix `756c817b5a` must be applied to all code where a transaction token is generated.	1 year ago
Michael Peter Christen	dc6f218520	set the default password for the admin account to "yacy"	1 year ago
Michael Peter Christen	756c817b5a	fix for https://github.com/yacy/yacy_search_server/issues/544	1 year ago
Michael Christen	bab1cfc7ea	added required build tools installation	1 year ago
Michael Peter Christen	03bf259601	fix for https://github.com/yacy/yacy_search_server/issues/363 We still need to set the load in the process because a demand for higher crawl speed may require to increase the maximum load limit. However, following the criticism in the bug, we do never reduce the load limit again.	1 year ago
Michael Christen	5bc09af426	Merge pull request #600 from okybaca/scheduler-sort UI: modified link to Process Scheduler in left menu	1 year ago
okybaca	4c1eb34e85	modified link to Process Scheduler in left menu	1 year ago
Michael Peter Christen	aeb4c7a660	removed warnings during normal build	1 year ago
Michael Peter Christen	095a444aa7	removed wiki links and added more shields badges	1 year ago
Michael Peter Christen	ca2a21008a	added screenshots	1 year ago
Michael Christen	961d3cc8af	Merge pull request #597 from joestr/issue/574-fix-mac-script Fix macOS script	1 year ago
Michael Christen	a035b21f63	Merge pull request #598 from joestr/improvement/remove-travis-yml Remove .travis.yml	1 year ago
Joel Strasser	b29c0ef133	remove .travis.yml since YaCy is not build on Travis CI anymore	1 year ago
Joel Strasser	09783ae89e	apply patches from @HenryLoenwind	1 year ago
Michael Peter Christen	94db89a757	small remaining changes in readme	1 year ago
Michael Peter Christen	0c4478cd71	migrated jetty to 9.4.52.v20230823	1 year ago
Michael Peter Christen	938724caa8	new development on-boarding process in eclipse with changes for ivy	1 year ago
mchristen	8fc51f66c6	fixed a test class which prevented compilation on latest jvm	1 year ago
Michael Christen	bda118af5d	Merge pull request #594 from joestr/master Match more YaCy versions	1 year ago

1 2 3 4 5 ...

14456 Commits (34a9fc1a076e89a67b351bc19bd1c2a67e730c60) All Branches Search

14456 Commits (34a9fc1a076e89a67b351bc19bd1c2a67e730c60)

All Branches