Commit Graph

21 Commits (3189f9cd39f1dfadb8b9bc8ddcb2cb4cd5233b39)

Author SHA1 Message Date
orbiter 89aeb318d3 enhanced the wikimedia dump import process
16 years ago
orbiter 5fb77116c6 added a submenu to index administration to import a wikimedia dump (i.e. a dump from wikipedia) into the YaCy index: see
16 years ago
orbiter c097531e3d added a catch Exception to all thread to check if any of them silently dies without any other notification
16 years ago
orbiter 9c6ac43f66 fixes for wiki parser
16 years ago
orbiter d079d6dfdb small changes in surrogate reader, wiki code and portal test
16 years ago
orbiter 2e3186189b fix for mediawikiIndex surrogate producer + added concurrency
16 years ago
orbiter 1b9e532c87 some concurrency for wikipedia dump reader
16 years ago
orbiter 16baa7ad24 To translate a mediawiki dump into the YaCy surrogate format do the following:
16 years ago
orbiter 0b2c98edc9 some more work on the wikipedia-dump exporter (not finished yet)
16 years ago
orbiter fbcbcc5bdb export of yacy document objects as dublin core record in xml
16 years ago
orbiter 8a24350036 - fix for join method with new generalized RWI data structure (caused by latest commit)
16 years ago
orbiter d4d87d90c4 - extended experimental wikipedia dump parser
16 years ago
orbiter c08f9b36a4 refactoring of wiki parser.
16 years ago
orbiter 9da69d6b68 - better selection of files to be merged
16 years ago
orbiter d39a5b42ca more care about open file handles. Now files also close on windows and can be deleted afterwards.
16 years ago
orbiter 62505bb3cb more bugfixes as recommendet by findbugs
16 years ago
orbiter 83ce65707a (almost) completed partition of classes in kelondro
16 years ago
orbiter bf93767ec6 refactoring of kelondro database classes
16 years ago
orbiter fc27bf8c4c refactoring of kelondro classes:
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter 0ca4bc7b79 - added reader and visualization for mediawiki-export files:
16 years ago