Commit Graph

280 Commits (6aabc4e5c88193d0850cbbcc7c62e0eb7175ecb6)

Author SHA1 Message Date
Michael Peter Christen 74d0256e93 enhanced postprocessing: fixed bugs, enable proper postprocessing also
11 years ago
Michael Peter Christen d328cc4a83 fix for didyoumean, added also more asian alphabets
11 years ago
Michael Peter Christen 21aa6a0321 migration to Solr 4.5.0
11 years ago
Michael Peter Christen 101a6e6e14 Patch the citation index for links with canonical tags.
11 years ago
Michael Peter Christen 4f83d5f18c added the new field harvestkey_s to the collection index and the
12 years ago
Michael Peter Christen 96ed0c980e - added hosthash to all documents (also fail documents which is needed
12 years ago
orbiter 828603e4f1 fix for 100%CPU problem in error cache cleaning process
12 years ago
orbiter f3be1930cb CPU problem when pusing to the error cache; wrong class,
12 years ago
Michael Peter Christen e40671ddb7 better and consistent deletions for error urls
12 years ago
Michael Peter Christen 2602be8d1e - removed ZURL data structure; removed also the ZURL data file
12 years ago
Michael Peter Christen 61c5e40687 - replaced the properties object in AnchorURL with distinct variables
12 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
12 years ago
Michael Peter Christen 85456f46b2 added two new fields, exact_signature_copycount_i and
12 years ago
Michael Peter Christen 1a3e42eca4 index migration to lucene 4.4
12 years ago
Michael Peter Christen a88a62f7aa added a feature to set a collection for a crawl result based on a
12 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user
12 years ago
Michael Peter Christen 47b1c81d08 - refactoring
12 years ago
reger 02fe8b43ba Field Re-Indexing: display list of fields in reindex queue
12 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
12 years ago
reger f2d99053ed Field Re-Indexing: prevent endless error loop in ReindexSolrBusyThread on Solr exception (by skipping query causing the exception)
12 years ago
Michael Peter Christen c3b2301b2f fix for http://bugs.yacy.net/view.php?id=268
12 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
12 years ago
orbiter 6fb2811e68 fixes for problems with remote solr and non-activated webgraph index
12 years ago
orbiter c124037f19 removed forced non-soft commits to prevent index fragmentation
12 years ago
Roland Haeder be0ff6018f Removed trailing spaces + some more final
12 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
12 years ago
Michael Peter Christen 1fd006cc56 fixes using the embedded connector
12 years ago
orbiter 5533fc8e01 fix for bug 260
12 years ago
Michael Peter Christen bcc623a843 refactoring of load_delay: this is a matter of client identification
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
Michael Peter Christen 203921006a redesign of citation index storage
12 years ago
sixcooler e5abccdfe4 added optimize-option
12 years ago
Michael Peter Christen 570511f3c8 removed fields references_internal_id_sxt and
12 years ago
Michael Peter Christen ffc570f95f removed forced soft commit since this may be the cause for a performance
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen 9fc0c4df98 fix for bad exists 'enhancement'; see bug:
12 years ago
reger 8a7fcb391d enable use of solrcore.properties for property substitution of solrconfig.xml
12 years ago
Michael Peter Christen 164603b946 cleanup
12 years ago
Michael Peter Christen a1644ca0fd new workflow processor in Segment to enqueue indexing documents to solr
12 years ago
Michael Peter Christen 0c1a018bbd removed 'later' tactic because it used too much RAM, reduced number of
12 years ago
Michael Peter Christen 709e9b8ce7 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 281959a2d7 added option to re-boot the embedded solr during run-time. Added also
12 years ago
orbiter da621e827e prevent NPE in case RWI is disabled
12 years ago
Michael Peter Christen 2b563debbf javadoc of new multiple-exist test
12 years ago
Michael Peter Christen 8f2d3ce2f9 reduced locking situation in crawler: shifted synchronized location and
12 years ago
Michael Peter Christen b68fbe7d21 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
reger 7f63d3747d more generic field selection for reindex option of documents with disabled fields
12 years ago
reger 79401cb938 added reindex option for documents with disabled or obsolete fields to Solr Schema Editor page (IndexSchema_p.html)
12 years ago