Commit Graph

50 Commits (33bc255e85b726fc46ba6c371267228f1c0b2b93)

Author SHA1 Message Date
Michael Peter Christen b6de1f42dc Full redesign of solr connection architecture. This was done to support
12 years ago
Michael Peter Christen 4111606654 removed the commitWithin attribute because that is not the way how the
12 years ago
Michael Peter Christen 4323621a76 update to Solr 4.1.0
12 years ago
Michael Peter Christen 0b6566a389 optimizations when starting large crawl requests with many start urls in
12 years ago
Michael Peter Christen 0fe7b6fd3b migrated the index export methods from the old metadata to solr. Now
12 years ago
Michael Peter Christen 4735bd47f4 - changed solr commit call and added an optimize option. Since Solr
12 years ago
reger 3897bb4409 added (manual) urldb migration (link on: Index Administraton -> Federated Solr Index)
12 years ago
reger 3b6e08b49f prevent checking of urldb if empty
12 years ago
Michael Peter Christen 38d3feae65 added separate delete commands for the local+remote solr index, the old
12 years ago
Michael Peter Christen fb0fa9a102 - fixed 'delete from subpath' during crawl start which deleted nothing;
12 years ago
Michael Peter Christen b7004043ea - added a field cache for solr queries which call only for a single
12 years ago
Michael Peter Christen 4eab3aae60 removed overhead by preventing generation of full search results when
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
Michael Peter Christen f5ca5cea44 - added field options to all solr queries. This can be used to restrict
12 years ago
Michael Peter Christen 5fd3b93661 added deletion of hosts during crawl start if deleteold option was given
12 years ago
Michael Peter Christen 52df6ee369 more logging
12 years ago
Michael Peter Christen 2371ef031c added solr faceted search support to YaCy search results
12 years ago
Michael Peter Christen d481abd087 added the visualization of error-urls to host browser
12 years ago
orbiter 354ef8000d - added 'deleteold' option to crawler which causes that documents are
12 years ago
Michael Peter Christen 75dd706e1b update to HostBrowser:
12 years ago
Michael Peter Christen e2c4c3c7d3 migration to solr 4.0.0
12 years ago
Michael Peter Christen 9330ad4838 - fixed the delete option in host browser
12 years ago
Michael Peter Christen f8f05ecba7 - added a delete button in host browser to delete a complete subpath
12 years ago
Michael Peter Christen 799d71bc67 enhanced solr caching:
12 years ago
Michael Peter Christen 8e1248ffe3 force a commit in advance of a search for the administrator to get most
12 years ago
Michael Peter Christen 3b48c78190 added an option to force a commit to solr.
12 years ago
Michael Peter Christen ce0e5b1e17 - more refactoring / private methods
12 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 1b02408936 use less cache
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the
12 years ago
Michael Peter Christen 3d33a5bdf6 turned the synonyms_t Text field into a multi-valued String field
13 years ago
Michael Peter Christen ca313e404f - if a "/date" modifier is used, the solr remote query applies an
13 years ago
Michael Peter Christen a4214694df We assert that no other metadata storage than solr is used now.
13 years ago
Michael Peter Christen 562183932b - removed ip_s from default profile since that needs a DNS lookup to
13 years ago
Michael Peter Christen 1533bfd63b refactoring
13 years ago
Michael Peter Christen 872f83ebe0 refactoring
13 years ago
Michael Peter Christen 15ea053c3a - added xml output in IndexControlURLs to get the storage page of index
13 years ago
Michael Peter Christen 1b474139dd used the new zip writer/reader to add a solr dump process: the whole
13 years ago
Michael Peter Christen 8219a445f3 refactoring
13 years ago
orbiter 563d584420 removed more dependencies in cora from kelondro
13 years ago
sixcooler e78fe3f477 also do a clearcache on the solr-connector-caches
13 years ago
orbiter 479bfca571 refctoring
13 years ago
Michael Peter Christen 4716546ef5 - reduced memory usage in index transmission using a transformation of
13 years ago
orbiter d7ea45f698 - get nice text_t values from metadata conversions that are stored into
13 years ago
orbiter 780f8974e7 added ramaining iteration methods for solr in fulltext class
13 years ago
orbiter ee01c12e56 fixes for putDocument and putMetadata
13 years ago
orbiter cc47a0876e reverted bf55f69176
13 years ago
Michael Peter Christen bf55f69176 removed write methods to old metadata file type; all metadata now goes
13 years ago
Michael Peter Christen 40c0856489 refactoring
13 years ago