Commit Graph

558 Commits (f7e887bf4960dd7713047faee8a1f9273ae81be8)

Author SHA1 Message Date
Michael Peter Christen 27d6222880 added new field host_extent_i which, after a crawl and postprocessing,
12 years ago
reger 518b20147c skip postprocessing during document.store if no citation index connected (prevent null pointer exception)
12 years ago
Michael Peter Christen ada3f27de7 added three new field for a better ranking: references_internal_i,
12 years ago
Michael Peter Christen 082e3274d6 - setting the same default ranking in the solr interface as for YaCy
12 years ago
Michael Peter Christen a20941c067 resume paused crawls on startup; user expects that restarts 'heal'
12 years ago
Michael Peter Christen edc0b33f6d - showing references count and clickdepth in host browser
12 years ago
reger 566a3b0294 fix: Index Administration > Reverse Word Index (IndexControlRWIs_p) corrected use of word search to word-hash search
12 years ago
Michael Peter Christen cf0acd2cb4 upgrade to solr 4.2.1
12 years ago
orbiter e4d26d1cb4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 940c6849ee enhanced did-you-mean (a bit): can now remember previously searched
12 years ago
reger d57b221921 add: reset Solr schema filed selection to default button in IndexSchema_p
12 years ago
Michael Peter Christen 9406a2e438 fixed NPE during index abstract computation
12 years ago
Michael Peter Christen 2d36a7eaf5 - do not create a new query for all remote peers
12 years ago
Michael Peter Christen 4af0839be2 use appropriate ranking for each search situation:
12 years ago
Michael Peter Christen b8ed66a55d added all clickdepth computations for source and target paths in
12 years ago
Michael Peter Christen 6300730d7f refactoring of clickdepth computation as preparation for clickdepth
12 years ago
Michael Peter Christen 2080fc7406 removed unused tag fields
12 years ago
orbiter 6b13dd0d3d added clickdepth field writing for webgraph core (unfinished)
12 years ago
orbiter 47114910d5 fix for possible memory leaks
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
orbiter ab74d559fb Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 4490133909 removed target_tag_s (superfluous)
12 years ago
orbiter cd197bb555 fix for NPE if surrogates do not exist
12 years ago
Michael Peter Christen 25300913fa fixes to search debugging after testing with the different search
12 years ago
Michael Peter Christen 81380ae5c8 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen c2fde018b5 concurrent snippet fetching from solr results which do not have snippets
12 years ago
orbiter b1140e3d82 added debug switches for detailed search testing
12 years ago
orbiter cdbfddf091 added filter queries for better image, audio and video results
12 years ago
Michael Peter Christen 587ef83eab added missing cleanup statements for short memory cases during search
12 years ago
Michael Peter Christen 2b6c79d347 in method exists() also use the new caching-stacks for
12 years ago
Michael Peter Christen ae734b3f8d enhanced the search result processing
12 years ago
Michael Peter Christen 0d7b4bc891 better protection against OOM during search flush and fixed missing
12 years ago
Michael Peter Christen 221ed7d764 - enhanced concurrency during search without IO blocking
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter f13c0b2abd fix for search
12 years ago
orbiter 0f7ea7ad9f - enhanced solr.add procedure for mass adds
12 years ago
Michael Peter Christen f327ffedb4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 9c09fd7d0b better/less requests to local solr; the request is made in chunks which
12 years ago
Michael Peter Christen 840fa22135 disabled clickdepth computation during craling since that is repeated
12 years ago
orbiter d74472f562 corrected result counter
12 years ago
Michael Peter Christen d957739441 removed size request
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 35fa718b77 testing to use solr for portalsearch caused some bugfixing but no full
12 years ago
Michael Peter Christen 008288719c fix for schema export to consider also automatically generated
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen c16de49f64 fix for webgraph delete query
12 years ago
Michael Peter Christen 56d5946a59 - added flags in IndexFederated_p.html to switch on or off the webgraph
12 years ago
Michael Peter Christen 14cceb6b17 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
reger f291d60c5f on remote Solr search take only locally enabled schema fields from remote solrdocument for the inputdocument added to local index
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Michael Peter Christen 33bc255e85 prevent that crawl starts with very large url lists cause a time-out in
12 years ago
Michael Peter Christen b6de1f42dc Full redesign of solr connection architecture. This was done to support
12 years ago
Michael Peter Christen 4111606654 removed the commitWithin attribute because that is not the way how the
12 years ago
Michael Peter Christen de58043205 Added image license generation for solr image search results when
12 years ago
Michael Peter Christen d3508fa8ff fixed json search, quotes, auto-facets, urls etc. for
12 years ago
Michael Peter Christen c34af7fe94 extended JSON Response Writer and Opensearch Response Writer for the
12 years ago
Michael Peter Christen 6f6ddaf7e7 A robinson peer does not need to write RWI data if such peers are only
12 years ago
Michael Peter Christen 7806680ab8 fixed a problem with re-feeding of already indexed documents whith
12 years ago
Michael Peter Christen eb80405a16 added a disable function in RemoteCrawl_p servlet which prevents setting
12 years ago
Michael Peter Christen e8f7b85b98 fixes to internal RWI usage if RWI is switched off (NPE etc)
12 years ago
Michael Peter Christen 3834829b37 bugfixes and more logging for solr connector
12 years ago
Michael Peter Christen 4323621a76 update to Solr 4.1.0
12 years ago
Michael Peter Christen 592adf7ccb fix for domain navigation
12 years ago
Michael Peter Christen 7dfcc92b71 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 0b6566a389 optimizations when starting large crawl requests with many start urls in
12 years ago
orbiter a2160054d7 ability to create vocabularies also without any objectspace: this
12 years ago
orbiter ecc10a752c fixes to index enumeration for vocabulary production
12 years ago
sixcooler 3a13906121 clear some more caches if running out of memory
12 years ago
Michael Peter Christen 8651ec35fe turned author_s into the multi-valued field author_sxt
12 years ago
Michael Peter Christen 0fe7b6fd3b migrated the index export methods from the old metadata to solr. Now
12 years ago
Michael Peter Christen 1768c82010 removed field selection because that created documents with that field
12 years ago
Michael Peter Christen 4735bd47f4 - changed solr commit call and added an optimize option. Since Solr
12 years ago
Michael Peter Christen cba038f97b one more NPE fix
12 years ago
Michael Peter Christen c3d50d91f8 relaxing site operator for www prefix:
12 years ago
Michael Peter Christen db49e91724 fixed a NPE which may appear for freeworld peers without any rwi index
12 years ago
Michael Peter Christen 4faa07c214 added a timeout for topic computation (solr is here much slower than the
12 years ago
Michael Peter Christen d2d5be032d added a 'inlink' search option according to the suggestion in the YaCy
12 years ago
reger 3897bb4409 added (manual) urldb migration (link on: Index Administraton -> Federated Solr Index)
12 years ago
reger 3b6e08b49f prevent checking of urldb if empty
12 years ago
reger f143804382 fix configuration for search page navigators
12 years ago
Michael Peter Christen becd52a984 added also a re-calculation of reference counts during the
12 years ago
Michael Peter Christen 38d3feae65 added separate delete commands for the local+remote solr index, the old
12 years ago
Michael Peter Christen 6f0baaa309 added the clickdepth post-processing: some links may have 'shortcuts' to
12 years ago
Michael Peter Christen 0f5b6f38c1 enhanced root-url detection
12 years ago
Michael Peter Christen 5c0c56cfe1 Preparations to produce a click depth attribute in the search index.
12 years ago
reger f301336adf fix: no results with configuration citation reference index switched off
12 years ago
orbiter fe50702eb0 added a filterscannerfail attribute to QueryParams which causes that a
12 years ago
Michael Peter Christen eb90d38cd7 added missing extension 'mkv' for navigation
12 years ago
Michael Peter Christen 4a9182ae16 use the search configuration to default the cacheStrategy to the value
12 years ago
Michael Peter Christen 98819ec3d9 use solr boost configuration to select search fields. At this time it is
12 years ago
Michael Peter Christen e1f89efd0d - made image search in interactive search using the ViewImage servlet -
12 years ago
reger 4987caf1c9 - apply fix for localhost handling (from yacy2solr) also to metadata2solr
12 years ago
reger 0148f1bb8c fix: exception if default work files don't exist
12 years ago
Michael Peter Christen 9e4033f229 fix for event starter: delete start time when event is removed
12 years ago
Michael Peter Christen 99271ffd13 copy work tables from defaults/data/work if exist there and not in
12 years ago
Michael Peter Christen 24c9bb35f7 extended the Scheduler: introduced scheduled events
12 years ago
Michael Peter Christen 433143ba40 removed protocol, tld, ext from the urlmask and created specific
12 years ago
Michael Peter Christen 84f82541e8 search process enhancements
12 years ago
Michael Peter Christen 02020b590b - removed all extension types from extension navigation which are not
12 years ago