Commit Graph

398 Commits (060fec15770911a993732276879a5e564c762942)

Author SHA1 Message Date
Michael Peter Christen 0d7b4bc891 better protection against OOM during search flush and fixed missing
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter 0f7ea7ad9f - enhanced solr.add procedure for mass adds
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 56d5946a59 - added flags in IndexFederated_p.html to switch on or off the webgraph
12 years ago
Michael Peter Christen 461d46101d - Removed log4j from libraries. This can be removed because the package
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Michael Peter Christen 4111606654 removed the commitWithin attribute because that is not the way how the
12 years ago
Michael Peter Christen d70d99fab5 added more metadata fields and facets to OpensearchResponseWriter.
12 years ago
Michael Peter Christen 8651ec35fe turned author_s into the multi-valued field author_sxt
12 years ago
Michael Peter Christen 4735bd47f4 - changed solr commit call and added an optimize option. Since Solr
12 years ago
Michael Peter Christen db024a4e19 added new solr fields (unused yet; implementation will follow)
12 years ago
Michael Peter Christen 9b5bdae1b4 Reverted setting of MMapDirectoryFactory from solrconfig; see
12 years ago
orbiter eb68a30947 solr performance settings
12 years ago
Michael Peter Christen f53703df62 using MMapDirectoryFactory as solution for ClosedChannelException given
12 years ago
Michael Peter Christen 22c694f906 activated the clickdepth_i attribute for solr again because the
12 years ago
Michael Peter Christen 5a0eb1b268 clickpath should not be active by default because it needs extensive
12 years ago
Michael Peter Christen 5c0c56cfe1 Preparations to produce a click depth attribute in the search index.
12 years ago
Michael Peter Christen 295884fd54 - Merge commit '168b1d130d9d67b5e8855a0b50c4ba7ad4a416f8'
12 years ago
reger 168b1d130d Adding heuristic to get search results from configured systems which support opensearch specification
12 years ago
reger 7761b60325 fix: Broken Link on Crawler_p.html - issue 218
12 years ago
reger e9e0d63897 Add config option to show HostBrowser link in search result
12 years ago
Michael Peter Christen 98819ec3d9 use solr boost configuration to select search fields. At this time it is
12 years ago
Michael Peter Christen 01200f06cc using the author field as solr-native facet. this makes it necessary to
12 years ago
Michael Peter Christen eac9650b31 added another solr field clickdepth_i which reflects the number of
12 years ago
Michael Peter Christen 1052263af3 - added a new solr field references_i which stores the number of
12 years ago
Michael Peter Christen 72f165d58b added a Boost class which stores solr query boost values. The class can
12 years ago
Michael Peter Christen ea033f8f8e added number of characters in url to default index to be able to use
12 years ago
Michael Peter Christen efd2c4622d added a new fail type attribute for the index to distinguish two
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
reger 328ce0b297 fix: remove fixed individual testing IP (85.25.151.30 = server4you.de) from default/yacy.network.freeworld.unit
12 years ago
Michael Peter Christen e2c4c3c7d3 migration to solr 4.0.0
12 years ago
sixcooler 2d972f289a rise commitWithinMs to default-value from SwitchBoard
12 years ago
Michael Peter Christen 1baf498d59 - show more lines in online log
12 years ago
sixcooler 206e7bcf94 whitelist yacyportalsearch aka search.yacy.net
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 7e3e45fd04 added Open Graph Metadata default fields, see http://ogp.me/ns#
12 years ago
Michael Peter Christen c3e5f667a7 added schema.org breadcrumb counter to parser and solr schema
12 years ago
Michael Peter Christen 42e525ca9a enhanced the host browser
12 years ago
sof 5cb244b79b Merge remote branch 'origin/master'
12 years ago
apfelmaennchen 88b062210c Added a parser for audio file tags (e.g. ID3 tags for MP3 files) based
12 years ago
Michael Peter Christen 3d33a5bdf6 turned the synonyms_t Text field into a multi-valued String field
12 years ago
Michael Peter Christen 3b959ee002 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 3190347814 added a synonyms_t field to solr and a process to read synonym files.
12 years ago
Michael Peter Christen 411d0e839b added an underline text field to solr to record all underlined texts
12 years ago
Michael Peter Christen f45f7fc12e added new Host Browser to main menu:
12 years ago
Michael Peter Christen 80edd8ecd7 some more after-refactoring fixes
12 years ago
Michael Peter Christen 562183932b - removed ip_s from default profile since that needs a DNS lookup to
12 years ago
Michael Peter Christen 0504b01bdc Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 9413f77b65 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter a55e77a115 added twitter search heuristic
12 years ago
Michael Peter Christen 62add1d564 added the protocol and the file name extension to the solr fields since
12 years ago
Michael Peter Christen 9db032664e activate two solr fields which will be used by administration interface
12 years ago
Michael Peter Christen 10b911eed4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen be67c70a47 added Solr fields:
12 years ago
orbiter d73fff0e0e added solr field images_withalt_i
12 years ago
Michael Peter Christen ee23fc7a32 added h1..h6 counter fields
12 years ago
Michael Peter Christen b2b516cc3e added a collection attribute to crawls and searches:
12 years ago
Michael Peter Christen 528d6763fa - added new solr fields:
12 years ago
Michael Peter Christen 2ddc33646a added new field for solr:
12 years ago
Michael Peter Christen 75d5e3475d Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
cominch dc468dad01 add content control features for custom filter lists
12 years ago
Michael Peter Christen 316b5fe116 - added a solr type definition verifier
12 years ago
Michael Peter Christen 4c79ddb91e switched off some solr logging
12 years ago
Michael Peter Christen e8acd542b5 - added faceted drill-down for host and geolocation to solr queries
12 years ago
Michael Peter Christen af764c106c re-activated audio and video search because they obviously work (!)
12 years ago
orbiter 716ea0cfe2 sorted the solr schema into mandatory and optional fields; reduced
12 years ago
orbiter db6863db77 reduced solr cache sizes to check if that solves memory problems a bit
12 years ago
Michael Peter Christen 23226676c6 FOR THE BRAVE.. this is a forced migration to solr which is now ready
12 years ago
Michael Peter Christen a1b2c9a67d doctype2mime fix, influences metadata conversion between old metadata
12 years ago
Michael Peter Christen 703f427303 fixed some peer-ping connection details
12 years ago
Michael Peter Christen ea49a8aa8c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen aab0b680c3 - added xslt support for solr result formats.
12 years ago
cominch e2119f4e76 augmented browsing: replace htmlparser by jsoup, which is more stable
12 years ago
Michael Peter Christen b51df6c7e8 - added coordinate storage in solr schema
12 years ago
Michael Peter Christen f9c0e6e950 - Implemented and integrated the URIMetadataNode object which is a
12 years ago
Michael Peter Christen bca4a16603 replaced the multivalue generic string field name suffix _ss by _txt
12 years ago
orbiter 67edfd991c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter d9173ba7ed added more solr fields to integrate values from URIMetadataRow. All
12 years ago
Michael Peter Christen 3ce04cecf3 bad hack to prevent a bug appearing in solr
12 years ago
Michael Peter Christen 826967513b changed options in IndexFederated_p to switch on/off parts of the index
12 years ago
Michael Peter Christen 1517a3b7b9 added webm mime-type
13 years ago
Michael Peter Christen 0301aba1e9 removed unused method parameters
13 years ago
Michael Peter Christen 4de50fe808 adding more principal peers for bootstraping
13 years ago
reger 067728bccc add search result heuristic. adding a crawl job with depth-1 for every displayed search result (crawling every external linked page of displayed search result pages)
13 years ago
Michael Peter Christen 508a81b86c added solr field 'refresh_s' which stores the refresh url contained in
13 years ago
Michael Peter Christen 9116013c64 - allow lazy initialization of solr value (if using 'lazy', then no
13 years ago
Michael Peter Christen c03d306afa shorter autocommit time (now: 1 second) to prevent that user cannot see
13 years ago
Michael Peter Christen 3fd4a01286 added option to record urls that are forwarded to the solr index
13 years ago
Michael Peter Christen 8dd469b9dd added option to configure the autocommit delay time of solr on-the-fly
13 years ago
Michael Peter Christen b9dfca4b0a - fixed IndexFederated Servlet / a embedded Solr can now be selected
13 years ago
Michael Peter Christen 1be0025a9c - added test for EmbeddedSolrConnector
13 years ago
Michael Peter Christen dbdd697f4d moved RDFaParser.xsl configuration file to defaults
13 years ago
Michael Peter Christen 8738336408 set Xms lower than Xmx
13 years ago
Michael Peter Christen 96f6a5869f more robust OAI-PMH client (large time-out, three re-tries). OAI-PMH
13 years ago
Michael Peter Christen 6d17686258 made triplestore persistent by default
13 years ago
cominch 3c255c025b Show tags in search results (if activated in ConfigPortal_p.html)
13 years ago
Michael Peter Christen a5cdfb91de - fixed Cache link (below snippet)
13 years ago
Roland 'Quix0r' Haeder af5a597e47 Scroogle is not comming back, remove dead code
13 years ago