Commit Graph

680 Commits (b2af745dd6c8493d4d866b57e912fe90f0ee4895)

Author SHA1 Message Date
Ryszard Goń 1728cd30c6 Create autocrawl profiles
9 years ago
reger e8256bb3b1 remove blekko from opensearch config (not available)
9 years ago
reger a5faf73afa remove obsolete yacy.init entries interaction.*
9 years ago
sixcooler dce1cb65c4 Merge remote-tracking branch 'choose_remote_name/master'
9 years ago
reger e84d94f8ca fix mime table for ms office / open office documents
9 years ago
reger 15e46b2bad exclude in/outboundlinksnofollowcount_i from default schema fields
9 years ago
luc 8c4ab9c76b Added an option to eventually limit size of remote solr documents put to
9 years ago
luc 55a4d15775 Added a note on deprecated default search field and operator.
9 years ago
reger b2c8bc0ae6 remove md5_s from default index fields
9 years ago
sixcooler f5a9948860 do not store subfield *_coordinate
9 years ago
sixcooler fca353e5eb set startuptype of most solr handlers to lazy
9 years ago
reger c720b4c249 remove override of dynamicField coordinate_p in solr schema
9 years ago
reger f0b5bc93a3 remove obsolete yacy.init entry "secureHttps"
9 years ago
reger 5e45f1a460 enable Solr schema dynamicField _p (type=location) for YaCy coordinate_p field
9 years ago
sixcooler 87e4abe393 fight the fieldcache by usind DocValues: in Solr-5.x the fieldcache has
9 years ago
reger 250f6457f0 remove exired domain titan.deep-one.in from bootstrap.seedlist
9 years ago
Michael Peter Christen df3314ac1a added a new facet type based on a probabilistic classifier using
9 years ago
Michael Peter Christen e1cd9c0dba added another default network / commented out
9 years ago
reger 00d2062813 Rem depreciated AdminHandlers in solrconfig.xml
10 years ago
Michael Peter Christen 694b22f165 migration to Solr 5.2: huge benefits - this is a lot faster!
10 years ago
Michael Peter Christen 9c12555be5 added link to Snapshots in search results if the snapshot exists and
10 years ago
reger 6bc8a9b11e make Quality of Service Servlet available to prioritize requests from local host
10 years ago
Michael Peter Christen b060ba900d added parsing of contentprop attribute in html tags for
10 years ago
Michael Peter Christen 4cb4f67f38 added parsing of dd, dt and article html fields. The parsed result is
10 years ago
Michael Peter Christen 36e9cdb376 testing switching off cold searchers; maybe this brings performance
10 years ago
Michael Peter Christen 535f1ebe3b added a new way of content browsing in search results:
10 years ago
reger ba276d3e64 add description_txt to default query fields,
10 years ago
reger fe6f5a395d fix Umlaut handling in blekko heuristic search term
10 years ago
Michael Peter Christen 97ba5ddbb7 configuration option for maxload limit for remote search
10 years ago
Michael Peter Christen ac19690d30 refactoring with CommonPattern.COMMA
10 years ago
Michael Peter Christen cf9b22ca5c do not reindex based on vocabulary fields (there are meanwhile many of
10 years ago
reger 24f68a4eb7 refactor opensearch heuristic
10 years ago
reger 4eb89d7f15 revert clickservlet
10 years ago
Michael Peter Christen 61ae9d2d11 do not use the clickservlet by default. From my personal view, this
10 years ago
sixcooler 5594c43d2e bump to Solr-/Lucene-4.10.3
10 years ago
reger d44d8996d0 Added a “don't store remote search results” option
10 years ago
reger e177d69387 remove obsolete config footer option (ConfigPortal user.login)
10 years ago
reger 6a04563578 Init Jetty using setDefaultDescriptor (web.xml) to defaults/web.xml
10 years ago
Michael Peter Christen eb78388a98 changed prefer strategy for http unique in such a way that http is
10 years ago
Michael Peter Christen d14114697c the miss cache does not seem to work, it sometimes contains urlhashes
10 years ago
reger 446f374ba9 fix yacy.init comment
10 years ago
Michael Peter Christen 66b5a56976 Added and integrated new date detection class which can identify date
10 years ago
Michael Peter Christen 114f0afc1e enable sku as anchor in html response writer
10 years ago
Michael Peter Christen 60f27bdf49 added the property timeoutrequests to configuration to disable
10 years ago
Michael Peter Christen 1d45d9405a security bugfix
10 years ago
Michael Peter Christen c94c24638f disabled postprocessing by default. If you read this: please disable
10 years ago
Michael Peter Christen c0f9f6ac66 added option to change the navbar-default, i.e. usable for dark skins
10 years ago
Michael Peter Christen 84763126e0 added option to make the YaCy proxy act as the cache is never stale. If
10 years ago
reger ee277b9b3e allow for local yacy.stopwords and yacy.badwords list (in DATA/SETTINGS/)
10 years ago
Michael Peter Christen c67c5c0709 added new solr schema fields which record the occurences of vocabulary
10 years ago
Michael Peter Christen 68e8039fd1 added high-precision scheduler for API processes. This allows also to
10 years ago
sixcooler 725b206fb4 update to solr-/lucene-4.10.2
10 years ago
Michael Peter Christen 26279b0993 added debug code for statistics about document attributes related to
10 years ago
Michael Peter Christen 2e5214eb21 added field postprocessing.partialUpdate to settings which can be used
10 years ago
Michael Peter Christen b1cfbc4a04 added new solr field url_paths_count_i which can be used to enhance the
10 years ago
Michael Peter Christen 8c1a89cb34 added another decoration flag to switch off network graphics in crawler
10 years ago
Michael Peter Christen bc221a0f9c less load and more ram prerequisite for crawl steps
10 years ago
Michael Peter Christen 2a052f446a Added an experimental audio feedback system.
10 years ago
Michael Peter Christen f03dd0df24 updated seedlist
10 years ago
Michael Peter Christen 2b1cf26828 removed solr warning during startup
10 years ago
Michael Peter Christen 57ce7eeff3 fixed localhost authorization and replaced the adminRealm with an info
10 years ago
orbiter f318d7c285 enhanced date-ordered ranking
10 years ago
orbiter b3ebd38079 removed the HTDOCS repository concept because the concept to host files
10 years ago
reger ec5b1d9e33 let NETWORK_WHITELIST take precedence over NETWORK_BLACKLIST
10 years ago
orbiter 2371d6b8db target linktexts must be string to enable search facets on these fields
10 years ago
orbiter 161a11070c yacystats is gone :(
10 years ago
reger 7328c2883b fix type in .init description
10 years ago
reger 94819f0797 set .ini default boost fields to same as assigned by button "reset to default"
10 years ago
reger a2cb366b25 Combine /heuristic search modifier with opensearch configured targets
10 years ago
Michael Peter Christen 2de159719b added an option to set 'obey nofollow' for links with rel="nofollow"
10 years ago
Michael Peter Christen 1092e798a5 fixed double content postprocessing
11 years ago
Michael Peter Christen 09dcdb9b19 update to solr 4.9.0
11 years ago
orbiter 0bbb5040b8 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter 9d5d86cd03 Added filter query options to the ranking servlet /RankingSolr_p.html.
11 years ago
Michael Peter Christen d2151857f1 Added collection navigation:
11 years ago
Michael Peter Christen 922979aae1 added option to prefer http over https in unique-protocol ranking
11 years ago
reger d8d318233e fix logging settings
11 years ago
Michael Peter Christen 698f053658 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen f23c4142e0 added option to configure a custom user agent within allip networks
11 years ago
orbiter d7d38f9135 made number of open files in crawler configurable and increased default
11 years ago
Michael Peter Christen ff5b3ac84d added new fields http_unique_b and www_unique_b which can be used for
11 years ago
Michael Peter Christen f0db501630 better handling of ranking parameters and new default values for date
11 years ago
Michael Peter Christen d4157184ec migration to Solr 4.8.1
11 years ago
orbiter 2944822bb0 updated bootstrap seed list
11 years ago
reger e31493e139 "Use remote proxy for yacy" has no function, remove option and related config item
11 years ago
reger f02203fb2f fix xml validation error on defaults/web.xml
11 years ago
Michael Peter Christen 229f2248b8 added configuration option for maxmimum load and minimum ram for
11 years ago
Michael Peter Christen 3d5e354471 small changes to search headline colour
11 years ago
Michael Peter Christen 71efc76170 new default skin pdbootstrap which keeps the design shapes but slightly
11 years ago
reger d812f80784 add exit proxy link to UrlProxy
11 years ago
reger 2dabe2009d - remove unused manual http KeepAlive config
11 years ago
Michael Peter Christen 7a2f3e2353 increased resource.disk.used.max.steadystate and
11 years ago
Michael Peter Christen 9a5ab4e2c1 removed clickdepth_i field and related postprocessing. This information
11 years ago
Michael Peter Christen da86f150ab - added a new Crawler Balancer: HostBalancer and HostQueues:
11 years ago
reger 46016fa153 autoupdate fails to download latest release (1.71) due to default release blacklist
11 years ago
Michael Peter Christen ebd44a7080 replaced solr 4.6.1 with solr 4.7.1 and added index migration to
11 years ago
Michael Peter Christen ee92d748b5 test using compound file format, see UseCompoundFile in
11 years ago
Michael Peter Christen 0a95fd27f3 update of seed list
11 years ago
Michael Peter Christen cca851a417 introduced new solr field crawldepth_i which records the crawl depth of
11 years ago
Michael Peter Christen 39b641d6cd added tutorial mode - some menu items will only appear if you 'qualify'
11 years ago
reger b12200cafe alternative UrlProxyServlet (for /proxy.html) using different url rewrite rules
11 years ago
Michael Peter Christen e515dd460d added linkscount_i and linksnofollowcount_i to the default solr schema
11 years ago
Michael Peter Christen a7bc130e27 removed performance settings
11 years ago
Michael Peter Christen a28fefba2d activated language facet by default
11 years ago
Michael Peter Christen 617dd9c97b - added new input field in index.html
11 years ago
orbiter 7d24bcb98d added flag to require that all web pages, even such without a "_p"
11 years ago
reger 1fe26550a0 remove AugmentedBrowsing_p.html augmented browsing switch
11 years ago
reger e972b87a8a remove AugmentedBrowsingFilters_p.html as none of the settings are used currently
11 years ago
reger a373fb717d remove more unused from legacy server.http
11 years ago
orbiter f77afa9d1d add index on _val fields, this affects especially title length
11 years ago
Michael Peter Christen de8f7994ab as crawling has a low-cpu demand, we want it to run even if the CPU load
11 years ago
Michael Peter Christen 9eb668e951 enhanced the resource observer
11 years ago
Michael Peter Christen ca8b100f96 run the cleanup process even when load is high, do postprocessing even
11 years ago
Michael Peter Christen 6e59ca4ebf removed jena library and all code that depended on jena. When jena was
11 years ago
Michael Peter Christen 931541d198 re-inserted default value re-set button to performance queues and
11 years ago
Michael Peter Christen 4b7f2fcf38 updated bootstrap seedlist list
11 years ago
reger a71718a459 add config value for ssl/https port (default=8443)
11 years ago
reger cf553e5045 added hint to web.xml and for completeness the full set of hardcoded mappings
11 years ago
Michael Peter Christen a8fdaace31 changed the web.xml as well to migrate the solr servlet
11 years ago
Michael Peter Christen be5e808236 - removed hardcoded load-test which is now handled in BusyQueues
11 years ago
sixcooler 40a4030b55 configurable max-load values for YaCy-Threads:
11 years ago
Michael Peter Christen 77531850b5 reverted crawling strategy from latest commit.
11 years ago
reger 97e84439fb adjusted ConfigHeuristic and changed QueryGoal.getOriginalQueryString to .getQueryString
11 years ago
reger d24a0ec32c upd heuristic default list (heuristicopensearch.conf)
11 years ago
reger 0c754dd794 implemented DIGEST authentication, which is for remote login more secure
11 years ago
Michael Peter Christen f8ce7040ab remote search peer selection schema change:
11 years ago
reger f09dbbef96 make SecurityHandler webappcontext ready
11 years ago
reger 37f2a82a5d making root context (htroot) a WebAppContext
11 years ago
reger f6099b730d disabled unused fields in default Solr collection schema
11 years ago
orbiter 2ead4e44d9 introduced a new storage path ARCHIVE inside of DATA which will be used
11 years ago
reger fbdd89e198 Merge origin/master
11 years ago
reger 65a2f3d5e7 tweak Jetty credentials to work with YaCy UserDB
11 years ago
Michael Peter Christen ee17bd0b69 added option to attach remote solr servers in read-only mode
11 years ago
Michael Peter Christen 84167adb49 removed unused anomichttpd code after migration to jetty
11 years ago
Michael Peter Christen 7603e879dc Merge branch 'master' into HEAD
11 years ago
Michael Peter Christen 2f16770681 migrated to solr 4.6.0
11 years ago
reger 92d9c56f9f Merge origin/master into jetty
11 years ago
Michael Peter Christen e3c2f09de9 - reduce computation in case that specific postprocessing fields are not
11 years ago
reger effea4bca0 Merge origin/master into jetty
11 years ago
Michael Peter Christen a16534cb0a tried to fix timeout and connection-lost problems when using an outside
11 years ago
reger f111f30ace Merge origin/master into jetty
11 years ago
Michael Peter Christen 5ec5be5769 fixed logging for remote solr configuration
11 years ago
Michael Peter Christen 24a052ecb9 removed debug code for existsByIds
11 years ago
Michael Peter Christen 087df05e24 added option to Config_Network_p.html to enable remote search while
11 years ago
Michael Peter Christen 899e7e92b0 added debug code
11 years ago
Michael Peter Christen a5c1249ee2 reverted autowarming setting in solrconfig
11 years ago
reger 1437c45383 merge rc1/master
11 years ago
Michael Peter Christen 81bb50118e found and fixed a huge memory leak in solr caching (inside Solr). The
11 years ago
Michael Peter Christen 7f768b42d3 we do not need the load-image flag any more since this is now controlled
11 years ago
reger f017066197 Merge origin/master into jetty
11 years ago