Commit Graph

548 Commits (35c24608cc217913aee9dd2919f16b2b8610b079)

Author SHA1 Message Date
Michael Peter Christen d4157184ec migration to Solr 4.8.1
11 years ago
orbiter 2944822bb0 updated bootstrap seed list
11 years ago
reger e31493e139 "Use remote proxy for yacy" has no function, remove option and related config item
11 years ago
reger f02203fb2f fix xml validation error on defaults/web.xml
11 years ago
Michael Peter Christen 229f2248b8 added configuration option for maxmimum load and minimum ram for
11 years ago
Michael Peter Christen 3d5e354471 small changes to search headline colour
11 years ago
Michael Peter Christen 71efc76170 new default skin pdbootstrap which keeps the design shapes but slightly
11 years ago
reger d812f80784 add exit proxy link to UrlProxy
11 years ago
reger 2dabe2009d - remove unused manual http KeepAlive config
11 years ago
Michael Peter Christen 7a2f3e2353 increased resource.disk.used.max.steadystate and
11 years ago
Michael Peter Christen 9a5ab4e2c1 removed clickdepth_i field and related postprocessing. This information
11 years ago
Michael Peter Christen da86f150ab - added a new Crawler Balancer: HostBalancer and HostQueues:
11 years ago
reger 46016fa153 autoupdate fails to download latest release (1.71) due to default release blacklist
11 years ago
Michael Peter Christen ebd44a7080 replaced solr 4.6.1 with solr 4.7.1 and added index migration to
11 years ago
Michael Peter Christen ee92d748b5 test using compound file format, see UseCompoundFile in
11 years ago
Michael Peter Christen 0a95fd27f3 update of seed list
11 years ago
Michael Peter Christen cca851a417 introduced new solr field crawldepth_i which records the crawl depth of
11 years ago
Michael Peter Christen 39b641d6cd added tutorial mode - some menu items will only appear if you 'qualify'
11 years ago
reger b12200cafe alternative UrlProxyServlet (for /proxy.html) using different url rewrite rules
11 years ago
Michael Peter Christen e515dd460d added linkscount_i and linksnofollowcount_i to the default solr schema
11 years ago
Michael Peter Christen a7bc130e27 removed performance settings
11 years ago
Michael Peter Christen a28fefba2d activated language facet by default
11 years ago
Michael Peter Christen 617dd9c97b - added new input field in index.html
11 years ago
orbiter 7d24bcb98d added flag to require that all web pages, even such without a "_p"
11 years ago
reger 1fe26550a0 remove AugmentedBrowsing_p.html augmented browsing switch
11 years ago
reger e972b87a8a remove AugmentedBrowsingFilters_p.html as none of the settings are used currently
11 years ago
reger a373fb717d remove more unused from legacy server.http
11 years ago
orbiter f77afa9d1d add index on _val fields, this affects especially title length
11 years ago
Michael Peter Christen de8f7994ab as crawling has a low-cpu demand, we want it to run even if the CPU load
11 years ago
Michael Peter Christen 9eb668e951 enhanced the resource observer
11 years ago
Michael Peter Christen ca8b100f96 run the cleanup process even when load is high, do postprocessing even
11 years ago
Michael Peter Christen 6e59ca4ebf removed jena library and all code that depended on jena. When jena was
11 years ago
Michael Peter Christen 931541d198 re-inserted default value re-set button to performance queues and
11 years ago
Michael Peter Christen 4b7f2fcf38 updated bootstrap seedlist list
11 years ago
reger a71718a459 add config value for ssl/https port (default=8443)
11 years ago
reger cf553e5045 added hint to web.xml and for completeness the full set of hardcoded mappings
11 years ago
Michael Peter Christen a8fdaace31 changed the web.xml as well to migrate the solr servlet
11 years ago
Michael Peter Christen be5e808236 - removed hardcoded load-test which is now handled in BusyQueues
11 years ago
sixcooler 40a4030b55 configurable max-load values for YaCy-Threads:
11 years ago
Michael Peter Christen 77531850b5 reverted crawling strategy from latest commit.
11 years ago
reger 97e84439fb adjusted ConfigHeuristic and changed QueryGoal.getOriginalQueryString to .getQueryString
11 years ago
reger d24a0ec32c upd heuristic default list (heuristicopensearch.conf)
11 years ago
reger 0c754dd794 implemented DIGEST authentication, which is for remote login more secure
11 years ago
Michael Peter Christen f8ce7040ab remote search peer selection schema change:
11 years ago
reger f09dbbef96 make SecurityHandler webappcontext ready
11 years ago
reger 37f2a82a5d making root context (htroot) a WebAppContext
11 years ago
reger f6099b730d disabled unused fields in default Solr collection schema
11 years ago
orbiter 2ead4e44d9 introduced a new storage path ARCHIVE inside of DATA which will be used
11 years ago
reger fbdd89e198 Merge origin/master
11 years ago
reger 65a2f3d5e7 tweak Jetty credentials to work with YaCy UserDB
11 years ago
Michael Peter Christen ee17bd0b69 added option to attach remote solr servers in read-only mode
11 years ago
Michael Peter Christen 84167adb49 removed unused anomichttpd code after migration to jetty
11 years ago
Michael Peter Christen 7603e879dc Merge branch 'master' into HEAD
11 years ago
Michael Peter Christen 2f16770681 migrated to solr 4.6.0
11 years ago
reger 92d9c56f9f Merge origin/master into jetty
11 years ago
Michael Peter Christen e3c2f09de9 - reduce computation in case that specific postprocessing fields are not
11 years ago
reger effea4bca0 Merge origin/master into jetty
11 years ago
Michael Peter Christen a16534cb0a tried to fix timeout and connection-lost problems when using an outside
11 years ago
reger f111f30ace Merge origin/master into jetty
11 years ago
Michael Peter Christen 5ec5be5769 fixed logging for remote solr configuration
11 years ago
Michael Peter Christen 24a052ecb9 removed debug code for existsByIds
11 years ago
Michael Peter Christen 087df05e24 added option to Config_Network_p.html to enable remote search while
11 years ago
Michael Peter Christen 899e7e92b0 added debug code
11 years ago
Michael Peter Christen a5c1249ee2 reverted autowarming setting in solrconfig
11 years ago
reger 1437c45383 merge rc1/master
11 years ago
Michael Peter Christen 81bb50118e found and fixed a huge memory leak in solr caching (inside Solr). The
11 years ago
Michael Peter Christen 7f768b42d3 we do not need the load-image flag any more since this is now controlled
11 years ago
reger f017066197 Merge origin/master into jetty
11 years ago
Michael Peter Christen f1bfe64361 integrated startpage to compare_yacy
11 years ago
Michael Peter Christen 9bb7eab389 hacks to prevent storage of data longer than necessary during search and
11 years ago
orbiter 3c3cb78555 - removed a lot of garbage and bloated code from GuiHandler.
11 years ago
Michael Peter Christen 6aabc4e5c8 reduced logging line memory, 10000 lines had filled up 450MB! grrr.
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
reger f46c723398 allow to choose used http server, YaCy-Anomic or Jetty
11 years ago
Michael Peter Christen 820b896146 Replaced the inframe loading from yacy.net for donations with the
11 years ago
reger cf32a92629 - add size check to multipart form data handling of YaCyDefaultServlet (same as in HTTPDemon.parseMultipart)
11 years ago
reger a44eede8b8 merge rc1/master
11 years ago
Michael Peter Christen 90c8577840 enhanced ranking; patches to replace old ranking
11 years ago
Michael Peter Christen 1b61bd40ed - Added new solr field url_file_name_tokens_t which stores the file name
11 years ago
orbiter 5f5a97bafc added the anchor text within web pages to the searcheable entities of a
11 years ago
Michael Peter Christen 21aa6a0321 migration to Solr 4.5.0
11 years ago
reger c7c706fd9f merge with rc1/master
11 years ago
Michael Peter Christen b28d43decc added two more fields source_cr_host_norm_i,target_cr_host_norm_i in
11 years ago
Michael Peter Christen 4f83d5f18c added the new field harvestkey_s to the collection index and the
11 years ago
orbiter 8ac2e8c8c9 added location navigator which causes that the image to the map search
11 years ago
reger 5111841e5b - reduce Jetty debug logging
11 years ago
Michael Peter Christen 61c5e40687 - replaced the properties object in AnchorURL with distinct variables
11 years ago
Michael Peter Christen 85456f46b2 added two new fields, exact_signature_copycount_i and
11 years ago
Michael Peter Christen a2511b5600 turned images_alt_txt back to images_alt_sxt because it is not necessary
11 years ago
Michael Peter Christen 69f85265e1 added an option to put image links to the crawl queue and handle these
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
orbiter deadeb406e image alt tag strings should be tokenized
11 years ago
Michael Peter Christen 1a3e42eca4 index migration to lucene 4.4
11 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user
11 years ago
sixcooler 1bc6003057 rise autoCommit maxTime to 3 Minutes to reduce IO
11 years ago
orbiter 944ae5686c added donation plea to the about box as default (you can replace this in
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
orbiter e7fcb81cea we should not do too much greedylearning at this time as we don't have
11 years ago
orbiter bf0ad04e1b apply load limitation also to dht-in
11 years ago