Commit Graph

644 Commits (049c3b3f2ee2f7ec9467751cf42328f7b0738eb7)

Author SHA1 Message Date
Michael Peter Christen b79471ee67 grr 12 years ago
Michael Peter Christen a79f288ac1 automatically running optimize on solr if user/search is idle for some 12 years ago
orbiter a9c8046c87 do a light optimization at the end of a crawl postprocessing 12 years ago
orbiter a548354c71 replaced type of solr schema object sku of text_en_splitting_tight by 12 years ago
orbiter 2f1ec8d4a2 npe fix 12 years ago
Michael Peter Christen bcc623a843 refactoring of load_delay: this is a matter of client identification 12 years ago
orbiter 0d0b3a30f5 activate api actions after postprocessing of crawls 12 years ago
orbiter 2be456e7fb added a postprocessing field into api/status_p.xml to show if the 12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog: 12 years ago
Michael Peter Christen a2c8116a8f accept (but ignore) a '+' sign in front of search words 12 years ago
sixcooler d5d8936f9d For indexes that are changing rapidly in NRT situations, fcs (stands for 12 years ago
Michael Peter Christen 57ffdfad4c added a crawl option to obey html-meta-robots-noindex. This is on by 12 years ago
Michael Peter Christen 5a5d411ec0 new robots_i attribute fields 12 years ago
Michael Peter Christen f1c5338210 prepartion for greedy crawl profiles and refactoring 12 years ago
Michael Peter Christen e6f361f474 adding the canonical tag to crawl queues 12 years ago
Michael Peter Christen 203921006a redesign of citation index storage 12 years ago
Michael Peter Christen 32aa1d4569 removed unused option for queries 12 years ago
sixcooler e5abccdfe4 added optimize-option 12 years ago
Michael Peter Christen 8caaf6203a fixed false multiple-generation of remote facet search which 12 years ago
Michael Peter Christen 823ae4d6a7 added url_protocol_s to error documents 12 years ago
Michael Peter Christen 9a6fcdf597 npe fix 12 years ago
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name 12 years ago
Michael Peter Christen f9d859f5dc now writing image alt texts and (camelcase-)parsed urls into a text 12 years ago
orbiter 8792e6c6e9 stub for better image indexing 12 years ago
Michael Peter Christen bdf306e0a7 increased time-out for loading of seed-lists 12 years ago
Michael Peter Christen 570511f3c8 removed fields references_internal_id_sxt and 12 years ago
Michael Peter Christen 1762911f57 added synchronizations and timeouts in solr api; missing 12 years ago
Michael Peter Christen ffc570f95f removed forced soft commit since this may be the cause for a performance 12 years ago
Michael Peter Christen 6115bef335 added a 'greedy learning' mechanismn which will cause that a 'fresh' 12 years ago
Michael Peter Christen 8e965ffd16 fix for host compare in case that the host is null. This happens when 12 years ago
Michael Peter Christen f7a4377812 usage of the new normalized link polularity CRn as default ranking 12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures. 12 years ago
reger d367b1f4d9 add null pointer check to stopword fix 12 years ago
reger 7480e87386 - fix stopword handling for RWI see example http://bugs.yacy.net/view.php?id=247 12 years ago
Michael Peter Christen 9fc0c4df98 fix for bad exists 'enhancement'; see bug: 12 years ago
reger 8a7fcb391d enable use of solrcore.properties for property substitution of solrconfig.xml 12 years ago
Michael Peter Christen f7e887bf49 added missing class 12 years ago
Michael Peter Christen 5f92c68f1f removed block rank ranking and all YBR files in /ranking 12 years ago
Michael Peter Christen 164603b946 cleanup 12 years ago
Michael Peter Christen 409d6edf53 Store node/solr search threads to be able to send them an interrupt 12 years ago
Michael Peter Christen 2a8b99ea82 remove text_t in search result after snippet has been computed to save 12 years ago
Michael Peter Christen a1644ca0fd new workflow processor in Segment to enqueue indexing documents to solr 12 years ago
Michael Peter Christen 0c1a018bbd removed 'later' tactic because it used too much RAM, reduced number of 12 years ago
Michael Peter Christen 5344a1c5f7 getting the trash out 12 years ago
Michael Peter Christen 709e9b8ce7 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 12 years ago
Michael Peter Christen 281959a2d7 added option to re-boot the embedded solr during run-time. Added also 12 years ago
orbiter da621e827e prevent NPE in case RWI is disabled 12 years ago
Michael Peter Christen c2b1075dcf activating pollImmediately in case that DHT receive is off. This will 12 years ago
Michael Peter Christen 2b563debbf javadoc of new multiple-exist test 12 years ago
Michael Peter Christen 8f2d3ce2f9 reduced locking situation in crawler: shifted synchronized location and 12 years ago
Michael Peter Christen b68fbe7d21 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 12 years ago
Michael Peter Christen 06d3063dc9 - no downcase when using collection modifier 12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single 12 years ago
reger 7f63d3747d more generic field selection for reindex option of documents with disabled fields 12 years ago
Michael Peter Christen 44e363f37f refactoring of WorkflowProcessor, added process counter, update of 12 years ago
Michael Peter Christen 4058369288 fixed query expressions for collection selection (added quotes) 12 years ago
reger 79401cb938 added reindex option for documents with disabled or obsolete fields to Solr Schema Editor page (IndexSchema_p.html) 12 years ago
orbiter cf36c1614f prevent that concurrent deletion process causes wrong double-check in 12 years ago
Michael Peter Christen b24d1d18e4 removed synchronization and concurrency in Fulltext class, concurrent 12 years ago
Michael Peter Christen b9b446bca6 - added ssl configuration sign (a lock) to network statistic/table 12 years ago
reger 4fc6837690 - fix monitor url of crawl job in PerformanceQueues_p.html 12 years ago
Michael Peter Christen ad050ec88d - upgraded httpclient, httpcore and httpmime 12 years ago
orbiter a1c989002b fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4652 12 years ago
Michael Peter Christen e26bdd4a52 fixes to deletion methods (removed unnecessary concurrency and added 12 years ago
Michael Peter Christen f7f3e28c5e prevent that the size of the index is computed too many times. 12 years ago
Michael Peter Christen cca19d94d4 re-declared some fields to be of type string rather than text which 12 years ago
Michael Peter Christen 3841854c97 abstraction of catchall term 12 years ago
Michael Peter Christen ea85674be2 added the date to error documents 12 years ago
orbiter 7de5b9cfa0 fix for http://bugs.yacy.net/view.php?id=233 12 years ago
Michael Peter Christen bb4bf3d8fd infinity timeout bug protection patch 12 years ago
Michael Peter Christen d1be4127e7 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 12 years ago
Michael Peter Christen f36a7da5f6 - re-introduced existById in solr connector. 12 years ago
reger 46fa800bc7 added httpstatus_i to automatically switched on fields (used in all search queries) 12 years ago
Michael Peter Christen 3502b4c697 refactoring (renaming) of yacy-solr api 12 years ago
Michael Peter Christen 3a0fcfbeda Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 12 years ago
Michael Peter Christen 25499eead5 - added a new field for the regular expression in crawl start 12 years ago
orbiter e1bfe9d07a - reduction of the concurrently running processes to make YaCy more 12 years ago
Michael Peter Christen c091000165 added collection attribute also to the rss feed reader 12 years ago
orbiter f7571386a3 added a 'collection' property attribute in yacysearch.html which can be 12 years ago
Michael Peter Christen d937c55204 extended limitation of dom export size from 100000 to 100000000 12 years ago
Michael Peter Christen 50421171c3 added new schema fields: 12 years ago
Michael Peter Christen 566d6c980c checking of document signature for a double-document check now refers 12 years ago
Michael Peter Christen d05dc07cff setting of new default values for ranking 12 years ago
Michael Peter Christen 97775fbebc fixed ranking for add-function queries: this did not work. The option 12 years ago
Michael Peter Christen 7ab5093321 added new solr title_exact_signature_l and 12 years ago
Michael Peter Christen f24ac518e6 redesign of exists()-query (can now be called with query) and the 12 years ago
Michael Peter Christen 27d6222880 added new field host_extent_i which, after a crawl and postprocessing, 12 years ago
reger 518b20147c skip postprocessing during document.store if no citation index connected (prevent null pointer exception) 12 years ago
Michael Peter Christen ada3f27de7 added three new field for a better ranking: references_internal_i, 12 years ago
Michael Peter Christen 082e3274d6 - setting the same default ranking in the solr interface as for YaCy 12 years ago
Michael Peter Christen a20941c067 resume paused crawls on startup; user expects that restarts 'heal' 12 years ago
Michael Peter Christen edc0b33f6d - showing references count and clickdepth in host browser 12 years ago
reger 566a3b0294 fix: Index Administration > Reverse Word Index (IndexControlRWIs_p) corrected use of word search to word-hash search 12 years ago
Michael Peter Christen cf0acd2cb4 upgrade to solr 4.2.1 12 years ago
orbiter e4d26d1cb4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 12 years ago
orbiter 940c6849ee enhanced did-you-mean (a bit): can now remember previously searched 12 years ago
reger d57b221921 add: reset Solr schema filed selection to default button in IndexSchema_p 12 years ago
Michael Peter Christen 9406a2e438 fixed NPE during index abstract computation 12 years ago
Michael Peter Christen 2d36a7eaf5 - do not create a new query for all remote peers 12 years ago
Michael Peter Christen 4af0839be2 use appropriate ranking for each search situation: 12 years ago