Commit Graph

1061 Commits (203df5a750f9ae5299a594276d4ddef59d088f70)

Author SHA1 Message Date
Michael Peter Christen 203df5a750 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
reger ee77f24e52 use some more declared HeaderFramework constants
10 years ago
Michael Peter Christen 11a848da5a Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen b94bd7f20a a collection of search query enhancements:
10 years ago
Michael Peter Christen 1ccbf739b1 added bayes filter from Philipp Nolte, originally taken from
10 years ago
Michael Peter Christen de8cfbe1d7 added export option to export the fulltext of the search index text only
10 years ago
Michael Peter Christen 03ea723889 added log lines for query performance profiling
10 years ago
Michael Peter Christen 0e87a99ab8 more fixes for special windows paths
10 years ago
Michael Peter Christen e5b6424eed patch for bad windows file paths
10 years ago
Michael Peter Christen 0aa6fcf259 remove old vocabularies and synonyms before adding new
10 years ago
reger 821262a179 add CommonPattern for multiple spaces
10 years ago
Michael Peter Christen 694b22f165 migration to Solr 5.2: huge benefits - this is a lot faster!
10 years ago
Michael Peter Christen 34de1e8cbc gzip compression will perform more efficient and with better compression
10 years ago
Michael Peter Christen b43811d38c added surrogate import process for exported solr dumps.
10 years ago
Michael Peter Christen c7576d6028 added a full solr export to the IndexControlURLs_p.html servlet. The
10 years ago
reger cd31633369 improve MultiprotocolURL.getFileExtension()
10 years ago
Michael Peter Christen f5f88272e4 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen 5c67c4d460 fix for latest commit, see
10 years ago
reger c37dda8849 fix NPE on MultiProtocolURL on url with parameter value and '='
10 years ago
Michael Peter Christen f810915717 added crawl start from a clone with very, very large url: they are now
10 years ago
Michael Peter Christen 51de86c992 disabled debug thread dumps
10 years ago
Michael Peter Christen 0710648c31 enable api calls with very long urls
10 years ago
reger 1481a8ab56 add opensearch rss results to dht collection (due to text = snippet)
10 years ago
Michael Peter Christen fbf85a1561 added temporary debug output in http client
10 years ago
Michael Peter Christen ff29b0e503 added option to re-index exported xml snapshot dumps to
10 years ago
Michael Peter Christen fed26f33a8 enhanced timezone managament for indexed data:
10 years ago
Michael Peter Christen b060ba900d added parsing of contentprop attribute in html tags for
10 years ago
Michael Peter Christen ae02c92fd0 logging fix
10 years ago
Michael Peter Christen 5651713134 better debugging of fq
10 years ago
reger b1ec0644e5 fix NPE in location search on missing/empty PubDate in underlaying rss data
10 years ago
reger 839b962c20 correct percent encoding for '%' char
10 years ago
reger 2ef8ffdb60 apply UTF-8 encoding
10 years ago
reger 7120ea42f1 fix for path with char code > 255
10 years ago
reger 1d81bd0687 fix url encoding for path see http://mantis.tokeek.de/view.php?id=559
10 years ago
reger 62087fb8b2 fix MultiProtocolURL mailto protocol detection
10 years ago
reger f94e34058c fix url (path) %-decoding http://mantis.tokeek.de/view.php?id=519
10 years ago
Michael Peter Christen 710a0efa1b generalized time period computations
10 years ago
Michael Peter Christen 535f1ebe3b added a new way of content browsing in search results:
10 years ago
reger 9b0de2de64 introduce getQueryFields to return default query fields (queryparamter QF)
10 years ago
reger 8ec1db76ee url unescape add check for inconsistent utf8 multibyte parsing
10 years ago
reger f0a5188e11 replace depreciated HTTPClient setStaleConnectionCheckEnabled with setValidateAfterInactivity()
10 years ago
reger 7b569d2dbe replace depriciated HTTPClient ALLOW_ALL_HOSTNAME_VERIFIER with NoopHostnameVerifier()
10 years ago
reger eda0aeaf26 allow/recognize host in file: protocol crawl target
10 years ago
Michael Peter Christen 8ff76f8682 the cleanup process experienced a 100% CPU load situation and the loop
10 years ago
Michael Peter Christen 6578ff3ddb enhanced suggest function
10 years ago
reger fe6f5a395d fix Umlaut handling in blekko heuristic search term
10 years ago
reger c454ef69c6 add shortMemory check to heuristic search
10 years ago
reger 9e1ec5fec4 refactor: just some more useages of constant for term ":[* TO *]"
10 years ago
Michael Peter Christen b5ac29c9a5 added a html field scraper which reads text from html entities of a
10 years ago
Michael Peter Christen 1cb290170e refactoring of autotagging code (combined same code pieces)
10 years ago