Commit Graph

3183 Commits (c3aadcf8999e8c5792160364dc6b69c8ca14119b)

Author SHA1 Message Date
reger de56d934b2 apply query parameter getQueryFields() to GSA servlet
10 years ago
reger 2d2299f484 fix mimetype of rss items in rss parser
10 years ago
Michael Peter Christen b432049d59 enhanced date parsing time
10 years ago
reger 9b0de2de64 introduce getQueryFields to return default query fields (queryparamter QF)
10 years ago
reger a0f04db9ea add extracted description/subject to pptParser
10 years ago
reger 8ec1db76ee url unescape add check for inconsistent utf8 multibyte parsing
10 years ago
reger 4b97ddb9ec stop sending crawl receipts if receiver got offline
10 years ago
reger 7e35518787 add extracted description/subject to docParser
10 years ago
reger f0a5188e11 replace depreciated HTTPClient setStaleConnectionCheckEnabled with setValidateAfterInactivity()
10 years ago
reger 7b569d2dbe replace depriciated HTTPClient ALLOW_ALL_HOSTNAME_VERIFIER with NoopHostnameVerifier()
10 years ago
reger fba34e12ef fix formatting issue if snippet contains html code
10 years ago
reger e48720a58c fix NPE in snippet computation
10 years ago
reger eda0aeaf26 allow/recognize host in file: protocol crawl target
10 years ago
reger df83fcc4fc disable optimistic GC assumption in StandardMemoryStrategy
10 years ago
Michael Peter Christen 8ff76f8682 the cleanup process experienced a 100% CPU load situation and the loop
10 years ago
Michael Peter Christen 1f5b5c0111 npe fix for latest scraper feature
10 years ago
Michael Peter Christen ee97302a23 hack to make date detection faster (while it becomes a bit incomplete
10 years ago
Michael Peter Christen 6578ff3ddb enhanced suggest function
10 years ago
reger fe6f5a395d fix Umlaut handling in blekko heuristic search term
10 years ago
reger 23924348e2 url with semicolon or comma handling in proxy request
10 years ago
reger 9025fe3518 upd error message for proxy
10 years ago
Michael Peter Christen 97ba5ddbb7 configuration option for maxload limit for remote search
10 years ago
reger c454ef69c6 add shortMemory check to heuristic search
10 years ago
reger 9e1ec5fec4 refactor: just some more useages of constant for term ":[* TO *]"
10 years ago
reger 8c491f51a5 remove hardcoded initialization of language nav if not used
10 years ago
Michael Peter Christen b5ac29c9a5 added a html field scraper which reads text from html entities of a
10 years ago
Michael Peter Christen 1cb290170e refactoring of autotagging code (combined same code pieces)
10 years ago
Michael Peter Christen c3b55455fc enhanced initialization speed of vocabularies by using better
10 years ago
Michael Peter Christen 68c605d637 replace with CommonPattern.SPACE for split
10 years ago
Michael Peter Christen de3e373913 using precompiled CommonPattern.TAB for split
10 years ago
Michael Peter Christen 1f5047b15f using precompiled pattern CommonPattern.SEMICOLON for splits
10 years ago
Michael Peter Christen a8a2b7a803 persistency for vocabulary facet switch
10 years ago
Michael Peter Christen efbc9a3561 introducting a new getConfig method which parses comma-separated llists
10 years ago
Michael Peter Christen 69eacdf4eb applying precompiled CommonPattern.COMMA.split to all places where
10 years ago
Michael Peter Christen ac19690d30 refactoring with CommonPattern.COMMA
10 years ago
Michael Peter Christen cf9b22ca5c do not reindex based on vocabulary fields (there are meanwhile many of
10 years ago
Michael Peter Christen 5a060c9f26 refactoring of reindexSolr (just replaced constant string)
10 years ago
Michael Peter Christen b5a55c8b3d fix for wkhtmltopdf (custom header does not work)
10 years ago
Michael Peter Christen 3d717b749a fix for urlmaskfilter
10 years ago
Michael Peter Christen bee5ee7cce removed some warnings
10 years ago
Michael Peter Christen 783cf6fbc7 the LinkedBlockingQueue is much faster than the ArrayBlockingQueue
10 years ago
Michael Peter Christen 6390454652 fix for vocabulary on/off setting
10 years ago
Michael Peter Christen a3c5995bde Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 5ca0762179 fix: eom on parsing ico file by genericImageParser
10 years ago
Michael Peter Christen 4cd2d68e03 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen dc5700148f update to latest code changes from json.org
10 years ago
reger 42b0672be3 Let auto-disabled crawls recover if low resource condition vanished.
10 years ago
Michael Peter Christen 287c528f46 replaced old JavaApplicationStub for Mac Application framework with new
10 years ago
Michael Peter Christen 4c9d2a7c64 reverted 'do not show all options' strategy. This is actually confusing
10 years ago
Michael Peter Christen 7db2888336 fixed font size and print page generation in pdf snapshots
10 years ago