Commit Graph

8012 Commits (fbbfeeb31397588b0b37b482c28a2b3da5cf2f22)

Author SHA1 Message Date
Michael Peter Christen 3288489fd2 more logging during start-up
10 years ago
Michael Peter Christen abaaaef5f1 fix for filter queries
10 years ago
Michael Peter Christen 4d00175157 <experimental> added parsing of <article> html element.
10 years ago
Michael Peter Christen 1df6492019 enhanced suggestions
10 years ago
Michael Peter Christen ae02c92fd0 logging fix
10 years ago
Michael Peter Christen 5651713134 better debugging of fq
10 years ago
Michael Peter Christen f5a032f293 split query into filter query and text query to get better ranking
10 years ago
Michael Peter Christen 2e88028c1a when selecting collections in navigation, do show the un-selected
10 years ago
Michael Peter Christen 1de9b21c65 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 5f4cd8d6f5 replace deprecated getIP with getIPs in AbstractRemoteHandler
10 years ago
Michael Peter Christen fa7edc9f7a refactoring of filter queries (several queries instead only one)
10 years ago
Michael Peter Christen 40389987ec Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen f9ba50379d added an expansion option to search facets on result page:
10 years ago
reger 1f0f77bb77 make location facet return results
10 years ago
reger b1ec0644e5 fix NPE in location search on missing/empty PubDate in underlaying rss data
10 years ago
reger c1dcc8c456 fix display and limit of max server connections after startup
10 years ago
reger 839b962c20 correct percent encoding for '%' char
10 years ago
Michael Peter Christen 9bf0d7ecb9 added a new collection type 'dht' to all documents from the peer-to-peer
10 years ago
reger 796770e070 prevent overwrite of crawled or received full documents by (newer) metadata
10 years ago
Michael Peter Christen ee2490ab98 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 431311df42 fix get fresh_date_dt to allow returned value to be date in future
10 years ago
otter 74c7e8b686 Fixes hanging FlushThread (see
10 years ago
reger f63fff9008 fix snippet containig number with comma as desmo point http://mantis.tokeek.de/view.php?id=344
10 years ago
reger b241264632 fix error on *abc query input
10 years ago
reger 2ef8ffdb60 apply UTF-8 encoding
10 years ago
reger 7120ea42f1 fix for path with char code > 255
10 years ago
reger 1d81bd0687 fix url encoding for path see http://mantis.tokeek.de/view.php?id=559
10 years ago
reger 62087fb8b2 fix MultiProtocolURL mailto protocol detection
10 years ago
reger 2e8c24e02a fix link to DeReWo download file
10 years ago
reger 706f75ddc2 try to fix hang on index blob merge on shutdown
10 years ago
reger f94e34058c fix url (path) %-decoding http://mantis.tokeek.de/view.php?id=519
10 years ago
reger 7e09bff4a1 exclude default search fields from text copy to text_t
10 years ago
reger 86073a5ba3 For remote crawlReceipt add document abstract/description
10 years ago
reger 8af70950d9 harmonize snippet computation
10 years ago
Michael Peter Christen fd4e2c809a Show dates in the content of a document in the search result:
10 years ago
Michael Peter Christen 893889bc7b added special terms for on: - Date modifier: tomorrow, today; i.e.:
10 years ago
Michael Peter Christen 710a0efa1b generalized time period computations
10 years ago
Michael Peter Christen d9d3111d10 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 535f1ebe3b added a new way of content browsing in search results:
10 years ago
reger d7259419f3 postpone raw snippet html encoding upon use
10 years ago
reger de56d934b2 apply query parameter getQueryFields() to GSA servlet
10 years ago
reger 2d2299f484 fix mimetype of rss items in rss parser
10 years ago
Michael Peter Christen b432049d59 enhanced date parsing time
10 years ago
reger 9b0de2de64 introduce getQueryFields to return default query fields (queryparamter QF)
10 years ago
reger a0f04db9ea add extracted description/subject to pptParser
10 years ago
reger 8ec1db76ee url unescape add check for inconsistent utf8 multibyte parsing
10 years ago
reger 4b97ddb9ec stop sending crawl receipts if receiver got offline
10 years ago
reger 7e35518787 add extracted description/subject to docParser
10 years ago
reger f0a5188e11 replace depreciated HTTPClient setStaleConnectionCheckEnabled with setValidateAfterInactivity()
10 years ago
reger 7b569d2dbe replace depriciated HTTPClient ALLOW_ALL_HOSTNAME_VERIFIER with NoopHostnameVerifier()
10 years ago
reger fba34e12ef fix formatting issue if snippet contains html code
10 years ago
reger e48720a58c fix NPE in snippet computation
10 years ago
reger eda0aeaf26 allow/recognize host in file: protocol crawl target
10 years ago
reger df83fcc4fc disable optimistic GC assumption in StandardMemoryStrategy
10 years ago
Michael Peter Christen 8ff76f8682 the cleanup process experienced a 100% CPU load situation and the loop
10 years ago
Michael Peter Christen 1f5b5c0111 npe fix for latest scraper feature
10 years ago
Michael Peter Christen ee97302a23 hack to make date detection faster (while it becomes a bit incomplete
10 years ago
Michael Peter Christen 6578ff3ddb enhanced suggest function
10 years ago
reger fe6f5a395d fix Umlaut handling in blekko heuristic search term
10 years ago
reger 23924348e2 url with semicolon or comma handling in proxy request
10 years ago
reger 9025fe3518 upd error message for proxy
10 years ago
Michael Peter Christen 97ba5ddbb7 configuration option for maxload limit for remote search
10 years ago
reger c454ef69c6 add shortMemory check to heuristic search
10 years ago
reger 9e1ec5fec4 refactor: just some more useages of constant for term ":[* TO *]"
10 years ago
reger 8c491f51a5 remove hardcoded initialization of language nav if not used
10 years ago
Michael Peter Christen b5ac29c9a5 added a html field scraper which reads text from html entities of a
10 years ago
Michael Peter Christen 1cb290170e refactoring of autotagging code (combined same code pieces)
10 years ago
Michael Peter Christen c3b55455fc enhanced initialization speed of vocabularies by using better
10 years ago
Michael Peter Christen 68c605d637 replace with CommonPattern.SPACE for split
10 years ago
Michael Peter Christen de3e373913 using precompiled CommonPattern.TAB for split
10 years ago
Michael Peter Christen 1f5047b15f using precompiled pattern CommonPattern.SEMICOLON for splits
10 years ago
Michael Peter Christen a8a2b7a803 persistency for vocabulary facet switch
10 years ago
Michael Peter Christen efbc9a3561 introducting a new getConfig method which parses comma-separated llists
10 years ago
Michael Peter Christen 69eacdf4eb applying precompiled CommonPattern.COMMA.split to all places where
10 years ago
Michael Peter Christen ac19690d30 refactoring with CommonPattern.COMMA
10 years ago
Michael Peter Christen cf9b22ca5c do not reindex based on vocabulary fields (there are meanwhile many of
10 years ago
Michael Peter Christen 5a060c9f26 refactoring of reindexSolr (just replaced constant string)
10 years ago
Michael Peter Christen b5a55c8b3d fix for wkhtmltopdf (custom header does not work)
10 years ago
Michael Peter Christen 3d717b749a fix for urlmaskfilter
10 years ago
Michael Peter Christen bee5ee7cce removed some warnings
10 years ago
Michael Peter Christen 783cf6fbc7 the LinkedBlockingQueue is much faster than the ArrayBlockingQueue
10 years ago
Michael Peter Christen 6390454652 fix for vocabulary on/off setting
10 years ago
Michael Peter Christen a3c5995bde Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 5ca0762179 fix: eom on parsing ico file by genericImageParser
10 years ago
Michael Peter Christen 4cd2d68e03 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen dc5700148f update to latest code changes from json.org
10 years ago
reger 42b0672be3 Let auto-disabled crawls recover if low resource condition vanished.
10 years ago
Michael Peter Christen 287c528f46 replaced old JavaApplicationStub for Mac Application framework with new
10 years ago
Michael Peter Christen 4c9d2a7c64 reverted 'do not show all options' strategy. This is actually confusing
10 years ago
Michael Peter Christen 7db2888336 fixed font size and print page generation in pdf snapshots
10 years ago
reger 24f68a4eb7 refactor opensearch heuristic
10 years ago
Michael Peter Christen 3b51636ecb fix for mediawiki import
10 years ago
Michael Peter Christen b07afbc115 a test with http://validator.w3.org/feed/#validate_by_input shows that
10 years ago
Michael Peter Christen 8cafdb989a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 66839f73fa remove debug limit from commit before
10 years ago
reger 4214f250d0 Add option for extended search (Autosearch) to Bookmark.html asking all connected peers for the searchterm added as description to the bookmark created by the bookmark icon.
10 years ago
reger 8e751d754a - add javadoc to busythread with hint about the init parameter useage
10 years ago
Michael Peter Christen 3e6c3e2237 documents pushed over the api/push_p.html interface will have their
10 years ago
Michael Peter Christen 35c24608cc fix for division by zero (rare cases)
10 years ago
Michael Peter Christen 4144c7cc52 do not write frame links to webgraph
10 years ago
reger 4eb89d7f15 revert clickservlet
10 years ago
Michael Peter Christen c9e2128260 please commit new files under your own name, this file was not created
10 years ago
reger d44d8996d0 Added a “don't store remote search results” option
10 years ago
reger c156548efe add info text to metadata page (htmlresponsewriter) on no documents found
10 years ago
reger 3ac1d14a21 improve TexParser.mimeOf( fileextension ) by returning 1st defined in supported list.
10 years ago
Michael Peter Christen d2792a43fd do not write iframe and embed links into webgraph, but use them anyway
10 years ago
Michael Peter Christen 3cd7deb3b8 do not flush non-errors to stdout because this is a concurrency issue.
10 years ago
Michael Peter Christen 4e3e2acc69 Merge branch 'master' of gitorious.org:yacy/rc1-fixed_percent-encoding
10 years ago
Michael Peter Christen ecb6a59e9e do not translate gif images into png images for thumbnails. Instead,
10 years ago
arucard21 3e9871291f Applied URL-decoding prior to HTML-encoding.
10 years ago
reger 6a04563578 Init Jetty using setDefaultDescriptor (web.xml) to defaults/web.xml
10 years ago
reger 51ec9c1f44 fix "null" title in response writer for documents with multivalued title
10 years ago
reger 73ba5d8ef7 adjust fieldtype and description of field httpstatus_redirect_s in CollectionSchema
10 years ago
reger 1f9389396a fix NPE related 500 (Bad Request) response of UrlProxy on blacklisted urls,
10 years ago
reger f856edecb6 fix proxy redirect (http status 302) response
10 years ago
Michael Peter Christen cc090bcb01 enhanced initialization of autotagging
10 years ago
Michael Peter Christen a0576ec737 fix for pdf sub-page result preparation
10 years ago
Michael Peter Christen 6ad43c4a8b removed debug code
10 years ago
Michael Peter Christen 407cfff010 fix to wkhtmltopdf usage
10 years ago
Michael Peter Christen 5d321d3dc5 fixes to wkhtmltopdf call
10 years ago
Michael Peter Christen eb78388a98 changed prefer strategy for http unique in such a way that http is
10 years ago
Michael Peter Christen 9e588944fa prevent NPE during initialization of very large vocabularies
10 years ago
Michael Peter Christen aaf7d4775a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 8c3e5b7b6d added experimental pdf splitting which enables YaCy to split pdfs during
10 years ago
Michael Peter Christen d14114697c the miss cache does not seem to work, it sometimes contains urlhashes
10 years ago
reger deb75a1dbe fix refactored size() -> filesize() in YMarkMetadata
10 years ago
reger 198102304b refactor size() -> filesize() of URIMetadataNode
10 years ago
reger c6f634a4f2 remove redundant caching of urlhash in URIMetadataNode
10 years ago
Michael Peter Christen 5516819354 preventing the use of no-cache and expires in case that images are
10 years ago
Michael Peter Christen d3e71ed070 fixes for searches when initialization of large autotagging libraries
10 years ago
Michael Peter Christen 28683530cd fixes to usage of no-cache: use and recognize also the no-store
10 years ago
Michael Peter Christen c9c700b510 reduction of http requests to YaCy using the correct cache-control,
10 years ago
reger 13cca2b114 fix missing AppPath
10 years ago
Michael Peter Christen 65125439fe added query modifier 'on'. This makes it possible to search for date
10 years ago
Michael Peter Christen 1cfddea578 added (very experimental) Solr response writer for snapshot image
10 years ago
Michael Peter Christen 7287dd764e added url, date, time and page number on pdf snapshot footer
10 years ago
Michael Peter Christen 8b5d074715 fix for image parser (there is a class missing!)
10 years ago
Michael Peter Christen 932faafffe reactivated on-demand snapshot loading
10 years ago
Michael Peter Christen 2362ad7c34 fix for a count issue in snapshot api
10 years ago
Michael Peter Christen 3354cd63be Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 9971e197e0 Added a transaction interface to the snapshots: all documents in the
10 years ago
reger 63846ddb89 add final SolrQueryRequest.close to SolrServlet
10 years ago
reger 9edc7308aa update to metadata-extractor-2.7.0.jar
10 years ago
Michael Peter Christen 578ae29f1e added a note that the servlet is linked using web.xml
10 years ago
reger 6c3f36def1 - fix path to default heuristic.cfg
10 years ago
Michael Peter Christen bbf0ac40c3 add the actual DateDetection class... (missed in latest commit)
10 years ago
Michael Peter Christen 66b5a56976 Added and integrated new date detection class which can identify date
10 years ago
Michael Peter Christen c3c2b6999b fixes on wkhtmltopdf
10 years ago
Michael Peter Christen 114f0afc1e enable sku as anchor in html response writer
10 years ago
Michael Peter Christen aa80cb1159 enhanced tagging preparation speed which reduces initialization time for
10 years ago
Michael Peter Christen 6a1865f507 refactoring date -> lastModified
10 years ago
Michael Peter Christen ab6cc3c88c added concurrent generation of snapshot pdfs
10 years ago
Michael Peter Christen 413eeefed4 added character set detection library from
10 years ago
Michael Peter Christen 7bfc5b80cb added new options to vocabulary editor:
10 years ago
Michael Peter Christen 87b53b3572 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 8df8ffbb6d enhanced the snapshot functionality:
10 years ago
reger 5d67e165d9 remove redundant null check in ResponseHeader.lastModified
10 years ago
reger 5f0bb1214f modified FieldReIndex to reindex queries with low number of documents first
10 years ago
reger e52370728a fix startup stop on missing HTCACHE/SNAPSHOT directory
10 years ago
reger e5236aa7ca Merge origin/master
10 years ago
reger 70cf7060a4 coding fixes suggested in
10 years ago
Michael Peter Christen 4fe4bf29ad added rss feed output to snapshot servlet which can be used to get a
10 years ago
Michael Peter Christen 8b522687e0 added toString() methods to feed classes which makes it possible to
10 years ago
reger 568c991405 remove the unused Request variable
10 years ago
reger d6539ba597 Merge origin/master
10 years ago
reger ff18129def ViewFile servlet: update index if newer,
10 years ago
Michael Peter Christen a304058840 added Image Events as another option to generate images with a mac if no
10 years ago
Michael Peter Christen d83de9ecf5 added another path for the convert command because on older Macs
10 years ago
Michael Peter Christen 226aea5914 added a servlet which can create preview images, preview tumbnails and
10 years ago
reger 28456dfc09 skip creation of unused Bluelist contenttransformer
10 years ago
Michael Peter Christen 321840fde3 Replaced all fixed thread pools with cached thread pools. The cached
10 years ago
Michael Peter Christen 7bfab5eb9d set Busy- and Blocking-Threads to daemon mode (they will now not prevent
10 years ago
Michael Peter Christen e586e423aa in case that loading from the cache fails, load from wkhtmltopdf without
10 years ago
Michael Peter Christen d5bac64421 recognize more html file types for snapshots
10 years ago
Michael Peter Christen a1ee101079 recognize more html file extensions
10 years ago
Michael Peter Christen 8480641f2d fix to xvfb-run usage (quotes did not parse in xvfb-run, default values
10 years ago
Michael Peter Christen 68b040e31e added fail-over missing http proxy service (i.e. overload) and quiet
10 years ago
Michael Peter Christen 25a64c51b3 moved snapshot generation out of the html handler to prevent that
10 years ago
Michael Peter Christen c35170a305 more logging
10 years ago
Michael Peter Christen e8be07ec78 grr
10 years ago
Michael Peter Christen 6f81bb756c wrap wkhtmltopdf with xvfb if necessary
10 years ago
Michael Peter Christen 0119f8665d more logging when failing to create pdf snapshot
10 years ago
Michael Peter Christen 416fe886e3 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 60f27bdf49 added the property timeoutrequests to configuration to disable
10 years ago
Michael Peter Christen 97f6089a41 YaCy can now create web page snapshots as pdf documents which can later
10 years ago
reger ff80700aff replace depreciated Solr DateField.formatExternal with recommended TrieDateField.formatExternal
10 years ago
Michael Peter Christen 9ea120dbe5 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 0c97cc2440 skip unused call parameter for hashSentence()
10 years ago
reger 5790c7242e skip to tokenize punktuation as word in WordTokenizer
10 years ago
reger f07392ff17 add. use host port parameter in YaCyApp
10 years ago
Michael Peter Christen 09d2867050 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen ad0da5f246 added new web page snapshot infrastructure which will lead to the
10 years ago
Michael Peter Christen 5f5c7d69d1 added image screenshot generator
10 years ago
Michael Peter Christen 1d45d9405a security bugfix
10 years ago
Michael Peter Christen ff728b4aa5 ignore url errors during search
10 years ago
Michael Peter Christen 8317914ce3 changed vocabulary navigator object type to TreeMap to get a specific
10 years ago
Michael Peter Christen d5c1b07768 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen c0f9f6ac66 added option to change the navbar-default, i.e. usable for dark skins
10 years ago
Michael Peter Christen 10794e8efd trying facet.method fc instead of fcs to handle large facets
10 years ago
Michael Peter Christen 041b605cfe Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
Michael Peter Christen f1f74e8626 toString fix
10 years ago
Michael Peter Christen 30276a2b48 prevent that a local Solr search and a local RWI search are running
10 years ago
Michael Peter Christen 84763126e0 added option to make the YaCy proxy act as the cache is never stale. If
10 years ago
reger 1e7ee72240 fix path lookup to ./defaults/yacy.badwords
10 years ago
reger 7d863d6254 fix empty text facet entry
10 years ago
Michael Peter Christen a39419f2ef more stacks shall be considered for on-demand loading, not only
10 years ago
Michael Peter Christen 5bb52f79be reduce number of calls to queue.size() because that may be a bottleneck
10 years ago
Michael Peter Christen 4920ab7b76 optimize usage of size() cache
10 years ago
reger ee277b9b3e allow for local yacy.stopwords and yacy.badwords list (in DATA/SETTINGS/)
10 years ago
reger de56266bcb remove redundant toLower for topwords
10 years ago
Michael Peter Christen a34f837592 better delete all files in path when removing host crawl stack
10 years ago
Michael Peter Christen 10b1db430a if we have many hosts, use on-demand earlier
10 years ago
Michael Peter Christen 1324927e66 prevent division by zero
10 years ago
Michael Peter Christen 2beb6abeb6 disabled crazy sleep loop
10 years ago
Michael Peter Christen 70f03f7c8e do not cache search requests to Solr if the result is used for
10 years ago
Michael Peter Christen a0b84e4def use a LinkedHashMap for factes to maintain facet order as given by solr
10 years ago
reger ef5dc68313 include domtype to searcheventcache id
10 years ago
Michael Peter Christen 0dc6e0a5f2 added option to enrich vocabularies with synonyms from synonym database
10 years ago
Michael Peter Christen 6a2a669db4 added loading of the synonyms file from addon/synonyms into the
10 years ago
Michael Peter Christen c67c5c0709 added new solr schema fields which record the occurences of vocabulary
10 years ago
Michael Peter Christen a67a465415 fix field counter for multi-fields in html writer for the solr servlet
10 years ago
Michael Peter Christen ec9d021568 added option in vocabulary editor to import CSV files with different
10 years ago
reger 3c818fc912 add a check of java version string >=1.7 to startup class
10 years ago
Michael Peter Christen 0550b54d56 added fix to postprocessing: avoid caching of postprocessing collection
10 years ago
Michael Peter Christen 68e8039fd1 added high-precision scheduler for API processes. This allows also to
10 years ago
Michael Peter Christen 8aee7f940e added missing class for latest changes
10 years ago
Michael Peter Christen 97039049e4 fix in key enumeration methods for cases where the enumeration is done
10 years ago
Michael Peter Christen 7e1b0b6712 fix for wildcard patch in search queries
10 years ago
Michael Peter Christen 0a879c98e7 added new 'firstSeen' database table and necessary data structures which
10 years ago
Michael Peter Christen 421ee64f33 another fix to ordering of table indexes; fixes also network stats
10 years ago
Michael Peter Christen 1db476c67e fix for bad table iteration
10 years ago
reger e4316e2d74 skip creation of local var in proxyhandler.storetocache
10 years ago
sixcooler 9c6e3a6b1c fix assertation-failure in version-string for Solr-4.10.2 by changing
10 years ago
sixcooler 725b206fb4 update to solr-/lucene-4.10.2
10 years ago
Michael Peter Christen 5c97ecb30f fix of bad query generation for search facets
10 years ago
Michael Peter Christen 95d87f00b3 fix for bad query generation in doublecheck in postprocessing
10 years ago
orbiter 72c2bc5189 fix for search in case where local peer has no local seed address in
10 years ago
orbiter 5be352da99 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
orbiter 0fcd8097a3 removed unused options from BusyThreads
10 years ago
Michael Peter Christen fe8b1d137d emergency bugfix for 100% CPU in image drawing
10 years ago
Michael Peter Christen 92007e5d2d more enhancements to posprocessing speed
10 years ago
Michael Peter Christen 9a7fe9e0d1 fix for bad timing computation in postprocessing
10 years ago
Michael Peter Christen bd16119a00 another fix for postprocessing (the query for "" on numeric field did
10 years ago
Michael Peter Christen 327e83bfe7 more fixes in postprocessing: partitioning of the complete queue to
10 years ago
orbiter 2bc6199408 more concurrency for postprocessing
10 years ago
orbiter a83cf26c38 more fixes and enhancements to postprocessing
10 years ago
orbiter 71758f0d62 enhanced postprocessing by usage of a field-list generation to prevent
10 years ago
orbiter 7856fbdbe8 fix for npe (in rare cases)
10 years ago
orbiter 8a2b569d7c fix for literal computation
10 years ago
orbiter 856da2712b Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
orbiter ca9cd7b58a more IPv6 fixes
10 years ago
Michael Peter Christen b4585e9546 added new index size history image in /Status.html page
10 years ago
Michael Peter Christen 167c5a51f0 IPv6 fix
10 years ago
Michael Peter Christen fe537679de fix for exact_signature_unique_b, exact_signature_copycount_i,
10 years ago
sixcooler eb9d2705d2 fix for ConnectionInfo.cleanup of server-connections
10 years ago
Michael Peter Christen 2e5214eb21 added field postprocessing.partialUpdate to settings which can be used
10 years ago
Michael Peter Christen 11074d8d24 fix for a ssl bug that appear only in java 7.
10 years ago
Michael Peter Christen e96490e3a1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 77662e08e1 concurrently initialize the error cache; extended also the cache by
10 years ago
sixcooler d8fcc4a2f5 added a timeout on Jetty connectors
10 years ago
Michael Peter Christen 0f0b60404b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
sixcooler 72561926aa do not overwrite yacy.conf in case of an exception
10 years ago
Michael Peter Christen 07c5b57953 removed warnings
10 years ago
orbiter fa2ad101ec enhanced graphics computation (avoiding long string parsing for colours)
10 years ago
orbiter ef813cec91 added proper copyright notice to OSM tiles presented at the search
10 years ago
Michael Peter Christen fca11701f0 better profiling of solr queries
10 years ago
Michael Peter Christen 2e09da9832 npe fix
10 years ago
Michael Peter Christen d80418f1b1 added partial updates to solr during postprocessing: during
10 years ago
Michael Peter Christen b1cfbc4a04 added new solr field url_paths_count_i which can be used to enhance the
10 years ago
Michael Peter Christen e69883d5ab fix-fix for
10 years ago
Michael Peter Christen 30d4402cd1 fixed location search
10 years ago
Michael Peter Christen 6983dff334 explain crawl denial when not switched to intranet mode
10 years ago
Michael Peter Christen f818f84adb more ipv6 fixes
10 years ago
Michael Peter Christen afd5bd5f5f slightly enhanced Network table computation by using a lazy initialized
10 years ago
Michael Peter Christen 2c2b50e65d refactoring (class name should start with uppercase letter)
10 years ago
Michael Peter Christen bc275dca07 added network history graph image /NetworkHistory.png which can show
10 years ago
Marc Nause ce9368246b Merge branch 'master' of gitorious.org:yacy/rc1
10 years ago
Marc Nause 5603809deb Minor changes:
10 years ago
Michael Peter Christen d8beafba3a fix for values in CrawlProfileEditor table and xml; now the full profile
10 years ago
Michael Peter Christen ec95dfa2e6 fixed crawl profile xml result which did not show the correct crawl
10 years ago
Michael Peter Christen 8c1a89cb34 added another decoration flag to switch off network graphics in crawler
10 years ago
Michael Peter Christen ee27be3399 misc bugfixes (concurrency, memory protection)
10 years ago
Michael Peter Christen 9b1958e8ca more ipv6 bugfixes
10 years ago
Michael Peter Christen 7817fc50c9 added a high cpu cycle monitor to PerformanceQueues
10 years ago
Michael Peter Christen 5082feb103 less volume for effect sounds
10 years ago
Michael Peter Christen e8392e2ff2 fix for local search
10 years ago
Michael Peter Christen 0bfc69b29b more ipv6 bugfixes
10 years ago
Michael Peter Christen a27563e5c3 removed the atmo sound clips because they had been too large
10 years ago
Michael Peter Christen 883622306e Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 97995a1dd9 fix for remote search process
10 years ago
Michael Peter Christen 0843b12ef3 ipv6 fix: avoid that shrinked own ip set is overwritten with (non-valid)
10 years ago
Michael Peter Christen 92c5d97486 fix for bad node flag setting with IPv6
10 years ago
orbiter c27bad9326 more ipv6 fixes
10 years ago
orbiter cddf884bc4 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
Michael Peter Christen 460858fb22 more ipv6 fixes
10 years ago
Michael Peter Christen 5cef88a315 argh.. adding missing java class for latest audio feature
10 years ago
Michael Peter Christen 74957f3760 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 2a052f446a Added an experimental audio feedback system.
10 years ago
Marc Nause 1e6e69bc40 Finished implementation of UPNP:
10 years ago
Michael Peter Christen d0358e568b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen e1bc768f9d more IPv6 bugfixes
10 years ago
reger 59c6532a65 add link extraction to pdfParser
10 years ago
reger aa2e15d846 allow url parameter in worktable apicall
10 years ago
orbiter f3a12801f0 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
orbiter d93325a578 lazy handling of process_sxt field (part of postprocessing)
10 years ago
Michael Peter Christen b31db00010 toString fixes
10 years ago
Michael Peter Christen 961f06c0b6 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 209e0f2fe8 allow url parameter in worktable apicall
10 years ago
reger b5ca20de15 preserve content_type (mime) if supplied in preference of construct in from file type.
10 years ago
reger fe9f1c594e fix char encoding parameter in UrlProxy
10 years ago
reger b0c87d8240 fix image search expand box, cut-off of 2nd capture line height
10 years ago
Michael Peter Christen 2c2ed8bf4e typo in javadoc
10 years ago
Michael Peter Christen 528f583d72 ipv6 fixes
10 years ago
Michael Peter Christen 6ee5b4352d Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 247e626083 IPv6 host parsing bugfixes
10 years ago
reger fb1fcc2b03 handle noarchive tag, skip writing page to cache
10 years ago
Michael Peter Christen fe917deb2d when pinging other peers, be able to select the right IP option
10 years ago
Michael Peter Christen 65e6ae52fb IPv6-enhanced Network monitoring page
10 years ago
Michael Peter Christen 3073c69aee Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 6491270b3a large IPv6 redesign of peer ping methods!
10 years ago
reger eaccce3467 added metadataImageParser for tif and psd (Photoshop) images.
10 years ago
reger a69f5358ff use javax ImageIO getReader to add supported image extension/mime
10 years ago
reger 8b1ce49ee6 remove unused variable timeout
10 years ago
reger 48aed15c48 skip loader wait cycle on concurrent access in nocache configuration.
10 years ago
Michael Peter Christen 67cd4c37bd activated the new apk parser which was already ready but not included in
11 years ago
orbiter a922b122a3 added a hack to forward solr search results from an external attached
11 years ago
Michael Peter Christen 025516f682 fix for crawl limit for number of pages fail
11 years ago
Michael Peter Christen 2645dc816a added warning for not well-formed postprocessing queries
11 years ago
Michael Peter Christen 437ce3b8a0 added internal api for partial updates to Solr
11 years ago
orbiter 3ac31614a3 added option to reverse-sort YaCy tables (internal API change only)
11 years ago
Michael Peter Christen 6d3d4c4ea6 changed the concurrent enumeration of query results in such a way that
11 years ago
Michael Peter Christen ad35d9294f added a 'stats' table which records some peer statistics twice every
11 years ago
reger 8284ea751a catch TimeoutException during ping and do not delete yacy.conf during prereadconfigfile
11 years ago
reger ffa7c7116f better fix for NPE in image search
11 years ago
Michael Peter Christen 759e7d9538 fix for http://forum.yacy-websuche.de/viewtopic.php?p=30720#p30720
11 years ago
Michael Peter Christen bf18a39d0e replaced warning with info
11 years ago
Michael Peter Christen f1032fb8fe more enhancements to image search in case that a restriction to a single
11 years ago
Michael Peter Christen 475125f9d7 hack to get more results when doing a remote site search
11 years ago
Michael Peter Christen 81f9b34da7 increaesed ability ot search for all images on a single server within
11 years ago
Michael Peter Christen 2c26013c50 better contentdom abstraction
11 years ago
Michael Peter Christen 6a8fb8190b changed default value for maximum number of connections to 50
11 years ago
Michael Peter Christen ca8b2bf099 removed www and welcome servlet, these had been demo servlets and are
11 years ago
reger 03a7a29db3 limit OAI import urn resolver try for Deutsche National Library
11 years ago
Michael Peter Christen 0838326a76 changed error message, see http://mantis.tokeek.de/view.php?id=439
11 years ago
reger b5e0f70197 - remove repositoryPath post from ConfigBasic (obsolete)
11 years ago
reger 8931e14514 fix NPE in image search
11 years ago
Michael Peter Christen 1735dbc9d9 enhanced image search: bugfixes and performance enhancements
11 years ago
Michael Peter Christen ebd0be2cea fixes and speed updates for search process
11 years ago
Michael Peter Christen 7611bf79bd Merge branch 'master' of gitorious.org:yacy/icewindxs-rc1
11 years ago
Michael Peter Christen 524bedc00a fixed text in startup tray icon and added shutdown icon during shutdown
11 years ago
Michael Peter Christen 4709d8417c npe fix for non-tray users
11 years ago
orbiter 5b5635e187 replaced font for boot tray icon with image and added some more images
11 years ago
orbiter aa6cdc4ab5 speed-up of start process if remote DNS waits for timeout
11 years ago
orbiter 40b3977c21 added an animation of the tray icon during the boot phase of YaCy.
11 years ago
Michael Peter Christen ec6082c872 very bad language detection hack fix hack
11 years ago
Michael Peter Christen 39615de3f9 adding the buffer size is not wrong but may cause confusing information
11 years ago
Michael Peter Christen 395edec6f1 changed strategy to count the number of documents: get the max of
11 years ago
Michael Peter Christen e87dc08c0d set the correct fail time in error docs
11 years ago
Michael Peter Christen cfb20bc0ce removing the [] for ipv6 addresses may be a bad idea..
11 years ago
orbiter b6d57f06eb enhanced the apk parser (up to beeing production-ready).
11 years ago
Michael Peter Christen a7dd89c4de changed method to write the citation index: do not catch up references
11 years ago
Michael Peter Christen 57ce7eeff3 fixed localhost authorization and replaced the adminRealm with an info
11 years ago
orbiter f318d7c285 enhanced date-ordered ranking
11 years ago
reger a6891ff7f8 fix Querygoal.parse exception on +/-null-term
11 years ago
reger c7335318eb remove unused legacy procedure from httpserver
11 years ago
Michael Peter Christen eab0d3e1a9 bugfix for wrong lock display, see
11 years ago
orbiter 49d4f95faf bugfix to latest commit
11 years ago
orbiter 68211f8244 enable Crawler_p servlet if a rss feed or a wiki dump import was
11 years ago
orbiter a65df4ce7e do not push noindex errors into log if in intranet mode. noindex
11 years ago
orbiter 688c6d8954 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
orbiter 4ae7aead28 addon to latest fix
11 years ago
Marc Nause 2af56fa37d Improved UPnP. (still not perfect)
11 years ago
orbiter b3ebd38079 removed the HTDOCS repository concept because the concept to host files
11 years ago
reger 1fdcc2d67b change seedfile upload ip check to allow intranet ip in intranet mode
11 years ago
reger e31b0e6d67 - update javadoc Seed.getIP
11 years ago
reger 350c6b8250 in IntranetMode allow intranet hosted seedlist with Network_Domain "any"
11 years ago
orbiter d68438c3d9 make sure that the postprocessing background thread never dies by any
11 years ago
orbiter b4f2a1db6e added a unlock icon for all protected pages that are unlocked because
11 years ago
reger ea6c9e9b07 reduce mem buffer overhead for gap files during r/w
11 years ago
reger e88537522d allow single quote " ' " in query
11 years ago
orbiter 487021fb0a snippet computation update
11 years ago
orbiter 1c2f1f233a Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
reger 5a4995ded3 fill solr rss writer dc:subject tag with keyword content
11 years ago
orbiter 927aaa95a6 concurrency bugfix
11 years ago
orbiter c9e593cf78 removed warnings
11 years ago
reger 7584352e7b use more predefined Solr query parameter constants
11 years ago
reger f9db5dd6c5 reduce doublecontent check document (prevent out of memory)
11 years ago
reger e9eae45b55 simplify rssreader and improve atom feed link extraction
11 years ago
reger a8508417d1 catch NPE during crawl (OAI import)
11 years ago
reger 3dde94422f center searchevent lines on network graph
11 years ago
Michael Peter Christen 3860711aef fix for possible interruption of concurrent queries
11 years ago
Michael Peter Christen 6344718f8b reducing the concurrent query stack size and reduced concurrency of
11 years ago
Michael Peter Christen eca9380e3d bugfix for crawler double-check: if an url is redirected, the
11 years ago
Michael Peter Christen 9ac0c93f17 fix for subpath crawl filter
11 years ago
Michael Peter Christen 66106bdaf0 fix for crawler attribute maxdompages
11 years ago
Michael Peter Christen 49d91b94c3 npe fix in crawler
11 years ago
Michael Peter Christen b7183a7321 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger ea2e627662 fix ConfigAccounts del user with uppercase letter in name
11 years ago
Michael Peter Christen c465b791af typo
11 years ago
Michael Peter Christen 191ec8c82a added concurrency to postprocess rewrite process
11 years ago