Commit Graph

7669 Commits (af2d66e3d84eb68497bf4b57c6b27f310a078620)

Author SHA1 Message Date
Michael Peter Christen 8cafdb989a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 66839f73fa remove debug limit from commit before
10 years ago
reger 4214f250d0 Add option for extended search (Autosearch) to Bookmark.html asking all connected peers for the searchterm added as description to the bookmark created by the bookmark icon.
10 years ago
reger 8e751d754a - add javadoc to busythread with hint about the init parameter useage
10 years ago
Michael Peter Christen 3e6c3e2237 documents pushed over the api/push_p.html interface will have their
10 years ago
Michael Peter Christen 35c24608cc fix for division by zero (rare cases)
10 years ago
Michael Peter Christen 4144c7cc52 do not write frame links to webgraph
10 years ago
reger 4eb89d7f15 revert clickservlet
10 years ago
Michael Peter Christen c9e2128260 please commit new files under your own name, this file was not created
10 years ago
reger d44d8996d0 Added a “don't store remote search results” option
10 years ago
reger c156548efe add info text to metadata page (htmlresponsewriter) on no documents found
10 years ago
reger 3ac1d14a21 improve TexParser.mimeOf( fileextension ) by returning 1st defined in supported list.
10 years ago
Michael Peter Christen d2792a43fd do not write iframe and embed links into webgraph, but use them anyway
10 years ago
Michael Peter Christen 3cd7deb3b8 do not flush non-errors to stdout because this is a concurrency issue.
10 years ago
Michael Peter Christen 4e3e2acc69 Merge branch 'master' of gitorious.org:yacy/rc1-fixed_percent-encoding
10 years ago
Michael Peter Christen ecb6a59e9e do not translate gif images into png images for thumbnails. Instead,
10 years ago
arucard21 3e9871291f Applied URL-decoding prior to HTML-encoding.
10 years ago
reger 6a04563578 Init Jetty using setDefaultDescriptor (web.xml) to defaults/web.xml
10 years ago
reger 51ec9c1f44 fix "null" title in response writer for documents with multivalued title
10 years ago
reger 73ba5d8ef7 adjust fieldtype and description of field httpstatus_redirect_s in CollectionSchema
10 years ago
reger 1f9389396a fix NPE related 500 (Bad Request) response of UrlProxy on blacklisted urls,
10 years ago
reger f856edecb6 fix proxy redirect (http status 302) response
10 years ago
Michael Peter Christen cc090bcb01 enhanced initialization of autotagging
10 years ago
Michael Peter Christen a0576ec737 fix for pdf sub-page result preparation
10 years ago
Michael Peter Christen 6ad43c4a8b removed debug code
10 years ago
Michael Peter Christen 407cfff010 fix to wkhtmltopdf usage
10 years ago
Michael Peter Christen 5d321d3dc5 fixes to wkhtmltopdf call
10 years ago
Michael Peter Christen eb78388a98 changed prefer strategy for http unique in such a way that http is
10 years ago
Michael Peter Christen 9e588944fa prevent NPE during initialization of very large vocabularies
10 years ago
Michael Peter Christen aaf7d4775a Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 8c3e5b7b6d added experimental pdf splitting which enables YaCy to split pdfs during
10 years ago
Michael Peter Christen d14114697c the miss cache does not seem to work, it sometimes contains urlhashes
10 years ago
reger deb75a1dbe fix refactored size() -> filesize() in YMarkMetadata
10 years ago
reger 198102304b refactor size() -> filesize() of URIMetadataNode
10 years ago
reger c6f634a4f2 remove redundant caching of urlhash in URIMetadataNode
10 years ago
Michael Peter Christen 5516819354 preventing the use of no-cache and expires in case that images are
10 years ago
Michael Peter Christen d3e71ed070 fixes for searches when initialization of large autotagging libraries
10 years ago
Michael Peter Christen 28683530cd fixes to usage of no-cache: use and recognize also the no-store
10 years ago
Michael Peter Christen c9c700b510 reduction of http requests to YaCy using the correct cache-control,
10 years ago
reger 13cca2b114 fix missing AppPath
10 years ago
Michael Peter Christen 65125439fe added query modifier 'on'. This makes it possible to search for date
10 years ago
Michael Peter Christen 1cfddea578 added (very experimental) Solr response writer for snapshot image
10 years ago
Michael Peter Christen 7287dd764e added url, date, time and page number on pdf snapshot footer
10 years ago
Michael Peter Christen 8b5d074715 fix for image parser (there is a class missing!)
10 years ago
Michael Peter Christen 932faafffe reactivated on-demand snapshot loading
10 years ago
Michael Peter Christen 2362ad7c34 fix for a count issue in snapshot api
10 years ago
Michael Peter Christen 3354cd63be Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 9971e197e0 Added a transaction interface to the snapshots: all documents in the
10 years ago
reger 63846ddb89 add final SolrQueryRequest.close to SolrServlet
10 years ago
reger 9edc7308aa update to metadata-extractor-2.7.0.jar
10 years ago
Michael Peter Christen 578ae29f1e added a note that the servlet is linked using web.xml
10 years ago
reger 6c3f36def1 - fix path to default heuristic.cfg
10 years ago
Michael Peter Christen bbf0ac40c3 add the actual DateDetection class... (missed in latest commit)
10 years ago
Michael Peter Christen 66b5a56976 Added and integrated new date detection class which can identify date
10 years ago
Michael Peter Christen c3c2b6999b fixes on wkhtmltopdf
10 years ago
Michael Peter Christen 114f0afc1e enable sku as anchor in html response writer
10 years ago
Michael Peter Christen aa80cb1159 enhanced tagging preparation speed which reduces initialization time for
10 years ago
Michael Peter Christen 6a1865f507 refactoring date -> lastModified
10 years ago
Michael Peter Christen ab6cc3c88c added concurrent generation of snapshot pdfs
10 years ago
Michael Peter Christen 413eeefed4 added character set detection library from
10 years ago
Michael Peter Christen 7bfc5b80cb added new options to vocabulary editor:
10 years ago
Michael Peter Christen 87b53b3572 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 8df8ffbb6d enhanced the snapshot functionality:
10 years ago
reger 5d67e165d9 remove redundant null check in ResponseHeader.lastModified
10 years ago
reger 5f0bb1214f modified FieldReIndex to reindex queries with low number of documents first
10 years ago
reger e52370728a fix startup stop on missing HTCACHE/SNAPSHOT directory
10 years ago
reger e5236aa7ca Merge origin/master
10 years ago
reger 70cf7060a4 coding fixes suggested in
10 years ago
Michael Peter Christen 4fe4bf29ad added rss feed output to snapshot servlet which can be used to get a
10 years ago
Michael Peter Christen 8b522687e0 added toString() methods to feed classes which makes it possible to
10 years ago
reger 568c991405 remove the unused Request variable
10 years ago
reger d6539ba597 Merge origin/master
10 years ago
reger ff18129def ViewFile servlet: update index if newer,
10 years ago
Michael Peter Christen a304058840 added Image Events as another option to generate images with a mac if no
10 years ago
Michael Peter Christen d83de9ecf5 added another path for the convert command because on older Macs
10 years ago
Michael Peter Christen 226aea5914 added a servlet which can create preview images, preview tumbnails and
10 years ago
reger 28456dfc09 skip creation of unused Bluelist contenttransformer
10 years ago
Michael Peter Christen 321840fde3 Replaced all fixed thread pools with cached thread pools. The cached
10 years ago
Michael Peter Christen 7bfab5eb9d set Busy- and Blocking-Threads to daemon mode (they will now not prevent
10 years ago
Michael Peter Christen e586e423aa in case that loading from the cache fails, load from wkhtmltopdf without
10 years ago
Michael Peter Christen d5bac64421 recognize more html file types for snapshots
10 years ago
Michael Peter Christen a1ee101079 recognize more html file extensions
10 years ago
Michael Peter Christen 8480641f2d fix to xvfb-run usage (quotes did not parse in xvfb-run, default values
10 years ago
Michael Peter Christen 68b040e31e added fail-over missing http proxy service (i.e. overload) and quiet
10 years ago
Michael Peter Christen 25a64c51b3 moved snapshot generation out of the html handler to prevent that
10 years ago
Michael Peter Christen c35170a305 more logging
10 years ago
Michael Peter Christen e8be07ec78 grr
10 years ago
Michael Peter Christen 6f81bb756c wrap wkhtmltopdf with xvfb if necessary
10 years ago
Michael Peter Christen 0119f8665d more logging when failing to create pdf snapshot
10 years ago
Michael Peter Christen 416fe886e3 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen 60f27bdf49 added the property timeoutrequests to configuration to disable
10 years ago
Michael Peter Christen 97f6089a41 YaCy can now create web page snapshots as pdf documents which can later
10 years ago
reger ff80700aff replace depreciated Solr DateField.formatExternal with recommended TrieDateField.formatExternal
10 years ago
Michael Peter Christen 9ea120dbe5 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 0c97cc2440 skip unused call parameter for hashSentence()
10 years ago
reger 5790c7242e skip to tokenize punktuation as word in WordTokenizer
10 years ago
reger f07392ff17 add. use host port parameter in YaCyApp
10 years ago
Michael Peter Christen 09d2867050 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen ad0da5f246 added new web page snapshot infrastructure which will lead to the
10 years ago
Michael Peter Christen 5f5c7d69d1 added image screenshot generator
10 years ago
Michael Peter Christen 1d45d9405a security bugfix
10 years ago
Michael Peter Christen ff728b4aa5 ignore url errors during search
10 years ago
Michael Peter Christen 8317914ce3 changed vocabulary navigator object type to TreeMap to get a specific
10 years ago
Michael Peter Christen d5c1b07768 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen c0f9f6ac66 added option to change the navbar-default, i.e. usable for dark skins
10 years ago
Michael Peter Christen 10794e8efd trying facet.method fc instead of fcs to handle large facets
10 years ago
Michael Peter Christen 041b605cfe Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
Michael Peter Christen f1f74e8626 toString fix
10 years ago
Michael Peter Christen 30276a2b48 prevent that a local Solr search and a local RWI search are running
10 years ago
Michael Peter Christen 84763126e0 added option to make the YaCy proxy act as the cache is never stale. If
10 years ago
reger 1e7ee72240 fix path lookup to ./defaults/yacy.badwords
10 years ago
reger 7d863d6254 fix empty text facet entry
10 years ago
Michael Peter Christen a39419f2ef more stacks shall be considered for on-demand loading, not only
10 years ago
Michael Peter Christen 5bb52f79be reduce number of calls to queue.size() because that may be a bottleneck
10 years ago
Michael Peter Christen 4920ab7b76 optimize usage of size() cache
10 years ago
reger ee277b9b3e allow for local yacy.stopwords and yacy.badwords list (in DATA/SETTINGS/)
10 years ago
reger de56266bcb remove redundant toLower for topwords
10 years ago
Michael Peter Christen a34f837592 better delete all files in path when removing host crawl stack
10 years ago
Michael Peter Christen 10b1db430a if we have many hosts, use on-demand earlier
10 years ago
Michael Peter Christen 1324927e66 prevent division by zero
10 years ago
Michael Peter Christen 2beb6abeb6 disabled crazy sleep loop
10 years ago
Michael Peter Christen 70f03f7c8e do not cache search requests to Solr if the result is used for
10 years ago
Michael Peter Christen a0b84e4def use a LinkedHashMap for factes to maintain facet order as given by solr
10 years ago
reger ef5dc68313 include domtype to searcheventcache id
10 years ago
Michael Peter Christen 0dc6e0a5f2 added option to enrich vocabularies with synonyms from synonym database
10 years ago
Michael Peter Christen 6a2a669db4 added loading of the synonyms file from addon/synonyms into the
10 years ago
Michael Peter Christen c67c5c0709 added new solr schema fields which record the occurences of vocabulary
10 years ago
Michael Peter Christen a67a465415 fix field counter for multi-fields in html writer for the solr servlet
10 years ago
Michael Peter Christen ec9d021568 added option in vocabulary editor to import CSV files with different
10 years ago
reger 3c818fc912 add a check of java version string >=1.7 to startup class
10 years ago
Michael Peter Christen 0550b54d56 added fix to postprocessing: avoid caching of postprocessing collection
10 years ago
Michael Peter Christen 68e8039fd1 added high-precision scheduler for API processes. This allows also to
10 years ago
Michael Peter Christen 8aee7f940e added missing class for latest changes
10 years ago
Michael Peter Christen 97039049e4 fix in key enumeration methods for cases where the enumeration is done
10 years ago
Michael Peter Christen 7e1b0b6712 fix for wildcard patch in search queries
10 years ago
Michael Peter Christen 0a879c98e7 added new 'firstSeen' database table and necessary data structures which
10 years ago
Michael Peter Christen 421ee64f33 another fix to ordering of table indexes; fixes also network stats
10 years ago
Michael Peter Christen 1db476c67e fix for bad table iteration
10 years ago
reger e4316e2d74 skip creation of local var in proxyhandler.storetocache
10 years ago
sixcooler 9c6e3a6b1c fix assertation-failure in version-string for Solr-4.10.2 by changing
10 years ago
sixcooler 725b206fb4 update to solr-/lucene-4.10.2
10 years ago
Michael Peter Christen 5c97ecb30f fix of bad query generation for search facets
10 years ago
Michael Peter Christen 95d87f00b3 fix for bad query generation in doublecheck in postprocessing
10 years ago
orbiter 72c2bc5189 fix for search in case where local peer has no local seed address in
10 years ago
orbiter 5be352da99 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
orbiter 0fcd8097a3 removed unused options from BusyThreads
10 years ago
Michael Peter Christen fe8b1d137d emergency bugfix for 100% CPU in image drawing
10 years ago
Michael Peter Christen 92007e5d2d more enhancements to posprocessing speed
10 years ago
Michael Peter Christen 9a7fe9e0d1 fix for bad timing computation in postprocessing
10 years ago
Michael Peter Christen bd16119a00 another fix for postprocessing (the query for "" on numeric field did
10 years ago