Commit Graph

11953 Commits (d54c5d310a1f80ad80b42d72db6fd738f39c30d3)
 

Author SHA1 Message Date
reger d54c5d310a add links with image extension not automatically to image links.
9 years ago
reger 5744342fec handle image preview for url w empty file extension
9 years ago
reger 851e8f6c8a check jpeg file signature in genericImageParser
9 years ago
reger fb75fea446 use recrawljob w/o sort results by date
9 years ago
reger 43c27aa550 upd to solr/lucene 5.3.1
9 years ago
reger fd5a1dc297 upd to poi-3.13
9 years ago
reger 688f7b2a5c allow/display svg images in image results previews
9 years ago
reger d5330391de remove some unused var allocation in parser
9 years ago
Michael Peter Christen 3d7dd9d3aa follow-up to latest commit: also flush the search cache if all crawls
9 years ago
Michael Peter Christen 225200194a every time a crawl is started, the user expects a different search
9 years ago
Michael Peter Christen c737ff235d in case that the include_string contains several entries including
9 years ago
Michael Peter Christen 8e555d79a3 add also 1-character tokens to the token list because that could be also
9 years ago
reger 7c82cd4415 add a end condition to svgParser for wrong content
9 years ago
reger b92d81b073 remove double caching of inputstream in ViewImage
9 years ago
reger c7c5e2dff9 fix old/obsolete solr dependency to stax
9 years ago
reger beed1c417e Add report profile with OWASP Dependency-Check to maven pom
9 years ago
reger 356d4d1301 remove rdfParser from init (current function identical with genericParser)
9 years ago
reger c647d899e3 add svgParser to parse metadate from svg images
9 years ago
reger bad34804fe optimize parseInt for <img> tag attribute parsing
9 years ago
Michael Peter Christen 3c31bf845f fix for latest merge
9 years ago
Michael Peter Christen 6ebc2451a9 Merge pull request #14 from luccioman/master
9 years ago
reger 2f51baff4f check for loading error (includs unsupported formats)
10 years ago
luc 5578886f6f Merge branch 'master' of https://github.com/luccioman/yacy_search_server.git
10 years ago
luc c38d6c1f37 Correction for mantis 535: inurl: parameter doesn't work on URLs with
10 years ago
reger 52e3eb4ce8 harmonize/correct assignment to Ymarkmeta.mime
10 years ago
Michael Peter Christen 87f358058e Fix for index entries which have id's not computed as hash from the url.
10 years ago
reger 2951c9fc40 remove unused check for known fileextension in searchtrailer
10 years ago
reger 3f2b8ab5e5 optionally include mime in p2p url exchange string
10 years ago
reger a3195d78ae add Portuguese month names to date recognition
10 years ago
reger d2cc11ea8f fix html parser taking <style> content as text.
10 years ago
Michael Peter Christen 5f706797cb patch for a bug inside of solr since solr 5.0 when using a boost
10 years ago
reger 733d725dec limit css scrolling to result/content window x
10 years ago
Burkhard 4c38083a11 Merge pull request #10 from Raegdan/raegdan-css-layout-fix
10 years ago
reger 7889fc2389 Hack to prevent Solr issue on partial update on a document containing multivalued date field
10 years ago
reger b4cbdea1e7 adapt SolrServerConnector.add to handle error on partial update input document.
10 years ago
reger e594130aec add test case for partial update - to discover effect on YaCy for update of documents with multivalued date fields (like dates_in_content_dts)
10 years ago
reger 98ab655917 on reindex delete index document with invalid url
10 years ago
reger 1e8369e18b use a parsed date in Document.toString
10 years ago
reger d5da9e5a38 fix test methode (add throw for URIMetadataNode)
10 years ago
luccioman a7179138ce Returned again to main repository location : does anyone want to
10 years ago
luccioman 199b2ce52d Translator refactoring : to simplify locale files writing, process keys
10 years ago
luccioman 711183bd72 Merge branch 'master' of ssh://git@github.com/yacy/yacy_search_server
10 years ago
luccioman 4dd9c0d5d9 Merge from main repository
10 years ago
reger 3428b6f13b improve filtering by filetype navigator.
10 years ago
reger e37a4f0b3d prevent metadata records in index w/o valid url
10 years ago
reger 41c4eade51 extract modification date from vCard (vcfParser)
10 years ago
reger 8768896975 extract lastmodified from openoffice doc
10 years ago
Michael Peter Christen c40c302748 when many crawl queues are generated, this NPE can occur; probably
10 years ago
Michael Peter Christen 94cfa63c46 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen 0a37d8af89 in case that a site crawl is started for urls with file:// path, the
10 years ago