Commit Graph

7902 Commits (4d2b934487fc5a35465ad0286cb2b20302167e10)

Author SHA1 Message Date
reger 4d2b934487 prevent mailto links getting into parser result document's in/outbound link collection
9 years ago
sixcooler 1be67d9ab6 CachedSolrConnector was replaced by ConcurrentUpdateSolrConnector years
9 years ago
reger 28b8bc290a fix use of NETWORK_SEARCHVERIFY for rwi verification
9 years ago
reger 020630efd8 remove unused network scanner parameter from queryparameter
9 years ago
luc ad5586f8f6 Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
luc 8ebefa4233 Fixed MediaWiki import : DCEntry conversion to SolrInputDocument was
9 years ago
luc 7736ee5a42 Updated MediaWimporter main() : display usage in console and stop
9 years ago
reger cdb8f3b10d make current ranking score value avail. to search interface / api
9 years ago
luc 27d11f8671 Fixed isSolrDump function : PushBackInputStream was not unread when
9 years ago
Michael Peter Christen 135a123a77 less logging in new language detection
9 years ago
Michael Peter Christen ef8cd80593 fix for npe
9 years ago
reger 0347bfa71f Apply collection query constraint/modifiert to rwi result stack.
9 years ago
luc 2a67d2ba6f Corrected error management for unsupported image formats, parsing
9 years ago
Michael Peter Christen d6e9834040 Merge branch 'master' of
9 years ago
Michael Peter Christen d82d311995 Merge branch 'master' of https://github.com/luccioman/yacy_search_server
9 years ago
reger b5371ea8c1 read/init crawl queue in a thread
9 years ago
reger 1160b13172 remove unused md5 from ViewFile servlet params
9 years ago
reger e163ea88f6 fix vsdParser (Visio) parser return statement
9 years ago
reger b2c8bc0ae6 remove md5_s from default index fields
9 years ago
luc e40ae0943b - No max dimensions specified : render raw image data when source and
9 years ago
reger 90686a75a2 fix flux factor (additional crawl delay by access count) calculation
9 years ago
luc 4af27289e5 Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger 297fdb60d3 throw exception if crawler hostqueue can't create hostpath directory.
9 years ago
luc 755efac17d Use same max file size when loading all resource bytes or opening stream
9 years ago
luc bc6c79fc12 Corrected scaling function for non RGB images.
9 years ago
luc 1565559df8 Refactoring : extracted write InputStream method.
9 years ago
luc f0478bb14d BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys
9 years ago
luc 07437986e7 Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger 97cc03ef6a start using a template for urlproxy header
9 years ago
luc f01d49c37a Process large or local file images dealing directly with content
10 years ago
luc 3c4c77099d If available, check content length before downloading. Check also
10 years ago
luc 5bbb2e1730 Ensure resource is closed when reading a full file InputStream
10 years ago
luc 6291a57300 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 0d3c5b223e have psParser cleanup temp file
10 years ago
reger 7d0d19cb8e avoid File.deleteOnExit() on temp files
10 years ago
luc bfe51001e3 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 02e4489a23 set tmpfile.deleteOnExit by default,
10 years ago
reger 2985baaa01 Exclude repetitive protocol part in tokenized url
10 years ago
reger ca3d26a401 harmonize wordsintitle & CollectionSchema.title_words_val calculation,
10 years ago
reger 52a9040ae6 Sort out double keywords (dc_subject) early in parsed documents
10 years ago
luc 49331dc523 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 47d70732f6 improve locale translator
10 years ago
sixcooler 646afe9183 do not store subfield *_coordinate + make all num-fields being docvalues
10 years ago
sixcooler 194df613de not using 'location' as defaultfacetfield - since we removed it being
10 years ago
sixcooler d3b9349b6f simplification / speedup of GenerationMemoryStrategy
10 years ago
sixcooler 4a905ec134 fix to not let the AccessTracker-Log grow to much, but have enough data
10 years ago
reger 20e18d79f8 harmonize document title for archive parsers
10 years ago
luc f11b5e8309 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 112ae013f4 update bzip and bzip parser process,
10 years ago
reger e76a90837b update zip and tar parser process,
10 years ago