Commit Graph

3436 Commits (5bbb2e173057b2f9ac3ef4c1fbd53dc53f899adc)

Author SHA1 Message Date
luc 5bbb2e1730 Ensure resource is closed when reading a full file InputStream
10 years ago
luc 6291a57300 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 0d3c5b223e have psParser cleanup temp file
10 years ago
reger 7d0d19cb8e avoid File.deleteOnExit() on temp files
10 years ago
luc bfe51001e3 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 02e4489a23 set tmpfile.deleteOnExit by default,
10 years ago
reger 2985baaa01 Exclude repetitive protocol part in tokenized url
10 years ago
reger ca3d26a401 harmonize wordsintitle & CollectionSchema.title_words_val calculation,
10 years ago
reger 52a9040ae6 Sort out double keywords (dc_subject) early in parsed documents
10 years ago
luc 49331dc523 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 47d70732f6 improve locale translator
10 years ago
sixcooler 646afe9183 do not store subfield *_coordinate + make all num-fields being docvalues
10 years ago
sixcooler 194df613de not using 'location' as defaultfacetfield - since we removed it being
10 years ago
sixcooler d3b9349b6f simplification / speedup of GenerationMemoryStrategy
10 years ago
sixcooler 4a905ec134 fix to not let the AccessTracker-Log grow to much, but have enough data
10 years ago
reger 20e18d79f8 harmonize document title for archive parsers
10 years ago
luc f11b5e8309 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 112ae013f4 update bzip and bzip parser process,
10 years ago
reger e76a90837b update zip and tar parser process,
10 years ago
luc 4e673ffc9a Ensure closing of InputStream even when an exception occurs.
10 years ago
luc 10696b53f7 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 8532565c7d optimize order of parsers to try
10 years ago
reger 681889ae64 use current tar library for untar files
10 years ago
reger 5d71fc70e3 fix tarParser early exit on looping content
10 years ago
luc bcc2e7cb5b Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger 2fcf6f104c fix bzipParser recognition
10 years ago
luc 745e97a575 Merge branch 'master' of https://github.com/yacy/yacy_search_server
10 years ago
reger a60b1fb6c2 differentiate api call getLocalPort() from getConfigInt()
10 years ago
reger 11f3666660 increase use of pre.defined CATCHALL_QUERY string
10 years ago
reger a58ee49307 Optimize internal imagequery focus on using content_type to select images
10 years ago
luc fc3294382e Updated javadocs for warning on target encoding format potential errors.
10 years ago
luc aa70ff4ff6 Corrected images alpha channel rendering
10 years ago
reger d223cf0ae4 adjust MediaWiki importer geo coordinate calculation
10 years ago
reger 2b775d5be6 fix typo in WikiCode coordinate calculation
10 years ago
reger bbe9df2bb3 fix MediawikiImporter for bz2 dump
10 years ago
reger c6687dd560 fix a system.out to log.fine
10 years ago
reger e53c6bbd51 fix init of peer flags
10 years ago
Michael Peter Christen ac034db8bc Merge branch 'master' of https://github.com/luccioman/yacy_search_server
10 years ago
reger 826f14f37f fix unnececary set null of peer flags, causing reread
10 years ago
luc 5902ce032e Corrected NullPointerException case when ImageIO reader is not found for
10 years ago
reger c6495a5b62 add a log entry on parsing ajax crawling scheme snapshot
10 years ago
reger 9252e36aeb implement ajax crawling scheme for ajax sites which adhere to the proposed use of hash-bangs to provide html content
10 years ago
Michael Peter Christen d1ae999ef9 replaced HashMap with LinkedHashMap to preserve the object order
10 years ago
Michael Peter Christen 7d075a1d76 added log lines
10 years ago
Michael Peter Christen 092dac086e Merge branch 'master' of https://github.com/luccioman/yacy_search_server
10 years ago
reger 7a64bebb86 init Recrawl job chunk size to max crawl loader during job start, to use some system preferences
10 years ago
luc d6522fa4a2 Integrated haraldk/TwelveMonkeys library to first add TIF image format
10 years ago
Michael Peter Christen 9244694e64 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen 151ccd50a9 fix for image size field values (must be multi-valued)
10 years ago
reger c9937973e3 unescape MultiProtocolURL getAttributes() return values.
10 years ago