reger
e163ea88f6
fix vsdParser (Visio) parser return statement
...
(final block un-necessary throw)
9 years ago
reger
b2c8bc0ae6
remove md5_s from default index fields
...
it is not assigned a value / not used
Due to above also excluded from transfer protocol.
9 years ago
luc
e40ae0943b
- No max dimensions specified : render raw image data when source and
...
target image format are the same.
- Corrected scaling condition.
9 years ago
luc
4c36b7bd14
Merge branch 'master' of https://github.com/yacy/yacy_search_server
...
Conflicts:
.classpath
9 years ago
reger
90686a75a2
fix flux factor (additional crawl delay by access count) calculation
9 years ago
reger
d79fa7fbeb
upd to Jetty v9.2.14.v20151106
9 years ago
luc
4af27289e5
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
297fdb60d3
throw exception if crawler hostqueue can't create hostpath directory.
...
In rare cases hostname may not be a valid filesystem directory name,
which can't be created (e.g. containing '*' char). To prevent crawl queue
looping on this invalid entry by throwing a malformedurlexception.
9 years ago
luc
755efac17d
Use same max file size when loading all resource bytes or opening stream
...
content
9 years ago
luc
5eafce5577
Rendering performance improvement : use EncodedImage constructor with
...
BufferedImage parameter to avoid re-rerendering BufferedImage.
9 years ago
luc
bc6c79fc12
Corrected scaling function for non RGB images.
9 years ago
luc
042b0e9658
Corrected IcedTea version. See http://mantis.tokeek.de/view.php?id=615
9 years ago
luc
1565559df8
Refactoring : extracted write InputStream method.
9 years ago
luc
f0478bb14d
BMP and ICO image formats support : integrated /haraldk/TwelveMonkeys
...
imageio-bmp-3.2 library.
- better BMP format flavours support
- handle PNG encoded icons
- handle transparency
Added some javadoc url references to .classpath
9 years ago
luc
b6ba941d33
Configuration projet eclipse : ajout nature et validation javascript
9 years ago
luc
7f27683831
Correction erreur de compilation.
9 years ago
luc
07437986e7
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
97cc03ef6a
start using a template for urlproxy header
...
It is included as iframe /proxmsg/urlproxyheader.html
to allow full servlet functionallity and flexibility to display some
index/meta data in future.
9 years ago
reger
d08e421809
fix link to logo (yacysearch.xsl)
9 years ago
luc
f01d49c37a
Process large or local file images dealing directly with content
...
InputStream.
9 years ago
luc
3c4c77099d
If available, check content length before downloading. Check also
...
content length is not over Integer.MAX_VALUE.
9 years ago
luc
5bbb2e1730
Ensure resource is closed when reading a full file InputStream
9 years ago
luc
6291a57300
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
0d3c5b223e
have psParser cleanup temp file
9 years ago
reger
7d0d19cb8e
avoid File.deleteOnExit() on temp files
...
JVM registers each file in a list regardless of already deleted and never
cleans up the list during runtime.
This accumulates to a considerable amount of mem during large crawls and/or
long uptime.
To tackle this, all temp files are now created in a subdir of java.io.tmpdir
and the jvm tmpdir property is set to this subdir, which is deleted by
code on shutdown.
Additionally let pdfParser use this tmp subdir too.
9 years ago
luc
bfe51001e3
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
02e4489a23
set tmpfile.deleteOnExit by default,
...
to make sure files are removed on shutdown.
9 years ago
reger
2985baaa01
Exclude repetitive protocol part in tokenized url
...
used as description if none is avail. from parser.
9 years ago
reger
ca3d26a401
harmonize wordsintitle & CollectionSchema.title_words_val calculation,
...
remove obsolete partial init of wordreference from urimetadata
9 years ago
reger
7bf03856d1
add link to quick select blacklist
...
from title list
9 years ago
reger
440ce6d198
add German translation to re-crawl job
9 years ago
reger
5362a80f1c
upd to httpcore 4.4.4
9 years ago
reger
e90593450c
upd to TwelveMonkeys ImageIO 3.2
9 years ago
reger
b4dbff6a6a
fix yacysearch.json "totalResults"
...
element "totalResults" is included twice (at begin & end),
only the element after performing the search holds number > 0
see http://mantis.tokeek.de/view.php?id=608
9 years ago
reger
52a9040ae6
Sort out double keywords (dc_subject) early in parsed documents
...
- by direct using Set vs. List
- remove not neede String[] getter
9 years ago
luc
49331dc523
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
luc
0de6988604
Added links to more image test suites.
9 years ago
reger
47d70732f6
improve locale translator
...
- skip empty line
- robustness file section detection (space independant)
9 years ago
sixcooler
646afe9183
do not store subfield *_coordinate + make all num-fields being docvalues
9 years ago
sixcooler
194df613de
not using 'location' as defaultfacetfield - since we removed it being
...
default.
9 years ago
sixcooler
d3b9349b6f
simplification / speedup of GenerationMemoryStrategy
9 years ago
sixcooler
f5a9948860
do not store subfield *_coordinate
9 years ago
sixcooler
fca353e5eb
set startuptype of most solr handlers to lazy
9 years ago
sixcooler
4a905ec134
fix to not let the AccessTracker-Log grow to much, but have enough data
...
to monitor.
(+gitignore-correction)
9 years ago
sixcooler
209f502f09
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
20e18d79f8
harmonize document title for archive parsers
9 years ago
sixcooler
d481653202
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
Linker Lin
658d9e74d2
Create .travis.yml
9 years ago
luc
f11b5e8309
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
112ae013f4
update bzip and bzip parser process,
...
to return one document for the file with combined parser results of the
containing file and registers it with supplied url and mime of the archive.
9 years ago