reger
96b8d9b09e
moving the J7Zip-modified source and Maven build to libbuild
...
from main pom.
Using source included in j7zip-modified.jar.
This combines all external lib preparation in the libbuild main pom.
9 years ago
reger
764f5100f0
fix delete of temp file after odt % ooxml parser
...
Close zipfile after parsing
9 years ago
reger
379e9b330d
use supplied url port to get robots.txt in crawlers hostqueue
9 years ago
reger
ed765de29b
adjust start/stop classpath in build script
...
(with servlet classloader no need for htroot in system classpath)
9 years ago
reger
9a7efa7814
harmonize classpath with startYaCy.bat
...
(with servlet classloader no need for htroot in system classpath)
9 years ago
reger
0dcda3809e
harmonize classpath with startYaCy.bat
9 years ago
reger
58a959403d
fix mixed logfactory in UrlProxyServlet,
...
Class doesn't use functions of declared ancestor, change to extend on httpservlet
9 years ago
reger
dc112d0e32
upd to slf4j-1.7.16
9 years ago
Michael Peter Christen
2494a820c7
0N - added recording of dump exports if given time frame is not negative
9 years ago
Michael Peter Christen
ef2cc4f690
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
reger
7b02cacb12
upd to Jetty 9.2.15.v20160210
9 years ago
Michael Peter Christen
a6bf0b1649
0N - added option to generate index export files for a specific number
...
of minutes in the past and reverted latest change. The export file dump
will now contain four data elements: f - first date of index entry write
date, l - last date of index write date, n - now-date of index dump
time, c - count of numbers inside the dump. '0N' denotes a series of
changes which will lead to the opportunity to exchange index data dumps
in a way that is needed to integrate ZeroNet index data. This will be
based on index dump sharing; that causes this commit.
9 years ago
reger
9312fbe563
making WebStructurePicture_p less vulnerable to faulty host input parameter (like host1,,host3)
...
by continue host loop on exception
inspired by http://mantis.tokeek.de/view.php?id=637
9 years ago
reger
6d56beaed8
fix assertion exception in toString of MultiProtocolURL
...
toString of AnchorURL and MultiProtocolURL are identical code
(no need to override or to protect call to parent)
as reported in https://github.com/yacy/yacy_search_server/issues/43
9 years ago
reger
b12b8fb1c2
include initial japaneese translation to language selection
9 years ago
Burkhard
6a3d27ca5b
Merge pull request #44 from ImpactCrater/master
...
Created a translation file ja.lng
9 years ago
reger
42a7bdb2af
fix SolrSelectServlet authentication to default to true
9 years ago
ImpactCrater
567c292302
Created a translation file ja.lng
...
I wrote a bit of translation to Japanese.
9 years ago
Michael Peter Christen
5b9030180c
added peer hash to export dump name.
9 years ago
Michael Peter Christen
287b918bd7
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
reger
20e3c25ae3
upd to weupnp-0.1.4.jar
9 years ago
reger
dbb28bb4f3
del unused statistic parameter (from status servlet)
9 years ago
Michael Peter Christen
b851308ee6
enhanced robustnes of image computation
9 years ago
reger
06d0e2aeb9
result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
...
- Above brought up that parser start url parameter, declared as AnchorURL uses only methodes of parent object DigestURL (changed parameter declaration accordingly).
9 years ago
reger
caf9e98f09
put metadata dc_publisher in corresponding schema field
9 years ago
reger
38e2b054d4
remove servlet classloder internal cache map (to save the resources, cache hits marginal)
...
- DefaultServlet includes already a class cache "templateMethodCache" which is emptied
on low mem status
- avoid classloader cache gets has no hits but over time holds all (used) servlet classes
9 years ago
luc
3f338777f7
Also check and index eventual icon url information from metadata.
9 years ago
luc
9f712146df
Display icons in ViewFile "links" mode.
9 years ago
luc
8682dfbd5e
Updated getpageinfo outputs to return page icons list.
9 years ago
luc
26f1ead57c
Created ViewFavicon class specialized in favicon viewing.
...
Main image processing is now in ImageViewer, used by both ViewImage and
ViewFavicon.
Fixed URIMetadataNode.getFavicon to use non-standard icons with no size
ass fallback.
9 years ago
otter
f2e5b3adb7
format2
9 years ago
otter
000ec16bf8
format
9 years ago
reger
6f0b073bf3
override detected language (statistic langdetect) only with TLD determided
...
language if langdetect probability is not high.
+ additionally truncate zh-cn / zh-tw returned by langdetect to 2 char ISO639-1 zh
used by YaCy
9 years ago
reger
b65e2b527d
include use of condenser's content text for language detection.
...
Language identification may show poor performance on documents with short or no
title but clear lang indication in text content. Using content text too
improves lang detection.
+ remove double caching of text in Identificator
9 years ago
reger
756c55e6d1
upd to Solr 5.4.1
9 years ago
otter
c3c5e7928b
Correctly handle POSTed parameter also with HTTPS activated
9 years ago
luc
07222b3e1a
Added favicon url transmission in RWI chunks.
9 years ago
luc
480772c070
Fixed json search results from commit "Improved URLLicence reliability"
9 years ago
reger
937fbb0b9f
correct isHidden() for smb from last commit
9 years ago
reger
535d4bf75f
respect hidden attribute for file and smb directory listing
...
(hidden directories are not listed, effects crawling of local file system)
9 years ago
luc
53781299d8
Extracted intranet and filtype related rules from getFaviconURL func
9 years ago
reger
cc79ad8de6
compare search page, remove diminished search target
...
(romso.de, dbpedia.neofonie.de )
9 years ago
reger
375d49d536
upd classpath in batches (remove not necessary htroot)
...
see prev commit
9 years ago
luc
3cc5619d93
Improved HTML icons indexing and rendering in search results.
...
See http://mantis.tokeek.de/view.php?id=629
9 years ago
luc
edef6cd0dc
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
c28142095a
add findClass() to servlet class loader (used in YaCyDefaltServlet)
...
In the 2 cases where servlet calls servlet the jvm classloader chain is
invoked and servlet class loaded by jvm loader (successful while requiring
htroot in system classpath). This patch uses the standard override design
for loaders to handle these cases (making in not longer crucial to have htroot
in system classpath, as this classLoader is mainly used for servlets and
looks in this case for the class in the configured path).
+ As the default classloader is parallelcapable we should register this too.
9 years ago
otter
f6e6250b83
Merge branch 'master' of https://github.com/otteresk/yacy_search_server.git
9 years ago
otter
770bb1d41f
Improved plotting
9 years ago
Andreas
e971f2af4a
Merge pull request #3 from yacy/master
...
Get my fork synced #3
9 years ago
luc
db6fc67b77
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago