reger
7f63fc50f3
prepare a IndexSegment test case for RWI index testing
...
+ prevent NPE in Segment.clear() on missing embedded solr instance.
9 years ago
reger
272cdd496a
reactivate sentence counter in WordTokenizer for phrasepos ranking,
...
by counting punktuation (delivered as 1 char word) again.
9 years ago
Michael Peter Christen
5e165a8150
removed unused imports
9 years ago
reger
e310ec5f70
fix posInText ranking calculation to score 0 on no position info
...
+ fix Word posInText calc in Tokenizer to start with 1
+ test case
9 years ago
reger
39dd244693
fix ConcurrentScoreMap.set() calculation of totalCount()
...
+ test case
9 years ago
reger
ebde21079a
refactor xlsParser to include Excel file attribute (like author) in parser result doc.
...
Similar to ppt and doc parser, completing a TODO in xlsParser.
9 years ago
reger
5e335b32da
fix Blacklist.contains() matching path pattern to string
...
similar to 5e9e871192
+ add proof testcase
9 years ago
reger
f89d4eb51d
fix MultiProtocolURL init (assign of host) for urls with '/' in query part
...
+ add to test case
9 years ago
reger
87fcfc6d78
Adjusted hash computation and toNormalform for file:// protocol to deliver
...
same hash same file on Windows filesystem path with forward- and backslash in path.
Background see http://mantis.tokeek.de/view.php?id=671
+Test case
9 years ago
reger
7b226afc33
fix HostQueueTest - changed open parameter
9 years ago
reger
fcc29c36f0
test case for HostBalancer issue in intranet mode
...
with file:// protocol, 2 hostqueues accessing same cache file concurrently
http://mantis.tokeek.de/view.php?id=668
Reason seems to be diff. hosthash key of hostqueues on reopen.
Internal queue key and external representation (directoryname currently hostname.port) must be adjusted to fix it (not done yet).
9 years ago
reger
a476d06aec
wiki header code test string add "closing" tag
9 years ago
reger
d4da4805a8
internal wiki code, require header line to start with markup
...
(to allow something like "one=two" as text)
+ incl. test case
9 years ago
reger
223071337b
Translator to take caution of word boundaries to identify text portion to
...
be translated. To avoid key="TEST" sourcetext="this is a myTESTcase for it"
translation of partial terms/words.
Add check of word boundary before and after sourcetext (incl. take care
of current praxis for key to be delimetered by > <
+ add test case
9 years ago
reger
a6ba1faa80
introduce a translation edit servlet Translator_p.html YaCy's UI text translation
...
This is the 1st rudimentary approach to support the translatio utilities.
It allows currently to edit untranslated text and save it in a local translation file
in the DATA/LOCALE directory.
+ refactor Translator (less static's) to leverage on class overrides and support garbage collection for this 1 time routine
+ adjust TranslatorXliff to check for local translations in DATA/LOCALE,
this includes storing manually downloaded translation files in DATA as well
(to keep default untouched)
+ on 1st call of Translator_p a master tanslation file is generated, checking
the supported languages for missing translation text (later this masterfile is planned to part of the distribution, to harmonize translation key text between the languages)
Outlook: the local modifications (possibly as translation fragments instead of complete file) to be shared with maintainer using xlif features.
9 years ago
reger
b74cddc49c
upd to Jetty v9.2.16.v20160414
...
- exclude unused mime4j
- remove unused yacy-cora build
9 years ago
reger
24b0fa2a38
extend snapshot Html2Image.pdf2image to use PDFBox image export capability
...
if no external tool installed (and for Win)
Resulting jpg are not always perfect (if graphic included) but imho sufficient.
9 years ago
reger
902e79e261
Introduce a TranslatorXliff wich can read/write xliff from/to internal translation map.
...
This eases up suggested initatives from http://mantis.tokeek.de/view.php?id=649
Allows longer term also to store translation maps for the htroot files
in standardized/reuseable xliff format ( http://docs.oasis-open.org/xliff/xliff-core/xliff-core.html ).
+ added test case creating and comparing xliff file with internal custom prop file.
(currently the introduced class is not used in core code)
9 years ago
reger
ec24a0c85a
add test case for optimized toTokens()
9 years ago
reger
84c970eaec
move test classes to test/java (subdirectory as in Maven standard subdir layout)
...
because ViewImage*Test.java breaks test run
9 years ago
luc
571bc55937
Refactoring : use StandardCharsets constants instead of hard-coded
...
charset names.
9 years ago
reger
1af0e9ef74
remove workaround for Solr bug regarding multivalued date fields
...
fixed in 5.4.0
http://issues.apache.org/jira/browse/SOLR-8050
9 years ago
reger
4d2b934487
prevent mailto links getting into parser result document's in/outbound link collection
...
by checking mailto scheme early.
- fix upper case mailto protocol assignment
- add test case for getProtocol
9 years ago
reger
288acceac3
fix test htmlParserTest, charset parameter
...
+ upd maven templating-plugin version
9 years ago
luc
f01d49c37a
Process large or local file images dealing directly with content
...
InputStream.
9 years ago
luc
0de6988604
Added links to more image test suites.
9 years ago
luc
745e97a575
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
luc
2895ab552a
Made ViewImagePerfTest extend ViewImageTest to ease automated image
...
render tests
9 years ago
luc
4a03cf06e1
Corrected encoding extension arg parsing
9 years ago
reger
d223cf0ae4
adjust MediaWiki importer geo coordinate calculation
...
- allow lat/long 0.xxx
- south / west assignment
include test class
9 years ago
luc
8da20718aa
Created a class to test ViewImage rendering against multiple image
...
files.
9 years ago
luc
ec04d27473
Corrected APNG test suite link name.
9 years ago
luc
cbb84ba073
Detailed javadoc.
9 years ago
luc
70111876d2
Filled ViewImageTest.html with all remaining IANA image file formats.
...
Added some links to test suites and specifications.
9 years ago
luc
e093fb228d
Created a generic ViewImage performance render test.
9 years ago
luc
3ad564e2e4
Created a ViewImage rendering performance measurement test.
9 years ago
luc
b3f044072e
Updated table headers and SVG file url for case sensitive OS.
9 years ago
luc
f5746b5490
Added ico and bmp sample pictures
9 years ago
luc
baede48161
Added JPEG 2000 and FITS samples
9 years ago
luc
7c9d80c5d0
Added image formats and informations for each format.
9 years ago
luc
0ae9297ca5
Created a html test page to check ViewImage rendering with different
...
file formats.
9 years ago
reger
bad34804fe
optimize parseInt for <img> tag attribute parsing
...
Performance better as using Numberformat.parse or parseInt(substring())
9 years ago
reger
d2cc11ea8f
fix html parser taking <style> content as text.
...
Noticed some result description contain css content from style tag.
Added <style> to tag list to scrape it's content not as text
+ test case included
10 years ago
reger
e594130aec
add test case for partial update - to discover effect on YaCy for update of documents with multivalued date fields (like dates_in_content_dts)
...
current result: loss of fields/information in index document, see EmbeddedSolrConnectorTest.testUdate_withMultivaluedDateField()
10 years ago
reger
d5da9e5a38
fix test methode (add throw for URIMetadataNode)
10 years ago
reger
4cf875336c
complete TODO: getFileExtension handle dot in query part
...
+ testcase
10 years ago
reger
c37dda8849
fix NPE on MultiProtocolURL on url with parameter value and '='
...
in getAttribute
- added test case for it
10 years ago
reger
71bf95af8a
upd parser calls in test cases
10 years ago
reger
f63fff9008
fix snippet containig number with comma as desmo point http://mantis.tokeek.de/view.php?id=344
...
to keep it as one word (by altering the split regex)
- added sniipet test case with number
- regex for word split to match multiple splitcars
10 years ago
reger
2ef8ffdb60
apply UTF-8 encoding
...
copied from escape()
10 years ago