reger
aaae7c6462
adjust ConcurrentScoreMap internal value map to interface and use parameter
...
Long -> Integer (saves some bytes)
8 years ago
reger
31d2a5645e
remove obsolete query variable
...
leftover from 8fb370d9f8 (diff-1d4259005ebfddc11083387857a86175)
harmonize ranking shift parameter to 0xFF
correct addresult weight parameter to long
8 years ago
luccioman
93ea366778
Updated license header file name
8 years ago
luccioman
4c0be4d5d4
Fixed maven compilation error
...
Removed unit test yacysearchitemTest from default maven Junit tests
path, as yacysearchitem class is not in maven build classpath.
8 years ago
reger
ba77e8f8ec
upd to Jetty 9.2.19
8 years ago
luccioman
a588ed7628
Applied image headers customization to the new ViewFavicon servlet.
8 years ago
luccioman
d16e57b41e
Merge pull request #39 from luccioman/master
...
Favicon retrieval and image preview enhancements.
More details on mantis 629 (http://mantis.tokeek.de/view.php?id=629 )
8 years ago
luccioman
7717a3d43d
Fixed license headers on files created to improve favicon management.
8 years ago
luccioman
6e1959f469
Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
...
Conflicts:
htroot/yacysearchitem.java
source/net/yacy/cora/federate/solr/responsewriter/YJsonResponseWriter.java
source/net/yacy/search/schema/CollectionConfiguration.java
source/net/yacy/server/serverObjects.java
8 years ago
luccioman
7136b1ad60
HTML validation : fixed URL encoding of Pictures link.
8 years ago
reger
407563b9f0
add lock symbol to messages UI Trans menu item
8 years ago
reger
685d8e86bf
Avoid frequent data type casting (float/long) for rwi score
...
refactor to using long in URIMetadataNode too (and related call parameters)
As remote rwi score's are not used (since v1.83) skip reading float-score ,
but keep in toString() for communication with older versions.
8 years ago
luccioman
3ccd89e274
Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments
8 years ago
luccioman
f1f4459f88
Added some unit tests for Blacklist.isListed()
8 years ago
luccioman
4b699c469a
Blacklist refactoring : extracted a function for easier unit testing
8 years ago
luccioman
54cfcc3f56
CrawlCheck_p.html : also display info about disallowed URLs.
8 years ago
luccioman
8b341e9818
Robots : properly handle URLs including non ASCII characters
...
This fixes GitHub issue 80 (
https://github.com/yacy/yacy_search_server/issues/80 ) reported by
Lord-Protector.
8 years ago
luccioman
75bb77f0cb
Refactoring : extracted a method to handle authorized action links.
8 years ago
luccioman
c996b04741
HTML validation : fixed URL encoding of search results action links.
8 years ago
luccioman
2b81703828
Refactored search result action links construction.
...
These are long URLS with common parts : it is valuable to build the
common parts only one time.
8 years ago
reger
e68b00678e
prevent negative score on URIMetadataNode - in the special case were no
...
solr score is supplied.
+ assert before use & test case
8 years ago
luccioman
242707f9b4
Fixed loadFromCache with strategy IFFRESH.
...
This fixes mantis 695 ( http://mantis.tokeek.de/view.php?id=695 ) :
crawl start with 'Link-List of URL' option on websites using cookies.
8 years ago
reger
c778219768
remove module for swfparser from maven parent pom
...
not longer required for the build
see a4465c97d6
8 years ago
luccioman
094aed8664
Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman
c7402a2f89
Removed invalid empty form action.
...
A form action URL must not be empty (see
https://www.w3.org/TR/html/sec-forms.html#element-attrdef-form-action ).
No action attribute has the same effect (relaunching the same GET
action) but is valid HTML.
8 years ago
luccioman
37df2e19fd
Removed xmlns attribute which no more makes sense in HTML5 pages.
8 years ago
luccioman
94924e288f
Added some accessibility improvements to the main interface.
...
Tested with NVDA screen reader.
8 years ago
luccioman
dd86f7c44e
Fixed HTML validation errors and grouped radios options in fieldsets
8 years ago
luccioman
fc0c72c84b
Switched to the short HTML Doctype
...
This pages were already no more XHTML 1.0 because made use of the HTML5
syntax and elements.
Applied current (2016) HTML standard recommended Doctype declaration
(see https://www.w3.org/TR/html/syntax.html#the-doctype ).
8 years ago
reger
7c81160f45
correct blacklist export as text url to blacklists_p.txt
...
was using servlet for network access and missing network.unit.name
fix for http://mantis.tokeek.de/view.php?id=694
+ prevent unresoved_pattern in yacy/list servlet
8 years ago
reger
b752bcfecb
adjust date in text detection to ignore some program version strings
...
like "3.1.2.0102" see http://mantis.tokeek.de/view.php?id=650
+ expand test case
8 years ago
reger
b017e97421
optimize condenser language detection a little.
...
langdetect probabilities take letter case into account, add words from
description and anchors etc. as is.
+ add it to javadoc
8 years ago
reger
ae3717d087
adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
...
+ remove unused sentenceword map (we use only the count)
+ upd test case for sentence count
8 years ago
luccioman
b5eb7a9217
Removed unnecessary crawlingDomFilterDepth hidden field.
...
It had incorrect "-UNRESOLVED_PATTERN-" value (see second part of
mantis 691 http://mantis.tokeek.de/view.php?id=691 )
Note : crawlingDomFilterDepth is apparently unused in current (2016)
YaCy code-base. It was also unnecessary because crawlingDomFilterCheck
hidden field is set to "off".
8 years ago
luccioman
f6d7c6ee1f
Fixed Recorded action URLs beginning displayed in /Table_API_p.html
...
Removed scheme, host and port from URL to avoid dealing with http/https,
external host and port retrieving issues.
What's more, this is consistent with how URL are displayed in
/Tables_p.html?table=api&count=100&reverse=on&search= or
Tables_p.xml?table=api&count=100&search=
This fixes mantis 691 first part
(http://mantis.tokeek.de/view.php?id=691 )
8 years ago
reger
474f0476c6
adjust Tokenizer sentence count on trailing text after last recognized sentence
...
+ upd test case for rwi multi-word-query (leaving results known to fail untested)
8 years ago
luccioman
34658ddb9b
Merge pull request #76 from luccioman/crawler
...
Crawl monitoring : refresh running crawls table
8 years ago
luccioman
0065c9b9ea
Crawl monitoring : refresh running crawls table
...
Fix mantis 690 ( http://mantis.tokeek.de/view.php?id=690 ).
Tested on :
- MS Windows 10 : Edge, Firefox 49, Chrome 53
- Debian Jessie : Firefox ESR 45
8 years ago
luccioman
e1e632ad84
Switched to the short HTML Doctype
...
This page was already no more XHTML 1.0 as it makes use of the HTML5
<progress> element.
Applied current HTML standard recommended Doctype declaration (see
https://www.w3.org/TR/html/syntax.html#the-doctype ).
8 years ago
luccioman
4d8611e5e7
Tables accessibility : added missing <thead> sections.
8 years ago
luccioman
9fb3142317
Restricted variables scope to function handleStatus() in Crawler.js
...
Missing 'var' in declaration was unnecessarily giving global scope to
these variables.
8 years ago
reger
3861ac9293
upd maven dependency-check plugin to reflect changes of https://nvd.nist.gov
...
+ upd unknown ant script with current lib/jsch version
8 years ago
reger
681a61dafb
adjust rwi index result word position handling used for rwi ranking
...
- correct WordReferenceVars.toRowEntry posintext parameter
to set expected min posintext (the difference is on multi-word queries,
while positions are ordered by search word order).
- modified posofphrase/posinphrase join operation
- to set min posofphrase
- and keep posinphrase if not same posofphrase (was set to 0, no differentiation during ranking)
+ fix compiler msg (missing type declaration)
8 years ago
reger
14f7577231
add support for older Word versions (Word6/Word95) to docParser
8 years ago
reger
8794e06721
upd to poi-3.15.jar
8 years ago
reger
e25f2ee88b
mention date search parameter in search option help (index.html)
8 years ago
reger
1a79c64495
generalize DateDetection with holiday date rules readily available in icu
...
to make sure current dates are recognized (was fixed to 2014 - 2016)
+ adjust holiday date parser from pattern.match to pattern.find to deal with leading and trailing text
+ moved relative date recognition (morgen, tomorrow) to parseline (used by query parser only), as not working and problematic for indexing
+ add test case for parseline (used by query parser)
8 years ago
reger
6f68f08354
correct DateDetection Silvester date
...
add Thanksgiving
8 years ago
reger
32a2e3a22a
have RSSFeed.getChannel return empty message on missing channel element,
...
a) required b) prevent NPE in rss servlets
+ add test
8 years ago
reger
fedb9f8151
del double entry in master.lng
8 years ago