reger
aaae7c6462
adjust ConcurrentScoreMap internal value map to interface and use parameter
...
Long -> Integer (saves some bytes)
8 years ago
luccioman
6e1959f469
Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
...
Conflicts:
htroot/yacysearchitem.java
source/net/yacy/cora/federate/solr/responsewriter/YJsonResponseWriter.java
source/net/yacy/search/schema/CollectionConfiguration.java
source/net/yacy/server/serverObjects.java
8 years ago
reger
685d8e86bf
Avoid frequent data type casting (float/long) for rwi score
...
refactor to using long in URIMetadataNode too (and related call parameters)
As remote rwi score's are not used (since v1.83) skip reading float-score ,
but keep in toString() for communication with older versions.
8 years ago
luccioman
3ccd89e274
Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments
8 years ago
reger
32a2e3a22a
have RSSFeed.getChannel return empty message on missing channel element,
...
a) required b) prevent NPE in rss servlets
+ add test
8 years ago
reger
862f28eaa6
display number of documents/rss-items for label "docs" in load_rss_p servlet
...
(as replacement for the rarely used "docs" rss-tag for a url to the rss-specification)
8 years ago
reger
70e1eb30a5
prevent StringIndexOutOfBounds in getLocalFile()
...
+ tighten patching of DOS path w/o protocol to drive "LETTER":
8 years ago
luccioman
1bb0b135ac
Avoid duplication of various MS Windows file URLs flavors
...
Fix for mantis 692 (http://mantis.tokeek.de/view.php?id=692 )
8 years ago
luccioman
b9a8476f02
Removed unused import
8 years ago
reger
e73c1eea8c
remove unused rootpattern, leftover from commit
...
9a5ab4e2c1 (diff-d2b184283abed53ae260fc9eabdaef40)
8 years ago
reger
6f8c3ccea4
improve url hash computation for file path with mixed java & windows
...
file.separator to compute equal hashes (by normalizing path for computation)
+ expand test case for to check mixed java / windows file url notation
like e.g. file:///c:/test/file.html vs. file:///c:\test/file.html
- relates partially to http://mantis.tokeek.de/view.php?id=692
8 years ago
luccioman
f2bc1b268d
Updated URL fragment validation rules according to current standards
...
See RFC 3986 (https://tools.ietf.org/html/rfc3986 ) or URL living
standard (https://url.spec.whatwg.org/ )
8 years ago
reger
330768c8a2
fix for solr write.lock after mode change http://mantis.tokeek.de/view.php?id=686
...
The embedded core holds a lock on the index and must be closed. Earlier commit
comment states that core should be closed with solr instance instead on close
of connector.
Adjusted the InstanceMirror.close() to take care of closing the embedded
instance to release the lock.
In 2 routines of fulltext this was already explicite implemented (disconnectLocalSolr).
Now this disconnect is part of the InstanceMirror.close().
8 years ago
luccioman
de5c873e38
Removed unused JavaScript file docs.min.js
...
This file is used by Bootstrap documentation website
(http://getbootstrap.com/ ) but is not part of the Bootstrap distribution
and has not be included in a Bootstrap based application.
8 years ago
luccioman
0a9ff14d96
Fixed NullPointerException case and added Javadoc
8 years ago
Michael Peter Christen
25a3c7a6d0
catch exception and write end of object
8 years ago
Michael Peter Christen
5e165a8150
removed unused imports
8 years ago
Michael Peter Christen
c716648c78
enhanced json encoding of strings
8 years ago
Michael Peter Christen
6139bd85a8
fix for broken facet names
8 years ago
Michael Peter Christen
5060f9fee9
fix for too long snippets
8 years ago
Michael Peter Christen
8681cee3f3
fix for bad comma
8 years ago
Michael Peter Christen
db6d8fc197
fix for bad json
8 years ago
Michael Peter Christen
8f4a341735
Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen
9934f546bb
added default fl to solr query, removed large texts retrieval and
...
changed snippet to description tag if no other description is available
8 years ago
reger
39dd244693
fix ConcurrentScoreMap.set() calculation of totalCount()
...
+ test case
8 years ago
reger
3b47a07dd1
change unused servletProperties entry CONNECTION_PROP_CLIENT_REQUEST_HEADER to
...
use directly HttpServletRequest. This is used to get the http protocol version
in HTTPDProxyHandler.fulfillRequestFromWeb() for error response to client.
- adjust YaCyProxyServlet and UrlProxyServlet accordingly
- use more http_version constants in headerframework and httpdeamon
- equalize servlets (3) use of HeaderFramework.CONNECTION_PROP_HOST to HeaderFramework.HOST
8 years ago
reger
036c1dc6ef
fix CookieTest_p formatting (output of <br> as text),
...
change to dataoutput only by servlet, leave formatting to html.
+ removed link to obsolete env/grafics gif
8 years ago
luccioman
744c9a2615
Opensearch desc : handle https protocol url with default port (443)
...
This completes modifications made for mantis 669
(http://mantis.tokeek.de/view.php?id=669 )
8 years ago
reger
226f81cfcf
declare poison pill url MultiProtocolURL() as protected to make sure not
...
used from outside.
After double checking use of poison url revert path init from commit
f8632ad292
8 years ago
reger
f8632ad292
prevent string index out of bounds MultiProtocolURL.getPaths
...
as path maybe a empty string
+ init path to "" also in init for poison url (to guarantee success for
all existing uses of path w/o check for null)
8 years ago
reger
9b07bbf955
deprecate newurl(), not used and already replaced
...
instead of making it handle all supported the protocols
8 years ago
reger
774b3906a9
fix GenericFormatter.parse ("time","timeoffset")
...
change: UTC offset internally expected in minutes
8 years ago
reger
27163af0e1
improve detection of referenced links by taking http and https link protocol
...
into account
+ correct query start detection of commit f89d4eb51d
8 years ago
reger
f89d4eb51d
fix MultiProtocolURL init (assign of host) for urls with '/' in query part
...
+ add to test case
8 years ago
reger
87fcfc6d78
Adjusted hash computation and toNormalform for file:// protocol to deliver
...
same hash same file on Windows filesystem path with forward- and backslash in path.
Background see http://mantis.tokeek.de/view.php?id=671
+Test case
8 years ago
luccioman
893a40995a
Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman
6e96c7341a
Merge remote-tracking branch 'origin/master'
...
Conflicts:
htroot/Load_MediawikiWiki.java
htroot/Load_PHPBB3.java
htroot/ViewImage.java
8 years ago
luccioman
a73c9327a5
JavaScript License fixes for LibreJS compatibility
9 years ago
reger
b3c9041f79
remove with localHostNames redundant (but unused) publicIPv4HostNames and publicIPv6HostNames
...
to free unused resources
9 years ago
reger
9e94989237
upd to PDFBox 2.0.1
9 years ago
reger
24b0fa2a38
extend snapshot Html2Image.pdf2image to use PDFBox image export capability
...
if no external tool installed (and for Win)
Resulting jpg are not always perfect (if graphic included) but imho sufficient.
9 years ago
reger
3adb670f44
remove never used Domains.myHostNames set
9 years ago
reger
ec24a0c85a
add test case for optimized toTokens()
9 years ago
reger
258cd41577
reduce logging (EmbeddedSolrConnector.query)
...
mainly to reduce the frequent metadat checks like
> EmbeddedSolrConnector.query QUERY: q={!cache=false raw f=id}xXxXxX&rows=1&start=0&fl=id,load_date_dt
(p.s. direct servlet queries logged via AccessTracker.addToDump)
9 years ago
reger
6d56beaed8
fix assertion exception in toString of MultiProtocolURL
...
toString of AnchorURL and MultiProtocolURL are identical code
(no need to override or to protect call to parent)
as reported in https://github.com/yacy/yacy_search_server/issues/43
9 years ago
luc
480772c070
Fixed json search results from commit "Improved URLLicence reliability"
9 years ago
reger
937fbb0b9f
correct isHidden() for smb from last commit
9 years ago
reger
535d4bf75f
respect hidden attribute for file and smb directory listing
...
(hidden directories are not listed, effects crawling of local file system)
9 years ago
luc
f7b854465b
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
a6617ad887
expand initRemoteCrawler() to terminate worker threads if called to deactivate
...
remote crawl.
On startup we save the resources for remote crawler if disabled. Once started
threads are running idle after disable remote crawl. Now threads are terminated
to save the resources also while disabeling during runtime.
+ remove empty class Channels
9 years ago