Michael Peter Christen
582d059fb7
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
Michael Peter Christen
a7b41bd206
use curl downloads in download script with silent mode
9 years ago
reger
46ac0867ff
fix poison mediawikiimporter output queue also after ExecutionException
...
in worker thread.
Writer of importer keeps needs a poison to close the file. On exception (e.g. OOM)
add a poison marker in outer most try/catch to assure output queue will terminate
in this condition too (and closes+renames the surrogate/in/xxx.prt file)
9 years ago
reger
a7591d3ed0
fix mediawikiimporter number format exception on coordinate parsing
...
handle uncomplete metadata like "NS=43/50//N".
For other {expr ... } type entries a try catch added
9 years ago
Michael Peter Christen
fade8452c6
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
Michael Peter Christen
7274e50d7f
Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
Michael Peter Christen
5a39f9f679
stub to use a new app launcher for mac
9 years ago
reger
9da1712a31
increase http header EXPIRES for css and images in DefaultServlet
...
to increase browser cache hits for not changing content
9 years ago
reger
6d54eb3d36
skip loading document on crawl start for YMark bookmarks
...
by adding a constructor giving the already loaded document as parameter.
9 years ago
reger
50f64ddc3b
apply default css styles (class btn) to submit buttons
9 years ago
reger
7f6ee76eb9
upd to ImageIO 3.2.1
9 years ago
reger
09d3dd13d6
limit bookmark tag cloud font-size to 2.0em
9 years ago
reger
3076c87247
fix typo in Steering.html
9 years ago
reger
80e2c82249
fix NPE on empty blog importfile parameter
9 years ago
reger
8a8e53b1a1
apply default css styles (class btn) to blacklist* submit buttons
9 years ago
reger
0fa6340936
apply more default css styles (class btn) to submit buttons
9 years ago
reger
bf098412cf
use input type=file for choosing IndexImportMediawiki_p dump file
9 years ago
reger
e84d94f8ca
fix mime table for ms office / open office documents
...
(causing wrong parser detect in intranet mode)
9 years ago
reger
7c6d6cd69a
change some more submit button to bootstrap btn css class
9 years ago
reger
4eb7fb0ee5
just remove debug leftover
9 years ago
reger
45b9bd8403
adjust MultiProtocolURL.protocol detection to handle mailto with "://" in parameters,
...
and feeding hyperlinks to webgraph processing.
9 years ago
reger
67f64af4b4
quick fix: go back to display search results favicon via <img> tag and ViewImage,
...
! until better solution is found !.
Reason: in IE-Browser no individual favicon is displayed with <object> tag
(always the default) and only few individual fav's with Firefox (randomly)
hint: to be able to use <img src=ViewImage/> return of default icon was
added back to it.
relates to http://mantis.tokeek.de/view.php?id=629
9 years ago
reger
d5fd031449
fix reading of ippattern config array in URLProxy
9 years ago
reger
b7e8358645
make use of header.getContentType where possible (mime is normalized afterwards)
...
otherwise use header.mime() differentiated in prev. commit.
9 years ago
reger
7a8c077838
fix HeaderFramework.mime() to strip charset parameter.
...
Differentiate mime() and getContentType() which gives the raw header field.
This improves parser detection if charsets are included in http content-type field.
9 years ago
reger
b4b6910d60
fix (todo): correct doc.id of remote search result if no match with newly
...
calculated doc hash if different.
Testing showed that in some cases delivered url doesn't match the local
calculated hash. In this case replace doc.id (and host_id_s) with calculation
from url.
9 years ago
reger
15e46b2bad
exclude in/outboundlinksnofollowcount_i from default schema fields
...
(not used in any function)
9 years ago
reger
dec3e6ad96
fix: adjust urlstub for mailto links
...
(skip protocol)
9 years ago
reger
cb83e65f89
drop returning document language "en" if unknown (fix todo)
...
which also harmonizes handling of query.modifier for rwi and solr results
(to result must match a given language filter)
9 years ago
reger
0c5548a7ff
fix (todo) remove redundant holding of email link nameproperty in parser document
9 years ago
reger
71c416f383
show mailto links in ViewFile.html linklist
9 years ago
reger
6b7c10cef8
fix dc:date in mediawikiimporter/document.writexml to use lastmodified
9 years ago
reger
14803d58cd
let html scraper accept html5 <link rel="icon"> for favicon links
9 years ago
luc
b4cdacee76
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
luc
ba0a293f5c
Corrected another case of
...
org.apache.lucene.store.AlreadyClosedException" occuring when
SearchEvent.cleanup() was called while committing local solr index.
9 years ago
reger
4d2b934487
prevent mailto links getting into parser result document's in/outbound link collection
...
by checking mailto scheme early.
- fix upper case mailto protocol assignment
- add test case for getProtocol
9 years ago
luc
befb2415f8
Corrected frames preview displaying eventually incorrectly in local
...
administration mode.
9 years ago
luc
8c4ab9c76b
Added an option to eventually limit size of remote solr documents put to
...
local index. See mantis #626 .
9 years ago
luc
8827b86b2a
Added useful debug JVM parameters as comment.
9 years ago
luc
a2c08402af
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
reger
288acceac3
fix test htmlParserTest, charset parameter
...
+ upd maven templating-plugin version
9 years ago
luc
55a4d15775
Added a note on deprecated default search field and operator.
9 years ago
luc
70595d05d0
Modified MemoryControl.main() test to properly end for better results
...
displaying.
9 years ago
sixcooler
1be67d9ab6
CachedSolrConnector was replaced by ConcurrentUpdateSolrConnector years
...
ago - time to let it go
Commented out unused table of cache-objects
9 years ago
reger
28b8bc290a
fix use of NETWORK_SEARCHVERIFY for rwi verification
...
was not used to set the searchevent parameter (done in SearchEventCache.getEvent)
- remove unused corresponding QueryParams.filterfailurls param.
9 years ago
reger
020630efd8
remove unused network scanner parameter from queryparameter
...
Search event is not using networkscanner
(removed filterscannerfail param always init to false)
9 years ago
Michael Peter Christen
967508a87d
fix in error handling
9 years ago
reger
7cda48a9d6
add hint to "default max results per page" limit on ConfigPortal
...
(limit is applied in yacysearch & max. total results by sum result-stack size)
- remove obsolete search.navigation prop (has moved to ConfigSearchPage_p)
9 years ago
Michael Peter Christen
b2fac989fd
Merge pull request #32 from luccioman/master
...
Fix for MediaWiki import (mantis 625)
9 years ago
luc
ad5586f8f6
Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago