Michael Peter Christen
8068e68474
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
bd886054cb
new structure and enhancements for link graph computation:
...
- added order option to solr queries to be able to retrieve document
lists in specific order, here: link length
- added HyperlinkEdge class which manages the link structure
- integrated the HyperlinkEdge class into clickdepth computation
- extended the linkstructure.json servlet to show also the clickdepth
and other statistic information
11 years ago
reger
f326a67561
fix: typo in default charset in metadata2solr
...
update pom and NB build to Solr 4.7.1 libs
11 years ago
Michael Peter Christen
df138084c0
do solr optimization independently from memory and load constraints:
...
- not doing an optimization will likely cause a too many files exception
- without optimization performance will be even worse which would
prevent optimization in the future as well (prevent a deadlock
situation)
11 years ago
Michael Peter Christen
ebd44a7080
replaced solr 4.6.1 with solr 4.7.1 and added index migration to
...
lucene_47
11 years ago
Michael Peter Christen
734778c0c8
fixed a time-out problem in the default servlet which is also a logging
...
problem because the error log showed the wrong reason (file not found)
instead the actual reason (time-out).
11 years ago
Michael Peter Christen
466d90ad42
fixed a problem with resource observer; probably coming from uncatched
...
exceptions within the apache library which appear only in concurrency
environments.
11 years ago
Michael Peter Christen
e8ddd415a8
enhanced the new link structure graph
11 years ago
Michael Peter Christen
926d28dd3f
fixed a bug which prevented crawl starts after a network switch
11 years ago
Michael Peter Christen
3ce8eff21b
another fix for inbound/outbound detection
11 years ago
Michael Peter Christen
d4b5c457e4
NPE fix
11 years ago
Michael Peter Christen
36a66b0704
fix for parsing of numeric value in case that boolean values are given
11 years ago
orbiter
41730c8048
better logging in template engine: shows filename of servlets where
...
errors in templates occur
11 years ago
orbiter
3c1274057d
fixed thread dump in case of wrong seeds
11 years ago
orbiter
18f9c40302
moved Edge class out of linkstructure servlet as this does not work on
...
non-eclipse driven environments (all non-dev cases)
11 years ago
orbiter
de95e5e524
reduced search activity corona strength in network image
11 years ago
reger
da413af664
move baseurl after parsing orig source in urlproxyservlet
...
to calculate absolute href links for rewrite from unmodified source.
11 years ago
reger
af6ad20728
fix: remove obsolete ref to yacy.home
...
(use Switchboard instead)
11 years ago
Michael Peter Christen
74ab094587
fix for solr query size; too many documents had been retrieved in case
...
that less than _pagesize_ had been requested.
11 years ago
Michael Peter Christen
c64c10ef00
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
48fbfa60c1
bugfix to inbound/outbound identification
11 years ago
reger
227c42bc96
eleminate obsolete URIMetaDataRow class
...
by joining it with/into URIMetaDataNode.
11 years ago
Michael Peter Christen
cca851a417
introduced new solr field crawldepth_i which records the crawl depth of
...
a document. This is the upper limit for the clickdepth_i value which may
be shorter in case that the crawler did not take the shortest path to
the document.
11 years ago
orbiter
b1ba764d81
fix for first start options and added german translation for popup texts
11 years ago
orbiter
429a874222
- added COLS field in GSA response (non-gsa standard by customer
...
request)
- updated document link in GSA response writer
11 years ago
Michael Peter Christen
1b9ec9a1c5
- added popover to p2p/stealth mode button to explain the peer mode and
...
privacy issues.
- added popover to first-time use case to explain that specific servlets
are only visible after customization and/or crawl starts
11 years ago
Michael Peter Christen
62a36fa584
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger
c9f92abddc
fix: application link count
...
(URIMetadataNode)
11 years ago
Michael Peter Christen
a267c46e1a
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
5b83887da8
npe fix
11 years ago
Michael Peter Christen
63c9fcf3e0
free configuration of postprocessing clickdepth maximum depth and time
11 years ago
Michael Peter Christen
39b641d6cd
added tutorial mode - some menu items will only appear if you 'qualify'
...
for them. Thus, the first-time user will only see four menu items. The
other items will unfold as the user interacts.
11 years ago
sixcooler
f06775850f
fix receiving DHT / parse pultipart
...
+ another close to fix possible resource leak warning
11 years ago
reger
49e76a1c55
make use of detected charset in htmlParser if none is given.
11 years ago
reger
e11504309f
adding a hint to javascript browser short cut on Url-Proxy page (AugmentedBrowsing_p.html)
11 years ago
reger
b12200cafe
alternative UrlProxyServlet (for /proxy.html) using different url rewrite rules
...
- use JSoup parser for selective rewrite of html body <a href= links only,
instead of regex which rewrites also header href/src links
- this improves display of pages which use header <base> tag
- tags with src attribute are taken from original location (like css) improving display and are not routed trough the indexer
Disadvantage: scripting links will drop out of proxy
Setting of the servlet through web.xml exclusivly (in case one would like to quickly switch back to the YaCyProxyServlet,
leaving the existing code of YaCyProxyServlet untouched available)
11 years ago
reger
2953ebe701
fix: port in local target adress
...
& button style
11 years ago
Michael Peter Christen
fda591695c
fixed visibility of custom icon
11 years ago
Michael Peter Christen
a9b9950d7f
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
b488f33975
added close to fix possible resource leak warning
11 years ago
Michael Peter Christen
56710ecb26
prevent opening of new files as that could be a cause for the latest
...
too-many-open-files exception. The old file is just truncated if the
table is cleaned.
11 years ago
Michael Peter Christen
8b44fcf0f4
added missing @Override annotation
11 years ago
reger
d7055904a6
fix: proxyservlet path header setting
11 years ago
Michael Peter Christen
e515dd460d
added linkscount_i and linksnofollowcount_i to the default solr schema
11 years ago
Michael Peter Christen
1a764135be
one more Thread Dump fix for new bootstrap css style
11 years ago
Michael Peter Christen
bb21d825f9
fix for thread dump line spacing
11 years ago
Michael Peter Christen
cbdfef7ce1
changed protocol facet to show also all other counts if one facet is
...
selected
11 years ago
reger
b9056ef2db
remove unused private header entries (HeaderFramework)
...
X_YACY_ORIGINAL_REQUEST_LINE
X_YACY_KEEP_ALIVE_REQUEST_COUNT
CONNECTION_PROP_REQUESTLINE
11 years ago
sixcooler
6d16fa993d
make transparent proxy handle https-connections:
...
the implemented handle for connect did not work for me - so lets try the
connectHandler
11 years ago
Michael Peter Christen
61ad194065
fix for source and target clickdepth in webgraph index
11 years ago