orbiter
3cb6c7861f
fixed shutdown authenticaton problem
11 years ago
Michael Peter Christen
7005ecdabd
cleanup
11 years ago
Michael Peter Christen
2939b47986
removed non-working realm setting in http client (auth for localhost was
...
added in previous commit)
11 years ago
Michael Peter Christen
9bd71fdbb4
made the access tracker class static because it shall be used by the
...
jetty auth module
11 years ago
Michael Peter Christen
7d6fc79eb8
refactoring (usage of constant names for attributes of authentication
...
check)
11 years ago
reger
cabe0943cd
fix opensearch resultcount in yacysearch.rss
...
see merge request https://gitorious.org/yacy/rc1/merge_requests/24
use result count in searchtrailer.xml which is on p2p search more accurate (timing)
11 years ago
reger
eaf596a257
adding proxy status to (private) status box
...
(show also transparent and url proxy status)
show search result via url proxy only if status=on
11 years ago
reger
e3d8459906
extend ssl enabled msg on status page
...
- post the portnr
11 years ago
reger
58ecf5e4dd
add to blacklist button in CrawlResults
...
http://bugs.yacy.net/view.php?id=220
introduced Blacklist.add with sourcefile only parameter
11 years ago
reger
17b454f957
fix external link (open in new tab)
11 years ago
reger
dd8ea0cdd6
fix "add to blacklist" button style in IndexControlRWIs_p
...
- added default filename filter to select field (as only addition to *.black list is permanent)
- modified Blacklist_p header/legend to show all active blacklists
(to support understanding that all configured lists are active)
- removed obsolete code in Blacklist_p servlet
11 years ago
orbiter
2861183359
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
4035e20f0b
unescaping the path
11 years ago
orbiter
7e21d1ff70
"inaccessible" better describes the state of a server which cannot be
...
reached (while 30c3: too many users)
11 years ago
reger
7f9b9315fe
Merge origin/master
11 years ago
reger
8eaabb9600
remove dependency from old serverCore.java
...
- remaining getPortNr not needed
(as current release allows only to set plain integer as port,
see ConfigBasic)
11 years ago
orbiter
2018e55f8b
switched back on index deletion (was accidently off because new jetty
...
framework delivers never null to post arguments .. there may be more of
that kind of problems)
11 years ago
orbiter
d4942ad5e0
startRecord fix; this is not according to SRU definition because this
...
states that the first record has number 0; but +1 is not consistent with
other places where the number is used.
11 years ago
reger
3d913558ab
display configured adminUserName in ConfigAccounts_p
...
- fix read default username in in loginservice
11 years ago
reger
fbdd89e198
Merge origin/master
11 years ago
reger
65a2f3d5e7
tweak Jetty credentials to work with YaCy UserDB
...
- user entry in UserDB with admin right can login to access protected pages
- dto. admin user, choosen username is stored in conf (adminAccountUserName=)
11 years ago
Michael Peter Christen
ee17bd0b69
added option to attach remote solr servers in read-only mode
11 years ago
Michael Peter Christen
25f9c35033
add patch which shall prevent that naive search mistakes like usage of
...
regular expressions cause no results. Usage of '*' followed by a dot or
any expression will now cause that this expression is used as a filetype
search.
11 years ago
reger
e05320b776
upd: to open more external links in new browser-tab
11 years ago
reger
cbb5dc01e4
remove obsolete htroot/solr htroot/gsa YaCy-servlets
...
- now handled by standard servlets
11 years ago
reger
71cac1a278
added SSL/HTTPS connector to support SSL/https connection on port 8443
...
!!! attention !!! to make sure YaCy can start, https will be disabled if port 8443 is used
- added ping test for above to migration
- as of now port for https is hardcoded to default 8443
- if not urgend required I'd leave it this way (it's standard) to use different ports for http and https
- post https port on ConfigBasic.html (if active)
11 years ago
reger
f681ce15ae
remove obsolete HTTPServer input field
11 years ago
Michael Peter Christen
20b48f894f
refactoring: moving all servlets to the same package (the solr servlet
...
is currently actually a filter which should be changed somehow)
11 years ago
Michael Peter Christen
84167adb49
removed unused anomichttpd code after migration to jetty
11 years ago
Michael Peter Christen
b461a27abb
fixed the SolrServlet
11 years ago
Michael Peter Christen
7603e879dc
Merge branch 'master' into HEAD
...
Conflicts:
.classpath
source/net/yacy/cora/federate/solr/SolrServlet.java
11 years ago
Michael Peter Christen
25250405f1
solr servlet preparation for join with jetty branch
11 years ago
reger
c84c313fe1
Merge origin/master into jetty
11 years ago
Michael Peter Christen
74466d731a
use pre-compiled patterns in ymark
11 years ago
Michael Peter Christen
09412ea3a4
counting search requests in solr interface
11 years ago
Michael Peter Christen
67e7dc0cc6
added more properties to seedlist servlet
11 years ago
Michael Peter Christen
79771c60c0
IPv6 fixes
11 years ago
reger
92d9c56f9f
Merge origin/master into jetty
11 years ago
Michael Peter Christen
da380343c2
perform greedy learning heuristic only if load < 1.0
11 years ago
Michael Peter Christen
81926c055d
fixed bug with image search in yacyinteractive
11 years ago
Michael Peter Christen
edda0699e4
changed default timeout for port scanner
11 years ago
Michael Peter Christen
f1b5db2c45
- performance graph does not shop peer ping in memory monitor any more
...
- after a forced GC, the PerformanceMemory view switches to automatic
update by default
11 years ago
Michael Peter Christen
0db8e34625
enhanced webgraph processing
11 years ago
Michael Peter Christen
9d8b32c63a
fixed a division by zero
11 years ago
Michael Peter Christen
957f6297fb
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger
effea4bca0
Merge origin/master into jetty
...
Conflicts:
source/net/yacy/cora/federate/solr/SolrServlet.java
11 years ago
reger
b49e90d2e9
remove reference to solrServlet from YaCy servlet select
...
- reference is not used
- solrServlet is used in Jetty branch and adjustments there conflict with unused solrServlet here.
11 years ago
Michael Peter Christen
38e1e3a707
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
sixcooler
2c2ebb0d92
tried some hardening in order not letting any Solr-Searchers open
11 years ago
Michael Peter Christen
cca79d12ef
setting of some default values to make an client development start easy
...
using the description at
http://www.yacy-websuche.de/wiki/index.php/Dev:APIhello
11 years ago
Michael Peter Christen
3d4b5e66ce
disallow remote robots to crawl the HostBrowser servlet
11 years ago
Michael Peter Christen
234ca720f5
only admins should be able to force a commit
11 years ago
Michael Peter Christen
2c39b65409
fixes for searches containing stopwords. The fix was done using a
...
reconstruction of the search word set access method to protect that
words are deleted from the sets from the outside of the QueryGoal class.
11 years ago
orbiter
61409788eb
less word hash computations (removing some overhead because of MD5
...
calcs) using the clear word in a normalized form.
11 years ago
reger
5c4a3d1c01
Merge origin/master into jetty
11 years ago
Michael Peter Christen
caa20d63d9
fixed seedlist (hash was missing)
11 years ago
Michael Peter Christen
ccf2f4e43b
refactoring of seed attributes (introduced more constants)
11 years ago
Michael Peter Christen
c927b428d3
fixed json
11 years ago
Michael Peter Christen
64048ff217
fir for XSS
11 years ago
orbiter
b7f1e5af51
added new servlet which generates the same file as the principal peers
...
upload to a bootstrap position
you can call it either with
http://localhost:8090/yacy/seedlist.html
or to generate json (or jsonp) with
http://localhost:8090/yacy/seedlist.json
http://localhost:8090/yacy/seedlist.json?callback=seedlist
11 years ago
orbiter
3e552550d1
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
c2d720cdaf
purge a lucene cache - possible memory leak fix
11 years ago
reger
f111f30ace
Merge origin/master into jetty
11 years ago
Michael Peter Christen
f4172cbb3d
fix for another XSS bug
11 years ago
orbiter
ff86cb683f
fixed some XSS bugs reported by Marius from http://ctf365.com/
11 years ago
orbiter
19a051bec8
more monitoring for postprocessing and enhanced layout in Crawler
...
monitor page
11 years ago
Michael Peter Christen
fceac8cffd
more monitoring for postprocessing
11 years ago
Michael Peter Christen
9d5895f643
enhanced and fixed postprocessing
11 years ago
Michael Peter Christen
087df05e24
added option to Config_Network_p.html to enable remote search while
...
DHT-Receive is switched off.
11 years ago
Michael Peter Christen
1a4a69c226
set more logger to 'final static'
11 years ago
Michael Peter Christen
69b8d61c47
fix for search requests in GSA interface which contain 'funny'
...
characters (like ':' etc.)
11 years ago
orbiter
4234b0ed6c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
74c86a72a0
better default value for crawler user agent
11 years ago
reger
1437c45383
merge rc1/master
11 years ago
Michael Peter Christen
87a956e881
calculating and showing the number of files and the average size of a
...
file in the HTCACHE in ConfigHTCache_p.html
11 years ago
Michael Peter Christen
acc1f8a749
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
81bb50118e
found and fixed a huge memory leak in solr caching (inside Solr). The
...
not-flushed Solr cache is now handled in this way:
- it is smaller by default
- an Solr-internal process is started to flush the cache periodically
(this does NOT clean the cache, just removes old objects)
- a Solr-external process (the standard YaCy cleanup-process) now has
direct access to the solr internal cache and flushes them completely.
The time frame for such a flush is defined by the cleanup-process
frequency, by default 10 minutes.
11 years ago
sixcooler
987f410011
URL-export:add query and fix for cast-class-exception
11 years ago
Michael Peter Christen
ffe8276063
replaced referrer link masking to 'pure' links to the referring page
...
(that was more useful during testing)
11 years ago
reger
b38de92a16
Merge origin/master into jetty
11 years ago
Michael Peter Christen
434e13b46d
in host browser also show the properties of failed documents including
...
referrer urls (this is a VERY USEFUL SEO and Web Admin feature!!)
11 years ago
orbiter
1ac504ae51
use html encoding for urls in metadata
11 years ago
reger
f017066197
Merge origin/master into jetty
11 years ago
Michael Peter Christen
25951cee14
- fixed opensearchdescription, this delivered an url with missing
...
'global' option
- added display=2 to compare_yacy to remove the superfluous border
11 years ago
Michael Peter Christen
f1bfe64361
integrated startpage to compare_yacy
11 years ago
Michael Peter Christen
2f57327f20
added boolean load property to CacheResource_p servlet which causes that
...
the servlet loads the page from the web.
11 years ago
Michael Peter Christen
9bb7eab389
hacks to prevent storage of data longer than necessary during search and
...
some speed enhancements. This should reduce the memory usage during
heavy-load search a bit.
11 years ago
Michael Peter Christen
5afa6e3aee
Automatically flush the log cache if a short memory status is reached.
...
For the default of 200 lines this can flush about 10MB.
11 years ago
Michael Peter Christen
030d0776ff
Enhanced crawl start for very, very large crawl lists (i.e. > 5000)
...
which had a problem because of badly used concurrency.
This fix also caused a redesign of the whole host deletion process.
This should fix bug http://bugs.yacy.net/view.php?id=250
11 years ago
Michael Peter Christen
4948c39e48
added concurrency for mass crawl check
11 years ago
Michael Peter Christen
1b4fa2947d
- fixed a problem which ocurred when a document was not recognized with
...
the right content domain (i.e. identifying that it is an image, text
etc.) because it used the file extension and not an existing mime type
assignment.
- fixed the new setting that images shall be loaded for a better image
search.
- both fixes together makes it now possible to crawl
commons.wikimedia.org which makes use of 'funny' document names (i.e.
ending with .jpg while the document is html)
11 years ago
Michael Peter Christen
16e3b357b3
replaced old tag cloud and adopted design a bit
11 years ago
Michael Peter Christen
dc38d35986
added matching in url field in Table_API_p search
11 years ago
Michael Peter Christen
691d7e70fa
added hint to development/commit rss feed
11 years ago
Michael Peter Christen
b81859c751
Show a RSS icon in the right top corner of search results. This replaces
...
the 'API' icon which was the link for the opensearch result which is an
extension of RSS. Since it is more appropriate to visualize a RSS link
with an RSS icon, this API icon was changed here.
11 years ago
Michael Peter Christen
1a09771be8
fixed sitemap crawl start
11 years ago
orbiter
b743e6d79f
- prevent that crawl filter have empty (never-match) content
...
- rewrite the description of the options "Restrict to start domain(s)"
and "Restrict to sub-path(s)" to an explanation, that the restriction
applies to all links in the link list of the option "From Link-List of
URL" if this option is selected
- allow "Restrict to sub-path(s)" if the "From Link-List of URL" is
selected. This is supported in the crawl start.
11 years ago
orbiter
f597fdb602
make it easier to filter properties (case insensitive)
11 years ago
reger
f46c723398
allow to choose used http server, YaCy-Anomic or Jetty
...
- defaults to Jetty (in this branch)
- add server version info & config option -> Admin Console -> Advanced Settings -> Http Networking
11 years ago
reger
1adb4b8741
merge rc1/master
11 years ago