Michael Peter Christen
74466d731a
use pre-compiled patterns in ymark
11 years ago
Michael Peter Christen
09412ea3a4
counting search requests in solr interface
11 years ago
Michael Peter Christen
67e7dc0cc6
added more properties to seedlist servlet
11 years ago
Michael Peter Christen
79771c60c0
IPv6 fixes
11 years ago
Michael Peter Christen
da380343c2
perform greedy learning heuristic only if load < 1.0
11 years ago
Michael Peter Christen
81926c055d
fixed bug with image search in yacyinteractive
11 years ago
Michael Peter Christen
edda0699e4
changed default timeout for port scanner
11 years ago
Michael Peter Christen
f1b5db2c45
- performance graph does not shop peer ping in memory monitor any more
...
- after a forced GC, the PerformanceMemory view switches to automatic
update by default
11 years ago
Michael Peter Christen
0db8e34625
enhanced webgraph processing
11 years ago
Michael Peter Christen
9d8b32c63a
fixed a division by zero
11 years ago
Michael Peter Christen
957f6297fb
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger
b49e90d2e9
remove reference to solrServlet from YaCy servlet select
...
- reference is not used
- solrServlet is used in Jetty branch and adjustments there conflict with unused solrServlet here.
11 years ago
Michael Peter Christen
38e1e3a707
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
sixcooler
2c2ebb0d92
tried some hardening in order not letting any Solr-Searchers open
11 years ago
Michael Peter Christen
cca79d12ef
setting of some default values to make an client development start easy
...
using the description at
http://www.yacy-websuche.de/wiki/index.php/Dev:APIhello
11 years ago
Michael Peter Christen
3d4b5e66ce
disallow remote robots to crawl the HostBrowser servlet
11 years ago
Michael Peter Christen
234ca720f5
only admins should be able to force a commit
11 years ago
Michael Peter Christen
2c39b65409
fixes for searches containing stopwords. The fix was done using a
...
reconstruction of the search word set access method to protect that
words are deleted from the sets from the outside of the QueryGoal class.
11 years ago
orbiter
61409788eb
less word hash computations (removing some overhead because of MD5
...
calcs) using the clear word in a normalized form.
11 years ago
Michael Peter Christen
caa20d63d9
fixed seedlist (hash was missing)
11 years ago
Michael Peter Christen
ccf2f4e43b
refactoring of seed attributes (introduced more constants)
11 years ago
Michael Peter Christen
c927b428d3
fixed json
11 years ago
Michael Peter Christen
64048ff217
fir for XSS
11 years ago
orbiter
b7f1e5af51
added new servlet which generates the same file as the principal peers
...
upload to a bootstrap position
you can call it either with
http://localhost:8090/yacy/seedlist.html
or to generate json (or jsonp) with
http://localhost:8090/yacy/seedlist.json
http://localhost:8090/yacy/seedlist.json?callback=seedlist
11 years ago
orbiter
3e552550d1
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
c2d720cdaf
purge a lucene cache - possible memory leak fix
11 years ago
Michael Peter Christen
f4172cbb3d
fix for another XSS bug
11 years ago
orbiter
ff86cb683f
fixed some XSS bugs reported by Marius from http://ctf365.com/
11 years ago
orbiter
19a051bec8
more monitoring for postprocessing and enhanced layout in Crawler
...
monitor page
11 years ago
Michael Peter Christen
fceac8cffd
more monitoring for postprocessing
11 years ago
Michael Peter Christen
9d5895f643
enhanced and fixed postprocessing
11 years ago
Michael Peter Christen
087df05e24
added option to Config_Network_p.html to enable remote search while
...
DHT-Receive is switched off.
11 years ago
Michael Peter Christen
1a4a69c226
set more logger to 'final static'
11 years ago
Michael Peter Christen
69b8d61c47
fix for search requests in GSA interface which contain 'funny'
...
characters (like ':' etc.)
11 years ago
orbiter
4234b0ed6c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
74c86a72a0
better default value for crawler user agent
11 years ago
Michael Peter Christen
87a956e881
calculating and showing the number of files and the average size of a
...
file in the HTCACHE in ConfigHTCache_p.html
11 years ago
Michael Peter Christen
acc1f8a749
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
81bb50118e
found and fixed a huge memory leak in solr caching (inside Solr). The
...
not-flushed Solr cache is now handled in this way:
- it is smaller by default
- an Solr-internal process is started to flush the cache periodically
(this does NOT clean the cache, just removes old objects)
- a Solr-external process (the standard YaCy cleanup-process) now has
direct access to the solr internal cache and flushes them completely.
The time frame for such a flush is defined by the cleanup-process
frequency, by default 10 minutes.
11 years ago
sixcooler
987f410011
URL-export:add query and fix for cast-class-exception
11 years ago
Michael Peter Christen
ffe8276063
replaced referrer link masking to 'pure' links to the referring page
...
(that was more useful during testing)
11 years ago
Michael Peter Christen
434e13b46d
in host browser also show the properties of failed documents including
...
referrer urls (this is a VERY USEFUL SEO and Web Admin feature!!)
11 years ago
orbiter
1ac504ae51
use html encoding for urls in metadata
11 years ago
Michael Peter Christen
25951cee14
- fixed opensearchdescription, this delivered an url with missing
...
'global' option
- added display=2 to compare_yacy to remove the superfluous border
11 years ago
Michael Peter Christen
f1bfe64361
integrated startpage to compare_yacy
11 years ago
Michael Peter Christen
2f57327f20
added boolean load property to CacheResource_p servlet which causes that
...
the servlet loads the page from the web.
11 years ago
Michael Peter Christen
9bb7eab389
hacks to prevent storage of data longer than necessary during search and
...
some speed enhancements. This should reduce the memory usage during
heavy-load search a bit.
11 years ago
Michael Peter Christen
5afa6e3aee
Automatically flush the log cache if a short memory status is reached.
...
For the default of 200 lines this can flush about 10MB.
11 years ago
Michael Peter Christen
030d0776ff
Enhanced crawl start for very, very large crawl lists (i.e. > 5000)
...
which had a problem because of badly used concurrency.
This fix also caused a redesign of the whole host deletion process.
This should fix bug http://bugs.yacy.net/view.php?id=250
11 years ago
Michael Peter Christen
4948c39e48
added concurrency for mass crawl check
11 years ago