Michael Peter Christen
67e7dc0cc6
added more properties to seedlist servlet
11 years ago
Michael Peter Christen
79771c60c0
IPv6 fixes
11 years ago
reger
92d9c56f9f
Merge origin/master into jetty
11 years ago
Michael Peter Christen
da380343c2
perform greedy learning heuristic only if load < 1.0
11 years ago
Michael Peter Christen
81926c055d
fixed bug with image search in yacyinteractive
11 years ago
Michael Peter Christen
edda0699e4
changed default timeout for port scanner
11 years ago
Michael Peter Christen
f1b5db2c45
- performance graph does not shop peer ping in memory monitor any more
...
- after a forced GC, the PerformanceMemory view switches to automatic
update by default
11 years ago
Michael Peter Christen
0db8e34625
enhanced webgraph processing
11 years ago
Michael Peter Christen
9d8b32c63a
fixed a division by zero
11 years ago
Michael Peter Christen
957f6297fb
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger
effea4bca0
Merge origin/master into jetty
...
Conflicts:
source/net/yacy/cora/federate/solr/SolrServlet.java
11 years ago
reger
b49e90d2e9
remove reference to solrServlet from YaCy servlet select
...
- reference is not used
- solrServlet is used in Jetty branch and adjustments there conflict with unused solrServlet here.
11 years ago
Michael Peter Christen
38e1e3a707
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
sixcooler
2c2ebb0d92
tried some hardening in order not letting any Solr-Searchers open
11 years ago
Michael Peter Christen
cca79d12ef
setting of some default values to make an client development start easy
...
using the description at
http://www.yacy-websuche.de/wiki/index.php/Dev:APIhello
11 years ago
Michael Peter Christen
3d4b5e66ce
disallow remote robots to crawl the HostBrowser servlet
11 years ago
Michael Peter Christen
234ca720f5
only admins should be able to force a commit
11 years ago
Michael Peter Christen
2c39b65409
fixes for searches containing stopwords. The fix was done using a
...
reconstruction of the search word set access method to protect that
words are deleted from the sets from the outside of the QueryGoal class.
11 years ago
orbiter
61409788eb
less word hash computations (removing some overhead because of MD5
...
calcs) using the clear word in a normalized form.
11 years ago
reger
5c4a3d1c01
Merge origin/master into jetty
11 years ago
Michael Peter Christen
caa20d63d9
fixed seedlist (hash was missing)
11 years ago
Michael Peter Christen
ccf2f4e43b
refactoring of seed attributes (introduced more constants)
11 years ago
Michael Peter Christen
c927b428d3
fixed json
11 years ago
Michael Peter Christen
64048ff217
fir for XSS
11 years ago
orbiter
b7f1e5af51
added new servlet which generates the same file as the principal peers
...
upload to a bootstrap position
you can call it either with
http://localhost:8090/yacy/seedlist.html
or to generate json (or jsonp) with
http://localhost:8090/yacy/seedlist.json
http://localhost:8090/yacy/seedlist.json?callback=seedlist
11 years ago
orbiter
3e552550d1
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
c2d720cdaf
purge a lucene cache - possible memory leak fix
11 years ago
reger
f111f30ace
Merge origin/master into jetty
11 years ago
Michael Peter Christen
f4172cbb3d
fix for another XSS bug
11 years ago
orbiter
ff86cb683f
fixed some XSS bugs reported by Marius from http://ctf365.com/
11 years ago
orbiter
19a051bec8
more monitoring for postprocessing and enhanced layout in Crawler
...
monitor page
11 years ago
Michael Peter Christen
fceac8cffd
more monitoring for postprocessing
11 years ago
Michael Peter Christen
9d5895f643
enhanced and fixed postprocessing
11 years ago
Michael Peter Christen
087df05e24
added option to Config_Network_p.html to enable remote search while
...
DHT-Receive is switched off.
11 years ago
Michael Peter Christen
1a4a69c226
set more logger to 'final static'
11 years ago
Michael Peter Christen
69b8d61c47
fix for search requests in GSA interface which contain 'funny'
...
characters (like ':' etc.)
11 years ago
orbiter
4234b0ed6c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter
74c86a72a0
better default value for crawler user agent
11 years ago
reger
1437c45383
merge rc1/master
11 years ago
Michael Peter Christen
87a956e881
calculating and showing the number of files and the average size of a
...
file in the HTCACHE in ConfigHTCache_p.html
11 years ago
Michael Peter Christen
acc1f8a749
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen
81bb50118e
found and fixed a huge memory leak in solr caching (inside Solr). The
...
not-flushed Solr cache is now handled in this way:
- it is smaller by default
- an Solr-internal process is started to flush the cache periodically
(this does NOT clean the cache, just removes old objects)
- a Solr-external process (the standard YaCy cleanup-process) now has
direct access to the solr internal cache and flushes them completely.
The time frame for such a flush is defined by the cleanup-process
frequency, by default 10 minutes.
11 years ago
sixcooler
987f410011
URL-export:add query and fix for cast-class-exception
11 years ago
Michael Peter Christen
ffe8276063
replaced referrer link masking to 'pure' links to the referring page
...
(that was more useful during testing)
11 years ago
reger
b38de92a16
Merge origin/master into jetty
11 years ago
Michael Peter Christen
434e13b46d
in host browser also show the properties of failed documents including
...
referrer urls (this is a VERY USEFUL SEO and Web Admin feature!!)
11 years ago
orbiter
1ac504ae51
use html encoding for urls in metadata
11 years ago
reger
f017066197
Merge origin/master into jetty
11 years ago
Michael Peter Christen
25951cee14
- fixed opensearchdescription, this delivered an url with missing
...
'global' option
- added display=2 to compare_yacy to remove the superfluous border
11 years ago
Michael Peter Christen
f1bfe64361
integrated startpage to compare_yacy
11 years ago
Michael Peter Christen
2f57327f20
added boolean load property to CacheResource_p servlet which causes that
...
the servlet loads the page from the web.
11 years ago
Michael Peter Christen
9bb7eab389
hacks to prevent storage of data longer than necessary during search and
...
some speed enhancements. This should reduce the memory usage during
heavy-load search a bit.
11 years ago
Michael Peter Christen
5afa6e3aee
Automatically flush the log cache if a short memory status is reached.
...
For the default of 200 lines this can flush about 10MB.
11 years ago
Michael Peter Christen
030d0776ff
Enhanced crawl start for very, very large crawl lists (i.e. > 5000)
...
which had a problem because of badly used concurrency.
This fix also caused a redesign of the whole host deletion process.
This should fix bug http://bugs.yacy.net/view.php?id=250
11 years ago
Michael Peter Christen
4948c39e48
added concurrency for mass crawl check
11 years ago
Michael Peter Christen
1b4fa2947d
- fixed a problem which ocurred when a document was not recognized with
...
the right content domain (i.e. identifying that it is an image, text
etc.) because it used the file extension and not an existing mime type
assignment.
- fixed the new setting that images shall be loaded for a better image
search.
- both fixes together makes it now possible to crawl
commons.wikimedia.org which makes use of 'funny' document names (i.e.
ending with .jpg while the document is html)
11 years ago
Michael Peter Christen
16e3b357b3
replaced old tag cloud and adopted design a bit
11 years ago
Michael Peter Christen
dc38d35986
added matching in url field in Table_API_p search
11 years ago
Michael Peter Christen
691d7e70fa
added hint to development/commit rss feed
11 years ago
Michael Peter Christen
b81859c751
Show a RSS icon in the right top corner of search results. This replaces
...
the 'API' icon which was the link for the opensearch result which is an
extension of RSS. Since it is more appropriate to visualize a RSS link
with an RSS icon, this API icon was changed here.
11 years ago
Michael Peter Christen
1a09771be8
fixed sitemap crawl start
11 years ago
orbiter
b743e6d79f
- prevent that crawl filter have empty (never-match) content
...
- rewrite the description of the options "Restrict to start domain(s)"
and "Restrict to sub-path(s)" to an explanation, that the restriction
applies to all links in the link list of the option "From Link-List of
URL" if this option is selected
- allow "Restrict to sub-path(s)" if the "From Link-List of URL" is
selected. This is supported in the crawl start.
11 years ago
orbiter
f597fdb602
make it easier to filter properties (case insensitive)
11 years ago
reger
f46c723398
allow to choose used http server, YaCy-Anomic or Jetty
...
- defaults to Jetty (in this branch)
- add server version info & config option -> Admin Console -> Advanced Settings -> Http Networking
11 years ago
reger
1adb4b8741
merge rc1/master
11 years ago
reger
37d24f3318
make use of declared static string ACTION_LOCATION
11 years ago
reger
eea504c117
update Info.plist
...
small DefaultServlet refactoring
11 years ago
reger
a44eede8b8
merge rc1/master
11 years ago
reger
54a0272338
searchpage javascript (latestinfo) causes reset of search statistic after moving to next page
...
- disabled call via setTimeout in yacysearch.html
11 years ago
Michael Peter Christen
91fa99e9bb
added new icon/image for latest commit
11 years ago
Michael Peter Christen
9fac9249bc
- replaced 'edit' link with a clone symbol in Table_API_p since that is
...
what it does: it clones the crawl, it does not change the crawl.
- moved the appearance of this clone link to the type column since this
makes it visible also if the URL column is not visible.
11 years ago
Michael Peter Christen
0f6db6ad5b
Merge remote-tracking branch 'jensbees/crawlexpert-post'
11 years ago
Jens Bertram
3252c1ec39
Merge upstream/master into crawlexpert-post
11 years ago
Michael Peter Christen
90c8577840
enhanced ranking; patches to replace old ranking
11 years ago
bhoerdzn
a3824dfbaa
check URL on inital load, if set
11 years ago
bhoerdzn
52f49d475b
add a hidden field for "crawlingstart" since jQuery omits the submit button value
11 years ago
bhoerdzn
b0c0ec2dec
link recorded crawl starts back to "CrawlStartExpert_p" in "Process Scheduler"
11 years ago
bhoerdzn
d64d45361c
use integer types for boolean values
11 years ago
bhoerdzn
eda123d6fd
remove debugging code intercepting post requests
11 years ago
bhoerdzn
5057f27bbd
fix typo in parsing "cachePolicy" parameter
11 years ago
bhoerdzn
98f5c9018d
Fixed template vars for "deleteold". Fixed parsing "deleteold" parameter. Stop "setState" overwriting "deletold" state on load.
11 years ago
bhoerdzn
a6a62986d4
correct state handling for country code restriction
11 years ago
bhoerdzn
4066b85155
correctly set initial state for load filters
11 years ago
bhoerdzn
8c91c3e7cd
set form boolean values to 0 & 1 instead of false & true
11 years ago
bhoerdzn
c27fabc88e
fixed wrong parameter check
11 years ago
bhoerdzn
2214bf5396
Remove some post parameters, if they are set to default values, as their values are already set by YaCy. Added some documentation.
11 years ago
reger
71d2655c02
downgrade to Jetty 8 to assure support of JRE 1.6
...
- introduce a YaCyHttp interface to modulize/separate http server
- adjust the Jetty version specific implementation part (in package net.yacy.http)
- putting the version specific code in classes starting with Jetty8xxxx
- moved existing Jetty9xxx implementation into a test class (to keep the code)
- adjust build to the changed jars
- make use of the introduced YaCyHttpServer interface in related htroot servlets
- adjust other test cases/classes
11 years ago
orbiter
705b3338ee
list more fields available for search and for ranking boosts
11 years ago
bhoerdzn
405878182f
Use list template for all other option lists. Fixed some template expressions.
11 years ago
bhoerdzn
8e74098cd4
Use list template for "reloadIfOlderNumber".
11 years ago
bhoerdzn
52bad7b908
Dynamic toggling of form fields, based on passed in and selected values. This will also cut down the post string by disabling not needed fields.
11 years ago
Michael Peter Christen
e56aa4fe93
fixed search navigation
11 years ago
Michael Peter Christen
4fbc4740df
removed warnings
11 years ago
bhoerdzn
45cf553bc3
try to guess default crawling mode, if none set
11 years ago
bhoerdzn
b4f0c822f2
assign strings before checking contents
11 years ago
bhoerdzn
499abe8f91
set default values for string parameters
11 years ago
bhoerdzn
42ea56eaad
made crawStartExpert_p aware of post variables; extended template where needed
11 years ago
reger
c7c706fd9f
merge with rc1/master
11 years ago
Michael Peter Christen
82bfd9e00a
- crawl profiles shall be deleted from active and passive stacks if they
...
are deleted to terminate the crawl because otherwise the crawl will go
on after the load-from-passive stack policy.
- better check if a crawl is terminated using the loader queue.
12 years ago
orbiter
8ac2e8c8c9
added location navigator which causes that the image to the map search
...
is visible whenever a location is available in the search result.
To activate this, the search.navigation property in yacy.conf must be
modified to the new default values.
12 years ago