Michael Peter Christen
1fd006cc56
fixes using the embedded connector
12 years ago
Roland Haeder
59b4fdd5ad
Merge remote-tracking branch 'upstream/master'
12 years ago
orbiter
5493389576
stealth mode shall only be available for authorized users, because
...
unauthorized users can otherwise be monitored by authorized users
12 years ago
Roland Haeder
ebbb3bc5c1
Fixed CHMOD on many files + added missing loggers (e.g. jena) and made some noisy loggers quiet
12 years ago
Michael Peter Christen
bcc623a843
refactoring of load_delay: this is a matter of client identification
12 years ago
orbiter
2be456e7fb
added a postprocessing field into api/status_p.xml to show if the
...
postprocessing task is running at that time (status: busy) or not
(status:idle)
12 years ago
orbiter
575f913154
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter
c4efb612e2
added list of crawls to status_p.xml
12 years ago
Lotus
bb6caa346c
Do not allow automatic update in case YaCy is installed to the Program
...
Files folder on Windows. There are no permissions to write that folder
and update would fail.
12 years ago
orbiter
dac88561ae
minimum access time has a tight connection to ClientIdentification,
...
therefore it is defined there.
12 years ago
sixcooler
bff8c753c6
re-insert this file - was deleted by mistake
...
+ correct an other case-typo
12 years ago
Michael Peter Christen
5878c1d599
- refactoring of log to ConcurrentLog:
...
jdk-based logger tend to block
at java.util.logging.Logger.log(Logger.java:476) in concurrent
environments. This makes logging a main performance issue. To overcome
this problem, this is a add-on to jdk logging to put log entries on a
concurrent message queue and log the messages one by one using a
separate process.
- FTPClient uses the concurrent logging instead of the log4j logger
12 years ago
orbiter
c79f687110
enhanced the network scanner: find more hosts automatically by removal
...
of common subdomains before application of protocol-specific prefix
12 years ago
orbiter
b4677d1cad
fix for bug #252
...
the naming of the servlet was wrong, the bug may not be present on
systems where upper/lowercase matching is lazy (windows)
12 years ago
Michael Peter Christen
07261fe274
Merge remote-tracking branch 'nutomics/blacklist_structure'
12 years ago
Michael Peter Christen
dea71851d2
- better concurrency for network scanner
...
- network scanner can now start from the list of all hosts in the search
index
12 years ago
orbiter
9f0cc9b401
enhanced network scanner
...
- textarea input field can now be used to paste in a large list of hosts
- /31er subnet is possible (only one host)
- auto-detect subdomains for ftp and www subdomains
12 years ago
orbiter
f8c28efd66
fix for rssTerminal coloring
12 years ago
Felix Ableitner
44f8fcf62e
Changed class structure of Blacklist.
12 years ago
Michael Peter Christen
3054a6d4b9
added a patch from Sebastian M.B., submitted by email for coloring of
...
rss terminal
12 years ago
Michael Peter Christen
78af998f8f
Merge commit 'fd90fcc4e08f80acbfd1c9a7ec62ce04cd309594'
12 years ago
Michael Peter Christen
57ffdfad4c
added a crawl option to obey html-meta-robots-noindex. This is on by
...
default.
12 years ago
Felix Ableitner
fd90fcc4e0
Fixes #196 .
12 years ago
Michael Peter Christen
f1c5338210
prepartion for greedy crawl profiles and refactoring
12 years ago
Michael Peter Christen
e6f361f474
adding the canonical tag to crawl queues
12 years ago
Michael Peter Christen
203921006a
redesign of citation index storage
12 years ago
Michael Peter Christen
e92b9275ce
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen
56cdcfa2fa
fixed greedy learning mode - global is not a search attribute in
...
searchitems
12 years ago
Michael Peter Christen
32aa1d4569
removed unused option for queries
12 years ago
Michael Peter Christen
0c5bed7e2c
added configuration option for greedy learning function to ConfigPortal
...
servlet
12 years ago
sixcooler
5d1f619f07
possible helpful closing of solr-requests
12 years ago
Michael Peter Christen
9d291764d1
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
sixcooler
e5abccdfe4
added optimize-option
12 years ago
Michael Peter Christen
8ea6ddf636
removed attributes from ConfigPortal.html which are redundant to
...
ConfigSearchPage_p.html
12 years ago
Michael Peter Christen
64140f35cd
fix for solr requests if no query part is given (prevent npe)
12 years ago
Michael Peter Christen
23fb458963
- fix to gsa searchresult answer in case that no query part is given
...
- fix to gsa default number of results (is 'num')
12 years ago
Michael Peter Christen
660a196989
refactoring
12 years ago
Michael Peter Christen
54024958ac
added url_file_name_s in qeury for live-search of urls
12 years ago
Michael Peter Christen
16d1d744fa
added url_file_name_s in default collection schema for the file name
...
without the file extension. This part of the file path is removed from
the multi-field url_paths_sxt, which has now not the file name as last
part of the path list.
The same applies to the new fields source_file_name_s and
target_file_name_s in the webgraph schema.
12 years ago
Michael Peter Christen
f542cf7d9c
fix for daterange: the to-date is inclusive
12 years ago
Michael Peter Christen
c36720d45f
added daterange option to gsa api
12 years ago
Michael Peter Christen
4e3007f4a0
typo
12 years ago
Michael Peter Christen
2cb6b6bc21
added target="_blank" to shutdown links
12 years ago
orbiter
c8e94ad7c7
fix for citation search in case that the citation is very fresh
12 years ago
orbiter
57dcf68665
added a feed-back message inside the shutdown page
12 years ago
Michael Peter Christen
0600d510e1
show the citation report also in ViewFile
12 years ago
Michael Peter Christen
1a92b61d69
fixed usage of ViewFile which needs a commit before showing latest crawl
...
result pages.
12 years ago
Michael Peter Christen
570511f3c8
removed fields references_internal_id_sxt and
...
references_internal_url_sxt because they had been shown to be
superfluous. The citation of referrer in the host browser is possible
without them. Therefore now the host browser does not only show
internal, but also external referrer to each link.
12 years ago
Michael Peter Christen
fd1776a3b0
added a new 'Citations' function: each search result item can now be
...
explored for citations within other documents. A click on the
'Citations' link shows an analysis with all text lines in the document
each with a complete list of documents which contain the same line. A
second section shows the linking documents in ascending order of number
of citations from the original document. Because documents from
different hosts are most interesting here, they are listed at the top of
the page as possible 'copypasta' source.
12 years ago
Michael Peter Christen
1762911f57
added synchronizations and timeouts in solr api; missing
...
synchronizations in index modification methods causes deadlocks inside
solr.
12 years ago