Michael Peter Christen
a3badd3205
changed search process for images: no more media snippet load process,
...
show only links from index which had been on the text search page
before. This creates a superfast search process for images!
13 years ago
Michael Peter Christen
4aa0eedead
one more scroogle...
13 years ago
Michael Peter Christen
347612ddd4
removed scroogle parser
13 years ago
Michael Peter Christen
f8cd57c92f
new indexing strategy: ALL links that appear anywhere are indexed, not
...
only links where the content can be parsed. All non-parseable links are
placed into the noload queue. The search process must therefore be able
to filter out non-text search results.
- This fixes the problem that image search results appeared in the text
search.
- The interactive search can retrieve now ALL types of links
- The p2p interface is now extended to retrieve only certain types of
links (text, image, video, apps)
- The search process has an extension to filter the right document type
according to the search query
13 years ago
Michael Peter Christen
14f67f217c
refactoring of ContentDomain: now subclass of Classification
13 years ago
Michael Peter Christen
a5d7da68a0
refactoring: removed dependency from switchboard in Balancer/CrawlQueues
13 years ago
Michael Peter Christen
33d1062c79
refactoring: the cache belongs to the crawler
13 years ago
Michael Peter Christen
8429967ea7
no more SVN
13 years ago
Michael Peter Christen
0466bb0ddf
no more SVN..
13 years ago
Michael Peter Christen
4844e124b1
one more warning in case that crawling is paused because of low disk
...
space
13 years ago
Michael Peter Christen
0ec2713af8
'download'
13 years ago
Michael Peter Christen
f30c577fdb
add hint to speed up search results
13 years ago
Michael Peter Christen
6b133de3e9
add hint for consulting support
13 years ago
Michael Peter Christen
eb2c8ffa62
display is not used any more
13 years ago
Michael Peter Christen
91a86f0b06
fixed to network graph testing
13 years ago
Michael Peter Christen
f31ad84d98
automatic generation of blacklist pattern, see
...
http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2685&p=25305#p25305
13 years ago
Michael Peter Christen
7b5b9baee0
added citation rank to ranking profile
13 years ago
reger
06951ef751
remove heuristic scroogle from search option help text in index.html
13 years ago
Michael Peter Christen
e377092198
fix to xml output format
13 years ago
Michael Christen
41be98dc9d
extended webstructure api to show together with incoming links also
...
outgoing links
13 years ago
Michael Christen
8f89c8ef07
added information about inbound, outbound and citation links into
...
yacydoc api servlet
13 years ago
Michael Christen
71649a1296
added an api to retrieve the new citation.index with the
...
webstructure.xml api. This api will respond with details about a single
URL if requested with 'webstructure.xml?about=[url|urlhash|host]'.
13 years ago
Lotus
3e61287326
some better feedback on properties change
13 years ago
Lotus
96ac95cff9
added hint how to change integration options
13 years ago
Thomas
4f61b8fd82
Fixes for compare-search
13 years ago
Thomas
e0680de7b3
Remove Scroogle from compare-search, Scroogle is dead
13 years ago
Lotus
78f0d8f046
no focus on preview frames for search integration
...
fixes bug http://bugs.yacy.net/view.php?id=161
13 years ago
Lotus
7792ac6406
fix links & bug #163
13 years ago
Michael Peter Christen
532c7cf827
added physics experiment to the graph plotter. not active by default
13 years ago
Michael Peter Christen
aba9b1bfa0
better names for elements of a linked graph
13 years ago
Michael Peter Christen
2fc8ecee36
ConcurrentLinkedQueue has a VERY long return time on the .size() method.
...
See
http://docs.oracle.com/javase/6/docs/api/java/util/concurrent/ConcurrentLinkedQueue.html
and the following test programm:
public class QueueLengthTimeTest {
public static long countTest(Queue<Integer> q, int c) {
long t = System.currentTimeMillis();
for (int i = 0; i < c; i++) {
q.add(q.size());
}
return System.currentTimeMillis() - t;
}
public static void main(String[] args) {
int c = 1;
for (int i = 0; i < 100; i++) {
Runtime.getRuntime().gc();
long t1 = countTest(new ArrayBlockingQueue<Integer>(c), c);
Runtime.getRuntime().gc();
long t2 = countTest(new LinkedBlockingQueue<Integer>(), c);
Runtime.getRuntime().gc();
long t3 = countTest(new ConcurrentLinkedQueue<Integer>(),
c);
System.out.println("count = " + c + ": ArrayBlockingQueue =
" + t1 + ", LinkedBlockingQueue = " + t2 + ", ConcurrentLinkedQueue = "
+ t3);
c = c * 2;
}
}
}
13 years ago
Michael Peter Christen
8aba045ba1
if a new pop-up page is set in config portal, then this page applies
...
also to the default page configuration for the httpd if no path is
given.
13 years ago
Michael Peter Christen
fa7b3481b3
better navigation in file search: less results by first try, but much
...
faster. after the first search is done, buttons appear to get more
results for the same search
13 years ago
Michael Peter Christen
8c06925984
animation of the web structure picture
13 years ago
Michael Peter Christen
99c74699de
removed scroogle (scroogle is dead)
13 years ago
Michael Peter Christen
6e51a00a2f
Revert "fix for page navigation: show only as much pages as are available for given navigation constraints, not as given by total results size"
...
This reverts commit 73f5a9e8b3
.
13 years ago
Michael Peter Christen
73f5a9e8b3
fix for page navigation: show only as much pages as are available for
...
given navigation constraints, not as given by total results size
13 years ago
Michael Peter Christen
9c51dc0f13
fixed a bug with navigation: if a navigation was applied to file type or
...
protocol, then it was not possible to remove that again. This is the fix
for that.
13 years ago
Michael Peter Christen
8bfc987374
enhanced hint how to enter file:// urls
13 years ago
Michael Peter Christen
c6c61be3f0
fix for http://bugs.yacy.net/view.php?id=148
13 years ago
Michael Peter Christen
edaa8ac94c
Merge commit 'e15e633a0128b8d31011283a65b4ef26a6dddcd8'
13 years ago
reger
e15e633a01
Bugfix for IE9 (doesn't accept html form within form)
...
changes of API schedule row data changed form input form to unique field names
using row pk.
Fix for issue 96 http://bugs.yacy.net/view.php?id=96
IE9-64bit doesn't interprete iframe with align parameter as desired
misaligns following content (in CrawlProfileEditor_p.html)
13 years ago
Michael Peter Christen
a9b4d49b75
removed debug output
13 years ago
Michael Peter Christen
8d63a5887c
bugfixes
13 years ago
Michael Peter Christen
9ad1d8dde2
complete redesign of crawl queue monitoring: do not look at a
...
ready-prepared crawl list but at the stacks of the domains that are
stored for balanced crawling. This affects also the balancer since that
does not need to prepare the pre-selected crawl list for monitoring. As
a effect:
- it is no more possible to see the correct order of next to-be-crawled
links, since that depends on the actual state of the balancer stack the
next time another url is requested for loading
- the balancer works better since the next url can be selected according
to the current situation and not according to a pre-selected order.
13 years ago
Michael Peter Christen
e2f8f263e8
changed storage of search words: keep order
13 years ago
Michael Peter Christen
c166eb68b6
fixes in solr schema file
13 years ago
Michael Peter Christen
2e5cd6a1b2
fixed parser extension deny list generation and usage
13 years ago
david
f077b11d38
Merge branch 'master' of git://git.gitorious.org/yacy/rc1.git
13 years ago
Lotus
29675d9766
more label on search options (usability)
13 years ago
Michael Peter Christen
b4bc1e2875
remote search does not do snippet generation
13 years ago
Lotus
335a776351
xss hardening on Status.html
13 years ago
Michael Peter Christen
4c5edab1ec
added option to have exception search result windows
13 years ago
Michael Peter Christen
32adad7dd5
show less navigation by default
13 years ago
Michael Peter Christen
88b86afc89
no DoS protection for intranet mode
13 years ago
Michael Peter Christen
0f443ac755
automatic switching off of navigation that is not useful
13 years ago
Michael Peter Christen
852ce43d99
better rules for default open/close of navigation objetcs
13 years ago
Michael Peter Christen
e8d24fd802
author navigator can be switched off
13 years ago
Michael Peter Christen
558ab7bd4e
made the protocol navigator reversible
13 years ago
Michael Peter Christen
96cb75f1d4
made the filetype navigator be able to deselect the search constraint
13 years ago
Michael Peter Christen
30891d026f
added a remove-navigation for vocabularies
13 years ago
Michael Peter Christen
d5ead5314d
changed navigation links: now using checkboxes.
...
This looks better and allows that negative checkboxes (such that remove
the navigation) are possible. These are not yet implemented (comming
next)
13 years ago
Michael Peter Christen
ef5192f8c9
using the generic document parser for crawl starts instead of the html
...
parser. This makes it possible that every type of document can be a
crawl start point, not only text documents or html documents. Testet
this with a pdf document.
13 years ago
Michael Peter Christen
ce620be783
for for crawl start with smb url
13 years ago
Lotus
18227790ca
variable search works at least on Firefox again
13 years ago
Lotus
c73af39e54
refactoring of tray icon class,
...
now uses Java 6 methods natively
13 years ago
Michael Peter Christen
5afec4af39
fix for http://bugs.yacy.net/view.php?id=131
13 years ago
Michael Peter Christen
dc165275ad
bugfix for usage of multiple vocabulary navigators
13 years ago
Michael Peter Christen
190b77c55e
added Ukrainian translation
13 years ago
Michael Peter Christen
7f9b6b7a0c
added switches to ConfigParser to accept/deny documents by their
...
extension
13 years ago
Michael Peter Christen
83009d86f7
added the vocabulary navigator. It can be very simply tested by
...
switching on the locale dictionaries.
13 years ago
Lotus
37d43e5589
jump back to where we want to modify the scheduler in api steering after
...
submit
13 years ago
Michael Peter Christen
26e227f17d
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen
7053f8ab46
added automatic generation of a solr schema.xml file
13 years ago
apfelmaennchen
c7f88f3fd1
fix for http://bugs.yacy.net/view.php?id=101 - the default crawl
...
depth for bookmarks is now editable.
13 years ago
Michael Peter Christen
c602eaaf46
enhanced search process
13 years ago
Michael Peter Christen
a71f751cc8
Merge branch 'master' of git://gitorious.org/~apfelmaennchen/yacy/apfelmaennchens-rc1
13 years ago
sof
a94ced7cb2
Solved ( http://bugs.yacy.net/view.php?id=124 ) accessibility issue with
...
page 'Ranking_p.html'
13 years ago
Michael Christen
eff966f396
fix for search process (it was aborted too early during remote search)
13 years ago
Marek Otahal
6d8faa8066
small fix
...
Signed-off-by: Marek Otahal <markotahal@gmail.com>
13 years ago
Marek Otahal
1dc5d9f0f3
make ConnectionInfo comparable and sort list of connections in Connections_p
...
ConnectionInfo compare by initTime
Connections_p implement wish to sort connections, descending
Signed-off-by: Marek Otahal <markotahal@gmail.com>
13 years ago
Marek Otahal
c1973c78a3
Blog + BlogComments method propose() make similar + fix anonymous poster's IP resolve in Blog
...
this method share very much common code in both classes,
would be nice to unify
+fix some ugly code
Signed-off-by: Marek Otahal <markotahal@gmail.com>
13 years ago
Marek Otahal
7189a70a62
BlogComments dateString() - use localized date/time format
...
use same method from Blog class, reduce code duplication
Signed-off-by: Marek Otahal <markotahal@gmail.com>
13 years ago
Marek Otahal
f727c35722
Blog dateString() print in localized date/time format
...
Signed-off-by: Marek Otahal <markotahal@gmail.com>
13 years ago
Marek Otahal
8318103f25
Blacklist addBlacklistEntry() - dont add an empty entry
...
Signed-off-by: Marek Otahal <markotahal@gmail.com>
13 years ago
Michael Peter Christen
972d411d99
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Christen
bd40a10230
added autotaggig stub .. only reading and parsing of vocabularies at
...
this time
13 years ago
Michael Peter Christen
f214f6ebb4
added no-load queues to the crawler monitor
13 years ago
Michael Peter Christen
2ee8cbeb2c
Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
...
Conflicts:
source/net/yacy/search/Switchboard.java
13 years ago
Michael Peter Christen
992dbdf4bb
added noload statistic to servlets
13 years ago
Michael Christen
216a287a85
Merge commit '6d4e08ed06c5cd28c45981b2ebe31c7f7ec6fd83' into quix0r
...
Conflicts:
source/de/anomic/crawler/CrawlQueues.java
13 years ago
stbrumm
d18095dc48
Patch fuer Issue 0000102
...
and fixes to Patch (private peer status is a property of a peer, not a
status)
13 years ago
Michael Christen
ce289eb64f
remove heuristics if index receive is switched off
13 years ago
Roland 'Quix0r' Haeder
fa08ed5ae5
Fixed a lot CHMOD rights (no need for execute flag on *.java/*.html) and introduced local/remote crawl size ratio based check
13 years ago
Michael Christen
1d66496362
added more properties to opensearchdescription, see also:
...
http://bugs.yacy.net/view.php?id=114
13 years ago
Michael Christen
1cf0f35621
the link to the path shall be the path
13 years ago
Michael Christen
87c16fb67f
show server list as server side include
13 years ago
Michael Christen
5bfb287753
make a bad fix even worse
13 years ago
Michael Christen
4656bb0365
allow html in greeting line since that is set by administrator only.
...
html in that line may be wanted
13 years ago
Michael Christen
85bd4cc8bc
better lookup for peer names
13 years ago