theli
eb9b138986
*) next step of restructuring for new crawlers
...
- conversion of the crawler pool into a keyed object pool
- crawlers are now loaded based on the url protocol (of course works only for http now)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2473 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
1395aae742
*) starting restructuring which is needed to add crawlers for additional protocols
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2472 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
7df572756a
fist step+attempt so solve the snippet marking problem.
...
See: http://www.yacy-forum.de/viewtopic.php?p=22855#22855
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2469 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
3879a0ecd0
replaced java.net.URL usage by use of new class de.anomic.net.URL
...
This shall be seen as an experiment to exclude all cases where
there could be a DNS lookup during URL comparisment.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2290 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
933a9e02ab
fix for broken build
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2284 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
360056b30c
fix ajax bug (no valid xml)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2283 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
90d569d70f
refactoring of index management:
...
url storage is part of index management; moved plasmaURL to indexURL
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@2122 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
44d72f06c4
more Caching
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1965 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
1a13c8b78e
right wordCachesize after orbiters commit.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1882 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
6b056610e3
updated watchcrawler for the recent changes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1881 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
bcd99fe83e
introduced a second RAM cache for DHT transfer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1880 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
bae3783d38
added a snippet marking
...
(search words are now bold in snippets)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1823 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
fb5d8fdc59
removed encoding attribute
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1776 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
f1b91b1266
xml with right encoding
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1766 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3703f76866
- fixed re-search bug: after a search with several words, a second search could not
...
find the same words as before. This was caused because indexContaines stored the url references
with a hashtable. A tree was needed to work with the index conjunction-by-numeration
- added permanent ram cache flush (again)
- removed direct flush of ram cache after a large container is added.
this happens especially during DHT transmission and therefore this fix should
speed up DHT transmission on server side.
- removed unused and out-dated methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1765 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
dc9174c809
*) Implementing snippet fetching via ajax
...
Snippets that are not available on page load time will be fetched using ajax requests.
see: http://www.yacy-forum.de/viewtopic.php?p=16479
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1748 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
7e7a72b108
display wordcaches number on WatchCrawler.html
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1746 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
3fd1641893
queuesizes in queues_p.xml
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1714 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
62664d7252
AJAX Check for robots.txt before crawling.
...
Icons from herrlich
TODO: Style it nicely ;-)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1689 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
26d7e8dd0d
more escapes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1677 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
127396436f
more queues in the xml backend
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1674 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
99a970eda1
xml backend with verifyAuthentication
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1652 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
73f18ed5b2
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1627 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
a4513523d6
hide add/edit/import bookmarks per default.
...
xml-bookmark import (this does not work, yet)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1619 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
27b6b3d714
public Tags.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1589 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
a8eff9a0ae
xml/bookmarks/posts/all.xml to list all public Bookmarks
...
bookmarksIterator now accepts an option, if you want all(with private) or only public bookmarks.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1577 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
62a0bb475a
More values displayed on WatchCrawler.html
...
status_p.xml: to be extended.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1561 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
1fce0ea263
xml api for deleting bookmarks.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1546 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
ba96cefe0c
packages for xml/*
...
bugfix for servlets with packages from theli.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1272 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
26bab876db
more del.icio.us Api
...
Bugfix for http in gettitle_p
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1268 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
2e2fa99501
bookmarksManager:
...
-gettitle_p.xml and AJAX to use it
-classpath change
httpc:
-simple wget function
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1267 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
ca3a513aee
del.icio.us style api
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1265 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
75b9163242
xmlapi for bookmarktags
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1263 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
37f88b4017
code cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1176 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a04930f025
code cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1158 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
89a4cca4df
max. num of Entries
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1043 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
cb69047b91
*)cleanup access static methods and fields
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1016 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
5526d2ff73
Basic WatchCrawler.html, which loades the Queues using AJAX.
...
ajax.js: generic Ajax-Functions
xml/queues/indexing_p*: the Backend
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1002 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago