karlchenofhell
e97b6f0458
- we still use Java 1.4 ...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3386 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
0c7b8cf632
- added first version of new wiki-parser
...
- added blacklist support to manual URLFetcher stack fill
- fix for NPE: http://www.yacy-forum.de/viewtopic.php?t=3559
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3385 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
f7803a6ce4
enhanced crawl balancer
...
- new domains now get a chance to get crawled early
- less IO operations
- new balancing method
- better dump order at shutdown time
- bugfixes regarding not found url hashes (no more superfluous cache kill)
- domain access time is now shared over all balancer stacks
- viewing the stack does no more disturbish the balancing algorithm that much
- intelligent selection of best next domain using domain access times
- extra double-check (to double-check the double-check)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3384 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
801eea8849
*) Fixed bug where pairReplace() got caught in infinite recursion. ( http://www.yacy-forum.de/viewtopic.php?t=3466 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3383 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
c8862e47fb
*) adding mimetype for svg
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3382 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
39b0658839
Redesign of Webinterface menu structure
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3381 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
c3e8c23f5d
fix for 'CANNOT FETCH ENTRY: hash is null' bug
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3380 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
badab8d924
fixed some more bugs in new db handling
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3379 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
e72d253577
fixed problem with initial cache load
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3378 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
2d8e472cfd
emergeny bugfix for last commit
...
(kelondroTree should work again)
the cache prefill is broken and will be fixed later
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3377 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
868aaabf88
documentation update
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3376 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
dc0c06e43d
PLEASE MAKE A BACK-UP OF YOUR COMPLETE DATA DIRECTORY BEFORE USING THIS
...
redesign for better IO performance
enhanced database seek-time by avoiding write operations at distant
positions of a database file. until now, a USEDC counter was written
at the head-section of a kelondroRecords database file (which is the
basic data structure of all kelondro database files) to store the
actual number of records that are contained in the database. Now, this
value is computed from the database file size. This is either done
only once at start-time, or continuously when run in asserts enabled.
The counter is then updated only in RAM, and written at close of the
file. If the close fails, the correct number can be computed from the
file size, and if this is not equal to the stored number it is a strong
evidence that YaCY was not shut down properly.
To preserve consistency, the complete storage-routine had to be re-written.
Another change enhances read of nodes in some cases, where the data-tail
can be read together with the data-head. This saves another IO lookup during
each DB node fetch.
Includes also many small bugfixes.
IF ANYTHING GOES WRONG, ALL YOUR DATA IS LOST: PLEASE MAKE A BACK-UP
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3375 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
hydrox
5af76fccd7
*) peer-search on Network.html now is case-insensitive
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3374 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
c016fcb10f
- added streaming-support to CrawlURLFetchStack_p servlet
...
- bug for NPE in list.java
- use more constants
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3373 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
65af9d3215
- continue shifting even in the case the stacked URL could not be found
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3372 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
rramthun
fdd1180ac5
Adding two icon files, both containing different sizes from 16x16 to 128x128 pixels in one file.
...
The .icns is for Macintosh
Both made by Philipp Redeker
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3371 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
d114a0136e
- crawl profile: don't add null-values
...
- added some settings and statistics for url-fetcher 'server'-mode
- added own stack for fetchable URLs
- added possibility to fill stack via shift from peer's queues, via POST (addurls=$count and url$num=$url) or via file-upload
- added "htroot" to classpath of linux start-script
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3370 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
a46dc43f45
- added lock symbol for restart- and stutdown-buttons on Status-page (see http://www.yacy-forum.de/viewtopic.php?p=31444#31444 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3369 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
b2a9d32f29
why do I always forget some lines? sorry...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3368 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
e6ddf135bb
- enabled fetching new crawls via /yacy/list.html?list=queueUrls for testing purposes
...
- sent URLs are taken off the limit-stack (of the global crawl trigger) (may be moved somewhere else in future versions)
- added option to set the requested chunk-size
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3367 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
67d96249b4
- fix for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3366 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
c5a2ba3a23
- prepared URL fetch from other peers
...
- more feedback for user
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3365 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
daburna
661a7bb702
*updated translation for
...
-network
-wiki
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3364 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
auron_x
5ba531a722
*) higher precision for QPH also on status-page
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3363 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
4e5eda6ef9
huch...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3362 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
50b59e312f
- added experimental CrawlURLFetch_p-Servlet to fetch new URLs from a specified location (\n-seperated list). Requested by Theli.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3361 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
6c6375577e
- fix for http://www.yacy-forum.de/viewtopic.php?t=3523
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3360 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
ea20d8d7c5
- return to edited wiki-page after submit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3359 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
e1edb23689
*) Bugfix for IllegalMonitorStateException
...
See: http://www.yacy-forum.de/viewtopic.php?t=3522
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3358 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
bf69a721cb
more protection against mis-use of YaCyHop interface:
...
- target must not be at port 80
- target access not more than every 3 seconds
- requester may not access more than every 10 seconds
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3357 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
a15963ff98
better balancing: if element from top would force a busy waiting,
...
an element from the bottom of the stack is used instead.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3356 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
dda24fcb85
ups
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3355 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
8c1d2e0227
protection against crawl balancer failure:
...
a minimum of 500 milliseconds distance between two acesses
to the same domain is now ensured
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3354 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
1f1f398bfa
enhanced speed of RAM cache flush by factor 20 (twenty times faster)
...
- the speed was doubled by avoiding read access during the dump
- the speed was dramatically increased at least by factor 10
by using a temporary ram-file where the structures are flushed to
before it is dumped then as a whole byte-chunk to the file system.
The speed enhancements also affects some other parts of the database.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3353 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
30d79d69a6
fix for wrong display of search statistics
...
see http://www.yacy-forum.de/viewtopic.php?p=31242#31242
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3352 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
ac376662cc
*) changing alternate link to relative link
...
*) fix for wrong date
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3351 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
b4981187c5
*) adding alternate link to rss
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3350 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
24e3dd4734
*) first version of yacy changelog RSS Feed
...
See: http://www.yacy-forum.de/viewtopic.php?t=3462
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3349 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
daf2e15f59
some storage process enhancements (write without preceding read)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3348 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
hydrox
faad869865
*) added peer-search to Network.html
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3347 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
9c2101a852
small enhancement to cache dump
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3346 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
c464157a6e
replaced some toString()
...
see http://www.yacy-forum.de/viewtopic.php?p=31151#31151
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3345 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
7673f0869b
minor enhancements
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3344 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
b4aa195c27
added user-agent check for yacy-hop proxy authentication
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3343 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
2d7f7da7ce
fix for null pointer exception
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3342 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
6256d89883
*) bugfix for reg.exp to determine svn rev. nr
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3341 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
d25caa07bf
redesigned some parts of http authentication
...
added another access check for peer hops
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3340 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
588e48ce0b
*) Part II of last commit. Note to myself: check svn commandline syntax :-(
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3339 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
0d2431d6f7
*) removed printed out '<br />' in row Hit-Size Miss-Size by moving <br /> from Java file to HTML file.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3338 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
hydrox
ff829e97f8
*) fixed headlines in blog (see: http://www.yacy-forum.de/viewtopic.php?t=3442 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3337 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago