orbiter
258fd9eb8e
adding missing file for websearch refactoring
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@894 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
77ae30063d
refactoring of websearch process
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@893 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
579b22d8ff
small update to network drawing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@892 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
2b5829c3da
small fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@891 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
e85989510a
update to network image; added disconneced peers by disconnection time and changed colors
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@890 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
4c7918f5b5
added shotdown to crawl stacker (moved from 882)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@889 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
2851658c2a
re-integrated Martins last change to crawl stacker from svn 882 that I had deleted accidently
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@888 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
c83594528c
integrated crawl stacker into thread control
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@887 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
16a49c1c9d
fix for graphics generation bug, see http://www.yacy-forum.de/viewtopic.php?p=10987#10987
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@886 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
d666b61b83
fix for news-deletion, see also http://www.yacy-forum.de/viewtopic.php?p=11000#11000
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@885 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
4180c422e8
cleaned, finals, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@884 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
45f55a6fad
*) Bugfix for wrong index-queue size displayed on status page
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@883 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
959eefbc4f
*) Robots.txt parser/ppt
...
cutting of comments at the line end
*) Adding Threadpool for stackCrawl Thread to speedup robots.txt download
and double url checks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@882 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f6024792b1
download-link update
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@881 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
c1c94111b0
added new network picture at Network menu using the new image-servlet method
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@880 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
a1777788a5
small change
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@879 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
d98ed41f31
update to image painter and test on http://localhost:8080/imagetest.png
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@878 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
ddb0876d01
Installer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@875 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
f65c939a60
userDB Auth
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@874 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
5153ec0f3e
update to image painter
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@873 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
7c6b85aea2
Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@872 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
c6f0e35a32
removed notifier.gif; this file is created automatically after startup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@871 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
1b2db0b52a
fix for file-share access; damaged some commits before by me :-(
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@870 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
f875eda9b7
changed 2, added 5 new picture for more information
...
added runtime to the list
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@869 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
1a5d98cd6d
better imagePainter example and fix for typo http://www.yacy-forum.de/viewtopic.php?p=10920#10920
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@868 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
afc5ef2819
cleaned, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@867 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
68aa215479
cleaned
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@866 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
60ab32c313
removed obsolete htmlPlotter
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@865 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
06f4ac494a
image-painter test; try with http://localhost:8080/imagetest.png
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@864 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
fb27428674
added restart to Status.html
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@863 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
0dcfc25d98
cleaned, finals, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@862 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
13dd16d1dd
small change
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@861 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
5c4db6968e
sorted URL list, StringBuffers, finals, cleaned, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@860 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
2a18e6486b
cleaned, finals, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@859 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
4191b21e73
cleaned, finals, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@858 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
64acb46a91
cleaned, finals, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@857 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
52168fab9b
cleaned, finals, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@856 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
be60b512b7
fix for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@855 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f6cf3967de
fix for compile-bug in svn 583 (Martin guck mal ob das richtig ist: fifo oder filo-stack?)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@854 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
a2fa75e688
*) Asynchronous queuing of crawl job URLs (stackCrawl)
...
various checks like the blacklist check or the robots.txt disallow check are now
done by a separate thread to unburden the indexer thread(s)
TODO: maybe we have to introduce a threadpool here if it turn out that this single
thread is a bottleneck because of the time consuming robots.txt downloads
*) improved index transfer
The index selection and transmission is done in parallel now to improve index
transfer performance.
TODO: maybe we could speed up performance by unsing multiple transmission threads in
parallel instead of only a single one.
*) gzip encoded post requests
it is now configureable if a gzip encoded post request should be send on
intex transfer/distribution
*) storage Peer (very experimentell and not optimized yet)
Now it's possible to send the result of the yacy indexer thread to a remote peer
istead of storing the indexed words locally.
This could be done by setting the property "storagePeerHash" in the yacy config file
- Please note that if the index transfer fails, the index ist stored locally.
- TODO: currently this index transfer is done by the indexer thread.
To seedup the indexer
a) this transmission should be done in parallel and
b) multiple chunks should be bundled and transfered together
*) general performance improvements
- better memory cleanup after http request processing has finished
- replacing some string concatenations with stringBuffers
- replacing BufferedInputStreams with serverByteBuffer
- replacing vectors with arraylists wherever possible
- replacing hashtables with hashmaps wherever possible
This was done because function calls to verctor or hashtable functions
take 3 time longer than calls to functions of arraylists or hashmaps.
TODO: we should take a look on the class serverObject which is inherited from hashmap
Do we realy need a synchronization for this class?
TODO: replace arraylists with linkedLists if random access to the list elements is not needed
*) Robots Parser supports if-modified-since downloads now
If the downloaded robots.txt file is older than 7 days the robots parser tries to
download the robots.txt with the if-modified-since header to avoid unnecessary downloads
if the file was not changed. Additionally the ETag header is used to detect changes.
*) Crawler: better handling of unsupported mimeTypes + FileExtension
*) Bugfix: plasmaWordIndexEntity was not closed correctly in
- query.java
- plasmaswitchboard.java
*) function minimizeUrlDB added to yacy.java
this function tests the current urlHashDB for unused urls
ATTENTION: please don't use this function at the moment because
it causes the wordIndexDB to flush all words into the
word directory!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@853 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
01db66dc69
implemented image-servlets. the imagetest will stay there only for a limited time. Now images can be generated on-the-fly from servlets
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@852 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f6a0e0f162
small bugfix to readFully
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@851 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
6d5d0ac801
bugfix for startup problems
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@850 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
a9c466ef21
cleaned, finals, StringBuffer, Properties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@849 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
37ee53856c
cut for release 0.41
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@848 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
5dc0d41900
bugfix in kelondroRA (hint by Martin)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@847 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0c3a20d44f
more + changed log for better understanding of outOfMemory bug and others
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@846 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
e5f8163203
fixed a bug with news; news moving could lead to shurtcut loop / 100% CPU; appeared when clicked on a 'Profile' news in Network menu
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@845 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
d8544e8416
cosmetic
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@844 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fbb5e36b80
documentation update
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@843 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago