low012
33e2423bde
*) Tool to find out the percentage of letters in a text. First step to build a tool which detects the language a text is written in.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4010 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
3ee9a26ef2
*) deleted unused program
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4009 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
d36786bf37
*) startYACY.sh is compatible with OpenSolaris now (see http://forum.yacy-websuche.de/viewtopic.php?t=203 for details)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4007 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
8d6aa7a66d
replaced detailed search page by ranking definition page (this is what it essentially is)
...
the ranking definition there will influence the normal web search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4006 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
7ff4357184
fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=206&hilit=&p=1130#p1130
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4004 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
75d1437340
fix for http://forum.yacy-websuche.de/viewtopic.php?p=1123#p1123
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4002 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
9ca46a8c69
indexing of local (intranet) urls enabled
...
To do this, one must create a separate YaCy network that has a local URL domain
A description how to do this is here: http://www.yacy-websuche.de/wiki/index.php/De:Netzdefinition
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4001 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
c48c9735cb
bugfix for property table loader (did not exclude comment lines)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4000 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
6758beae9c
fix for http://forum.yacy-websuche.de/viewtopic.php?p=1092#p1092
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3999 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
511dcbb172
fixed encoding bug made in SVN 3993
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3998 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
ec817a2ff5
removed JAR command from ftpc - produces warnings and is not used by YaCy
...
see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=193&hilit=
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3997 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
f5a4efb76e
fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=192&hilit=&p=1034#p1034
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3996 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
bf6f6820ec
fix for another BMP parser problem. This is not a bug, I guess it was caused by a corrupted graphics file.
...
see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=180&hilit=&p=1017#p1017
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3995 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
40b0547611
- documentaton changes (removed old forum links)
...
- different handling of link quotation
- different handling of link normalization
- enhanced html/unicode en/de-coding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3993 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
dcb8687904
fix to update cycle
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3992 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
cff019464e
fix for BMP/ICO magic detection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3991 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
557f8d80e4
- better logging
...
- fixed bugs in auto-update
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3990 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
f323e1813d
added commons.logging again (is used by mimeTypeParser)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3989 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
michitux
a695c93662
Fix of the Status-page for IE:
...
- reverted revision 3982 and 3979
- added a real fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=163
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3988 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
b6d9cca67e
- fixed problem with yacyVersion and own version generation
...
- within this context: generalized date format handling
- extended Update interface:
* a version lookup can be triggered manually
* a complete lookup + download + re-boot process can be triggered with one click
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3986 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
b1630e73c8
addition to fix 3983
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3985 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
6071668c3b
better error message in case that a mime type cannot be found.
...
see also http://forum.yacy-websuche.de/viewtopic.php?f=6&t=132&p=587#p587
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3984 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
d477773fa3
fix for b/w color mode in BMP parser http://forum.yacy-websuche.de/viewtopic.php?f=6&t=180&hilit=&p=942#p942
...
see also
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3983 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
e7a99d765e
fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=171
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3977 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
03847bebc1
removed unused libs
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3971 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
9da0e53fe8
repaired rss feed reader
...
- removed old rss parser
- removed unused rss parser libraries
- added new rss reader
- added previously removed FeedReader_p.java and adopted it to new rss parser
- adopted parser interface for rss indexing to new rss parser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3970 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
26ddf797eb
added bmp and ico image format to all parser/viewing methods
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3969 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
5444b07674
fixed bug with decompression of index abstracts
...
this fixes a problem that occurred when searching for several words
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3968 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
924ae39170
replaced old map loading method with new implementation which is more robust against change of line termination methods
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3967 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
ac7b6b9c10
added a default favicon for search results from domains that do not have a favicon
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3966 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
89e1848db6
fixed problem with favicons:
...
target servers had been able to see search words from the referrer of the favicon fetch.
This has been removed by using the getImage - servlet for favicon fetch.
Since java does not support loading of bmp and ico-Images, such parsers had been added.
The image parser had been coded from their original microsoft documentation.
This influences also the image-search functionality: there can now be a preview
of found bmp-images. Another benefit: favicons for search results are now cached with the HTCACHE.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3965 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
d95a06e2e4
*) cosmetics: fixed a typo
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3964 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
7c5c814a47
- simplified code (removed exception handling where not necessary)
...
- added confirmation dialog for shutdown and restart
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3962 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
a4e8ad95ab
enhancements to news and switchboard queue processing
...
removed direct access and replaced by iteration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3961 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
a45216b479
fix to prevent bad-formed news messages
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3960 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
bec4dbc753
added options and execution methods for automated updates
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3959 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
527b3decde
- re-sructuring of configuration menus
...
- added new system update configuration page
- moved system update from status page to system udate page
- moved shutdown and restart from status page to main menu
- added new configuration properties to yacy.init (not yet actively used)
- added some methods to handle new automatic update process
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3958 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
4968556668
- fix for broken news queue during iteration
...
- enhancement for searching special news (usage of new iterator)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3957 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
1ea5fa2c04
*) Changed a comment to get rid of this message:
...
[javac] /home/low012/subversion/yacy/trunk/source/de/anomic/http/httpc.java:1117: warning: unmappable character for encoding UTF8
[javac] // if download == null, the get result is stored to a byte[]�and returned,
*) Changed broken link (see: http://forum.yacy-websuche.de/viewtopic.php?t=128 )
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3956 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
5702257a5f
fix for elementCount bug when db was reset
...
possible solution for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=124&hilit=
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3955 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
208b5297f1
enhanced handling of news records:
...
result is a speedup of Surftips, Supporter, and Network page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3954 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
f40566f9bb
separate YaCy networks:
...
- added server-side network unit identification
- added server-side network access authorization
- enhanced client-side network authentification essentials generation
- implemented first peer-peer salted-magic authentification method
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3953 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
a9e73b6852
fixed great mess with localization paths. the problem was:
...
automatic re-translation after update did not work. hopefully now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3952 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
53266e6497
does not belong to yacy, just a test file (committed by mistake)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3951 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
36a37f758b
fix for oom exception during release download
...
see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=101&hilit=
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3950 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
3421c64d26
implemented update function:
...
after downloading a release using the download button on the status page
the user can choose any of the downloaded versions for a update.
this enables also a downgrade to a older version.
when the update button is pushed, yacy terminates, installes the choosen version
and restarts
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3948 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
21fabe259b
another fix to the restart function; now tested under linux
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3947 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
2158f83d43
*) cosmetics, changed a character to get rid of "warning: unmappable character for encoding UTF8" during compilation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3946 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
28baecd41b
another fix for the concurrentModificationException in AccessTracker
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3944 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
84be912e90
fix for null pointer exception that occurred when missing user-agent in request header
...
see also http://forum.yacy-websuche.de/viewtopic.php?f=6&t=78&hilit=
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3943 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
c1aad9e508
added parameter for network graphic background
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3942 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
19786b73b6
next try for a better restart
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3941 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
08d5db6bb4
next try to fix the restart
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3939 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
2ff94b2fb4
another try to fix the restart on linux (it works on mac)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3938 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
c5c268c43e
tried to fix restart button
...
** kann das mal jemand auf seiner linux-platform testen **
** und feed-back geben ob der restart funktionier ? **
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3937 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
e03fcf4627
SSI fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=29
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3936 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
1a45ecb356
- fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=14&p=137#p137
...
- fix for missing restart script in ant built target
- removed some more synchronization for size() operations
- removed blocking statement on search page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3935 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
1fa4feb8e6
added restart button. should work on linux and mac, but was only tested on mac
...
should of course work on windows as before
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3934 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
f1ed91a8e4
added option to allow/disallow DHT transmission during indexing
...
see also http://forum.yacy.de/viewtopic.php?f=9&t=8
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3933 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
87afdfc2a7
fix for long waiting time during deletion of processed news
...
see http://forum.yacy.de/viewtopic.php?f=6&t=6
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3932 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
9bbd39b67c
- removed unfinished auto-updater from roland and martin
...
- added new download-option for releases on the status page
still mising:
- thomas-style restart for linux/mac
- untar/gunzip on shell basis
(comes next)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3931 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
154ffd7c2c
fix for wrong http connection version and SSIs
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3928 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
1782ef57e5
- added SSI parser and include directive for <!--# include virtual="<file>" -->
...
- added chunked file transfer for non-yacy clients
- SSIs are streamed using chunked transfer, partly delivered pages can be seen in browser before transmission is finished
- added client-side network unit identification
- cleaned up code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3926 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
6b4cfbd2d6
new network bootsraping method
...
- no more contact to yacy.net (no remote superseed any more)
- moved superseed file into new network unit definition
- fixed build; includes new network bootstraping files now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3922 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
0e57a8062b
added network definition for different YaCy networks
...
(needs much more work)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3919 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
auron_x
1d41ebf489
*) made age for deletion of too old seeds configurable
...
*) changed naming-scheme of seed-deletion-properties
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3918 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
auron_x
52cb3208d0
*) old (lastseen > 7d) peers are now automatically removed from passive and potential seed-dbs
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3917 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
815e3da62f
fix for http://www.yacy-forum.de/viewtopic.php?p=37353#37353
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3913 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
michitux
25529290ca
- 2 small changes in documentation
...
- hopefully fixed logging of GCs (in order to avoid things like "performed necessary GC, freed 18014398509481565 KB (requested/available/average: 4096 / 1631 / 2957 KB)") with the help of KoH
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3909 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
c59a7ce5c2
*) hopefully fixed a stupid bug (my fault of course) that sometimes messed up the marking of search words in the snippets (see http://www.yacy-forum.de/viewtopic.php?p=37329#37329 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3908 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
6518bb6c08
changed release strategy:
...
we will provide two different releases in the future, one standard release and one 'pro'-release.
the 'pro'-release contains all additional parsers AND has different default performance values.
The pro-version differs therefore from the previous 'all'-version by this default values.
The pro-configuration is automatically choosen if the libx-folder exists. If a version is once initialized, its configuration stays independently from an existing libx folder.
The ant targets had been changed. There are now 3 different targets to create standard and pro-releases, and one target to upgrade:
- dist: creates a standard release (only, no libx target any more)
- distPro: creates a pro-release (includes the libx)
- distExt: creates a libx-release which includes the libx-folder only. It may be used to upgrade from standard to pro
Furthermore, the naming of 'dev'-releases had been removed.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3902 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
069562a14d
fixed problem with re-crawl; replaced error file-db with ram-db
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3900 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
c7a614830a
several bugfixes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3899 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
465145cb6f
revert to insecure, but dau-proof defaults
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3898 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
7ad11ceaaa
security fix for peers without password. allow access only from localhost
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3897 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
2784820ee3
*) moving sleep to a better place
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3895 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
7a1b811d18
*) bugfix for SocketException:
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3893 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
71fd972ac0
- reduced default search time
...
- catched case when web structure cannot be painted because of too less data
- better logging when balance fails
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3892 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
2b937abef1
slighlty different behavior in shutdown sequence for http server threads:
...
- first close streams
- make pause (that one that was made in httpdFileHandler)
- close sockets
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3890 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
e4aa8f2a08
disabled more sleep(200)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3889 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
cb38e57622
reduced httpd final waiting time
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3888 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
b4585ad67d
im Sommer 2005 wurden die ersten pings zwischen YaCy-Peers ausgetauscht.
...
Das klappte aber merkwürdigerweise nicht immer. Um das Protokoll zu testen schrieb ich eine einfache message-Funktion, so wie sie heute noch in YaCy drin ist.
Aber auch die Messages funktionierten nicht richtig. Alex und ich haben lange Zeit gesucht, und den Fehler nie gefunden. Es stellte sich heraus das ein Timing-Detail das Problem lösen konnte, die Ursache haben wir bis heute nicht gefunden.
Die Lösung des Problems bestand aus einem kurzen sleep, kurz bevor der httpd Daten zum client zurück geschrieben hat. Das ist natürlich eine fürchterlich schlechte Lösung.
Bis heute war diese Sache im httpd. Mit diesem Commit habe ich den sleep auskommentiert, und es steht zu befürchten das wieder irgendwas nicht geht.
Wenn jetzt das Netz zusammenbricht, keine pings mehr ankommen oder so, war es dieses sleep, das es verhinderte.
Vorschläge willkommen.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3887 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
f64d9f1c6c
removed forced termination in case that a previous bad termination is detected
...
this will cause many users to be unsure what to do next an leave them helpless
to simply delete the control file is the same thinig that the user is othervise forced to do
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3885 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
e1d809d5f1
- more detailed logging of MEMORY messages
...
- forced GCs don't contribute to heuristics anymore
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3881 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
9f7765863b
bugfix for seed length control routine
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3879 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
0b10ef64ba
better server access tracking
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3878 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
4f5496062c
protection against too large seeds
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3877 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
684ded0e09
added new news types
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3876 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
669f840eab
- added ViewProfile / Impressum (default on) to local peer's robots.txt
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3874 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
d7de0938a6
fix for http://www.yacy-forum.de/viewtopic.php?p=36587#36587
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3870 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
22ee85ca02
- specified exceptions thrown by ResourceInfoFactory and plasmaHTCache.loadResourceInfo()
...
- caught possible NPE in CacheAdmin_p and added more error-cases
- speeded up deletion of entries in the local crawl queue by crawl profile (it has been noted often that this deletion is slow)
- added a bit javadoc
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3868 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
dfd5e823c3
automatic limitation of web structure host count
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3867 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
8b0aea6910
fixed automatic deletion of too many referenced hosts in web structure
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3866 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
5dd9acc2a7
removed calls to deprecated methods
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3865 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
66ec8b63c1
added a httpd access tracker:
...
- all requests to the own httdp can now be listed in the access tracker menu
- the search statistics had been renamed to access tracker and extended by this tracker
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3861 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
71ca9aa6d4
- fix for changed blacklist types
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3857 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
9a8a87612d
added new qph column to search tracker servlet
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3854 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
e07458bad4
added time-out function to web analysis
...
the default time-out is 1 second
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3852 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
hydrox
4a1bc4743a
*)News-entries with blacklisted URLs are now ignored
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3849 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
6074264267
dynamic rights.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3847 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
99062c0c9e
*) SOAP should support authentication against the user-DB now (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3846 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
339153d40e
*) favicons that are specified in the document content via html link-tags
...
are now detected and displayed on the search page (requested by allo).
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3845 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
854eb1492f
.yacy /.yacyh urls for the feedreader
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3844 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
7a5b22a0b8
Integration of FeedReader in Bookmarks.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3841 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
6265d321bd
- more constants
...
- display why global search is not available on search page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3839 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
7921f07c9d
userDB fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3837 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
7b2e1bb8f2
Feedparser with reflection.
...
TODO: This needs a special build.xml entry
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3832 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
8bff810d19
- fixed logging output of serverMemory.request()
...
- don't start up if DATA/yacy.running exists as this is usually a sign of an already started yacy-instance
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3831 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
rramthun
18a5380ee3
*) situation-dependent lock-buttons for search-page
...
*) removed one unused import and a double definition of "ogg" as media-type
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3817 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
f05ca43780
- the wiki-parser works for remote wiki-code now, not displaying links anymore as if they were local (ViewProfile comment)
...
- fixed wrong link to CrawlStart on Status-page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3816 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
9d6605a83c
- fixed NPE in Blacklist Cleaner during deletion of more than one double entries
...
- don't display responseHeader1.db in CacheAdmin_p anymore
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3814 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
594ff95955
:-(
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3801 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
4ca797401e
fix for ConcurrentModificationException
...
see http://www.yacy-forum.de/viewtopic.php?p=36566#36566
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3800 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
7b904e0077
integrated robots.txt crawlDelay into the crawl balancer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3797 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
52cb033f01
- slightly different painting of web structure picture:
...
hosts that have many own connections are painted farer away (this is not yet cato's idea, this will be implemented in another step)
- doc update
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3796 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
30c3d909b1
- fixed charset problem in ConfigProfil_p.html (use accept-charset="UTF-8" in forms)
...
- fixed wrong XML output if no peers are known in Network.xml
- simplified parsing of table properties in wikiCode and ZTableToken
- reimplemented GC heuristics. They are needed to constantly ensure that an amount of free memory is available which is higher than Java's max. limit for performing a Full GC (please use serverMemory.request(long, boolean) rather than serverMemory.available(long, boolean) to provide data for averaging over the last GCs)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3793 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
6c9df13552
more debugging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3791 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
4392ee0c51
BugFix for typo and wrong include
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3789 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
d1e1580223
Surftips Blacklist
...
Blacklists List Hardcoded instead of only updated on firststart / migration.java
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3788 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
e1a5babff1
*) Logging GUI handler: line-size is now set to max-size if max-size was exceeded
...
See: http://www.yacy-forum.de/viewtopic.php?p=36355
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3786 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
94cc9f05f5
*) Improvements for restart via update wrapper
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3785 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
hydrox
44bac7dea1
*) blog-comments can now be moderated
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3778 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
borg-0300
2ab020445a
bugfix, i think - http://www.yacy-forum.de/viewtopic.php?t=4059
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3777 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
f89517203d
*) SOAP: new function to get the Performance Settings of Queues and Processes
...
No items left in the yadmin SOAP-TODO :-)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3776 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
92351c4dcb
*) SOAP: bookmarks list now indicates if a bookmark is private (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3775 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
957a25afff
getRight(rightName) instead of get...Right()
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3774 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
e0e46d3aec
*) SOAP: new function doGarbageCollection (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3773 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
1efe607c34
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3771 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
low012
a0149317ac
*) fixed bug where headlines were added to directory of a wiki page multiple times ( http://www.yacy-forum.de/viewtopic.php?t=4034 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3762 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
ef24bed406
Sorry...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3760 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
(no author)
a29cb2e1af
blupp
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3759 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
651b05ba43
*) wsdl file updated (requested by KoH)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3758 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
0ae6664ad8
enhanced web structure picture
...
- hand-over of get properties from web front-end to graphics generation
- added depth-control buttons
- added marking of anchor-points to highlight relation order
- enhanced ymage graphics library
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3757 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
a585b4d41b
added web structure image
...
see http://localhost:8080/WatchWebStructure_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3747 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
33ad0c8246
added a web structure computation and logging:
...
- all web page parsing operations will now increase a web structure file
- the file is computed in memory and dumped at shutdown-time to PLASMASB/webStructure.map in readable form (not a database)
- the file can be used externally to analyse the link structure of the crawled pages
- the web structure can also be retrieved using a xml-interface at http://localhost:8080/xml/webstructure.xml
- the short-term purpose is the computation of a link-graph image (before linuxtag!)
- a long-term purpose could be a decentralized computation of the citation rank
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3746 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
7904175338
- sorry for typos
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3743 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
baa9402b97
- wiki-parser is now configurable via the config setting wikiParser.class which holds the class-name for the parser to use
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3742 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
601fc7d1c5
- added source to J7Zip-modifed.jar and it's license (changelog is still to come)
...
- moved HTML-*replace-methods from wikiCode to de.anomic.data.htmlTools
- prepared use of different wiki parsers as suggested here: http://www.yacy-forum.de/viewtopic.php?p=34444#34444
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3741 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
0a64047081
- plasmaParserDocument can process subdocuments now (other archive-parsers may want to use this method)
...
- added 7zip parser
- added 'text/sgml' to realtime parseable mimetypes (sometimes returned by the mime type parser)
- added new cached output stream class, very suitable for parsers because of limited memory
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3740 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
b1680ab71f
*) bugfix for ArrayIndexOutOfBoundsException in robots-parser (thanks to low012)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3739 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
b30e64daab
*) passing homepath to serverLog.configureLogging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3738 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
theli
9a4375b115
*) robots.txt: adding support for crawl-delay
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3737 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
11ac7688d5
reverted a part of last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3736 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
b3f97b5c38
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3735 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
3c5ff7f735
adopted kelondroBytesIntMap to kelondroIntBytesMap
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3734 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
5551ff5306
enhanced index storage data structure kelondroBytesIntMap
...
this stores now two index structures, one for data that is aquired during start-up
and one for data that is aquired during run-time. This reduces the grow factor, and should reduce the memory amount in case that a index-reorganisation happens.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3733 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
orbiter
872eb46cb9
some redesign of the handling of the index for kelondroFlexTable
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3732 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
086239da36
- added servlet: remote crawler queue overview
...
- added servlet: crawl profile editor
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3731 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
karlchenofhell
8ba81e0995
- added some comments (will get more in the near future)
...
- added missing <label> to the search field in Network.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3728 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago
allo
65a8a9fc58
fix for nullpointer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3726 6c8d7289-2bf4-0310-a012-ef5d649a1542
18 years ago