danielr
d32fe84472
added default User-Agent
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4763 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
8c5f062e0b
corrected YaCy version in HTTP User-Agent
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4762 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
d7b21bc90c
re-added gzip POST for transferRWI/URL (HTTP/1.1 compliant)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4761 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
f1ori
a5a1f19368
* allow to force login for xbel, needed for yacybar
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4757 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
low012
8d83febb95
*) BlacklistCleaner_p.java reports exception to log instead of System.err
...
*) changes in formatting for better readability in BlacklistCleaner_p.java
*) replaced test for necessary Java version (was 1.4.2, is 1.5 now)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4756 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
d4bce6affd
refactoring (initialized static fields, removed empty if/else, serialized some fields in serializable classes)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4755 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
daburna
19ca452666
updated language file
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4754 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
be2c9c07ff
escape some unescaped characers in URLs (fixes problems with proxy)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4753 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d0678f7ab9
refactoring as result of
...
http://forum.yacy-websuche.de/viewtopic.php?f=6&t=959&p=7560#p7560
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4752 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
483e9a2066
- shifted tld recognition methods from yacyURL to serverDomains
...
- changed isLocal Property in such a way that it is possible to see if a domain is in the internet (and not intranet)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4751 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
a3df23659c
re-implementation of charset checking
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4750 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
75a1702133
- fix for ConcurrentModificationException during shutdown
...
- fix for Ranking distribution problem (suma-lab peer does not exist any more)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4749 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
27ab0a5f89
fixed XSS problem in ConfigProperties
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4748 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
32b5b057b9
- modified, simplified old kelondroHTCache object; I believe it should be replaced by something completely new
...
- removed tree data type in kelondroHTCache
- added new class kelondroHeap; may be the core for a storage object that will once replace the many-files strategy of kelondroHTCache
- removed compatibility mode in indexRAMRI
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4747 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d3715e02ae
removed double/redundant servlet Config_p
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4746 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
ec84a52adb
change for problem with NPE (seen as "PROXY Unknown Error while processing request")
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4745 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
5813cc149f
fix for bad rssTerminal behavior
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4744 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
88216c1f1f
fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1103&hilit=&p=7362#p7362
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4743 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d0b893523e
- protection against RAM overflow caused by new peer rss news
...
- more XSS protection
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4742 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
685794e7e7
fix for parser/encoding Exception
...
see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1111&hilit=&sid=55a320b54e1e3bda9410e7c50b5147f1&p=7431#p7431
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4741 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
cf042e6957
reverted change by mistake in yacyVersion
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4740 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9935e83c86
added new news window into the status page. At this moment it is just a test.
...
The news inside the window are about peer arrivals and departures, remote search accesses and crawls
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4739 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
bac38cfa18
added very rudimentary peer news as rss feed. An example can be retrieved with
...
http://localhost:8080/xml/feed.rss?channel=PEERNEWS
to be extended and integrated in interface ...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4738 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
6495227ad6
the class rssReader is replaced by RSSReader, RSSFeed and RSSMessage
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4737 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
724bbdf9b2
refactoring of RSS reader
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4736 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
b9a2a2d287
more search performance hacks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4735 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
ff755fb858
small corrections and enhancements after search timing profiling
...
search should be a little bit faster now
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4734 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
0702dd2507
added a profiling script to analyse search process timing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4733 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d0e2830e01
enhanced the thread dump to make it usable for scripted remote-debugging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4732 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
e024e3b9cf
added new default profiles to distinguish snippet fetch for local and global search
...
the difference is, that a local search will no not cause a re-indexing of loaded pages
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4731 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2c0c8f0f0c
SRU compliance according to
...
http://www.loc.gov/standards/sru/specs/search-retrieve.html
The example given on this page can be used to retrieve opensearch-compatible rss pages with YaCy
Try it:
The transcription to YaCys search servlet address is
http://localhost:8080/yacysearch.rss?version=1.1&operation=searchRetrieve&query=dinosaur&maximumRecords=1&recordSchema=dc
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4730 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
9b03310f8a
bin jetzt wach :/
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4729 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
7bd8601f04
delete old releases compatible with java 1.5 ;)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4728 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
e90282da1c
added experimental javascript terminal for rss feeds (not used yet anywhere yet, expect the worst)
...
.. possibly to be used as content for iframes within monitoring pages
not ready yet!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4727 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
da386a1924
fixed deleteOldDownloads if there are no downloads
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4726 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
21418a22a3
removed DEBUG output
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4725 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
79a3edeeef
deleting downloaded releases after x days (default 30)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4724 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
763f9d4f5d
serverCore: setting timeout for new connection before SSLDetect
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4723 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
1995faef8d
- refactoring of Colage back-end: move to plasma package
...
- renamed also the plasmaCrawlResults to have a consistent naming for url and image queues
- added a double-check for the images
- added additional queues for the images: all worse-quality images go there, so the queue can be used also if no sizes are given; no image is lost
- added a cleanup for the stacks so they cannot flood the memory
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4722 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d7e89c2aca
fixed near-deadlock situation when deleting crawl profiles
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4721 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
5e3ce46339
- better logging when rejecting a url because it is not in declared domain
...
- more XSS attack protection
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4720 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
6d1be66822
- longer refresh rate for reload of WatchCrawler page forwarding to indexing start (does not work in IE)
...
- better names for search pages
- Release 0.58
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4719 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
apfelmaennchen
2149728227
- major rework on YaCy-UI
...
- search results are retrieved from rss/xml, no other servlet needed
- added double accordion sidebar menus
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4718 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
c270d02176
Reverting SVN 4716
...
ein 1.6er Target versehentlich zu setzen bedeutet bei automatischer Release-Erzeugung und updates bei Usern,
die nur Java 5 installiert haben (was bei allen Mac-Usern so ist), dass große Teile des Netzes crashen können und manuell
wieder hoch gezogen werden müssen.
Neu-User die mit einem dev-Release Beginnen können mit der intranet-Einstellung gar keine Websuche starten.
Bitte nach einem Commit immer kontrollieren, was man da eingecheckt hat.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4717 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
48ffd61e6a
changed "patched wrong" to warning, so it goes to the logfile
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4716 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2f629d20a7
- tried to fix the '4217666-problem'
...
- removed more unused code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4715 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
512f48e7d6
- removed unused methods
...
- fixed xss attack on peer list in CrawlStartSimple
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4714 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
14384e7a45
deactivated unnecessary and very CPU-intensive deletion check for blacklisted URLs in index receive
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4713 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
f1ori
701f769c66
* removed comma, which caused invalid xml
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4711 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
3c76342619
- added servlet to configure the search page greeting line
...
- added information output about the current network definition in the network servlet
- better description and usage of profile entries in User Profile servlet regarding FOAF format
- reformatting of menues at status page
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4710 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago