low012
48463c4507
*) General private License? ;-)
...
*) minor code changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7432 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
c93f4dda72
- cleaned up yacy news
...
- removed unused methods
- avoid news generation in case that the peer runs in robinson mode
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7431 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
6c1b14c8e1
- more control in access tracker: count number of returned search results (not only info how much is in the index)
...
- extended query params for this
- enhanced cora
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7430 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
9f38c0023d
*) Minor changes, mainly cleaning up a little bit, no functional changes.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7428 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
54e77e6255
refactoring
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7426 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
feefe17568
npe assert fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7424 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
733903f2c9
fix for http://forum.yacy-websuche.de/viewtopic.php?p=21489#p21489
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7422 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
24e4126eee
added JSON parser code from json.org (added generics to it)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7421 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
10ae8d961b
- cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring)
...
- cleaned up (removed special code and documentation for 27c3)
- added remote search functions to be used within cora
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7420 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
lotus
0e54233408
UPnP: map port again if we are not reachable (e.g. when router rebooted)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7419 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
lotus
b1484299b2
same units for memory observer configuration (MiB)
...
old setting for DHT (RAM) will be lost after update
can be set on /Performance_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7418 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
387db84087
maybe found bug in non-working index dumper
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7414 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
89ae6101b9
fix for NPE and added comment in search result
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7412 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
0769f4caa6
added search suggestions for interactive search: is only shown if there are no search results
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7411 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
a4c9d27287
- moved some variables from Stwitchboard to new class AccessTracker
...
- added a limitation in access tracking to delete queries which are older than 10 minutes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7410 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
e4aabaa1c3
* fix negative filelength for files >2G
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7408 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
cdfe8afe3f
fix for really bad table iteration implementation: reduction of IO
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7407 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
9eae33f886
*) Ooops...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7406 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
a001e8075c
*) minor enhancements
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7405 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
11ea966f9e
*) added SID file (Commodore 64) sound file parser
...
*) minor changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7403 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
ee3cef91e8
* fix filesize in ftp crawls
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7402 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
b2ed4cfaf8
more small bugfixes and light refactoring
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7401 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
3ca06d6290
patch for http://forum.yacy-websuche.de/viewtopic.php?p=21460#p21460
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7399 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
903c824c2c
- allow only scanned resourced with granted status
...
- increased time-out when scanning an ip range
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7398 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
936e976c23
*) added FreeMind ( http://freemind.sourceforge.net/ ) mindmap parser
...
*) minor changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7397 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
3d95981f7d
*) cleaning up the code a little bit
...
*) minor changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7396 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
2a6499364d
*) minor changes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7395 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
c0274bd123
*) minor changes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7394 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
fe46536f6e
enhanced network scanner (less name resolving during scanning and no name resolving during search)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7392 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e753027c43
fix for http://forum.yacy-websuche.de/viewtopic.php?p=21439#p21439
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7390 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
bf4ef1513e
- fix for map view
...
- remove some UNRESOLVED PATTERN
- maybe a fix for non-flushing cache
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7389 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
6b70393d1d
- new java version 1.6
...
- replaced old gif animator by java 1.6 gif animator
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7388 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e88c428008
fix to ftp loader
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7387 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
59b70a5a92
another fix to the ftp crawler: now correct directory listings according to rfc2640 (path with spaces) and better title names for such files
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7386 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
9b25a33fd9
- fixed numerous bugs
...
- better document names
- fixed problem with ftp crawling
- added automatic removal of search results from services that are not online according to the latest network scan: this does not delete the index but just does not show them. after the next network scan when the server is available again, the results are again showed.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7385 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
7bdb13bf7f
more fixes to smb crawling: better file names
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7384 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
94c48500cc
several fixes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7383 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
0ac7311a62
fix for token parser
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7382 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
58b59f9bc8
- a collection of bug fixes and some redesign of the Scanner class
...
- fixed smb crawling
- added smbget to download script generation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7381 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
c54170421a
fix for npe
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7379 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
c288fcf634
redesigned CrawlStartScanner user interface and added more features:
...
- multiple hosts for environment scans can be given (comma-separated)
- each service (ftp, smb, http, https) for the scan can be selected
- the scan result can be accumulated or refreshed each time a network scan is made
- a scheduler was added to repeat a scan and add all found urls to the indexer automatically
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7378 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
6f4f957e50
*) cleaning up the code a little bit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7377 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
2521677a45
* deny adminForLocalhost and intranet network setup also on bootup and not only on network switch
...
* require authentication for yacybot what ever adminForLocalhost is set to
(after this patch, is the rule from above really nesseccary,
the crawler also checks the robots.txt)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7376 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
9d2159582f
* fix system update if urls are in blacklist (for example for very general blacklists like *.de)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7375 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
56264dcc17
- added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls
...
- integrated new parser into loader processes: enrich document parser
- fixed a concurrent modification exception in kelondro iterator
- hand-over of document size from crawler to indexer
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7374 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
99a7fe87f9
- removed old intranet scanner (the generic scanner now completely subsumes the old one)
...
- added information about granted access
- enhanced servlet design
- added submit-feedback (because it is a long-running task)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7372 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
acab6801d9
added new network scanner
...
- you can scan any ip or host in the internet for services
- this replaces the intranet scanner
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7371 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
14e4fae8e9
fixes to ftp client
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7369 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
a563b05b60
enhanced crawler:
...
- added a new queue 'noload' which can be filled with urls where it is already known that the content cannot be loaded. This may be because there is no parser available or the file is too big
- the noload queue is emptied with the parser process which indexes the file names only
- the 'start from file' functionality now also reads from ftp crawler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7368 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
c36da90261
added a very fast ftp file list generator to site crawler:
...
- when a site-crawl for ftp sites is now started, then a special directory-tree harvester gets the complete directory structure of a ftp server at once
- the harvester runs concurrently and feeds into the normal crawl queue
also in this:
- fixed the 'start from file' crawl function
- added a link detector for the html parser. The html parser can now also extract links that are not included in <a> tags.
- this causes that a crawl start is now also possible from clear text link files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7367 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
4565b2f2c0
removed the display option from index.html, yacysearch.html and yacyinteractive.html
...
instead, a setting at ConfigPortal.html can be made to define if the topmenu shall be shown at these pages or if there is no naviagtion at all.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7366 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
fc2e41e691
added a forwarder for the default page. The forwarder forwards a browser to a different page if the root file index.html is accessed. This can be done by setting the name of the forwarder page to the field
...
"Default index.html Page (by forwarder)" in /ConfigPortal.html
The purpose is to forward to /yacyinteractive.html for the 27C3 FTP search plattform
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7365 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
db99db4be9
some redesign of the search-fail-response mechanism:
...
when a search fails for a single url because the snippet cannot be generated, then the url reference is deleted from the index. This mechanism was redesign and enhanced. The process now also writes into the work tables into the table searchfl to prepare a re-indexing mechanism.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7364 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
4915d1781a
* use local backup-file, if remote network-definition is not availible
...
* resolve single point of failure in networks, managed by central network-definitions
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7363 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
18d33b5c6d
fixed several search result navigation bugs
...
fixed bad behaviours during search result collection
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7362 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
49b5a206cd
- better caclculation of search result size
...
- predefined search recommendations
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7361 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
4e2c14efbb
fixed bugs in parser and ftp client
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7360 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
d78e322e84
added a directory-structure reader to ftp client
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7359 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
f0651e5f2f
added image search to yacyinteractive.html
...
this causes that the search result view switches from list format to image preview format when a search is restricted to png, gif or jpg documents
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7358 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
fffb91447a
fixed crawl queue delete function
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7357 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
b769cce433
- added a catch-all parser for all documents that cannot be parsed: they will contributed with their document url for the search index only
...
- enhanced the pdf and torrent parser: better documents titles
- enhanced the ftp client: more time-out time
- fixed bugs in json for search results
- enhanced yacyinteractive.html: added a file type navigator and a download-script generator for search result files
Please have a look at yacyinteractive.html: this will become the hacker-download tool for 27c3!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7355 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
21e84539e8
one more fix to Domains
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7353 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e192d61972
fix for latest commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7352 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
22453b13ad
implemented local host address discovery as posted in
...
http://forum.yacy-websuche.de/viewtopic.php?p=21310#p21310
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7351 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
cc6499bf8d
- added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed
...
- renamed YaCys search result modifications keywords for RECENT, NEAR and language: to the blekko slashtag naming scheme. YaCy now supports the following blekko-like slash built-in slashtags:
/date
- for search results ordered by date (most recent up)
/near
- for search results where search words appear near to each other (closest up)
/language/<lang>
- for a sorting by language where the wanted language gets up. Example: /language/de
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7350 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
a9f754c45f
removed unused CR accumulation and distribution process
...
this was never used and extended in the last years. The resulting YBR ranking criteria
is still a good idea and will be used in the future. Possible generation methods for YBR
ranking are:
- "trust-rank" using the link structure as can be discovered in a single crawl (idea from FSCONS)
- "block-rank" calculated from the local link structure
- a distributed "block-rank" using the xml API to the link structure from other peers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7349 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
3d945bb442
fix for ftp client: suppress bad directory listing time-out
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7348 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
d4a1a1850b
removed warnings
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7347 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
3b5830b7d4
*) Fixed typo.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7346 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
9b3fae9496
*) cleaning up the code a little bit
...
*) program to interface, not implementation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7345 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
7bb4b001ed
- view image files from cache
...
- fixed generic header settings; affects CORS functionality
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7344 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
e7552bd719
*) cleaning up the code a little bit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7343 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
321eb012fe
removed two warnings and reverted one change
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7340 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
737aaf6952
various small changes to ymarks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7339 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
8a50670546
some code clean up for the last post
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7338 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
442497868d
another step towards an auto tagging function for YMarks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7337 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
741a87a3e9
* make .yacy-domains crawlable (.yacy-domains are local domains, so only in custom networks/peers)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7334 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
fd74bc388c
* fix small bug in sessionid-removal
...
* add testcase for seesionid-removal
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7333 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
dca9e16f51
* don't index pages, which redirect, twice
...
* there fore auto-redirection of HTTPClient for crawling is disabled and the old code is reactivated
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7332 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
eb79b952ef
*) cleaner code
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7331 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
38fdf43587
*) renamed classes according to standard Java coding conventions
...
*) String.isEmpty() was introduced in Java 1.6, but we still use Java 1.5
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7330 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
025e3f4790
*) renamed classes according to standard Java coding conventions
...
*) removed unsused code
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7328 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
3b9aa0504e
*) removed unsused code
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7327 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
db3db0fdb9
*) trying to make this class less confusing (probably failing)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7326 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
54e63b556e
intermediate step for a YMark auto-tagging function based on word frequencies.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7325 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
403ee9c014
added a drill-down for metadata and word count to /api/ymarks/test_treeview.html
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7324 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
a025b1da89
* fix bug when browsing local filesystem (e. g. repository) with yacy
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7323 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
11ae5b108e
enabled rebuildIndex for /Table_YMark_p.html (rebuilds the tags and folders index)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7320 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
sixcooler
b87bf88ac8
using less memory on merging and rewriting blobs
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7317 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
94a9be18a4
added a ymark table administration: /Table_YMark_p.html
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7316 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
25339f93c7
more updates to ymarks
...
- working xbel import/export
- exported xbel includes yacy specific metadata but still validates against PUBLIC DTD
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7315 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
d62e449a11
* fix FilterEngine, forgot comparision-operator
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7314 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
cdd65aca71
update to ymarks
...
- get_xbel.xml is almost working
- startet ymark api documentation info.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7313 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
808edffaf6
ymarks
...
- some refactoring
- working xbel and html import (/api/ymarks/test_import.html)
- working treeview (/api/ymarks/test_treeview.html)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7312 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
2c539b514a
* add domaincheck (local/global/domainlist) to urlcleaner
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7311 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
117fc86b3d
fix for http://forum.yacy-websuche.de/viewtopic.php?p=21199#p21199
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7308 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
441fbc26e2
security patch for WeakPriorityBlockingQueue (produced a deadlock)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7307 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
5dcb838293
- removed thread overhead when calling dns services
...
- fixed localsearch (changed it by accident)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7306 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
4c50d3428e
smaller file size for array stacks to support smaller deletion sizes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7305 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
09badc697b
- low-memory patch for crawler
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7304 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
becc463d8a
enhanced did-you-mean
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7300 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
43586a2ace
a update to ymarks (please test if you wish):
...
- import HTML (e.g. FF export) via /api/ymarks/import.html
- view your import via /api/ymarks/test.html
- get a xml list via /api/ymarks/get_ymark_list.xml?tags=&folders=
- delete bookmark tables via standard interface /Tables_p.html
it is still very experimental!!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7299 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
93c535d111
fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113
...
fixed a concurrent modification exception during search and a time-out problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7298 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
04932dc268
added rdf data structure for rss feeds
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7297 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
84f2953cd8
fix for rss loader / rss type recognition
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7296 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
4c72885cba
added a sitemap entry parser and loader for sitemaps
...
(a recursion if a sitemap refers to another sitemap)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7295 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
790e0b1894
- enhanced index deletion in IndexControlRWIs_p: delete also robots.txt database and cache if demanded
...
- added option for details of deletion
- added deletion to new ConfigHTCache_p servlet
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7294 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
f5324b27f2
more updates to the new bookmarks (ymarks)....
...
- split YMarkTables and YMarkIndex in two different classes
- HTML import is working properly
- XBEL import is still broken
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7292 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
445619f3ec
added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7291 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
sixcooler
85c65475fa
smal but important correction of last commit @ HTTPClient
...
(if there is a response it really should be taken to its end)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7290 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
acd93b1b31
* add failsafe mechanisme to domainlist retrieval
...
domainlist is saved locally, if none of the given urls in network.unit.domainlist
could be retrieved, the file from the last boot is used instead
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7289 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
70c95608d4
Added CORS Access header for yacysearch.rss output
...
used some of the recommendations from Copro:
http://forum.yacy-websuche.de/viewtopic.php?p=21015#p21015
Original Request:
http://forum.yacy-websuche.de/viewtopic.php?p=20829#p20829
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7288 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
lotus
18729351e7
upnp: hint for wrongly detected local ip address
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7286 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
def4253555
* add option to network definition to provide a domainlist (syntax like in blacklists)
...
* crawler and search allow only urls matching one in domainlist (if list is provided)
* this may be useful to prevent dedicated networks from being "polluted"
* FilterEngine is improved Backlist-object, Blacklist may inherit from FilterEngine in the future
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7285 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
ac6b503adf
untar files without gzip decompression even if the file has gz extension. this is done when the decompression fails.
...
decompressed gzip files with gz extension may appear if the server sets a gzip compression header
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7282 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
efe0667fdd
more new bookmark (ymarks) code with experimental html and xbel import
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7281 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
mikeworks
caabebf9be
Fixed spelling mistake omiting -> omitting in debug messages in ConfigUpdate_p.java and Switchboard.java
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7280 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
fb92f9ae8e
added mime type image/jpeg (image/jpg is wrong but it is left here because it does not harm and this error also exists in configuration of web servers)
...
see also:
http://forum.yacy-websuche.de/viewtopic.php?p=21129#p21129
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7279 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
155d556568
- better memory protection
...
- more logging
- little bit of refactoring
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7278 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
7d8de34778
* add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7276 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
25a8e55bc9
more logging about bad seeds
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7275 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
959b8c6fa0
- allow greater seed size
...
- more logging for bad seeds
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7274 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e103419a56
- removed <3 peers barrier for peer ping feedback
...
- more logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7273 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
d0e6c03b51
some updates to the new bookmark code...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7272 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
facfd204e9
added a parent configuration option.
...
see /ConfigPortal.html
requested here:
http://forum.yacy-websuche.de/viewtopic.php?p=21099#p21099
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7271 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e3964f2c31
better catch of network definition load error; continue with secondary network load definition location
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7270 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
65a0381f76
*) cleaning up code (still not done)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7267 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e3e3b49d52
- enhanced main release recognition
...
- yacybot user agent now includes the yacy network name (not the peer name!)
- refactoring and clean-up (mostly turned tab into spaces)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7266 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
9c94ebdee4
small changes to new bookmark code...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7265 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
244b56e9d3
an update to the new bookmark code...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7264 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
dc40f51b8d
*) added headlines as proposed by Vega
...
*) <pre> will be displayed monospaced in wiki and blog again
*) bugfix for <pre> spanning multiple lines
*) replaced deprecated <s> tag with <span> equivalent
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7262 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
f035f257da
added some more bookmark code...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7261 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
low012
22ed9c380c
*) fixed bug which was introduced in r7226 (shame on me) which made wiki unusable (all entries were stored with empty subject as key -> edits were lost)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7260 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
f1ori
60fd2e549d
* log failures when writing config file
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7259 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
58e74282af
added a word counter statistic in condenser which is used by the did-you-mean to calculate best matches for given search words.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7258 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
863065abc4
added user agent logging to access tracker
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7256 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
a79728b97d
some updates to experimental bookmark code...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7254 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
ef782cd026
and even more experimental bookmark code...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7253 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
ed4371dcf3
enhanced navigation implementation and enhanced tag cloud computation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7252 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
ca738ac924
- added a tag cloud to search results (using the topics)
...
- some refactoring of score classes
- added default package for new classes add_ymark and delete_ymark
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7251 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
7aca763ca8
Some more experimental bookmark code...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7250 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
apfelmaennchen
4270ed696c
Experimental code (I need to transfer the code to my macbook, sorry) for the new bookmarks API based on the Tables concept (same as for crawl starts). Currently you can add a bookmark by api/ymarks/add_ymark.xml?url= http://www.yacy.net&title=YaCy and watch the result via the standard view Tables_p.html.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7249 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e4d561971e
added more score cluster options and made score cluster usage more transparent
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7248 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
e8f90201a5
fix for scheduling of rss feeds
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7247 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
7cd9d9d22a
- enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better
...
- added limitations on DidYouMean result sets according to input and output string length
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7246 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
de722090b5
enhancements in did-you-mean guessing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7243 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
a59c885ee0
autocomplete and did-you-mean can now understand _all_ languages and can generate suggestions in all languages and character types
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7242 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
b7acd92ce4
Auto-Suggestions for YaCy Search:
...
- added a suggest servlet according to opensearch and firefox standard
- integrated the suggest servlet into opensearch description file
- integrated a autocomplete plugin for jquery
- added a autocomplete addition to the yacy search windows showing autosuggest queries
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7241 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
24f1cba7b2
performance hacks:
...
- faster generation of index abstract compression during remote search
- less synchronization in IO record reading
- request index abstract generation only if necessary and faster time-out in remote search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7239 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
6a166c2040
patches for bad proxy behaviour
...
- accept ipv6 localhost clients
- index media files (url only)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7238 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago