Michael Peter Christen
a6ad1d6fd1
update to search tests (use yacy interface and a bugfix)
12 years ago
orbiter
f86d469973
more search command tools
12 years ago
orbiter
1228a5798d
you can now search for '*' to get just ALL entries in the search index
...
as result list. This makes sense if you intend to search just by using
the navigation tools to cut the data set into navigation 'slices'.
12 years ago
Michael Peter Christen
68c7ed5ce9
added a shell script which can be used to delete the api action steering
...
table. This may be necessary if the api is called by remote command and
the recordings are not used. Then they can be deleted frequently by
calling this clear command using a cron job
12 years ago
Michael Peter Christen
ed803708ab
added a shell script which can be used to add a rss feed to the index.
...
All pages linked in the rss feed are added. The process is not repeated
automatically. If you want to repeat this, add the command to a cron
job.
12 years ago
Michael Peter Christen
f3fc8eac80
fixed clear scripts
12 years ago
Michael Peter Christen
640339ee21
added the indexrestore.sh script which must be called with the path of
...
the index dump. This is the reverse of indexdump.sh which takes the
output of indexdump.sh as input to restore an index.
Now it should be possible to transfer a complete YaCy Solr index from
one peer yacy1 to another peer yacy2 with the following command:
yacy2/bin/indexrestore.sh ´yacy1/bin/indexdump.sh´
12 years ago
Michael Peter Christen
15ea053c3a
- added xml output in IndexControlURLs to get the storage page of index
...
dump commands
- adjusted the apicall.sh script to get the downloaded text as output to
stdout which is necessary to parse the content out of it
- added indexdump.sh script which creates a solr dump and prints out the
storage path for the index dump
- added synchronization to the Fulltext class to prevent that data is
stored to a non-existing solr index while this index is disabled during
the storage of the dump
12 years ago
Roland 'Quix0r' Haeder
aef9dd0350
- removed cleaning of blacklist cache on startup
...
- added cleaning of blacklist cache if cache is modified in interface
- extended cache saving to all cache types
- moved cache location to DATA/LISTS
- fixed static file path which was relative to the application path but
should be relative to data path - which is different in debian and mac
implementations
13 years ago
Michael Peter Christen
cc1b6762bb
root, not yacy
13 years ago
Michael Peter Christen
2589158f44
changed recommended line in /etc/crontab for high-availability
13 years ago
Michael Peter Christen
2823e33531
fixes for wrong port number in shell scripts
13 years ago
Michael Peter Christen
8b974905ee
changed log-in text for all servlets with authentication:
...
- added hint how to set the password using a shell script
- added a shell script to change the password
13 years ago
Michael Peter Christen
6c4f8fdc44
removed superfluous files
13 years ago
Lotus
e14eb9de82
checkalive.sh: try to fetch only once (default: 20)
13 years ago
Michael Peter Christen
5e18f54a8c
added shell script to get a servlet. this is the same as apicall.sh but it prints the result to stdout
13 years ago
Michael Christen
7a5ff9b808
added checkalive script that can be called frequently to restart crashed
...
peers automatically
13 years ago
orbiter
3ac6fb0baf
added dump check script
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7978 6c8d7289-2bf4-0310-a012-ef5d649a1542
13 years ago
orbiter
fabda9ad31
added script that can be used to delete a single url from the index
...
call:
bin/deleteurl.sh <url>
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7969 6c8d7289-2bf4-0310-a012-ef5d649a1542
13 years ago
orbiter
2e8c8ebf54
fix for clear commands
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7939 6c8d7289-2bf4-0310-a012-ef5d649a1542
13 years ago
low012
24e76a7b69
*) Replaced occurrences of "Wikimedia" with "MediaWiki" where applicable. (Thanks to the folks of 0x20.be for pointing this out.)
...
*) Added description of where to place MediaWiki dump for import.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7905 6c8d7289-2bf4-0310-a012-ef5d649a1542
13 years ago
orbiter
d0d6123b18
added a deploy script that can be used to deploy yacy releases into the current release for testing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7768 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
265b7ce4f9
removed pause in search test
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7767 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
752576b521
- localsearch test script does also a snippet-fetch
...
- killYACY.sh does not need a sleep between kill -3 and kill -9
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7750 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
f6077b3cc0
added more attributes for html parser and enhanced data structures
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7679 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
9b25d07295
- added geo information parsing to html parser
...
- extended metadata information in index with geolocalisation
- added display of location in yacydoc and ViewFile
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7629 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
18ec7fe53c
added a clearall.sh script that deletes the complete index and everything else that belongs to crawling
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7611 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
a50f28e6e7
- fixed missing save operation for peer name change
...
- fixed import of mediawiki dump files
- added script to add mediawiki dump files
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7609 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
88773e4daa
changed the default port from 8080 to 8090
...
see also: http://forum.yacy-websuche.de/viewtopic.php?p=21683#p21683
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7454 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
5dcb838293
- removed thread overhead when calling dns services
...
- fixed localsearch (changed it by accident)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7306 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
becc463d8a
enhanced did-you-mean
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7300 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
fc924f024e
import of oai sources from a list using a command line interface:
...
if you have a list of oai servers you can import them all using the linux command:
bin/importOAIList.sh <name-of-oai-list-file>
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7141 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
3197ca42ed
preparations to move the HTCache into cora:
...
- move the header framework classes to cora
- move the ARC caching classes to cora
- refactoring of code to call these classes from cora
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7068 6c8d7289-2bf4-0310-a012-ef5d649a1542
14 years ago
orbiter
b68deb407a
- moved test data from /bin to /test/words
...
- refactoring of stopYACY.sh by introduction of /bin/apicall which is able to call any api file with attached authorization
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6691 6c8d7289-2bf4-0310-a012-ef5d649a1542
15 years ago
orbiter
5d7045387b
added more word lists and a multi-access search test tool for high-performance query testing:
...
run searchtestmulti.sh; then 10 concurrent processes fire 1000 requests each to the local peer.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6045 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
orbiter
8ce5bb4f31
added shell scripts that list host addresses
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5793 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
orbiter
51ea865569
small fix for localsearch shell script
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5792 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
orbiter
d39d420b39
performance hacks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5376 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
orbiter
bb5c898441
enhancements to localsearch behavior
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5131 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
orbiter
42e2d195ac
added hint from http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1294
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@5130 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
f1ori
f0012554de
* improved localsearch.sh:
...
* read port from DATA/SETTINGS/yacy.conf
* use curl or wget, depending on what is available
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4972 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
994c609cf8
added new shell script to do a web search from the terminal
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4916 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
e9dd0e6831
added a test script for profiling of local search speed and a list of search words
...
current result: 120 milliseconds for a single search during a sequence of 100 searches,
while the crawler is running with 500 PPM!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4913 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
724bbdf9b2
refactoring of RSS reader
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4736 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
0702dd2507
added a profiling script to analyse search process timing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4733 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
40621a5663
anhancements in ranking preparation and fixed problem with parser/mime recognition
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1132 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0e25020f51
added first generation and usage of YBR index-files. Enhanced overall ranking of search results.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1118 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0ec54d9c5f
enhanced CR-file handling and added first RCI-evaluation tests
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1110 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
88e3234393
fine-tuning of rci-generation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1105 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a12759c1bf
first try to implement a rci-computation from cr-files
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1103 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago