Yacy Search Server
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Go to file
Michael Peter Christen c0d9a3e9a7
turned HostBrowser into a admin-only page, now called IndexBrowser
4 years ago
.github added patreon 5 years ago
.settings migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8 8 years ago
addon Added a stop command using the existing stop script to the snap package 6 years ago
bin Going back to basic authentication for console/shell commands 4 years ago
debian Updated Debian optional dependencies with the ones used for snapshots 7 years ago
defaults turned HostBrowser into a admin-only page, now called IndexBrowser 4 years ago
dictionaries changed way to integrate dictionary files: 16 years ago
docker do better documentation on how to set a default password 4 years ago
examples move example code SearchClient out of yacycore package 9 years ago
htroot turned HostBrowser into a admin-only page, now called IndexBrowser 4 years ago
langdetect Use language-detection library for increased accuracy 10 years ago
lib updated jetty from 9.4.17 to 9.4.35 4 years ago
libbuild bump to commons-codec-1.14, commons-compress-1.20, 5 years ago
libt replace org.junit.Assert.assertThat by 5 years ago
locales turned HostBrowser into a admin-only page, now called IndexBrowser 4 years ago
skins switched url and snippet position 5 years ago
snap Added a stop command using the existing stop script to the snap package 6 years ago
source turned HostBrowser into a admin-only page, now called IndexBrowser 4 years ago
test removes some warning and unused objects 5 years ago
vocabularies moved yacy.logging to defaults according to request in 13 years ago
.checkstyle problems with code style 16 years ago
.classpath updated jetty from 9.4.17 to 9.4.35 4 years ago
.env Removed unnecessary variables for local heroku run 9 years ago
.gitignore Ignore generated Javadoc with git SCM. 8 years ago
.project removes some warnings: unused imports, params 5 years ago
.travis.yml trying to fix 'Expected feature release number in range of 9 to 14, but got: 8' error while travis is building 5 years ago
AUTHORS added flori and me 17 years ago
CONTRIBUTING.md Add contributor guidelines; closes #214 7 years ago
COPYRIGHT updated copyright message; included LGPL for 'cora' and a warranty 12 years ago
Heroku.md Simplified Heroku variables configuration 9 years ago
NOTICE * updated jxpath to latest v1.3 16 years ago
Procfile Simplified Heroku variables configuration 9 years ago
README.md Fix formatting errors in the README file 5 years ago
app.json Updated YaCy home page embedded links from http to https scheme 7 years ago
assembly.xml Fixed maven assembly base directory to match last main YaCy binaries. 8 years ago
build.nsi upd windows installer script to Java 8 update 211 5 years ago
build.properties code freeze for release 5 years ago
build.xml updated jetty from 9.4.17 to 9.4.35 4 years ago
getWin32MaxHeap.bat update for memory observer algorithm 15 years ago
gpl.txt initial load with yacy 0.36 20 years ago
installYaCyWindowsService.bat upd classpath in batches (remove not necessary htroot) 9 years ago
killYACY.sh Fix for http://mantis.tokeek.de/view.php?id=432 11 years ago
lgpl21.txt migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2 15 years ago
pom.xml updated jetty from 9.4.17 to 9.4.35 4 years ago
reconfigureYACY.sh Fix for http://mantis.tokeek.de/view.php?id=432 11 years ago
startYACY.bat Made SNI extension user configurable without the need for server restart 6 years ago
startYACY.sh fix case esac in start.sh 5 years ago
startYACY_debug.bat fixed many links to old forum, now https://searchlab.eu 6 years ago
stopYACY.bat upd classpath in batches (remove not necessary htroot) 9 years ago
stopYACY.sh Going back to basic authentication for console/shell commands 4 years ago
uninstallYaCyWindowsService.bat added Windows Service installer 11 years ago
updateYACY.sh Fix for http://mantis.tokeek.de/view.php?id=432 11 years ago
yacy-packages.readme bump to commons-codec-1.14, commons-compress-1.20, 5 years ago
yacy.yellow performance setting for remote indexing configuration and latest changes for 0.39 20 years ago

README.md

YaCy

Gitter Build Status

Deploy

What is this?

The YaCy search engine software provides results from a network of independent peers, instead of a central server. It is a distributed network where no single entity decides what to list or order it appears in.

User privacy is central to YaCy, and it runs on each user's computer, where search terms are hashed before they being sent to the network. Everyone can create their individual search indexes and rankings, and a truly customized search portal.

Each YaCy user is either part of a large search network (search indexes can be exchanged with other installation over a built-in peer-to-peer network protocol) or the user runs YaCy to produce a personal search portal that is either public or private.

YaCy search portals can also be placed in an intranet environment, making it a replacement for commercial enterprise search solutions. A network scanner makes it easy to discover all available HTTP, FTP and SMB servers.

To create a web index, YaCy has a web crawler for everybody, free of censorship and central data retention:

  • Search the web (automatically using all other YaCy peers)
  • Co-operative crawling; support for other crawlers
  • Intranet indexing and search
  • Set up your own search portal
  • All users have equal rights
  • Comprehensive concept to anonymise the users' index

To be able to perform a search using the YaCy network, every user has to set up their own node. More users means higher index capacity and better distributed indexing performance.

License

The (GPLv2+) source code used to build YaCy is distributed with the package (in /source and /htroot).

Where is the documentation?

All these have (YaCy) search functionality combining all these locations into one search result.

Dependencies? What other software do I need?

You need Java 1.8 or later to run YaCy. (No Apache, Tomcat or MySQL or anything else)

YaCy also runs on IcedTea 3. See https://icedtea.classpath.org

Start and stop it

Startup and shutdown:

  • GNU/Linux and OpenBSD:

    • Start by running ./startYACY.sh
    • Stop by running ./stopYACY.sh
  • Windows:

    • Start by double-clicking startYACY.bat
    • Stop by double-clicking stopYACY.bat
  • macOS: Please use the Mac app and start or stop it like any other program (double-click to start)

The administration interface

A web server us brought up after starting YaCy. Open this URL in your web-browser:

http://localhost:8090

This presents you with the personal search and administration interface.

(Headless) YaCy server installation

YaCy will authorize users automatically if they access the server from its localhost. After about 10 minutes a random password is generated and then it is no longer possible to log in from a remote location. If you install YaCy on a server that is not your workstation you must set an admin account immediately after the first start-up. Open:

http://<remote-server-address>:8090/ConfigAccounts_p.html

and set an admin account.

YaCy in a virtual machine or a container

Use virtualization software like VirtualBox or VMware.

The following container technologies can deployed locally, on remote machines you own, or in the 'cloud' using a provider by clicking "Deploy" at the top of the page:

Docker

More details in the docker/Readme.md.

Heroku

PaaS (Platform as a service) More details in Heroku.md.

Port 8090 is bad, people are not allowed to access that port

You can forward port 80 to 8090 with iptables:

iptables -t nat -A PREROUTING -p tcp --dport 80 -j REDIRECT --to-port 8090

On some operation systems, access to the ports you are using must be granted first:

iptables -I INPUT -m tcp -p tcp --dport 8090 -j ACCEPT

Scaling, RAM and disk space

You can have many millions web pages in your own search index. By default 600MB RAM is available to the Java process. The GC process will free the memory once in a while. If you have less than 100000 pages you could try 200MB till you hit 1 million. Here you can adjust it. Several million web pages may use several GB of disk space, but you can adjust it Here to fit your needs.

Help develop YaCy

Join the large number of contributors that make YaCy what it is; community software.

To start developing YaCy in Eclipse:

  • Clone https://github.com/yacy/yacy_search_server.git
  • File → Import as Git → Projects from Git → Existing local repository
  • → Add → Your Git clone of yacy_search_server
  • "Import existing Eclipse projects" → Finish
  • Run → External Tools → External Tools Configuration → Double-click "Ant Build"
  • → Name: "YaCy Build" → Buildfile: Browse Workspace → build.xml → Run
  • In the Package Explorer, right-click on YaCy → Run as → Java Application -> Select "yacy - net.yacy" -> OK

To join our development community, got to https://searchlab.eu

Send pull requests to https://github.com/yacy/yacy_search_server

Compile from source

The source code is bundled with every YaCy release. You can also get YaCy from https://github.com/yacy/yacy_search_server by cloning the repository.

git clone https://github.com/yacy/yacy_search_server

Compiling YaCy:

  • You need Java 1.8 or later and Apache Ant.
  • Compile: ant clean all - then you can ./startYACY.sh or ./startYACY.bat.
  • Create a release tarball: ant dist.
  • Create a macOS release: ant distMacApp (only works on macOS).
  • Create a Debian release: ant deb.
  • Work with Eclipse: Within Eclipse you also need to start the ant build process. because the servlet pages are not compiled by the Eclipse build process. after the dist procedure, the release can be found in the RELEASE subdirectory.

Build with Maven:

  • For the first time go to the subdirectory libbuild (which contains the Maven parent POM).
  • Compile with mvn clean install -DskipTests, this will create all needed modules.
  • After this you can use just use the POM in the main directory to build YaCy with Maven.

APIs and attaching software

YaCy has many built-in interfaces, and they are all based on HTTP/XML and HTTP/JSON. You can discover these interfaces if you notice the orange "API" icon in the upper right corner of some web pages in the YaCy web interface. Click it and you will see the XML/JSON version of the respective webpage. You can also use the shell script provided in the /bin subdirectory. The shell scripts also calls the YaCy web interface. By cloning some of those scripts you can easily create more shell API access methods.

Contact

Visit the international YaCy forum where you can start a discussion there in your own language.

Questions and requests for paid customization and integration into enterprise solutions. can be sent to the maintainer, Michael Christen per e-mail (at mc@yacy.net) with a meaningful subject including the word 'YaCy' to prevent it getting stuck in the spam filter.

  • Michael Peter Christen