Commit Graph

169 Commits (06352b8d6b3bbc24fe9f0281ff3e8380b7502265)

Author SHA1 Message Date
orbiter 78ce3b13be typo
13 years ago
orbiter cf4fd525ee added directDocByURL attribute in crawl profile
13 years ago
orbiter 5ad7f9612b added crawl settings for three new filters for each crawl:
13 years ago
orbiter e48ce5d80e - style change for search box: larger font, selected by default
13 years ago
sixcooler ecb4986b38 refactored stuff from last commit to ReferenceContainer
13 years ago
orbiter 49e5ca579f added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
13 years ago
orbiter 9a8937f8b6 be more liberal when evaluating search results. This may cause that it is possible to fraud content on fresh peers, but that is better than looong waiting times for the evaluation of every link which causes that everybody rejects YaCy as 'too slow'. But this is only because of the high standards that YaCy sets to itself. If we are able to gain more users by lowering the standard, then that is useful. The option to set that flag to verify each link is still there.
13 years ago
sixcooler 4fec99115b Implementation of strategies for controlling memory resources.
13 years ago
orbiter 77a9af99f1 same values for Xmx and Xms: memory extension may be difficult if the OS has not the remaining memory available and may kill the jvm. If the memory is reserved at the start but never used the OS may handle that as well and leave non-used space in swap area (and never swap)
13 years ago
orbiter 768c59740c - replaced solrj 3.1 with solrj 3.3
14 years ago
lotus fa6f2c2b44 use proxy accounts by default for more security
14 years ago
orbiter b6f09a475d - added an index profile editor in the /indexFederated_p.html servlet for solr indexes
14 years ago
f1ori fdc84d8319 small pi link on index page to administration pages
14 years ago
orbiter 84c9658644 added a file type navigator
14 years ago
f1ori 900dacbf97 * improve link rewriting in proxy-url
14 years ago
orbiter cc239b18cd fix for IPv6 localhost proxy client
14 years ago
orbiter 10e2f588f8 - enhanced ybr ranking computation
14 years ago
orbiter 3ed4a09368 small features, some bug fixes and performance hacks
14 years ago
orbiter d8e934c085 better abstraction of http client identification
14 years ago
orbiter b77b8cac0c - enhanced html parser: recognized much more details in the content
14 years ago
orbiter 19fd13d3bc Added federated index storage to solr.
14 years ago
orbiter b1a8d0c020 enhancements to web cache and less strict caching rules
14 years ago
orbiter ba03ca8620 added more configuration options for search:
14 years ago
orbiter bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
14 years ago
f1ori 59dea3a284 * implement url proxy, a proxy via the url http://peer:port/proxy.html?url=http://domain.tld/path
14 years ago
orbiter e3ef4e3021 - increased default peer ping time from 2 minutes to 1 minute
14 years ago
orbiter d28f8040e0 removed unnecessary recording function that caused also a performance problem after serving too much files
14 years ago
orbiter 6c52e31993 new methods to open a browser
14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
14 years ago
low012 64f32e8f00 *) replaced all IPs in IP filters for proxy with the proper regular expression
14 years ago
orbiter fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
14 years ago
orbiter 88773e4daa changed the default port from 8080 to 8090
14 years ago
orbiter 6c35b68f17 - removed 'peerName' property from the yacy settings file because this information is stored in the yacy seed file
14 years ago
orbiter 786166041a - added recording of all accessed and submitted servlets
14 years ago
orbiter 3fe03f153d - search page becomes default start page (new users are not forced to do configuration since this is not necessary)
14 years ago
orbiter 3ae8f40fc8 removed yacy.network.group - this feature was never used
14 years ago
f1ori 4e29e9712a * create cleanupjob for cached failed urls
14 years ago
lotus b1484299b2 same units for memory observer configuration (MiB)
14 years ago
orbiter 4565b2f2c0 removed the display option from index.html, yacysearch.html and yacyinteractive.html
14 years ago
orbiter fc2e41e691 added a forwarder for the default page. The forwarder forwards a browser to a different page if the root file index.html is accessed. This can be done by setting the name of the forwarder page to the field
14 years ago
orbiter cc6499bf8d - added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed
14 years ago
orbiter a9f754c45f removed unused CR accumulation and distribution process
14 years ago
f1ori 442bebca2b * %0 does not belong to the IPv6-Address -> entry does not work on some systems
14 years ago
f1ori 6ac4f8142e * allow proxy requests from localhost via ipv6
14 years ago
orbiter facfd204e9 added a parent configuration option.
14 years ago
orbiter 6a166c2040 patches for bad proxy behaviour
14 years ago
orbiter 45b1ab3d07 custom + generic skins:
14 years ago
orbiter 2c549ae341 fixed a number of small bugs:
14 years ago
orbiter 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification
14 years ago
orbiter 461a2a6ec7 enhanced remote crawling:
14 years ago