Commit Graph

716 Commits (c88c30a5c52dafb46c6d3eb401d23aa5feed63f1)

Author SHA1 Message Date
Michael Peter Christen b28d43decc added two more fields source_cr_host_norm_i,target_cr_host_norm_i in
11 years ago
Michael Peter Christen 4f83d5f18c added the new field harvestkey_s to the collection index and the
11 years ago
orbiter 8ac2e8c8c9 added location navigator which causes that the image to the map search
11 years ago
reger 5111841e5b - reduce Jetty debug logging
11 years ago
Michael Peter Christen 61c5e40687 - replaced the properties object in AnchorURL with distinct variables
11 years ago
Michael Peter Christen 85456f46b2 added two new fields, exact_signature_copycount_i and
11 years ago
Michael Peter Christen a2511b5600 turned images_alt_txt back to images_alt_sxt because it is not necessary
11 years ago
Michael Peter Christen 69f85265e1 added an option to put image links to the crawl queue and handle these
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
orbiter deadeb406e image alt tag strings should be tokenized
11 years ago
Michael Peter Christen 1a3e42eca4 index migration to lucene 4.4
11 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user
11 years ago
sixcooler 1bc6003057 rise autoCommit maxTime to 3 Minutes to reduce IO
11 years ago
orbiter 944ae5686c added donation plea to the about box as default (you can replace this in
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
orbiter e7fcb81cea we should not do too much greedylearning at this time as we don't have
11 years ago
orbiter bf0ad04e1b apply load limitation also to dht-in
11 years ago
orbiter f50b596e0b do not run dht ditribution if system load is over 2.5
11 years ago
orbiter e24016e30a added the property federated.service.solr.indexing.timeout to yacy.init
11 years ago
Roland Haeder 98e10f95e2 Added some cora package loggers
11 years ago
orbiter 1b43e02b86 Merge branch 'master' of git://gitorious.org/~quix0r/yacy/quix0rs-yacy-rc1
12 years ago
orbiter a548354c71 replaced type of solr schema object sku of text_en_splitting_tight by
12 years ago
Roland Haeder ebbb3bc5c1 Fixed CHMOD on many files + added missing loggers (e.g. jena) and made some noisy loggers quiet
12 years ago
orbiter e609ec388a metager whitelist update
12 years ago
Michael Peter Christen 2716dfc46c increase crawler speed by reduction if the busysleep time
12 years ago
Michael Peter Christen 57ffdfad4c added a crawl option to obey html-meta-robots-noindex. This is on by
12 years ago
Michael Peter Christen 5a5d411ec0 new robots_i attribute fields
12 years ago
orbiter 7c6ccc426c set crawlingQ to true by default because most webpages are dynamic and
12 years ago
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name
12 years ago
orbiter 8792e6c6e9 stub for better image indexing
12 years ago
Michael Peter Christen 570511f3c8 removed fields references_internal_id_sxt and
12 years ago
Michael Peter Christen fd1776a3b0 added a new 'Citations' function: each search result item can now be
12 years ago
Michael Peter Christen 7754a1263b switching back to the merge factor 10; the solr default.
12 years ago
Michael Peter Christen 1762911f57 added synchronizations and timeouts in solr api; missing
12 years ago
Michael Peter Christen 959ccc4675 increased the solr merge factor because 4 was too much IO load for
12 years ago
Michael Peter Christen 20fab1feb6 allip net has greedy learning disabled
12 years ago
Michael Peter Christen 6115bef335 added a 'greedy learning' mechanismn which will cause that a 'fresh'
12 years ago
Michael Peter Christen 856e5c42ae the line "Web Search by the People, for the People" is more generic for
12 years ago
Michael Peter Christen 713a6199ef activated citation ranking by default
12 years ago
Michael Peter Christen f7a4377812 usage of the new normalized link polularity CRn as default ranking
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
reger 8a7fcb391d enable use of solrcore.properties for property substitution of solrconfig.xml
12 years ago
Michael Peter Christen eb9d0ba5b1 ranking and boost function update, small bugfixes, better default search
12 years ago
Michael Peter Christen a8dc4346e8 default configuration of MMapDirectoryFactory for solr, increased lock
12 years ago
Michael Peter Christen 0c1a018bbd removed 'later' tactic because it used too much RAM, reduced number of
12 years ago
Michael Peter Christen 536fd1450e added new keys for update locations
12 years ago
orbiter a83c2fe833 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 4baa0d4a97 Added a default keystore for ssl encryption of the YaCy web interface.
12 years ago
reger da191c839d reduce SolrConnectorLogging setting (from default ALL to INFO)
12 years ago
Michael Peter Christen 9bd2aee180 migrated to solr 4.3.0
12 years ago
Michael Peter Christen cca19d94d4 re-declared some fields to be of type string rather than text which
12 years ago
Michael Peter Christen cc90f82dbb increased default proxy client timeout to one minute
12 years ago
Michael Peter Christen 50421171c3 added new schema fields:
12 years ago
Michael Peter Christen d05dc07cff setting of new default values for ranking
12 years ago
Michael Peter Christen 97775fbebc fixed ranking for add-function queries: this did not work. The option
12 years ago
Michael Peter Christen 7ab5093321 added new solr title_exact_signature_l and
12 years ago
Michael Peter Christen 27d6222880 added new field host_extent_i which, after a crawl and postprocessing,
12 years ago
Michael Peter Christen ada3f27de7 added three new field for a better ranking: references_internal_i,
12 years ago
reger e89491271f - fix opensearch discover err msg - webgraph not enabled - if no opensearchdescription link found in index
12 years ago
orbiter 17ae51e741 increased number of links limitation from 1000 to 10000 for rss feeds
12 years ago
Michael Peter Christen 2d36a7eaf5 - do not create a new query for all remote peers
12 years ago
Michael Peter Christen 4af0839be2 use appropriate ranking for each search situation:
12 years ago
Michael Peter Christen 2080fc7406 removed unused tag fields
12 years ago
orbiter 6b13dd0d3d added clickdepth field writing for webgraph core (unfinished)
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 25300913fa fixes to search debugging after testing with the different search
12 years ago
orbiter b1140e3d82 added debug switches for detailed search testing
12 years ago
Michael Peter Christen 0d7b4bc891 better protection against OOM during search flush and fixed missing
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter 0f7ea7ad9f - enhanced solr.add procedure for mass adds
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 56d5946a59 - added flags in IndexFederated_p.html to switch on or off the webgraph
12 years ago
Michael Peter Christen 461d46101d - Removed log4j from libraries. This can be removed because the package
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Michael Peter Christen 4111606654 removed the commitWithin attribute because that is not the way how the
12 years ago
Michael Peter Christen d70d99fab5 added more metadata fields and facets to OpensearchResponseWriter.
12 years ago
Michael Peter Christen 8651ec35fe turned author_s into the multi-valued field author_sxt
12 years ago
Michael Peter Christen 4735bd47f4 - changed solr commit call and added an optimize option. Since Solr
12 years ago
Michael Peter Christen db024a4e19 added new solr fields (unused yet; implementation will follow)
12 years ago
Michael Peter Christen 9b5bdae1b4 Reverted setting of MMapDirectoryFactory from solrconfig; see
12 years ago
orbiter eb68a30947 solr performance settings
12 years ago
Michael Peter Christen f53703df62 using MMapDirectoryFactory as solution for ClosedChannelException given
12 years ago
Michael Peter Christen 22c694f906 activated the clickdepth_i attribute for solr again because the
12 years ago
Michael Peter Christen 5a0eb1b268 clickpath should not be active by default because it needs extensive
12 years ago
Michael Peter Christen 5c0c56cfe1 Preparations to produce a click depth attribute in the search index.
12 years ago
Michael Peter Christen 295884fd54 - Merge commit '168b1d130d9d67b5e8855a0b50c4ba7ad4a416f8'
12 years ago
reger 168b1d130d Adding heuristic to get search results from configured systems which support opensearch specification
12 years ago
reger 7761b60325 fix: Broken Link on Crawler_p.html - issue 218
12 years ago
reger e9e0d63897 Add config option to show HostBrowser link in search result
12 years ago
Michael Peter Christen 98819ec3d9 use solr boost configuration to select search fields. At this time it is
12 years ago
Michael Peter Christen 01200f06cc using the author field as solr-native facet. this makes it necessary to
12 years ago
Michael Peter Christen eac9650b31 added another solr field clickdepth_i which reflects the number of
12 years ago
Michael Peter Christen 1052263af3 - added a new solr field references_i which stores the number of
12 years ago
Michael Peter Christen 72f165d58b added a Boost class which stores solr query boost values. The class can
12 years ago
Michael Peter Christen ea033f8f8e added number of characters in url to default index to be able to use
12 years ago
Michael Peter Christen efd2c4622d added a new fail type attribute for the index to distinguish two
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
reger 328ce0b297 fix: remove fixed individual testing IP (85.25.151.30 = server4you.de) from default/yacy.network.freeworld.unit
12 years ago
Michael Peter Christen e2c4c3c7d3 migration to solr 4.0.0
12 years ago
sixcooler 2d972f289a rise commitWithinMs to default-value from SwitchBoard
12 years ago
Michael Peter Christen 1baf498d59 - show more lines in online log
12 years ago
sixcooler 206e7bcf94 whitelist yacyportalsearch aka search.yacy.net
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 7e3e45fd04 added Open Graph Metadata default fields, see http://ogp.me/ns#
12 years ago
Michael Peter Christen c3e5f667a7 added schema.org breadcrumb counter to parser and solr schema
12 years ago
Michael Peter Christen 42e525ca9a enhanced the host browser
12 years ago
sof 5cb244b79b Merge remote branch 'origin/master'
12 years ago
apfelmaennchen 88b062210c Added a parser for audio file tags (e.g. ID3 tags for MP3 files) based
12 years ago
Michael Peter Christen 3d33a5bdf6 turned the synonyms_t Text field into a multi-valued String field
12 years ago
Michael Peter Christen 3b959ee002 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 3190347814 added a synonyms_t field to solr and a process to read synonym files.
12 years ago
Michael Peter Christen 411d0e839b added an underline text field to solr to record all underlined texts
12 years ago
Michael Peter Christen f45f7fc12e added new Host Browser to main menu:
12 years ago
Michael Peter Christen 80edd8ecd7 some more after-refactoring fixes
12 years ago
Michael Peter Christen 562183932b - removed ip_s from default profile since that needs a DNS lookup to
12 years ago
Michael Peter Christen 0504b01bdc Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 9413f77b65 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter a55e77a115 added twitter search heuristic
12 years ago
Michael Peter Christen 62add1d564 added the protocol and the file name extension to the solr fields since
12 years ago
Michael Peter Christen 9db032664e activate two solr fields which will be used by administration interface
12 years ago
Michael Peter Christen 10b911eed4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen be67c70a47 added Solr fields:
12 years ago
orbiter d73fff0e0e added solr field images_withalt_i
12 years ago
Michael Peter Christen ee23fc7a32 added h1..h6 counter fields
12 years ago
Michael Peter Christen b2b516cc3e added a collection attribute to crawls and searches:
12 years ago
Michael Peter Christen 528d6763fa - added new solr fields:
12 years ago
Michael Peter Christen 2ddc33646a added new field for solr:
12 years ago
Michael Peter Christen 75d5e3475d Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
cominch dc468dad01 add content control features for custom filter lists
12 years ago
Michael Peter Christen 316b5fe116 - added a solr type definition verifier
12 years ago
Michael Peter Christen 4c79ddb91e switched off some solr logging
12 years ago
Michael Peter Christen e8acd542b5 - added faceted drill-down for host and geolocation to solr queries
12 years ago
Michael Peter Christen af764c106c re-activated audio and video search because they obviously work (!)
12 years ago
orbiter 716ea0cfe2 sorted the solr schema into mandatory and optional fields; reduced
12 years ago
orbiter db6863db77 reduced solr cache sizes to check if that solves memory problems a bit
12 years ago
Michael Peter Christen 23226676c6 FOR THE BRAVE.. this is a forced migration to solr which is now ready
12 years ago
Michael Peter Christen a1b2c9a67d doctype2mime fix, influences metadata conversion between old metadata
12 years ago
Michael Peter Christen 703f427303 fixed some peer-ping connection details
12 years ago
Michael Peter Christen ea49a8aa8c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen aab0b680c3 - added xslt support for solr result formats.
12 years ago
cominch e2119f4e76 augmented browsing: replace htmlparser by jsoup, which is more stable
12 years ago
Michael Peter Christen b51df6c7e8 - added coordinate storage in solr schema
12 years ago
Michael Peter Christen f9c0e6e950 - Implemented and integrated the URIMetadataNode object which is a
12 years ago
Michael Peter Christen bca4a16603 replaced the multivalue generic string field name suffix _ss by _txt
12 years ago
orbiter 67edfd991c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter d9173ba7ed added more solr fields to integrate values from URIMetadataRow. All
12 years ago
Michael Peter Christen 3ce04cecf3 bad hack to prevent a bug appearing in solr
12 years ago
Michael Peter Christen 826967513b changed options in IndexFederated_p to switch on/off parts of the index
12 years ago
Michael Peter Christen 1517a3b7b9 added webm mime-type
13 years ago
Michael Peter Christen 0301aba1e9 removed unused method parameters
13 years ago
Michael Peter Christen 4de50fe808 adding more principal peers for bootstraping
13 years ago
reger 067728bccc add search result heuristic. adding a crawl job with depth-1 for every displayed search result (crawling every external linked page of displayed search result pages)
13 years ago
Michael Peter Christen 508a81b86c added solr field 'refresh_s' which stores the refresh url contained in
13 years ago
Michael Peter Christen 9116013c64 - allow lazy initialization of solr value (if using 'lazy', then no
13 years ago
Michael Peter Christen c03d306afa shorter autocommit time (now: 1 second) to prevent that user cannot see
13 years ago
Michael Peter Christen 3fd4a01286 added option to record urls that are forwarded to the solr index
13 years ago
Michael Peter Christen 8dd469b9dd added option to configure the autocommit delay time of solr on-the-fly
13 years ago
Michael Peter Christen b9dfca4b0a - fixed IndexFederated Servlet / a embedded Solr can now be selected
13 years ago
Michael Peter Christen 1be0025a9c - added test for EmbeddedSolrConnector
13 years ago
Michael Peter Christen dbdd697f4d moved RDFaParser.xsl configuration file to defaults
13 years ago
Michael Peter Christen 8738336408 set Xms lower than Xmx
13 years ago
Michael Peter Christen 96f6a5869f more robust OAI-PMH client (large time-out, three re-tries). OAI-PMH
13 years ago
Michael Peter Christen 6d17686258 made triplestore persistent by default
13 years ago
cominch 3c255c025b Show tags in search results (if activated in ConfigPortal_p.html)
13 years ago
Michael Peter Christen a5cdfb91de - fixed Cache link (below snippet)
13 years ago
Roland 'Quix0r' Haeder af5a597e47 Scroogle is not comming back, remove dead code
13 years ago
cominch 90512640bf Added config switches for custom parser
13 years ago
cominch 5d20cd324a Add Triplestore and RDF query interface
13 years ago
cominch a32943b382 add json mimetype
13 years ago
Michael Peter Christen 41c02cb10e - less restrictions for usage of Table RAM copy
13 years ago
Michael Peter Christen 8002fd2578 use less cache space since a large cache would cause more memory usage
13 years ago
Michael Peter Christen 5aee19daa4 added show from cache in search results (not yet finished)
13 years ago
Michael Peter Christen 0d32a766ed relax verify attribute for search widget to make it faster:
13 years ago
Michael Peter Christen 7eece0256f moved yacy.logging to defaults according to request in
13 years ago
Michael Peter Christen db9d81cb7a ups
13 years ago
Michael Peter Christen e7e381d110 added configuration to switch off redirection following in crawler
13 years ago
Michael Peter Christen 2be327b5ab update location update
13 years ago
Michael Peter Christen 99c74699de removed scroogle (scroogle is dead)
13 years ago
Michael Peter Christen 8bee1472c9 there is no noindex, only nofollow in links
13 years ago
Michael Peter Christen 4c5edab1ec added option to have exception search result windows
13 years ago
Michael Peter Christen 696ee5fc16 removed pdf from default parser deny list
13 years ago
Lotus c73af39e54 refactoring of tray icon class,
13 years ago
Michael Peter Christen 987b412491 updated solr scheme: generic declaration of solr schemes
13 years ago
Michael Peter Christen 0bcef2d156 added feature as requested in
13 years ago
Michael Christen 17f962fceb translator updates:
13 years ago
Michael Christen c715d19c09 fixes for dependency on svn
13 years ago
Michael Christen f62e6fb438 less frequent DHT distribution to reduce the load a bit on every peer
13 years ago
Michael Christen 9dbc93613e now that the whole world knows that we actually do p2p and not
13 years ago
orbiter f9216e388c - faster ping to clean up old peers faster
13 years ago
orbiter ac5bda205f - removed lower page navigation (it never looks nice)
13 years ago
orbiter c659310e89 - removed option to search for audio, video and applications. These things are still experimental and should not be shown to new users since this would cause them to argue that YaCy does not work. The functions are stil available, because:
13 years ago
orbiter 6cd27473f5 - better default values for caching and cache usage
13 years ago
orbiter 5866c73a09 fix for compare search: use scroogle instead of bing and get a default search if configured search engine is not available
13 years ago
orbiter e4a82ddd8b produce a bookmark entry from every crawl start. these bookmarks are always private.
13 years ago
orbiter f183d3822c added a default accept header in http requests since some http fraud detection functions check that this header field exist
13 years ago
orbiter 78ce3b13be typo
13 years ago
suessthomas 887f088dad The IP address of the YaCy-Demo portal added to Whitelist.
13 years ago
orbiter 1b45e33f04 added robots tag parser to solr scheme
13 years ago
orbiter cf4fd525ee added directDocByURL attribute in crawl profile
13 years ago
orbiter 5ad7f9612b added crawl settings for three new filters for each crawl:
13 years ago
orbiter 2c3161b4ac refactoring:
13 years ago
orbiter 6b22865dbc - removed some warinings
13 years ago
orbiter e48ce5d80e - style change for search box: larger font, selected by default
13 years ago
sixcooler ecb4986b38 refactored stuff from last commit to ReferenceContainer
13 years ago
orbiter 49e5ca579f added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
13 years ago
orbiter 9a8937f8b6 be more liberal when evaluating search results. This may cause that it is possible to fraud content on fresh peers, but that is better than looong waiting times for the evaluation of every link which causes that everybody rejects YaCy as 'too slow'. But this is only because of the high standards that YaCy sets to itself. If we are able to gain more users by lowering the standard, then that is useful. The option to set that flag to verify each link is still there.
13 years ago
orbiter 1c007188ad bugfixes in html parser
13 years ago
orbiter 5dd2efc9a2 - bugfixes in html parser
13 years ago
sixcooler 4fec99115b Implementation of strategies for controlling memory resources.
13 years ago
orbiter 77a9af99f1 same values for Xmx and Xms: memory extension may be difficult if the OS has not the remaining memory available and may kill the jvm. If the memory is reserved at the start but never used the OS may handle that as well and leave non-used space in swap area (and never swap)
13 years ago
orbiter 768c59740c - replaced solrj 3.1 with solrj 3.3
14 years ago
orbiter e7c7598923 docfix
14 years ago
orbiter b84089ff04 fix for solr scheme list definition
14 years ago
orbiter 2d4bb139d3 - added counting of links with noindex tag for solr index
14 years ago
lotus fa6f2c2b44 use proxy accounts by default for more security
14 years ago
orbiter bda3eec0ff added parsing of canonical link element to html parser
14 years ago
orbiter b6f09a475d - added an index profile editor in the /indexFederated_p.html servlet for solr indexes
14 years ago
orbiter 6deef60bc0 added keyword list for solr index attributes
14 years ago
f1ori fdc84d8319 small pi link on index page to administration pages
14 years ago
orbiter 84c9658644 added a file type navigator
14 years ago
suessthomas 66c477129e Creates a new network definition, yacy.networks.metager.unit.
14 years ago
f1ori 900dacbf97 * improve link rewriting in proxy-url
14 years ago
orbiter cc239b18cd fix for IPv6 localhost proxy client
14 years ago
orbiter 10e2f588f8 - enhanced ybr ranking computation
14 years ago
orbiter 3ed4a09368 small features, some bug fixes and performance hacks
14 years ago
orbiter d8e934c085 better abstraction of http client identification
14 years ago
orbiter b77b8cac0c - enhanced html parser: recognized much more details in the content
14 years ago
orbiter 19fd13d3bc Added federated index storage to solr.
14 years ago
orbiter b1a8d0c020 enhancements to web cache and less strict caching rules
14 years ago
orbiter ba03ca8620 added more configuration options for search:
14 years ago
orbiter bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
14 years ago
f1ori 59dea3a284 * implement url proxy, a proxy via the url http://peer:port/proxy.html?url=http://domain.tld/path
14 years ago
orbiter e3ef4e3021 - increased default peer ping time from 2 minutes to 1 minute
14 years ago
orbiter d28f8040e0 removed unnecessary recording function that caused also a performance problem after serving too much files
14 years ago
orbiter addbd5b482 moved the main update url - because of the many languages we support now on yacy.net
14 years ago
orbiter 6c52e31993 new methods to open a browser
14 years ago
orbiter 5892fff51f introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased.
14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
14 years ago
low012 64f32e8f00 *) replaced all IPs in IP filters for proxy with the proper regular expression
14 years ago
orbiter fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
14 years ago
orbiter 88773e4daa changed the default port from 8080 to 8090
14 years ago
orbiter 6c35b68f17 - removed 'peerName' property from the yacy settings file because this information is stored in the yacy seed file
14 years ago
orbiter 786166041a - added recording of all accessed and submitted servlets
14 years ago
orbiter 3fe03f153d - search page becomes default start page (new users are not forced to do configuration since this is not necessary)
14 years ago
orbiter 59d9fe1bd7 added more php mime types
14 years ago
orbiter 3ae8f40fc8 removed yacy.network.group - this feature was never used
14 years ago
orbiter efb4ca8fa8 modified auto-delete of search failure-words:
14 years ago
f1ori 4e29e9712a * create cleanupjob for cached failed urls
14 years ago
lotus b1484299b2 same units for memory observer configuration (MiB)
14 years ago
low012 11ea966f9e *) added SID file (Commodore 64) sound file parser
14 years ago
low012 936e976c23 *) added FreeMind (http://freemind.sourceforge.net/) mindmap parser
14 years ago
orbiter 4565b2f2c0 removed the display option from index.html, yacysearch.html and yacyinteractive.html
14 years ago
orbiter fc2e41e691 added a forwarder for the default page. The forwarder forwards a browser to a different page if the root file index.html is accessed. This can be done by setting the name of the forwarder page to the field
14 years ago
orbiter cc6499bf8d - added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed
14 years ago
orbiter a9f754c45f removed unused CR accumulation and distribution process
14 years ago
f1ori 442bebca2b * %0 does not belong to the IPv6-Address -> entry does not work on some systems
14 years ago
f1ori 6ac4f8142e * allow proxy requests from localhost via ipv6
14 years ago
orbiter 917d715374 lulabad found his signature
14 years ago
f1ori def4253555 * add option to network definition to provide a domainlist (syntax like in blacklists)
14 years ago
orbiter 482127e777 removed release key from location 2 because the signature of that source can not be verified. But the source is ok.
14 years ago
orbiter facfd204e9 added a parent configuration option.
14 years ago
orbiter 6a166c2040 patches for bad proxy behaviour
14 years ago
orbiter 45b1ab3d07 custom + generic skins:
14 years ago
orbiter 2c549ae341 fixed a number of small bugs:
14 years ago
orbiter 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification
14 years ago
orbiter 461a2a6ec7 enhanced remote crawling:
14 years ago
orbiter 670ba4d52b - removed the remote crawl option from the network configuration submenu and
14 years ago
orbiter 4c21d8dc9d - changed default values for online caution (the pausing may not be necessary any more)
14 years ago
orbiter 0ab6a462ee - added a missing entry in YaCy interface robots.txt for bookmarks
14 years ago
orbiter 3f1d5a061f by default store crawled pages to HTCache to support verify=false snippet generation
14 years ago
lotus 23ba107834 UPnP port forwarding default on now. This also displays a message on the entry settings page if not successful, so the user gets an extra hint to open his ports.
14 years ago
orbiter 70dd26ec95 added the new crawl scheduling function to the crawl start menu:
14 years ago
orbiter 59c035c40b changed explanation of Xmx and Xms
14 years ago
orbiter 171f2bd84e - removed unused network oanet
14 years ago
low012 8e88fa4a62 *) fixed indetion (tab vs. spaces)
15 years ago
orbiter b6fb239e74 redesign of parser interface:
15 years ago
orbiter 11b7853940 added a configuration page for search heuristics. currently you can switch on there:
15 years ago
orbiter 353a924760 - changed default memory to 500m
15 years ago
orbiter 11639aef35 - added new protocol loader for 'file'-type URLs
15 years ago
orbiter 90fa8fd4d4 - support gpx file extension
15 years ago
orbiter cf43bdc87e This is a large bugfix and enhancement commit to support a better location detection for data
15 years ago
orbiter a7d038bb7a The oai ListFriends source list becomes configurable: just write them into defaults/oaiListFriendsSource.xml
15 years ago
orbiter cf13c65bdd added another network definition file for the open access (decentral OAI) search network
15 years ago
orbiter 5efc0dce0b fix for domain options in search box
15 years ago
orbiter f83b1b91b9 increased dht busy sleep time to 10 seconds to reduce TCP/IP traffic for default settings. 2 seconds had been too much traffic for home-use routers.
15 years ago
orbiter 9623d9e6d2 added a smb loader component for the YaCy crawler
15 years ago
orbiter 72f00dee59 removed never-used server access account function
15 years ago
orbiter 30c8185139 fix for sid check
15 years ago
orbiter ef62d017e5 integrated session id filtering for crawler
15 years ago
orbiter d8d9984913 added framework for session id filtering (not ready yet)
15 years ago
lotus 945e0ba5a5 allow global search if res. observer disabled index transmission
15 years ago
lotus 5cbef63c37 fixed bad ip pattern
15 years ago
orbiter 8df1694288 - added options to switch on/off search domains (text, image, audio, video, app)
15 years ago
orbiter dff4f95c78 some patches to get the torrent parser working
15 years ago
lotus 12dd8ece3e enabled memory protection from 6459 with 50000kb (disables dht-in)
15 years ago
orbiter 2bab0679e0 lost my key :-(
15 years ago
lotus 6edc168cfe option to disable dht by memory limit:
15 years ago
lotus 79251e6f60 configurable disk space hardlimit for dht
15 years ago
orbiter 8a1046feaa less maximum file size, too many problems with larger size
15 years ago
orbiter 3d5eeb842a new default skin 'pdblue'
15 years ago
orbiter c864901087 - moved httpd.mime to defaults path
15 years ago
orbiter 735e2737e3 * added index segments
15 years ago
orbiter 6e0dc39a7d - some fixes to prevent blocking situations
15 years ago
orbiter 23ab6fbca4 - navigation appear at correct position when opengeodb-results are also presented after a search
15 years ago
orbiter 721b88efbd - fixed a problem loading blacklists with new yacycore.jar
15 years ago
orbiter 573d03c7d7 added configuration to enable ram table copy
15 years ago
orbiter d656a94f55 fix for bad paths in dictionary processing
15 years ago
orbiter 39ae96450b draw more peers in network picture
15 years ago
orbiter c6c97f23ad - added cache usage properties to crawl start
16 years ago
orbiter 5bb8074150 removed the indexing queue. This queue was superfluous since the introduction of the blocking queues last year, where documents are parsed, analysed and stored in the index with concurrency.
16 years ago
orbiter b2263bc720 enhanced document type recognition
16 years ago
orbiter 57a88d435b redesign of parser mime type detection and parser steering
16 years ago
orbiter 8ca1f5d400 - some work to integrate the html parser the same way as the other parsers are integrated (not finished)
16 years ago
orbiter 801aa08162 added f1oris update location
16 years ago
lotus ec2970cc76 higher dht distribution speed by default
16 years ago
orbiter 995da28c73 all stack/heap files that had been stored in DATA/PLASMA are now stored in the network-specific QUEUES path
16 years ago
low012 457b6c0d6d *) updated Apache POI library to be able to parse Visio files
16 years ago
lotus db70badcf0 possibility to set remote host on upnp device
16 years ago
lotus aec3e7995a autoconfig.pac can be used to browse .yacy-domains only
16 years ago
orbiter f348190566 tried to insert a database dump import method to the phpBB3 import function. Reason: imports or large database dumps are cannot be handled with phpMyAdmin and this should be an easy way to the database dumps into a mySQL database where it can be exported again with the phpBB3 content integration adapter. Completion or removal of this function stub will follow before next main release.
16 years ago
orbiter d50be59088 - added a automatic re-construction of the domain stack after 10 minutes. this includes then urls to the domain stack that were left over in case of stack size limitations when the domain stack was created the last time
16 years ago
orbiter 5fdba0fa51 - fixed a not working selection rule in balancer
16 years ago
orbiter 4522c13ee7 added option for a table prefix when importing phpbb3
16 years ago
orbiter 4b4bddca00 added new submenu to crawler menu: import of phpbb3 forum postings from mysql
16 years ago
orbiter 26a46b5521 increased default maximum file size for database files to 2GB
16 years ago
orbiter 27eb8d62cb - new development cycle
16 years ago
lotus 13fb84ab81 you can define your default number of search results displayed by search.items
16 years ago
orbiter a642d6a7b5 - added navigation icons for search result pages
16 years ago
lotus bad7ce9286 experimental option trayIcon.force for unsupported platforms. java 1.6 needed
16 years ago
f1ori bde88b684a * splitt off yacyRelease from yacyVersion
16 years ago
orbiter d2ac0aa682 - fixed possible bugs in Stack (may affect Crawler reset) and RandomAccess handling
16 years ago
orbiter c10c257255 attempt to fix a deadlock situation where the IODispatcher did not work.
16 years ago
orbiter 54773ad4d4 added release keys
16 years ago
orbiter 138422990a - removed useCell option: the indexCell data structure is now the default index structure; old collection data is still migrated
16 years ago
lulabad 06c878ed11 moved update_key to correct position in file
16 years ago
lulabad fadf311b97 added sign key for yacystats updates
16 years ago
orbiter bd5f4c78d8 - added default profile for surrogate indexing
16 years ago
orbiter b3f75e48fa - enhanced balancer: auto-solving of waiting-deadlocks
16 years ago
orbiter a9cea419ef Integration of the new index data structure IndexCell
16 years ago
orbiter aa44d9bad9 more refactoring of kelondro.text / deleted de.anomic.index
16 years ago
lotus 6117e083e5 option to customize tray label (tooltip) with tray.label
16 years ago
lotus 9519d84372 changed "dooble" variable to "browserintegration" to be less specific
16 years ago
lotus 8429083972 adjusted tray for dooble:
16 years ago
orbiter c12bb8a6d0 - refactoring of the http client
16 years ago
orbiter 94c42691d8 - reject less transmissions as transmission receiver
16 years ago
orbiter 26978b2a25 - better memory protection in kelondro caches: computation of needed memory for cache grow
16 years ago
lotus 4aad461100 added UPnP support
16 years ago
orbiter 40d9849aa4 - better control of chunk size in dht selection
16 years ago