Commit Graph

321 Commits (dd241d03bb014449cc63237a932911a0fb6cd1ee)

Author SHA1 Message Date
Michael Peter Christen bca4a16603 replaced the multivalue generic string field name suffix _ss by _txt
12 years ago
orbiter 67edfd991c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter d9173ba7ed added more solr fields to integrate values from URIMetadataRow. All
12 years ago
Michael Peter Christen 3ce04cecf3 bad hack to prevent a bug appearing in solr
12 years ago
Michael Peter Christen 826967513b changed options in IndexFederated_p to switch on/off parts of the index
12 years ago
Michael Peter Christen 1517a3b7b9 added webm mime-type
13 years ago
Michael Peter Christen 0301aba1e9 removed unused method parameters
13 years ago
Michael Peter Christen 4de50fe808 adding more principal peers for bootstraping
13 years ago
reger 067728bccc add search result heuristic. adding a crawl job with depth-1 for every displayed search result (crawling every external linked page of displayed search result pages)
13 years ago
Michael Peter Christen 508a81b86c added solr field 'refresh_s' which stores the refresh url contained in
13 years ago
Michael Peter Christen 9116013c64 - allow lazy initialization of solr value (if using 'lazy', then no
13 years ago
Michael Peter Christen c03d306afa shorter autocommit time (now: 1 second) to prevent that user cannot see
13 years ago
Michael Peter Christen 3fd4a01286 added option to record urls that are forwarded to the solr index
13 years ago
Michael Peter Christen 8dd469b9dd added option to configure the autocommit delay time of solr on-the-fly
13 years ago
Michael Peter Christen b9dfca4b0a - fixed IndexFederated Servlet / a embedded Solr can now be selected
13 years ago
Michael Peter Christen 1be0025a9c - added test for EmbeddedSolrConnector
13 years ago
Michael Peter Christen dbdd697f4d moved RDFaParser.xsl configuration file to defaults
13 years ago
Michael Peter Christen 8738336408 set Xms lower than Xmx
13 years ago
Michael Peter Christen 96f6a5869f more robust OAI-PMH client (large time-out, three re-tries). OAI-PMH
13 years ago
Michael Peter Christen 6d17686258 made triplestore persistent by default
13 years ago
cominch 3c255c025b Show tags in search results (if activated in ConfigPortal_p.html)
13 years ago
Michael Peter Christen a5cdfb91de - fixed Cache link (below snippet)
13 years ago
Roland 'Quix0r' Haeder af5a597e47 Scroogle is not comming back, remove dead code
13 years ago
cominch 90512640bf Added config switches for custom parser
13 years ago
cominch 5d20cd324a Add Triplestore and RDF query interface
13 years ago
cominch a32943b382 add json mimetype
13 years ago
Michael Peter Christen 41c02cb10e - less restrictions for usage of Table RAM copy
13 years ago
Michael Peter Christen 8002fd2578 use less cache space since a large cache would cause more memory usage
13 years ago
Michael Peter Christen 5aee19daa4 added show from cache in search results (not yet finished)
13 years ago
Michael Peter Christen 0d32a766ed relax verify attribute for search widget to make it faster:
13 years ago
Michael Peter Christen 7eece0256f moved yacy.logging to defaults according to request in
13 years ago
Michael Peter Christen db9d81cb7a ups
13 years ago
Michael Peter Christen e7e381d110 added configuration to switch off redirection following in crawler
13 years ago
Michael Peter Christen 2be327b5ab update location update
13 years ago
Michael Peter Christen 99c74699de removed scroogle (scroogle is dead)
13 years ago
Michael Peter Christen 8bee1472c9 there is no noindex, only nofollow in links
13 years ago
Michael Peter Christen 4c5edab1ec added option to have exception search result windows
13 years ago
Michael Peter Christen 696ee5fc16 removed pdf from default parser deny list
13 years ago
Lotus c73af39e54 refactoring of tray icon class,
13 years ago
Michael Peter Christen 987b412491 updated solr scheme: generic declaration of solr schemes
13 years ago
Michael Peter Christen 0bcef2d156 added feature as requested in
13 years ago
Michael Christen 17f962fceb translator updates:
13 years ago
Michael Christen c715d19c09 fixes for dependency on svn
13 years ago
Michael Christen f62e6fb438 less frequent DHT distribution to reduce the load a bit on every peer
13 years ago
Michael Christen 9dbc93613e now that the whole world knows that we actually do p2p and not
13 years ago
orbiter f9216e388c - faster ping to clean up old peers faster
13 years ago
orbiter ac5bda205f - removed lower page navigation (it never looks nice)
13 years ago
orbiter c659310e89 - removed option to search for audio, video and applications. These things are still experimental and should not be shown to new users since this would cause them to argue that YaCy does not work. The functions are stil available, because:
13 years ago
orbiter 6cd27473f5 - better default values for caching and cache usage
13 years ago
orbiter 5866c73a09 fix for compare search: use scroogle instead of bing and get a default search if configured search engine is not available
13 years ago
orbiter e4a82ddd8b produce a bookmark entry from every crawl start. these bookmarks are always private.
13 years ago
orbiter f183d3822c added a default accept header in http requests since some http fraud detection functions check that this header field exist
13 years ago
orbiter 78ce3b13be typo
13 years ago
suessthomas 887f088dad The IP address of the YaCy-Demo portal added to Whitelist.
13 years ago
orbiter 1b45e33f04 added robots tag parser to solr scheme
13 years ago
orbiter cf4fd525ee added directDocByURL attribute in crawl profile
13 years ago
orbiter 5ad7f9612b added crawl settings for three new filters for each crawl:
13 years ago
orbiter 2c3161b4ac refactoring:
13 years ago
orbiter 6b22865dbc - removed some warinings
13 years ago
orbiter e48ce5d80e - style change for search box: larger font, selected by default
13 years ago
sixcooler ecb4986b38 refactored stuff from last commit to ReferenceContainer
13 years ago
orbiter 49e5ca579f added new configuration property "crawler.embedLinksAsDocuments". If this is switched on (this is default now), the all embedded image, audio and video links from all parsed documents are added to the search index as individual document. This will increase the search index size dramatically but will also enable us to create a much faster image, audio and video search. If the flag is switched on, the index entries are also stored to a solr index, if this is also enabled.
13 years ago
orbiter 9a8937f8b6 be more liberal when evaluating search results. This may cause that it is possible to fraud content on fresh peers, but that is better than looong waiting times for the evaluation of every link which causes that everybody rejects YaCy as 'too slow'. But this is only because of the high standards that YaCy sets to itself. If we are able to gain more users by lowering the standard, then that is useful. The option to set that flag to verify each link is still there.
13 years ago
orbiter 1c007188ad bugfixes in html parser
13 years ago
orbiter 5dd2efc9a2 - bugfixes in html parser
13 years ago
sixcooler 4fec99115b Implementation of strategies for controlling memory resources.
13 years ago
orbiter 77a9af99f1 same values for Xmx and Xms: memory extension may be difficult if the OS has not the remaining memory available and may kill the jvm. If the memory is reserved at the start but never used the OS may handle that as well and leave non-used space in swap area (and never swap)
13 years ago
orbiter 768c59740c - replaced solrj 3.1 with solrj 3.3
14 years ago
orbiter e7c7598923 docfix
14 years ago
orbiter b84089ff04 fix for solr scheme list definition
14 years ago
orbiter 2d4bb139d3 - added counting of links with noindex tag for solr index
14 years ago
lotus fa6f2c2b44 use proxy accounts by default for more security
14 years ago
orbiter bda3eec0ff added parsing of canonical link element to html parser
14 years ago
orbiter b6f09a475d - added an index profile editor in the /indexFederated_p.html servlet for solr indexes
14 years ago
orbiter 6deef60bc0 added keyword list for solr index attributes
14 years ago
f1ori fdc84d8319 small pi link on index page to administration pages
14 years ago
orbiter 84c9658644 added a file type navigator
14 years ago
suessthomas 66c477129e Creates a new network definition, yacy.networks.metager.unit.
14 years ago
f1ori 900dacbf97 * improve link rewriting in proxy-url
14 years ago
orbiter cc239b18cd fix for IPv6 localhost proxy client
14 years ago
orbiter 10e2f588f8 - enhanced ybr ranking computation
14 years ago
orbiter 3ed4a09368 small features, some bug fixes and performance hacks
14 years ago
orbiter d8e934c085 better abstraction of http client identification
14 years ago
orbiter b77b8cac0c - enhanced html parser: recognized much more details in the content
14 years ago
orbiter 19fd13d3bc Added federated index storage to solr.
14 years ago
orbiter b1a8d0c020 enhancements to web cache and less strict caching rules
14 years ago
orbiter ba03ca8620 added more configuration options for search:
14 years ago
orbiter bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
14 years ago
f1ori 59dea3a284 * implement url proxy, a proxy via the url http://peer:port/proxy.html?url=http://domain.tld/path
14 years ago
orbiter e3ef4e3021 - increased default peer ping time from 2 minutes to 1 minute
14 years ago
orbiter d28f8040e0 removed unnecessary recording function that caused also a performance problem after serving too much files
14 years ago
orbiter addbd5b482 moved the main update url - because of the many languages we support now on yacy.net
14 years ago
orbiter 6c52e31993 new methods to open a browser
14 years ago
orbiter 5892fff51f introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased.
14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
14 years ago
low012 64f32e8f00 *) replaced all IPs in IP filters for proxy with the proper regular expression
14 years ago
orbiter fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
14 years ago
orbiter 88773e4daa changed the default port from 8080 to 8090
14 years ago
orbiter 6c35b68f17 - removed 'peerName' property from the yacy settings file because this information is stored in the yacy seed file
14 years ago
orbiter 786166041a - added recording of all accessed and submitted servlets
14 years ago