yacy_search_server

Commit Graph

Author	SHA1	Message	Date
reger	29ccbf6491	seedUploadUrl config is lost on restart if no publish event occured -add a saveMySeed() on uploadurl changes (to keep url setting without retyping even if network down)	11 years ago
reger	e033e79826	remove old description for proxy port settings (Settings_p.html?page=ProxyAccess) - The options were not current (only port number accepted, which is part of ConfigBasic.html) - Deleted options and the port number input field from the proxyaccess page. - joined both transparent proxy setup pages (Settings_Http.inc & Settings_ProxyAccess.inc) in one page - adjustments to the related/linked pages	11 years ago
orbiter	e4e1bdeba0	added 0x40 to image of lockopen-gif image palette (light grey)	11 years ago
orbiter	7028a39abb	changed lock/unlock image design	11 years ago
orbiter	b4f2a1db6e	added a unlock icon for all protected pages that are unlocked because the administrator is logged in.	11 years ago
reger	7267c76881	set default "Search Interfaces"."Solr RSS/Opensearch" query to show latest 10 addition to index	11 years ago
reger	f76d81f5c9	fix: hanging text in input fields of WatchWebStructure_p.html in IE11	11 years ago
orbiter	cf9e7fdbb8	reverted template from latest cherry-picked commit	11 years ago
Alex	f6c7467a90	updated some french translations	11 years ago
reger	19e35a9126	add type attribute to atom feed <link> tag (for /yacysearch.atom)	11 years ago
reger	0a2f4a0e2f	eliminate lat/lon type conversion in osm (define as double)	11 years ago
Michael Peter Christen	01bbb20666	increased default logging line count to max	11 years ago
Michael Peter Christen	9bc3e457dd	fix for termination of all crawls	11 years ago
Michael Peter Christen	8d650ca225	added hint to port forwarding videos	11 years ago
reger	3963bca3b6	catch IndexControlRWIs_p error if RWI not connected	11 years ago
orbiter	2371d6b8db	target linktexts must be string to enable search facets on these fields	11 years ago
Michael Peter Christen	05d58e4df0	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	98f45c9032	fix for image alt attachment to AnchorURLs in html parser.	11 years ago
orbiter	22ce4fb4dd	better error handling for remote solr queries and exists-checks	11 years ago
orbiter	161a11070c	yacystats is gone :(	11 years ago
Michael Peter Christen	c115f3869c	enhanced snippet computation and test method in ViewFile	11 years ago
Michael Peter Christen	6e1dc444c3	added a snippet test function in ViewFile: you can now search for a specific word on the document; the servlet returns the snippet in the same way as it would be shown in a search result.	11 years ago
reger	29d1945c16	fix double &query parameter (index.html) ?query=word&query=	11 years ago
Michael Peter Christen	542c20a597	changed handling of crawl profile field crawlingIfOlder: this should be filled with the date, when the url is recognized as to be outdated. That field was partly misinterpreted and the time interval was filled in. In case that all the urls which are in the index shall be treated as outdated, the field is filled now with Long.MAX_VALUE because then all crawl dates are before that date and therefore outdated.	11 years ago
reger	7f0e757bb5	fix bookmark.rss - channel end tag postion - link with html entity	11 years ago
orbiter	e441831a24	reverted toString() change in AnchorURL to prevent mistakenly used toString(). This fixes also the update link bug.	11 years ago
reger	697b9743e7	Add link to RemoteCrawl_p suggestion http://mantis.tokeek.de/view.php?id=277	11 years ago
reger	47f201a6b8	Add Solr default query fields (&qf) to select servlet according to the ranking profiles boost fields defined by the peer (if df/qf is not specified in query). This allows for pretty simple queries ( q=word) without the need to know about the specific index configuration. Making sure all relevant fields (as determined by the index owner) are searched, still maintaining the option to query specific fields and does not relay on the duplication of text to text_t. - add author to reset-default boost fields (support results for author nav)	11 years ago
reger	8004cfc961	fix input boostfield factor of 0.0 in RankingSolr - input was accepted and stored but not editeable (added check factor >0.0 during edit) - make use of some more predefined solr constants	11 years ago
reger	a2cb366b25	Combine /heuristic search modifier with opensearch configured targets - with search modifier /heuristic a request is send to all configured opensearch target systems (old /heuristic/blekko modifier not longer valid) - this allows to use opensearch heuristic on individual search request (in contrast to configuration HEURISTIC_OPENSEARCH=true which sends a osd request on all global searches - the index.html searchoption text adjusted to be displayed only if option configured - add Archive-It to predefined systems	11 years ago
Michael Peter Christen	2de159719b	added an option to set 'obey nofollow' for links with rel="nofollow" attribute in the <a> tag for each crawl. This introduces a lot of changes because it extends the usage of the AnchorURL Object type which now also has a different toString method that the underlying DigestURL.toString. It is therefore not advised to use .toString at all for urls, just just toNormalform(false) instead.	11 years ago
Michael Peter Christen	87f8118108	added option to delete documents from the webgraph	11 years ago
Michael Peter Christen	32a2ff925c	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	d07cdd8c3b	added SolrCloud access mode and configuration	11 years ago
Michael Peter Christen	8514bffc22	enhanced postprocessing status report	11 years ago
reger	f99f3d5cf2	fix button (clear list) text color in CrawlResults	11 years ago
Michael Peter Christen	b5fc2b63ea	removed exist() retrieval functions from error cache and replaced it with metadata retrieval from connectors directly. This should cause better usage of the cache. Automatically increase the metadata cache if more memory is available.	11 years ago
Michael Peter Christen	62c72360ee	cleanup of checkAcceptanceInitially in CrawlStacker, should avoid double-calling of solr	11 years ago
orbiter	dab9a0786a	Merge branch 'master' of git@gitorious.org:yacy/rc1.git	11 years ago
orbiter	51bf5c85b0	Renamed the transmission cloud to buffer in dispatcher since the name 'cloud' was a bad idea. Changed also the accumulation process for peer targets so that every dht chunk is not assigned the set of redundant targets but they are assigned to redundant targets individually. This enhances the granularity of the target accumulation and should enhance the efficiency of the process. Finally the dht protocol client was enriched with the ability to remove the 'accept remote index' flag from peers or remove peers completely if they do not answer at all.	11 years ago
reger	7057e0b3e2	catch input file not found in Mediawiki import	11 years ago
Michael Peter Christen	f384fd624b	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
reger	ba5a59a28d	make search result also avail. as atom feed via /yacysearch.atom - fix logo in rss feed	11 years ago
orbiter	59160984cc	timeline performance update	11 years ago
orbiter	54bea96e67	Merge branch 'master' of git@gitorious.org:yacy/rc1.git	11 years ago
Michael Peter Christen	15b2fad6a2	reverted latest change for reindexing because that works actually only for internal Solr indexes. This is mainly caused by the fact that an external Solr may be also a SolrCloud which do not support LukeRequests, which are needed to request the old Schema.	11 years ago
Michael Peter Christen	841cc77391	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	e09218129c	remove check for local solr. This check was made during a time when Solr was optional and another alternative metadata store was available. Since that store is now removed, Solr is always available (internally or externally)	11 years ago
orbiter	2073e69034	fix for long periods in timeline	11 years ago
reger	1f94df29e7	fix NPE in solr rss where snippet contains only the title text and adjusted xslt, for solr snippets (&hl=true) to decode the xml encoded html <b> tag by adding disable-output-escaping (still open item description may be double as dc: tag and rss.description tag)	11 years ago
Michael Peter Christen	8c52f0651b	refactoring of AccessTracker events & timeline fix	11 years ago
Michael Peter Christen	1b279d7a7e	fixed external link	11 years ago
Michael Peter Christen	74206a10c7	refactoring	11 years ago
Michael Peter Christen	36e623d8bf	enhanced metadata enrichment for media file type search: - Web servers may now deliver YaCy-specific http header field with a title and keywords. The new http header fields are: X-YaCy-Media-Title - to be used for media (image, audio, video) titles X-YaCy-Media-Keywords - to be used for media (image, audio, video) keywords - both fields are written to document fields title and keywords and are searched also during image search. - to make the usage of arbitrary http header fields (including this new fields) possible in the /api/push_p.json servlet, a new POST argument is also introduced to push http header fields. The new POST attribute is named "responseHeader-X" (where X is the counter). It is allowed to use this attribute as multi-attribute several times, each can be filled with a http header line. - see /api/push_p.html for examples	11 years ago
reger	a88ea14e09	harmonize use of style for "delete" button - apply the monstly used btn-danger class	11 years ago
Michael Peter Christen	8fd72b5e8b	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	81d0f01a6f	added 'synchronous' and 'commit' flags in push api	11 years ago
reger	5043eff33a	move page navigation below results (image search) force page navigation to be displayed below results in image search for any number of displayed images instead to be displayed to the right of last image.	11 years ago
Marc Nause	f443cfa32d	Improvements and bugfixes for recording actions of blacklist API.	11 years ago
Michael Peter Christen	0ba6b98d5b	fix for broken json	11 years ago
orbiter	4177c9cf05	fix for crawl start check	11 years ago
orbiter	0bbb5040b8	Merge branch 'master' of git@gitorious.org:yacy/rc1.git	11 years ago
orbiter	9d5d86cd03	Added filter query options to the ranking servlet /RankingSolr_p.html. Filter queries are not actually related to ranking, but user requests have pointed out that specific boost queries to move results to the end of the result list are not sufficient. Such boost filters may be better executed as actual filter and therefore such a filter can now be statically applied to every search request. A typical use could be the expression "http_unique_b:true AND www_unique_b:true" which uses the recently introduced fields http_unique_b and www_unique_b which are true only for one of the alternatives with/without http(s) and with/without prefix 'www.' in host names.	11 years ago
Michael Peter Christen	d2151857f1	Added collection navigation: The collection field (can be filled i.e. in Crawl Start) can be used to add categories to YaCy index entries. The usage of that field was restricted to solr searches and post argument filters as implemented in commit `f7571386a3`. This commit extends collections to a full navigation option in the standard YaCy search interface. The field is not active by default but can be activated easily in the /ConfigSearchPage_p.html servlet (just check the 'Collection' facet field). Collections can now be used for (at least) two purposes: - to provide search tenants (through post argument collection) - to provide self-made category navigation Search requests may now have (independently from switched on or off collection facet) a "collection:<collection-name>" modifier attached; firthermore collection names may use disjunctions using the '\|' pipe symbol. For example, this is a valid search request: www collection:user\|proxy	11 years ago
Michael Peter Christen	74c249288a	added a push api to make it possible to upload files directly without crawling to the YaCy indexer. Files are uploaded using POST multipart requests; multiple file uploads are possible as well. Each file has attached the file date and mime type which is used to get the right parser for the submitted data. Also an url is submitted which is assigned to the document. The CrawlSwitchboard has a new option for default Crawl Profiles which are assigned dynamically from the new push interface.	11 years ago
reger	c798a9d1bb	fix unresolved pattern in yacysearch.rss title and rss xml error due to html & encoding in url entries	11 years ago
Michael Peter Christen	e64be5dcad	in case that the network is switched to any other than freeworld, RWIs are disabled. This is a temporary fix. There must be a better way to determine if RWIs are to be switched on or of.	11 years ago
Michael Peter Christen	87f171675b	doing index deletions using a get string which makes it easier to copy-paste deletion examples (see: #EuGH :( )	11 years ago
Michael Peter Christen	a2f800cd8f	fix for bad String conversion	11 years ago
Michael Peter Christen	b3b174e2b8	fixed webgraph postprocessing and status display in Crawler_p servlet	11 years ago
reger	7a52a6ba3f	add links to port config in status panel - pom upd to match javadoc location	11 years ago
reger	c3e40c82fe	make https port setting changeable via front end somewhere (chosen Http Networking page /Settings_p.html?page=http )	11 years ago
Michael Peter Christen	698f053658	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	f23c4142e0	added option to configure a custom user agent within allip networks	11 years ago
reger	8e233e2eb4	- fix typo in Message_p (defaultpath) - use more existing switchboardconstants for getproperties - replace depriciated call defaultservlet	11 years ago
Michael Peter Christen	8ad41a882c	fixed several problems with postprocessing: - unique-postprocessing was destroying results from other postprocessings; removed cross-updates as they had been not necessary - unique-postprocessing did not restrict on same protocol - inefficient concurrent update cache was redesigned completely - increased limits for concurrent blocking queues to prevent early time-out	11 years ago
Michael Peter Christen	640b684bb6	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	11 years ago
Michael Peter Christen	2f5477ea59	a try to fix the mixed up terms 'Active' -> 'Senior' and 'Passive' -> 'Junior'	11 years ago
reger	ca5437dd50	fix crawl of file:// , also http://mantis.tokeek.de/view.php?id=149 local files can be crawled (intranet mode) url parsing fixed according to RFC 1738 (for unix and windows) for win like file:///c:/tmp or file://localhost/c:/tmp for linux like file:///tmp or file://localhost/tmp Host is ignored and path must be absolute	11 years ago
reger	66f6797f52	make config search page layout closer to actual page appearance	11 years ago
sixcooler	5b1c4ef191	Monitoring and limit connection-count for Jetty	11 years ago
orbiter	ce1dbfeb0f	fix appearance of image search thumbnails.	11 years ago
orbiter	6daae59479	switch on core.service.rwi when switching back from portal mode to p2p mode	11 years ago
Michael Peter Christen	f0db501630	better handling of ranking parameters and new default values for date navigation which is done using ranking in solr.	11 years ago
Michael Peter Christen	2520590b45	migrated from pdfbox 1.8.4 to 1.8.5. They have a very long bugfix list for that update: http://www.apache.org/dist/pdfbox/1.8.5/RELEASE-NOTES.txt	11 years ago
Michael Peter Christen	6634b5b737	debug code for index distribution testing	11 years ago
Michael Peter Christen	89e13fa34e	fixed bug in test function	11 years ago
Marc Nause	4723329e29	Improved blacklist XML/JSON API.	11 years ago
reger	f91b2f51ae	fix: load_Rss remove feed to many parameter for get use form post methode	11 years ago
orbiter	c028ae9b09	Merge branch 'master' of git@gitorious.org:yacy/rc1.git	11 years ago
reger	e31493e139	"Use remote proxy for yacy" has no function, remove option and related config item see/fix bug http://mantis.tokeek.de/view.php?id=23 http://mantis.tokeek.de/view.php?id=189	11 years ago
reger	89e2c5e884	fix: allow enable of CrawlStartExpert.html #file	11 years ago
reger	1b37b12998	fix: CrawlStartExpert.html # From File with missing filename - crawlName must not be empty - crawlingFile must not be empty	11 years ago
orbiter	0d8072aa99	removed warnings	11 years ago
orbiter	be7c99dbe8	switched menu position of ConfigPortal.html and ConfigSearchBox.html	11 years ago
Michael Peter Christen	a1ac4c3b76	automatically clear graphics cache	11 years ago
reger	f87ac716f3	improve IndexDeletion by query adding transparently text_t as pseudo default search field if no fieldname (no : ) is included. adressing bug report http://mantis.tokeek.de/view.php?id=274	11 years ago
reger	e9060d31bd	update to Jetty 9 besides adjustments in code it makes the servlet settings in web.xml significant. This applies to solr, gsa and proxy servlet. There is no longer a default setup in code during init (as jetty 9 checks for double definition).	11 years ago
orbiter	b9c1a61814	added a peername=<peername> property in the seedlist API	11 years ago
orbiter	c637955e67	fix for navigation steering / p2p mode see also: http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5198&p=29958#p29958	11 years ago

1 2 3 4 5 ...

4972 Commits (6491270b3a17a26834956d9aaf396b211e0d6b2b)