yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	fc98ca7a9c	removed ContentControl servlet and functinality This was not used at all (as I know) and was blocking a smooth integration of ivy in the context of an existing JSON parser.	2 years ago
sgaebel	fc03c4b4fe	removes some warning and unused objects	4 years ago
luccioman	9782a98a9c	Added the possibility to customize facets sort type and direction Previously search navigators/facets elements were sorted only by counts. Now from the ConfigSearchPage_p.html admin page, sort direction (ascending/descending) and type (on counts or labels) can be customized independently for each navigator.	6 years ago
luccioman	fa4399d5d2	Small perf improvement : initialize threads names early when possible Initializing Thread names using the Thread constructor parameter is faster as it already sets a thread name even if no customized one is given, while an additional call to the Thread.setName() function internally do synchronized access, eventually runs access check on the security manager and performs a native call. Profiling a running YaCy server revealed that the total processing time spent on Thread.setName() for a typical p2p search was in the range of seconds.	7 years ago
luccioman	f511e16d50	Prevent duplication of Solr query highlight fields parameters That was caused by concurrent modifications (with addHighlightField() function) to the same SolrQuery instance when requesting Solr on remote peers in p2p search.	7 years ago
luccioman	e115e57cc7	Reduced text snippet extraction processing time. By not generating MD5 hashes on all words of indexed texts, processing time is reduced by 30 to 50% on indexed documents with more than 1Mbytes of plain text.	7 years ago
luccioman	3b89c232db	Easier tracking of longest text snippets initializations When text snippets statistics are enabled and FINE log level is enabled on the TextSnippetStatistics class.	7 years ago
luccioman	a3ec7a7a5f	Added analysis optional setting to compute statistics on text snippets Thus producing some basic stats on processing times for snippets generation and counts on snippets per source type.	7 years ago
luccioman	26d8ad591c	Adjusted Solr select servlet output when using an external Solr only - Use the EnhancedXMLResponseWriter only when requested output is "exml" - Use the Standard Solr writers when possible, for example for json, xml or javabin output formats - Return an error when the requested format can not been rendered with an external Solr server only Important : this modification is necessary for peers using exclusively an external Solr server to be reachable as robinson targets in p2p search, as the binary format ("javabin") is the default Solr exchange format for peers. Before this, when a peer requested a remote one attached only to an external Solr (no embedded one), it ended with "Invalid type" error, as the remote peer answered with xml although binary format was requested.	7 years ago
luccioman	1b90479a76	Added missing vocabulary navigator increment on results from RWI	7 years ago
luccioman	3a973dbb23	Removed unused import	7 years ago
reger	87077b8fb6	Adjust and move Language Navigator to be member of the navigatior plugin list.	7 years ago
luccioman	0cdee4e26a	Fixed loss of "meanCount" search param when using facets or page buttons Then on new search queries, no suggestions at all could be displayed.	7 years ago
luccioman	117a859879	Do not clear all search modifiers when unselecting one modifier. Previously, when clicking a selected facet in the search results page to unselect it, all other eventually selected modifiers/facets were also removed.	7 years ago
luccioman	33593c22e9	Fixed loss of other modifiers on keywords/tags search navigation links	7 years ago
luccioman	a9dc0874c0	Remove old query terms from search results suggestions links. Especially when old terms were misspelled, suggestions links then provided most of the time empty results.	7 years ago
luccioman	8a4ea1c11e	Added UI switch to control content domain constraint per search request	7 years ago
luccioman	e6907fdab3	Added optional search parameter/setting to control content domain filter Thus allowing to choose at configuration or per search request, whether extending or not results beyond strict content domain filter (image, video, audio or application). Related graphical controls to be added to user interface.	7 years ago
luccioman	a4494d6e01	Improved support for internationalized domain names on "site:" modifier Allow typing directly internationalized domain names including non ASCII characters in the search field. Search is done using the ASCII Compatible Encoding (ACE) representation.	7 years ago
luccioman	d07006bac4	Do locale independant case conversion on "filetype:" query modifier.	7 years ago
luccioman	8fbf25d1ed	Made "site:" query modifier case insensitive.	7 years ago
luccioman	867388e05b	Refactored 'site:' query modifier parsing into a dedicated function.	7 years ago
luccioman	c9d80b5b77	Prefer fine URL match over approximate URL mask regex on final filtering Also prevent adding a redundant and CPU costly Solr url mask filter query when possible	7 years ago
luccioman	0a120787e3	Improved accuracy of URLs search filters : protocol, tld, host, file ext	7 years ago
luccioman	e07ef1b610	Apply tld query modifier on Solr host_s mandatory field. The filter has thus much more chances to be effective than when applied on the optional field host_dnc_s.	7 years ago
luccioman	478e92deff	Fixed url mask filter generated when protocol modifier is not null	7 years ago
luccioman	29de4a65d7	Refactored url mask filter build from query modifiers For better readability and easier unit testing.	7 years ago
luccioman	8303e15419	Reduced number of search navigators refresh requests in JS resort mode The SearchEvent listen to changes on each of its navigators, and the information about their overall state is sent with each fetched search item (as a "data-nav-generation" attribute). Then the browser can regularly fetch a fresh version of yacysearchtrailer.html only if necessary (when that nav-generation value change).	7 years ago
luccioman	27ab733685	Ensure private search features are not lost on Digest auth timeout This is a fix for mantis 766 ( http://mantis.tokeek.de/view.php?id=766 ) Since the upgrade to Digest authentication, access to protected search features was indeed disabled once the Digest nonce timed out. After Digest auth timeout the browser no more sent authentication information and as the search results page is not private, protected features were simply be hidden without asking browser again for authentication. Adding a supplementary parameter when accessing the search results as authenticated fixes this.	7 years ago
reger	ba60f65040	Adjust filetype: query modifier parameter to lower case to prevent mismatch on user input with mixed case Internally file extension are always compared lowercase.	7 years ago
luccioman	ef8aea7f8d	Made the dates navigator max elements number user configurable. Also used object properties on QueryParams instances, rather than using mutable class (static) properties.	7 years ago
luccioman	66cb9c4ff9	Added Solr filter queries for audio, video and application domains Inspired from the existing one used on image search, and consistent with post filtering on content domain applied in SearchEvent.addNodes(). These filters are quite simplistic but at least audio, video or application search now return results. Previously, when filtering on these content domains, many results pages (and often even the first page) were empty while the total results count suggested that results should be available. This was because filtering on domain was only applied AFTER requesting Solr indexes.	7 years ago
luccioman	5d3ceb31b7	Improved search navigators counters accuracy and consistency. - added some missing increments from RWI results - decrement relevant navigator counts when solr or RWI results are evicted because duplicates detection or constraints checked belatedly - do not compute facets when unnecessary to avoid unwanted CPU load - do not increment from facets when already done - do not rely on facets on remote solr peers requests, as most of the time only a limited part of their total results if fetched (thus also preventing unnecessary load on remote peers) - use a concurrency friendly score map for the dates navigators to prevent unwanted ConcurrentModificationExceptions This improves the situation for the most obvious inconsistencies in search navigators counts, but more has to be done for a true accuracy (notably when query modifiers constraints are applied belatedly - after the solr or RWI retrieval request - such as the content domain constraint)	7 years ago
luccioman	a28428047a	Fixed count of filtered results from local solr. Was inadequately modified in my previous related commits (making next pages buttons unavailable in Search portal mode), as SearchEvent.local_solr_available did not count the total filtered results but only the ones within the currently fetched result page(s).	7 years ago
luccioman	3c9df6e0ce	Use local solr filtered results in total search results count. This modification has indeed low incidence as eventual query modifiers are already applied when requesting the local solr index. It mainly impact doublons detected with results from remote peers. Also updated javadocs for clarification.	7 years ago
luccioman	a1a0515312	Added a button to manually refresh sorting of p2p search results. As a server-side oriented alternative to the JavaScript realtime resorting feature proposed in PR #104. The goal is the same as in this PR : having the possibility compensate the network latency of various peers results fetching and obtain once possible a consistently ranked result set.	7 years ago
luccioman	4eba88f2ff	Removed some unnecessary uses of java.lang.reflect api. This improves code browsing and readability, making search by references or call hierarchy IDE features more accurate.	7 years ago
luccioman	da3dbf9ea1	Use Javadoc style comments on SearchEvent properties. For better code readability and understanding.	7 years ago
reger	ddd13b776d	Add keyword constraint to rwi query result filter To discard rwi results not matching query keyword: parameter	7 years ago
luccioman	8da3174867	Ensure lower case conversion consistency with any default locale. Especially for Turkish speaking users using "tr" as their system default locale : strings for technical stuff (URLs, tag names, constants...) must not be lower cased with the default locale, as 'I' doesn't becomes 'i' like in other locales such as "en", but becomes 'ı'.	7 years ago
Michael Peter Christen	6fe735945d	migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8 Also: now Version 1.921	8 years ago
reger	a814f3d885	Introduce keyword query parameter This enables keyword navigator to filter on keywords. Added search page output and layout config for keywords, allowing e.g. in Intranet use to display the keywords. No styling or links applied to the keyword text (but is desirable possibly in combination with bootstrap-tagsinput for future/intranet).	8 years ago
luccioman	31ad043bb9	Added user interface feedback on results feeding termination status. Added as an additional icon with title in the search progress bar, to inform about background search feeder threads terminated or still running. While giving a bit more information to users about the p2p search process, this can help choosing whether or not wait a little bit more time before going to the next page, in order to get results from various sources sorted as best as possible (see #91 for a discussion about sorting accuracy and network latency). Other related modifications included : - regular updates to statistics in the progress bar until the background feeders are completely terminated. - removed some uses of unsecure and discouraged JavaScript elements	8 years ago
reger	7b80189bda	Activate hosts navigator plugin. This includes rwi results in the navigator count. This might be tangential related to http://mantis.tokeek.de/view.php?id=736 as the example includes a local index search, while rwi results are not counted.	8 years ago
Burkhard	4fdc11cae8	Update SearchEvent.java Fix NPE on disabled local SolrIndex, occuring on search moving to the 2nd result page. The debug purpose only setting to disabeling local SolrIndex (System Admin -> Debug Settings) should long term probably be removed from production code.	8 years ago
reger	334c70c37a	correct fromDate init value on missing param in api/timeline_p servlet revert test modification from last commit in AccessTracker.main	8 years ago
reger	cc770512d5	add hint of query syntax in AccessTracker log (qs=normal querystring, sq=solr-querystring) to allow to filter simple text queries for processing, remove toString for counter parameter use more predefined constants in solrservlet	8 years ago
luccioman	526f2d6a8b	Fixed NPE case occurring when local solr index is disabled in search.	8 years ago
luccioman	08de58b6d3	Named a Thread without name for easier monitoring	8 years ago
luccioman	fc01b69eca	Fixed local image search pagination regression. As reported by @tglman on issue #90, when searching images on the local index only, pages next to the first were always empty. This was a regression from commit `c25e48e969`.	8 years ago

1 2 3 4 5 ...

468 Commits (5db97a892856265737f67978a5175c36c46e5cee)