yacy_search_server

Commit Graph

Author	SHA1	Message	Date
luccioman	2da5f339f8	Fixed /News.html and /Wiki.html pages in Search Portal mode (issue #87 ). Also fixes theses pages rendering when the peer is not online. Re-factored code in common with /opensearchdescription.xml and ConfigPortal.html.	8 years ago
reger	8fe28a83f2	harmonize used lastmodified date for rwi and fulltext in storeDocument	8 years ago
reger	da0f4ee599	include navigator-plugin output data in json and rss/xml output fix encoding of url for rss fix unresolved-pattern of url in json & xml for domain navigator	8 years ago
reger	3d1d297308	refactor namespace navigator as part of navigatorplugin map, this allows the navigator to include counts all matches (rwi+fulltext). Fixing also unresolved_pattern in navigators title (of the counter) The use of inurl: query modifier as filter has not been changed keeping it as soft (unsharp) filter facet. Upd StringNavigator to prevent empty string form multivalued solr fields, removed date value conversion (better handled elsewhere, not need here).	8 years ago
reger	c8983805f2	upd IndexControlRWIs servlet, url list table remove unused word distance column (table lists always refs for one word). upd master.lng with recent text changes	8 years ago
reger	67f660523b	Make navigators underlaying indexfield name accessible in interface use interface in declaration and extend facet check to include navigator field.	8 years ago
reger	5eb3ee4e20	Add search navigator interface to allow for additional navigators (plugins) Prepared the first basic navigators (for authors and collections) for the list of SearchEvent.navigatorPlugins and adjusted servlet to use these. - this allows to configure display order of these navigators (by ordering config string) - eventually allows for additional and/or custom navigators using any available index field without need for changing servlets - the Collection navigation has been adjusted to exclude the internal, default robot_* and dht collections from displaying - rwi results are now also checked for navigatior by the refactored navi's So far no config options were added to customize or add navigators (may come later if route of upcoming modularization/plugin system is defined).	8 years ago
reger	fd3f58fcaa	improve query modifier parsing of "collection:" and possible collision with "on:" in case multiple collection modifier were entered (by mistake) http://mantis.tokeek.de/view.php?id=702	8 years ago
reger	4c7e515769	correct Collection navigatior - search servlet modifier parameter (navigator entries are single collection names, spaces are removed by crawlstart) preparation: for abstraction of navi's	8 years ago
reger	af39a76bf6	Reduce number of default max. search navigator lines (from 10000) to 100 + make it configurable	8 years ago
Sudheesh Singanamalla	065bcfba75	Merge pull request #88 from sudheesh001/Patch16 Fixes #16 Updates documentation about cloning and build from source	8 years ago
sudheesh001	d97da1ddb7	Fixes #16 Updates documentation about cloning and build from source	8 years ago
reger	20a1b29ed3	add simple test case for ReferenceContainer helpful for debugging calculated ranking parameter	8 years ago
reger	3cc2af8f92	reduce the mix of absolute and relative internal html page links (prefer relative for same pg or neighbors) to ease proxied access e.g. http://mantis.tokeek.de/view.php?id=701	8 years ago
reger	3c7220bc7b	Refacture rwi reference word position and word distance calculation used for rwi ranking. Main changes: - introduce a posintext() to access the stored value. This reduces also mem alloc of position array for WordReferenceRow (index access) - use the positions() array for joined references on multi-word queries if needed (otherwise allow positions() to be null - adjust assignments and the min() max() and distance() calculation accordingly	8 years ago
luccioman	f0639d810c	Customized name for Threads still using the default "Thread-n" pattern. This makes threads monitoring easier to read.	8 years ago
luccioman	c0379c3cd3	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git	8 years ago
luccioman	db3b9db9c2	Crawl from local file : faster task end when manually terminating crawl.	8 years ago
luccioman	78085fad8d	Fixed NullPointerException case. As reported by @reger24 , search in Intranet mode was failing due to this error.	8 years ago
reger	4c67ed3f8d	catch rwi ranking div by zero exception during rwi search result processing worddistance calculation is effected by concurrent update (normalization) of min/max ranking parameter for wordpositions. On update of min/max the exception is raised in distance calc and now catched. This concurrent update and change of ranking results is needed for speed but should be further checked for optimization	8 years ago
luccioman	47af33a04c	Advanced Crawl from local file : better processing of large files. Applied strategy : when there is no restriction on domains or sub-path(s), stack anchor links once discovered by the content scraper instead of waiting the complete parsing of the file. This makes it possible to handle a crawling start file with thousands of links in a reasonable amount of time. Performance limitation : even if the crawl start faster with a large file, the content of the parsed file still is fully loaded in memory.	8 years ago
luccioman	ee92082a3b	Updated javadocs : warning about closing stream responsibility.	8 years ago
luccioman	6f49ece22f	Fixed redirected URLs processing as crawl start point. See mantis 699 (http://mantis.tokeek.de/view.php?id=699) for details.	8 years ago
reger	68217465fe	div by null in word distance calculation (again, description in http://mantis.tokeek.de/view.php?id=698) as root cause was not seen, added just workaround reducing in favour over a try catch (for easier followup).	8 years ago
luccioman	7263d17436	Removed mentions of deprecated LURL-db. Thanks to LA_FORGE asking about if on YaCy forum ( http://forum.yacy-websuche.de/viewtopic.php?f=5&t=5895 )	8 years ago
luccioman	c3c4a52408	Added more examples in Blacklist JUnit test.	8 years ago
reger	8b74a6bf57	fix min/max calculation of WordReferenceVars.distance() Issue was the calculation in AbstractReference with positions.clear() call, this made distance result always 0 (distance needs min 2 positions) and created concurrency issues. + unit test of changes	8 years ago
luccioman	da362628fb	Added fine log level for too long blacklist matching processing.	8 years ago
reger	aaae7c6462	adjust ConcurrentScoreMap internal value map to interface and use parameter Long -> Integer (saves some bytes)	8 years ago
reger	31d2a5645e	remove obsolete query variable leftover from `8fb370d9f8 (diff-1d4259005ebfddc11083387857a86175)` harmonize ranking shift parameter to 0xFF correct addresult weight parameter to long	8 years ago
luccioman	93ea366778	Updated license header file name	8 years ago
luccioman	4c0be4d5d4	Fixed maven compilation error Removed unit test yacysearchitemTest from default maven Junit tests path, as yacysearchitem class is not in maven build classpath.	8 years ago
reger	ba77e8f8ec	upd to Jetty 9.2.19	8 years ago
luccioman	a588ed7628	Applied image headers customization to the new ViewFavicon servlet.	8 years ago
luccioman	d16e57b41e	Merge pull request #39 from luccioman/master Favicon retrieval and image preview enhancements. More details on mantis 629 (http://mantis.tokeek.de/view.php?id=629)	8 years ago
luccioman	7717a3d43d	Fixed license headers on files created to improve favicon management.	8 years ago
luccioman	6e1959f469	Merge branch 'master' of https://github.com/yacy/yacy_search_server.git Conflicts: htroot/yacysearchitem.java source/net/yacy/cora/federate/solr/responsewriter/YJsonResponseWriter.java source/net/yacy/search/schema/CollectionConfiguration.java source/net/yacy/server/serverObjects.java	8 years ago
luccioman	7136b1ad60	HTML validation : fixed URL encoding of Pictures link.	8 years ago
reger	407563b9f0	add lock symbol to messages UI Trans menu item	8 years ago
reger	685d8e86bf	Avoid frequent data type casting (float/long) for rwi score refactor to using long in URIMetadataNode too (and related call parameters) As remote rwi score's are not used (since v1.83) skip reading float-score , but keep in toString() for communication with older versions.	8 years ago
luccioman	3ccd89e274	Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments	8 years ago
luccioman	f1f4459f88	Added some unit tests for Blacklist.isListed()	8 years ago
luccioman	4b699c469a	Blacklist refactoring : extracted a function for easier unit testing	8 years ago
luccioman	54cfcc3f56	CrawlCheck_p.html : also display info about disallowed URLs.	8 years ago
luccioman	8b341e9818	Robots : properly handle URLs including non ASCII characters This fixes GitHub issue 80 ( https://github.com/yacy/yacy_search_server/issues/80 ) reported by Lord-Protector.	8 years ago
luccioman	75bb77f0cb	Refactoring : extracted a method to handle authorized action links.	8 years ago
luccioman	c996b04741	HTML validation : fixed URL encoding of search results action links.	8 years ago
luccioman	2b81703828	Refactored search result action links construction. These are long URLS with common parts : it is valuable to build the common parts only one time.	8 years ago
reger	e68b00678e	prevent negative score on URIMetadataNode - in the special case were no solr score is supplied. + assert before use & test case	8 years ago
luccioman	242707f9b4	Fixed loadFromCache with strategy IFFRESH. This fixes mantis 695 ( http://mantis.tokeek.de/view.php?id=695 ) : crawl start with 'Link-List of URL' option on websites using cookies.	8 years ago

1 2 3 4 5 ...

12880 Commits (4eeb448eb3d0b0fda80375aae866a5a6c914e30f) All Branches Search

12880 Commits (4eeb448eb3d0b0fda80375aae866a5a6c914e30f)

All Branches