yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	7ee71c2354	changed administration page headline to 'admnistration'	12 years ago
Michael Peter Christen	b4f0cac102	added the reindexing job servlet to the submenu structure	12 years ago
Michael Peter Christen	1b102d98d8	- added index deletion to index administration submenu - added index deletion processes to the process scheduler/recorder	12 years ago
reger	40b3f2c5fe	comment out dead menue link	12 years ago
Michael Peter Christen	addba047e2	changes in ranking computation - an existing ranking servlet for solr was extended. It is now possible to set boost values for fields, boost functions and boost queries. - The ranking can have different instances, but currently only the first one is used - added an abstraction layer for fields which can be used for search and those fields can be edited in the solr ranking configruation - the ranking value from solr within the field score is used to combine remote search requests, which all are created using the same locally defined boost values - reduced the number of fields which are used for search (makes it faster) - replaced some text fields by string fields (makes indexing faster) - removed classes which had no use - made a large number of experiments for a better ranking and created a temporary setting which prefers hits inside titles - adjusted also the RWI-based ranking computation to 'prefer title' - made special cases like for portal search where no post-processing and post-ranking is wanted: this keeps the original ranking order as done by Solr - fixed many bugs with old settings for ranking	12 years ago
Michael Peter Christen	56d5946a59	- added flags in IndexFederated_p.html to switch on or off the webgraph index (new solr core webgraph) .. this is now off by default - completely redesigned this servlet - added description how to attach a remote solr - adjusted naming of servlet and menues - moved 'lazy initialization' attribut from IndexSchema to IndexFederated (this is a general option) back again.	12 years ago
Michael Peter Christen	788288eb9e	added the generation of 50 (!!) new solr field in the core 'webgraph'. The default schema uses only some of them and the resting search index has now the following properties: - webgraph size will have about 40 times as much entries as default index - the complete index size will increase and may be about the double size of current amount As testing showed, not much indexing performance is lost. The default index will be smaller (moved fields out of it); thus searching can be faster. The new index will cause that some old parts in YaCy can be removed, i.e. specialized webgraph data and the noload crawler. The new index will make it possible to: - search within link texts of linked but not indexed documents (about 20 times of document index in size!!) - get a very detailed link graph - enhance ranking using a complete link graph To get the full access to the new index, the API to solr has now two access points: one with attribute core=collection1 for the default search index and core=webgraph to the new webgraph search index. This is also avaiable for p2p operation but client access is not yet implemented.	12 years ago
Michael Peter Christen	b6de1f42dc	Full redesign of solr connection architecture. This was done to support multiple solr cores instead of just one. Therefore it is now necessary to distuingish between solr server connections (called an 'Instance') and a connection to a single solr core. One Instance may now have multiple connector classes assigned to it, each connecting to a single core. To support multiple cores it is also necessary to distinguish between the connection configuration and the configuration of the index schema. We will have multiple schema configurations in the future, each for every solr core. This caused that the IndexFederated servlet had to be split into two parts, the new Servlet for the Schema editor is now in the IndexSchema Servlet.	12 years ago
Michael Peter Christen	51e7ab4f70	moved bookmarks back to more prominent location (even if this does not fit to the 'Search Interfaces' headline)	12 years ago
reger	3b6e08b49f	prevent checking of urldb if empty - disconnect urlIndexFile if empty - add missing lock class in submenuSearchConfiguration	12 years ago
reger	f143804382	fix configuration for search page navigators - added additional config page (ConfigSearchPage_p) for easy setup of search page layout (to not overload ConfigPortal page) - currently redundant setting with part of ConfigPortal page - added missing config for filetype and protocol navigator - adjusted init of SearchEvent to check navigation config setting - renamed RankigProcess.getTopicNavigator to getTopics (to distiguish between added SearchEvent.getTopicNavigator)	12 years ago
Michael Peter Christen	8ae08a2cac	moved HTCache, Heuristics and Parser servlet to a more appropriate menu location	12 years ago
Michael Peter Christen	908ad2f174	Added a new servlet to configure the solr ranking using field boosts	12 years ago
Michael Peter Christen	a598fb6227	renamed Ranking_p.html to RankingRWI_p.html because there will be another Ranking servlet as well at next	12 years ago
Michael Peter Christen	4c4e0eece2	added new submenu 'Target Analysis' with three servlets which are useful to analyse the target servers: robots.txt table, mass target analysis and a regex tester	13 years ago
Michael Peter Christen	51f420e4f5	removed location search because it is only working in special cases	13 years ago
Michael Peter Christen	a15819fbec	fix for some interface problems	13 years ago
Michael Peter Christen	64ac2b7b7d	new submenu template	13 years ago
Michael Peter Christen	5e77801aac	update to web interface structure	13 years ago
Michael Peter Christen	ce3fed8882	added the Google Search Appliance (GSA) api interface to the main menu. See: https://developers.google.com/search-appliance/documentation/68/xml_reference#request_overview	13 years ago
Michael Peter Christen	3d3d654e88	if a network configuration is choosed which does not allow DHT and no P2P communication is in robinson mode) then some menu entries are disabled which have no use in this mode.	13 years ago
Michael Peter Christen	cc98496ff3	enhanced the HostBrowser: - showing also outbound links to other domains if there are any - the outbound links browser shows also the link structure image - showing even inbound links if the web structure graph has information about that - removed the left menu and made the HostBrowser a part of the top menu for search - moved the file search also to the top menu - added hover information in the HostBrowser to explain what the click means - because the HostBrowser also links to the Metadata viewer ViewFile, there should be a button to switch back to the HostBrowser: added that also.	13 years ago
Michael Peter Christen	abebb3b124	added a crawl start checker which makes a simple analysis on the list of all given urls: shows if the url can be loaded and if there is a robots and/or a sitemap.	13 years ago
Michael Peter Christen	941873fba4	moved the index deletion functions from IndexControlRWIs to IndexControlURLs where it appears more naturally. Because the RWI administration is less important in the presence of Solr, the IndexControlURL is now the default servlet when the Index Administration button on the main menu is selected.	13 years ago
Michael Peter Christen	f45f7fc12e	added new Host Browser to main menu: this new search interface is something completely new for search, but completely common on desktops: browser a web space like one would browse a file system in a file browser. The file listing is created using the search index and a faceted restriction to specific domains.	13 years ago
Michael Peter Christen	174530a9e0	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
apfelmaennchen	43f3a932fd	removed jquery.slider as it is already included as part of jquery-ui package	13 years ago
apfelmaennchen	a01eb1b7fe	removed unused jquery plugin slider as it is part of jquery-ui package	13 years ago
cominch	dc468dad01	add content control features for custom filter lists	13 years ago
orbiter	7ac259477f	added a direct access to solr search api to enhance the visibility if the embedded solr	13 years ago
Michael Peter Christen	3bcd9d622b	cleaned up classes and methods which are either superfluous at this time or will be superfluous or subject of complete redesign after the migration to solr. Removing these things now will make the transition to solr more simple.	13 years ago
cominch	c63c3a4495	Show additional interaction elements in footer section on each page, if activated in ConfigPortal.html. This footer is also visible in augmented browsing proxy mode.	13 years ago
cominch	011f8a5818	Auto Tagging: Add hyperlinks to tags (provisional)	13 years ago
Michael Peter Christen	fbded1f466	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
Michael Peter Christen	e806106b10	jquery bugfix	13 years ago
Michael Peter Christen	a0f1decd82	- added loading of the dbpedia pnd triplestore in the dictionary loader - renamed the dictionary loader to knowledge loader - some refactoring in the library provider method names	13 years ago
cominch	bddac2839e	add missing files for tag display	13 years ago
Michael Peter Christen	eca38c53e7	added a vocabulary editor	13 years ago
Michael Peter Christen	80e8aaabc8	moved new servlets into one submenu "Content Semantic"	13 years ago
Michael Peter Christen	dd020a1a8a	removed autocrawler and feedback servlet link since that was not cherry-picked	13 years ago
cominch	90512640bf	Added config switches for custom parser Conflicts: source/net/yacy/document/TextParser.java	13 years ago
cominch	bde07ed7a8	Add tagging overlay element Conflicts: htroot/env/templates/jqueryheader.template htroot/yacysearchitem.java source/net/yacy/interaction/Interaction.java	13 years ago
cominch	e859481889	Add Triplestore settings functionality Conflicts: htroot/env/templates/header.template	13 years ago
cominch	1626be7916	Add menu entries for urlproxy / augmented browsing	13 years ago
Michael Peter Christen	5b25272f40	added location search to main menu	13 years ago
Michael Peter Christen	c846e9ca14	redesign of the crawler monitor page: show crawled pages instead of queue of urls that shall be crawled	13 years ago
Michael Peter Christen	0d32a766ed	relax verify attribute for search widget to make it faster: set to "cacheonly"	13 years ago
Michael Peter Christen	0ec2713af8	'download'	13 years ago
Michael Peter Christen	8c06925984	animation of the web structure picture	13 years ago
Michael Peter Christen	9ad1d8dde2	complete redesign of crawl queue monitoring: do not look at a ready-prepared crawl list but at the stacks of the domains that are stored for balanced crawling. This affects also the balancer since that does not need to prepare the pre-selected crawl list for monitoring. As a effect: - it is no more possible to see the correct order of next to-be-crawled links, since that depends on the actual state of the balancer stack the next time another url is requested for loading - the balancer works better since the next url can be selected according to the current situation and not according to a pre-selected order.	13 years ago

1 2 3 4 5 ...

259 Commits (823ae4d6a770c55bd3ea6bb26c69536a3fe9f5cf)