yacy_search_server

Commit Graph

Author	SHA1	Message	Date
reger	96ae332427	revert del _blank (last commit) in template	12 years ago
reger	43348a98a9	add some href target=_blank to ext. links with external icon	12 years ago
reger	070bf85b33	css fix for IE10 showing border on all img within <a /> tag since introduction of external link icon (commit `112836dcc9`)	12 years ago
Marc Nause	112836dcc9	Improved external links. ) image links will not be marked (if they have class "yacylogo" or "forceNoExternalIcon") ) external links in menu on left (and "fork me"-banner) will open in new tab/window now	12 years ago
Marc Nause	d64a094f0e	External links in HTML interface are marked as external with small icon. ) added new icon ) added CSS rules to mark all external links except search results (target="_self")	12 years ago
Roland Haeder	ebbb3bc5c1	Fixed CHMOD on many files + added missing loggers (e.g. jena) and made some noisy loggers quiet	12 years ago
orbiter	b4677d1cad	fix for bug #252 the naming of the servlet was wrong, the bug may not be present on systems where upper/lowercase matching is lazy (windows)	12 years ago
Michael Peter Christen	23fb458963	- fix to gsa searchresult answer in case that no query part is given - fix to gsa default number of results (is 'num')	12 years ago
Michael Peter Christen	7ee71c2354	changed administration page headline to 'admnistration'	12 years ago
Michael Peter Christen	5132bf719c	added new buttons to search result page in p2p mode which show the switch between p2p search and the 'stealth mode' which is simply a non-p2p search within the p2p network. The functionality was there all the time, but the switch to this was not very visible.	12 years ago
Michael Peter Christen	b4f0cac102	added the reindexing job servlet to the submenu structure	12 years ago
Michael Peter Christen	f965d04496	added new peer icons for Mentor peers and Mentee peers (not used yet)	12 years ago
Michael Peter Christen	1b102d98d8	- added index deletion to index administration submenu - added index deletion processes to the process scheduler/recorder	12 years ago
Michael Peter Christen	e4f7e5bcfe	fixed bad css change	12 years ago
Michael Peter Christen	25499eead5	- added a new field for the regular expression in crawl start - added the field in crawl profile - adopted logging end error management - adopted duplicate document detection - added a new rule to the indexing process to reject non-matching content - full redesign of the expert crawl start servlet The new filter field can now be seen in /CrawlStartExpert_p.html at Section "Document Filter", subsection item "Filter on Content of Document"	12 years ago
reger	40b3f2c5fe	comment out dead menue link	12 years ago
Michael Peter Christen	addba047e2	changes in ranking computation - an existing ranking servlet for solr was extended. It is now possible to set boost values for fields, boost functions and boost queries. - The ranking can have different instances, but currently only the first one is used - added an abstraction layer for fields which can be used for search and those fields can be edited in the solr ranking configruation - the ranking value from solr within the field score is used to combine remote search requests, which all are created using the same locally defined boost values - reduced the number of fields which are used for search (makes it faster) - replaced some text fields by string fields (makes indexing faster) - removed classes which had no use - made a large number of experiments for a better ranking and created a temporary setting which prefers hits inside titles - adjusted also the RWI-based ranking computation to 'prefer title' - made special cases like for portal search where no post-processing and post-ranking is wanted: this keeps the original ranking order as done by Solr - fixed many bugs with old settings for ranking	12 years ago
Michael Peter Christen	56d5946a59	- added flags in IndexFederated_p.html to switch on or off the webgraph index (new solr core webgraph) .. this is now off by default - completely redesigned this servlet - added description how to attach a remote solr - adjusted naming of servlet and menues - moved 'lazy initialization' attribut from IndexSchema to IndexFederated (this is a general option) back again.	12 years ago
Michael Peter Christen	788288eb9e	added the generation of 50 (!!) new solr field in the core 'webgraph'. The default schema uses only some of them and the resting search index has now the following properties: - webgraph size will have about 40 times as much entries as default index - the complete index size will increase and may be about the double size of current amount As testing showed, not much indexing performance is lost. The default index will be smaller (moved fields out of it); thus searching can be faster. The new index will cause that some old parts in YaCy can be removed, i.e. specialized webgraph data and the noload crawler. The new index will make it possible to: - search within link texts of linked but not indexed documents (about 20 times of document index in size!!) - get a very detailed link graph - enhance ranking using a complete link graph To get the full access to the new index, the API to solr has now two access points: one with attribute core=collection1 for the default search index and core=webgraph to the new webgraph search index. This is also avaiable for p2p operation but client access is not yet implemented.	12 years ago
Michael Peter Christen	b6de1f42dc	Full redesign of solr connection architecture. This was done to support multiple solr cores instead of just one. Therefore it is now necessary to distuingish between solr server connections (called an 'Instance') and a connection to a single solr core. One Instance may now have multiple connector classes assigned to it, each connecting to a single core. To support multiple cores it is also necessary to distinguish between the connection configuration and the configuration of the index schema. We will have multiple schema configurations in the future, each for every solr core. This caused that the IndexFederated servlet had to be split into two parts, the new Servlet for the Schema editor is now in the IndexSchema Servlet.	12 years ago
Michael Peter Christen	51e7ab4f70	moved bookmarks back to more prominent location (even if this does not fit to the 'Search Interfaces' headline)	12 years ago
reger	3b6e08b49f	prevent checking of urldb if empty - disconnect urlIndexFile if empty - add missing lock class in submenuSearchConfiguration	12 years ago
reger	f143804382	fix configuration for search page navigators - added additional config page (ConfigSearchPage_p) for easy setup of search page layout (to not overload ConfigPortal page) - currently redundant setting with part of ConfigPortal page - added missing config for filetype and protocol navigator - adjusted init of SearchEvent to check navigation config setting - renamed RankigProcess.getTopicNavigator to getTopics (to distiguish between added SearchEvent.getTopicNavigator)	12 years ago
Michael Peter Christen	8ae08a2cac	moved HTCache, Heuristics and Parser servlet to a more appropriate menu location	12 years ago
Michael Peter Christen	908ad2f174	Added a new servlet to configure the solr ranking using field boosts	12 years ago
Michael Peter Christen	a598fb6227	renamed Ranking_p.html to RankingRWI_p.html because there will be another Ranking servlet as well at next	12 years ago
Michael Peter Christen	074dfd297b	added icons and a selection for hosts with urls pending for crawler or with errors	13 years ago
Michael Peter Christen	4c4e0eece2	added new submenu 'Target Analysis' with three servlets which are useful to analyse the target servers: robots.txt table, mass target analysis and a regex tester	13 years ago
Michael Peter Christen	29fbbb49dc	better colors for host browser and corrected document count	13 years ago
Michael Peter Christen	51f420e4f5	removed location search because it is only working in special cases	13 years ago
Michael Peter Christen	d481abd087	added the visualization of error-urls to host browser - only visible for admins - a faceted search generates a huge list for all hosts in the host list - the faceted search algorithms had to be modified for that - within the browsing of the directory path, the error cause is written to the url which is presented as error-url - the errors are also accumulated for directory sums	13 years ago
Michael Peter Christen	a15819fbec	fix for some interface problems	13 years ago
Michael Peter Christen	64ac2b7b7d	new submenu template	13 years ago
Michael Peter Christen	5e77801aac	update to web interface structure	13 years ago
Michael Peter Christen	40df2fd193	added the host browser as link to search results. that means you can select a browsing position after a search is done on the search results.	13 years ago
Michael Peter Christen	ce3fed8882	added the Google Search Appliance (GSA) api interface to the main menu. See: https://developers.google.com/search-appliance/documentation/68/xml_reference#request_overview	13 years ago
Michael Peter Christen	3d3d654e88	if a network configuration is choosed which does not allow DHT and no P2P communication is in robinson mode) then some menu entries are disabled which have no use in this mode.	13 years ago
Michael Peter Christen	1baf498d59	- show more lines in online log - reverse order is default now	13 years ago
Michael Peter Christen	cc98496ff3	enhanced the HostBrowser: - showing also outbound links to other domains if there are any - the outbound links browser shows also the link structure image - showing even inbound links if the web structure graph has information about that - removed the left menu and made the HostBrowser a part of the top menu for search - moved the file search also to the top menu - added hover information in the HostBrowser to explain what the click means - because the HostBrowser also links to the Metadata viewer ViewFile, there should be a button to switch back to the HostBrowser: added that also.	13 years ago
Michael Peter Christen	abebb3b124	added a crawl start checker which makes a simple analysis on the list of all given urls: shows if the url can be loaded and if there is a robots and/or a sitemap.	13 years ago
Michael Peter Christen	941873fba4	moved the index deletion functions from IndexControlRWIs to IndexControlURLs where it appears more naturally. Because the RWI administration is less important in the presence of Solr, the IndexControlURL is now the default servlet when the Index Administration button on the main menu is selected.	13 years ago
orbiter	be4c96f3b1	The HostBrowser now offers to index files that are discovered because they are linked in the web interface.	13 years ago
Michael Peter Christen	c4a3d8870f	fixed computation of links in host browser which are not indexed but knwon by the crawler. Such links are now displayed in grey color.	13 years ago
Michael Peter Christen	97a47319c8	added nice links to the host browser: - click on the file icon to get the metadata of the file - click on the link icon behind the link to open the original file in the browser	13 years ago
Michael Peter Christen	f45f7fc12e	added new Host Browser to main menu: this new search interface is something completely new for search, but completely common on desktops: browser a web space like one would browse a file system in a file browser. The file listing is created using the search index and a faceted restriction to specific domains.	13 years ago
Michael Peter Christen	00c1c777fa	refactoring	13 years ago
Michael Peter Christen	a30653a864	added a regular expression test servlet which is linked within the parser/crawler error page whenever a problem with regular expression occurs. This makes it easy to correct and enhance the must-match and must-not-match patterns just by trying out which pattern could be correct.	13 years ago
Michael Peter Christen	4b36a2c3b4	small style changes	13 years ago
Michael Peter Christen	174530a9e0	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
apfelmaennchen	43f3a932fd	removed jquery.slider as it is already included as part of jquery-ui package	13 years ago

1 2 3 4 5 ...

465 Commits (697613170daa8f1c6b6c0464631ec84a20b3a4ea)