yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	00c1c777fa	refactoring	13 years ago
orbiter	563d584420	removed more dependencies in cora from kelondro	13 years ago
orbiter	63762d8f89	removed kelondro dependencies from cora	13 years ago
orbiter	089a03114e	full memory usage for debian and when changing the size: debian seems to dislike the big difference between xmx and xms (I have crashes here which stop if both values are same)	13 years ago
orbiter	60b1e23f05	added new crawl options: - indexUrlMustMatch and indexUrlMustNotMatch which can be used to select loaded pages for indexing. Default patterns are in such a way that all loaded pages are also indexed (as before) but when doing an expert crawl start, then the user may select only specific urls to be indexed. - crawlerNoDepthLimitMatch is a new pattern that can be used to remove the crawl depth limitation. This filter a never-match by default (which causes that the depth is used) but the user can select paths which will be loaded completely even if a crawl depth is reached.	13 years ago
Michael Peter Christen	6ec02deec6	added new crawl attributes in crawl profile (not active yet)	13 years ago
Michael Peter Christen	a13e5153ac	- added the possibility to have not one but a list of crawl start urls - the list of urls is entered in the expert crawl start in a textfield; the one-line input field was replaced with a text box - start urls can also be given in one single line where the urls are separated by a '\|'-character - as an effect, the crawl profile cannot carry a single start url for identificaton because it is possible to have more. Therefore the url was removed from the crawl profile - this affect all servlets which display a crawl profile: removed the url field from all there servlets - to work consistently with several start urls and the other crawl starts which computed crawl start url lists from sitelists or sitemaps, the crawl start servlet was restructured completely - new rules for must-match patterns were created to make it possible that site crawl starts also work with several crawl starts at once	13 years ago
Michael Peter Christen	975bc95ddf	added default facet fields for json response format (stub)	13 years ago
Michael Peter Christen	2f218df55d	added missing license headers	13 years ago
Michael Peter Christen	a30653a864	added a regular expression test servlet which is linked within the parser/crawler error page whenever a problem with regular expression occurs. This makes it easy to correct and enhance the must-match and must-not-match patterns just by trying out which pattern could be correct.	13 years ago
Michael Peter Christen	0504b01bdc	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
orbiter	a55e77a115	added twitter search heuristic	13 years ago
Michael Peter Christen	e54ac38095	- some corrections in usage of getFile() and getFileName() - added more attributes in json response writer according to yacy servlet	13 years ago
Michael Peter Christen	9644c186a4	added search functionality to ViewFile.html servlet	13 years ago
Michael Peter Christen	b69ed96f0b	- added collections to yacydoc - changed yacydoc.htm to yacydoc.json - added query logging in solr and gsa search result	13 years ago
Michael Peter Christen	5df553c152	- added a json writer for solr (yes there was one using xslt but this one writes the same way as yacysearch.json) - using the new json solr result to change the ajax search in IndexControlURLs to the new solr search	13 years ago
Michael Peter Christen	4d29f59a27	removed warnings	13 years ago
Michael Peter Christen	8c099d2106	Merge remote-tracking branch 'origin/master' Conflicts: htroot/api/ymarks/import_ymark.java source/de/anomic/data/ymark/YMarkEntry.java source/de/anomic/data/ymark/YMarkTables.java	13 years ago
apfelmaennchen	59bd478ed1	Added more sophisticated RDF output for YMarks, including the folder structure (b:Topic) and support for multiple tags (dc:subject) and folders (b:hasTopic) via rdf:Bag container.	13 years ago
apfelmaennchen	d31a632951	- added dmoz RDF dump importer - added indexing to Tables columns to support larger bookmark collections - added RDF output (HTTP) for public bookmarks at /YMarks.rdf - YMarkRDF also provides a Jena RDF Model as "internal" API - various other changes/fixes for YMarks (mainly backend)	13 years ago
orbiter	66ac4076c2	added disjunction '\|' option to site parameter in GSA API	13 years ago
sixcooler	9ee2e09983	statistics for solr-cache	13 years ago
Michael Peter Christen	d8425e6809	added collections to crawl monitor	13 years ago
Michael Peter Christen	4b36a2c3b4	small style changes	13 years ago
Michael Peter Christen	8ca842b137	added new button design to more buttons	13 years ago
Michael Peter Christen	b2b516cc3e	added a collection attribute to crawls and searches: - a solr field collection_sxt can be used to store a set of crawl tags - when this field is activated, a crawl tag can be assigned when crawls are started - the content of the collection field can be comma-separated, all of them are assigned to the documents when they are indexed as result of such a crawl start - a search result can be drilled down to a specific collection; this is currently only available in the solr interface and also in the gsa interface using the 'site' option - this adds a mandatory field for gsa queries (the google api demands that field all the time)	13 years ago
Michael Peter Christen	174530a9e0	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
apfelmaennchen	43f3a932fd	removed jquery.slider as it is already included as part of jquery-ui package	13 years ago
apfelmaennchen	a01eb1b7fe	removed unused jquery plugin slider as it is part of jquery-ui package	13 years ago
Michael Peter Christen	f75b3f8a47	added more patches to work without RWI data structure	13 years ago
Michael Peter Christen	a427a68bac	removed many warnings	13 years ago
Michael Peter Christen	c72c435517	- moved the gsa search interface from /gsa/searchresult? to /gsa/search? - fixed the NB field data	13 years ago
Michael Peter Christen	31d4d38804	- extended the solr interface by a references-by-word-count method - reduced danger that a non-existing RWI database causes NPEs - added Solr queries to did-you-mean: this makes it possible that our did-you-mean algorithm works together with only Solr and without RWIs	13 years ago
Michael Peter Christen	528d6763fa	- added new solr fields: title_count_i, title_chars_val, title_words_val description_count_i, description_chars_val, description_words_val - added many asserts to ensure data type correctness from YaCy to Solr and vice versa - made many fixes according to new findings from these asserts (!)	13 years ago
Michael Peter Christen	3142e675e8	fixed problems with GSA api: - better FS attribute - highlightning of searched words in title	13 years ago
Michael Peter Christen	3b19fe7b52	- fixed num parameter in GSA api - changed FS attribute in GSA api	13 years ago
Michael Peter Christen	75d5e3475d	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
cominch	dc468dad01	add content control features for custom filter lists	13 years ago
Michael Peter Christen	316b5fe116	- added a solr type definition verifier - fixed type definition found by the verifier - added multivalue-string fields for solr with extension 'sxt' - added multivalue-integer fields for solr with extension 'val' - renamed some solr attributes from txt to sxt - changed solr query line to an explicit AND/OR structure - added a country code second level domain list to Domains class; with parser - added a host string parser to get domain class name, country-code second-level domain and subdomain out of it - removed old coordinate attributes	13 years ago
reger	2d2be546fe	fix path to env/grafics to display api icon on meta data page	13 years ago
orbiter	7ac259477f	added a direct access to solr search api to enhance the visibility if the embedded solr	13 years ago
orbiter	67f2866cd0	small fixes	13 years ago
orbiter	479bfca571	refctoring	13 years ago
Michael Peter Christen	48a82bc705	log queries anonymous from gsa+solr requests	13 years ago
Michael Peter Christen	ab6ec4ec52	added snippet computation to solr/rss and gsa result writer	13 years ago
Michael Peter Christen	4716546ef5	- reduced memory usage in index transmission using a transformation of Node to Row objects - removed peerDeparture in solr remote search in case that peer does not answer (this may be normal because it is allowed to switch this off)	13 years ago
Michael Peter Christen	0ad52ac4c3	gsa bugfix for date parser	13 years ago
Michael Peter Christen	3ce4c2f937	fixes for gsa result format	13 years ago
Michael Peter Christen	2d5fdfeb65	added authorization-based maximum results limitation to solr and gsa search	13 years ago
Michael Peter Christen	6fc5400f91	added a tooltip for search navigation to mention that search pages can be navigated using the TAB key	13 years ago
Michael Peter Christen	a06123aec6	more abstraction and less parameter overhead for remote search	13 years ago
Michael Peter Christen	f00733186b	code simplifications	13 years ago
orbiter	780f8974e7	added ramaining iteration methods for solr in fulltext class	13 years ago
orbiter	6f01542aaa	explicit double-check in transferURL	13 years ago
Michael Peter Christen	d54b80327a	refactoring	13 years ago
Michael Peter Christen	0cab06c47c	refactoring	13 years ago
Michael Peter Christen	40c0856489	refactoring	13 years ago
Michael Peter Christen	e651d3e320	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
Michael Peter Christen	06a78eecb7	code simplification	13 years ago
cominch	8a91f4fa42	local robots.txt: disallow external crawlers to follow the URL proxy	13 years ago
Michael Peter Christen	18f989dfb1	- refactoring (load -> getMetadata) - added getDocument to retrieve Solr documents which shall replace getMetadata	13 years ago
Michael Peter Christen	6197caf698	added clear-text search words in query params	13 years ago
Michael Peter Christen	23226676c6	FOR THE BRAVE.. this is a forced migration to solr which is now ready for production as a replacement of the metadata-db. This intermediate release 1.041 will switch on the previously optional solr index and the old metadata-db will still work as it did before. Solr+metadata are accessed in mixed mode, no migration is done yet. If this causes not a catastrophe until the end of the weekend, we will do a YaCy 1.1 main release containing this as default.	13 years ago
Michael Peter Christen	7c31be1c80	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
cominch	6456a1656a	changed local robots.txt to prevent external crawlers to submit random search queries	13 years ago
Michael Peter Christen	703f427303	fixed some peer-ping connection details - larger time-out - removed too old seedlist - fixed a bug in connection test	13 years ago
Michael Peter Christen	597bb76e4f	get the peer location more quickly	13 years ago
orbiter	156d457aec	fix for Index out of bounds exception in Network servlet	13 years ago
Lotus	ae9cd7a118	fix xss bug #204	13 years ago
Michael Peter Christen	d988ba50cf	added a very rudimentary, incomplete, non-verified GSA response writer for solr. Try this: http://localhost:8090/gsa/searchresult?q=pdf&site=col1&num=10	13 years ago
Michael Peter Christen	aab0b680c3	- added xslt support for solr result formats. try i.e. http://localhost:8090/solr/select?q=:&start=0&rows=10&wt=xslt&tr=json.xsl - added servlet-side mime-type configuration for streamed servlets. this is used for the result formatters in solr result formats	13 years ago
cominch	ad62609ec7	added a possibility to define a custom network definition URL for remote management	13 years ago
cominch	fb0f430685	Merge remote-tracking branch 'original yacy/master'	13 years ago
Michael Peter Christen	b51df6c7e8	- added coordinate storage in solr schema - fixed shutdown process - fixed some solr-to-metadata reading - added a large number of metadata attributes in ViewFile.html	13 years ago
orbiter	9b88433f45	patch from hint in http://forum.yacy-websuche.de/viewtopic.php?p=26858#p26858 from gaston	13 years ago
orbiter	e816b88b55	changed behaviour of metadata storage: in case that any solr is attached, the metadata is not written to the metadata-db, even if it is enabled but instead to solr. This prevents that metadata is written in two store systems at the same time. It is also the next step to migrate the current metadata-db to solr.	13 years ago
Michael Peter Christen	f9c0e6e950	- Implemented and integrated the URIMetadataNode object which is a metadata representation from the solr index. This shall replace metadata from the built-in database in the future. - added the Solr-driven metadata into the search index of YaCy which makes it now possible to run YaCy without the old metadata index. This is a major stept forward to a full migration to Solr.	13 years ago
Michael Peter Christen	b2b480fff2	more abstraction of the YaCySchema -> Opensearch matching process	13 years ago
Michael Peter Christen	73f6d69d03	more abstraction for solr query params parsing	13 years ago
Michael Peter Christen	24462e9baa	set the title every time, it is possible that it has changed	13 years ago
Michael Peter Christen	136fcb1ad9	refactoring	13 years ago
Michael Peter Christen	a12f693ec9	added two response writer for embedded solr interface: a rss/opensearch writer and an enhanced solr xml writer. The enhanced solr writer has less configuration overhead than the original writer and should by slightly faster. The rss/opensearch writer is at this time slightly incomplete compared with the already existing rss search result form YaCy and also snippets are missing at this time. To test the new interface, open for example: http://localhost:8090/solr/select?wt=rss&q=olympia The wt-code for the new result writers are= wt=rss for opensearch wt=exml for the enhanced solr xml writer. Additionally, the SRU search parameters had been added to the solr interface which can now also be used for a normal solr/xml search.	13 years ago
orbiter	67edfd991c	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
orbiter	d9173ba7ed	added more solr fields to integrate values from URIMetadataRow. All writings to the Metadata-DB are now also done to solr. This includes metadata transfer during search and rwi transfer. The new/added solr fields are: ## time when resource was loaded load_date_dt ## date until resource shall be considered as fresh fresh_date_dt ## id of the host, a 6-byte hash that is part of the document id host_id_s ## ids of referrer to this document referrer_id_ss ## the md5 of the raw source md5_s ## the name of the publisher of the document publisher_t ## the language used in the document; starts with primary language language_ss ## an external ranking value ranking_i ## the size of the raw source size_i ## number of links to audio resources audiolinkscount_i ## number of links to video resources videolinkscount_i ## number of links to application resources applinkscount_i	13 years ago
Michael Peter Christen	70b10e8316	added the JSON response writer to solr interface, add &wt=json to the servlet GET properties to use this format	13 years ago
Michael Peter Christen	8d944f6517	nowrap from gaston in forum http://forum.yacy-websuche.de/viewtopic.php?p=26815#p26815	13 years ago
Michael Peter Christen	24d9db1613	snippet retrieval loading processes may use a smaller minimum load time value than crawling processes. This speeds up the search result preparation dramatically.	13 years ago
Michael Peter Christen	1687737771	Abstraction of HandleMap and HandleSet	13 years ago
Michael Peter Christen	3bcd9d622b	cleaned up classes and methods which are either superfluous at this time or will be superfluous or subject of complete redesign after the migration to solr. Removing these things now will make the transition to solr more simple.	13 years ago
Michael Peter Christen	6f1ddb2519	Moved solr index-add method to the same method where the YaCy index is written. Also done some code-cleanup.	13 years ago
Michael Peter Christen	315d83cfa0	cleanup	13 years ago
Michael Peter Christen	76202f068e	extended abstraction of local and remote solr index using one front-end for index administration and querying.	13 years ago
Michael Peter Christen	7ec7341f60	added user-authentication protection to solr search (same as implemented for yacysearch)	13 years ago
Michael Peter Christen	e2a97ef8f6	better explain how to access the embedded solr	13 years ago
Michael Peter Christen	826967513b	changed options in IndexFederated_p to switch on/off parts of the index individually. The settings are experimental and the values of the settings will be overwritten when an index migration from urldb to solr starts.	13 years ago
Michael Peter Christen	cba4ab862e	fix for http://bugs.yacy.net/view.php?id=202	13 years ago
reger	36c9875b6e	removed localized number formatting from num-results_totalcount response (this is only used in xml and json where localized format is not valid)	13 years ago
orbiter	69e743d9e3	- more abstraction for the RWI index as preparation for solr integration - added options in search index to switch parts of the index on or off	13 years ago
orbiter	6cc5d1094e	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
orbiter	05a3ffd03a	patches to ensure that solr connectors are active ony if they have a solr object assigned and vice versa	13 years ago
orbiter	5a3c829872	embedded solr is only initiated if it is activated with IndexFederated_p.html	13 years ago
Lotus	3a350a2f83	partial html fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4454	13 years ago
Michael Peter Christen	97b7bcf2a6	added a solr search index - by default, a (empty) solr storage instance is created at SEGMENTS/solr_36 - the index is written if in /IndexFederated_p.html the flag "embedded solr search index" is switched on - a standard solr query interface is available now with a new servlet at http://127.0.0.1:8090/solr/select To test this, do the following: - switch to webportal mode - switch on the feature as described - do a crawl. this fills the solr index. The normal YaCy search will NOT work now! - do a solr query, like: http://127.0.0.1:8090/solr/select?q=: http://127.0.0.1:8090/solr/select?q=text_t:Help play with different search fields as you can see in /IndexFederated_p.html You can use the standard solr query attributes as described in http://wiki.apache.org/solr/SearchHandler	13 years ago
Michael Peter Christen	f78ce93a80	collection of speed and memory saving hacks	13 years ago
orbiter	c00a3cf74d	less usage of generic logger to avoid logger generation overhead	13 years ago
orbiter	e76159040b	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
orbiter	bbfa497a3c	replaced more size() > 0 by !isEmpty()	13 years ago
Michael Peter Christen	e3aa05b9dd	added creation of subpath pattern when crawl start is 'from file'	13 years ago
orbiter	0cbda0b2b8	- replaced all length() == 0 and size() == 0 with isEmpty() - replaced some length() > 0 and size() > 0 with !isEmpty() - cannot be done automatically - implemented some isEmpty() methods	13 years ago
Roland 'Quix0r' Haeder	aef9dd0350	- removed cleaning of blacklist cache on startup - added cleaning of blacklist cache if cache is modified in interface - extended cache saving to all cache types - moved cache location to DATA/LISTS - fixed static file path which was relative to the application path but should be relative to data path - which is different in debian and mac implementations	13 years ago
orbiter	c7afa8bc48	using SwitchboardConstants for solr attributes	13 years ago
orbiter	62202e2d71	refactoring of query attribute variable names for better consistency with (next) stored query words	13 years ago
Michael Peter Christen	91f14ea38e	fix to solr configuration (case where the external solr was not online)	13 years ago
sixcooler	2c5b68d932	more abstraction of error message	13 years ago
Michael Peter Christen	9758c521ab	abstraction of error message	13 years ago
sixcooler	9b6e4e46ca	fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=4430	13 years ago
Michael Peter Christen	b0c408788b	made class methods static where possible	13 years ago
Michael Peter Christen	5bd3c90907	- removed unnecessary semicolons - added default case for switch	13 years ago
Michael Peter Christen	7c1ba99755	removed more unused method parameters	13 years ago
Michael Peter Christen	0301aba1e9	removed unused method parameters	13 years ago
Michael Peter Christen	241dd8410a	removed snippet pattern filter - it was not used	13 years ago
Michael Peter Christen	d3964253ae	- added @SuppressWarnings to unused servlet method parameters - removed unnecessary casts - removed unnecessary throw statements	13 years ago
Michael Peter Christen	ea10766bfd	cleaned unnecessary nested code	13 years ago
orbiter	78fc3cf8f8	refactoring and new usage of SentenceReader: this class appeared as one of the major CPU users during snippet verification. The class was not efficient for two reasons: - it used a too complex input stream; generated from sources and UTF8 byte-conversions. The BufferedReader applied a strong overhead. - to feed data into the SentenceReader, multiple toString/getBytes had been applied until a buffered Reader from an input stream was possible. These superfluous conversions had been removed. - the best source for the Sentence Reader is a String. Therefore the production of Strings had been forced inside the Document class.	13 years ago
Michael Peter Christen	276a66a793	Adding a limit of 1000 links that a parser shall store during indexing. A limit was necessary because some web pages have such huge numbers of links that it can easily cause a OOM just by the number of links. The quesion if the number of 1000 links is sufficient or too weak must be answered with the result of testing this feature.	13 years ago
Michael Peter Christen	1825f165b8	better integration of blacklist according to use case	13 years ago
Michael Peter Christen	c18fa9fa75	Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1	13 years ago
Michael Peter Christen	ce8d4b87d9	fixes for new eclipse 'Juno' warning 'Resource leak'.	13 years ago
reger	067728bccc	add search result heuristic. adding a crawl job with depth-1 for every displayed search result (crawling every external linked page of displayed search result pages)	13 years ago
Michael Peter Christen	03280fb161	removed segments-concept and the Segments class: the segments had been there to create a tenant-infrastructure but were never be used since that was all much too complex. There will be a replacement using a solr navigation using a segment field in the search index.	13 years ago
Michael Peter Christen	9116013c64	- allow lazy initialization of solr value (if using 'lazy', then no 0-values and no empty strings are written). This may save a lot of memory (in ram and on disc) if excessive 0-values or empty strings appear) - do not allow default boolean values for checkboxes because that does not make sense: browsers may omit the checkbox attribute name if the box is not checked. A default value 'true' would not comply with the semantic of the browsers response. - add a checkbox in IndexFederated_p for the lazy initialization of solr fields.	13 years ago
cominch	e6792ed37d	Merge remote-tracking branch 'original yacy/master'	13 years ago
Michael Peter Christen	96aeb127e3	generalized localhost naming. this is also a preparation for a better IPv6 implementation.	13 years ago
Michael Peter Christen	77f795756c	fixing redirects and status codes: storing of status code in ResponseHeader to make it available for late evaluations, like storage in solr.	13 years ago
Michael Peter Christen	8dd469b9dd	added option to configure the autocommit delay time of solr on-the-fly	13 years ago
Michael Peter Christen	b9dfca4b0a	- fixed IndexFederated Servlet / a embedded Solr can now be selected - added code stub for an embedded Solr but generation of Solr store is still commented out (it works but is not yet ready for usage)	13 years ago
Michael Peter Christen	fad3b14813	added jetty libraries, needed for future use as web server and as application server for the solr search interface	13 years ago
Michael Peter Christen	b9d42fd9c8	using com.google.common.io.Files instead of homebrew methods	13 years ago
Michael Peter Christen	a5eb91fa60	refactoring	13 years ago
cominch	c1ba58ae51	Augmented browsing: Small CSS fix	13 years ago
cominch	b2b205aa38	Augmented browsing: small js fix	13 years ago
cominch	dc9ee0cdb3	Augmented browsing: CSS fix	13 years ago
cominch	74fcc6f8c5	Augmented browsing: small UI modifications	13 years ago
cominch	c63c3a4495	Show additional interaction elements in footer section on each page, if activated in ConfigPortal.html. This footer is also visible in augmented browsing proxy mode.	13 years ago
cominch	fa98657bb3	Augmented Browsing: changed the settings page	13 years ago
cominch	751eeade0d	Merge remote-tracking branch 'original yacy/master'	13 years ago
cominch	84a11ec48c	Corrected loading of default page settings on ConfigPortal.html	13 years ago
sixcooler	bea002dc15	correct table in new look of Crawler_p	13 years ago
Michael Peter Christen	8738336408	set Xms lower than Xmx	13 years ago
cominch	6b4545d6b0	Only load tag information if necessary	13 years ago
cominch	011f8a5818	Auto Tagging: Add hyperlinks to tags (provisional)	13 years ago
Michael Peter Christen	1d4e206b2b	bugfix in vocabulary generation	13 years ago
cominch	2c89975378	Merge remote-tracking branch 'original yacy/master'	13 years ago
cominch	71047fe63a	Augmented browsing: CSS fix	13 years ago
Michael Peter Christen	52f5d40043	better abstraction of document model generation	13 years ago
Michael Peter Christen	8b7c4d3144	produce a rdf output containing the triplestore with yacydoc; ie: http://localhost:8090/api/yacydoc.rdf?urlhash=yOiCM7Fh1hyQ	13 years ago
cominch	f7160dae5c	Merge remote-tracking branch 'original yacy/master'	13 years ago
cominch	e4555cbee3	Augmented browsing: Pass on additional action parameter	13 years ago
Michael Peter Christen	24bbe359ca	integrate also geonames library files for less cities. these are more useful for tagging since less normal words are false-identified as location	13 years ago
Michael Peter Christen	5a41e739b4	better apilink description	13 years ago
Michael Peter Christen	e16e4bd2ba	added ontology extraction in xml as api call for vocabularies	13 years ago
cominch	8cf47a8335	Merge remote-tracking branch 'original yacy/master'	13 years ago
cominch	b85f01a14e	Augmented browsing: small UI fix	13 years ago
Michael Peter Christen	26cb1c65c2	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Conflicts: source/net/yacy/document/importer/OAIPMHLoader.java	13 years ago
Michael Peter Christen	963f92ed9a	- merged files - changed behaviour of delete button in vocabulary edit - fixed size numbe in vocabulary listing	13 years ago
cominch	d8815db877	Merge remote-tracking branch 'original yacy/master'	13 years ago
cominch	e4dab19045	Augmented Browsing: added template for document info bar	13 years ago
Michael Peter Christen	743b0ec89f	- added size of vocabulary to vocabulary view - fixed bad terms in vocabulary-from-titles autogeneration	13 years ago
Michael Peter Christen	22d5e33c5e	added more methods to vocabulary generation: scrape document title and document author to vocabulary	13 years ago
Michael Peter Christen	b2d1c25ebb	removed warnings/unused entities	13 years ago
Michael Peter Christen	f1aa4c4390	- accept only location names wit a minimum length - remove comma from synonym terms	13 years ago
Michael Peter Christen	cc9ad7198a	- use only names which consists of at least two parts - remove word from derewo from locations	13 years ago
Michael Peter Christen	9264d8b4af	removed old navigation practice using subject tags in favor of triplestore-tags	13 years ago
Michael Peter Christen	eeb4fd8b8c	refactoring (geolocalzation -> geolocation)	13 years ago
Michael Peter Christen	64c0268b2b	show triplestore metadata in yacydoc and viewfile	13 years ago
Michael Peter Christen	c2f0d16d2c	fixed vocabulary initialization	13 years ago
Michael Peter Christen	fbded1f466	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
Michael Peter Christen	df3531f8d5	added the generation of virtual vocabularies using the pnd	13 years ago
Michael Peter Christen	e806106b10	jquery bugfix	13 years ago
Michael Peter Christen	a0f1decd82	- added loading of the dbpedia pnd triplestore in the dictionary loader - renamed the dictionary loader to knowledge loader - some refactoring in the library provider method names	13 years ago
Michael Peter Christen	6d17686258	made triplestore persistent by default added a size display in triplestore servlet	13 years ago
Michael Peter Christen	8d6e77ad0c	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
cominch	2ac7a5c1f2	Augmented browsing: Add overlay bar which shows the vocabulary tags	13 years ago
Michael Peter Christen	777d22e145	renamed "augmented proxy" to "augmented browsing"	13 years ago
cominch	bddac2839e	add missing files for tag display	13 years ago
cominch	441430f507	Merge remote-tracking branch 'original yacy/master'	13 years ago
cominch	3c255c025b	Show tags in search results (if activated in ConfigPortal_p.html)	13 years ago
Michael Peter Christen	1f9120d189	create new vocabularies also without an objectspace. this creates an empty vocabulary	13 years ago
Michael Peter Christen	a5cdfb91de	- fixed Cache link (below snippet) - added 'Augmented Proxy' link below snippet - added configuration options for augmented proxy	13 years ago
Michael Peter Christen	492b3e09f2	added api icon to triplestore	13 years ago
Michael Peter Christen	16d8f33795	added objectlink generation to vocabulary generation and editor	13 years ago
Michael Peter Christen	f1f97b7c95	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	13 years ago
Michael Peter Christen	b3eaaf5ebc	check also delete triplestore by default	13 years ago
cominch	f2f07a11f1	hotfix for unresolved pattern	13 years ago
cominch	5fd1a15fcf	hotfix until we have updated query routine for tags	13 years ago
cominch	f49d92d8da	Cleanup of interaction class and helper routines	13 years ago
cominch	56b0115054	Triplestore: modify routines to access per user store	13 years ago
Michael Peter Christen	d45718251e	refactoring (Localization -> Location)	13 years ago
Michael Peter Christen	b8b3c87ba7	- renamed localization to location (that was confusing) - renamed 'Locale' navigator to 'Location' - produce Location navigation only if geolocation libraries are loaded	13 years ago
sixcooler	f64e78497a	fix for reload-feature in Crawler_p	13 years ago
Michael Peter Christen	e89747bb67	- added automated generation of vocabularies from url stubs - added clear of all terms for vocabularies - added deletion of vocabularies	13 years ago
Michael Peter Christen	79464189a4	The 'Locale' vocabulary, which is generated by geo data, has now the objectspace "http://dbpedia.org/resource/"	13 years ago
Michael Peter Christen	eca38c53e7	added a vocabulary editor	13 years ago
Michael Peter Christen	80e8aaabc8	moved new servlets into one submenu "Content Semantic"	13 years ago
Michael Peter Christen	2bbb6c52cf	added option to clean the triplestore when deleting the index	13 years ago
Michael Peter Christen	8b53771db2	changed behavior of navigation processing: - vocabulary annotation is not done any more into the metadata of urldb - vocabularies are written into the jena triplestore using a rdf vocabulary - vocabularies for rdf tripel must be updated; refactoring done - with the new navigation tags in the triplestore a faster pre-urldb-lookup is possible: navigation is processed now within the RWI during pre-ranking retrieval - added also a Owl vocabulary stub to add the plain-text url to the triplestore using the owl:sameas predicate	13 years ago
Michael Peter Christen	5fc6524ca8	- moved triple store to net.yacy.cora.lod (should be generalized there later - added abstract add, delete, get methods in the triplestore - added generation of triples after auto-annotation - migrated all MultiProtocolURI objects to DigestURI in the parser since the url hash is needed as subject value in the triples in the triple store	13 years ago
cominch	c90f174799	preparation and generalization of augmented browsing methods	13 years ago
Roland 'Quix0r' Haeder	edaa09b9b1	Rewrote all String blacklist types to enum 'BlacklistType', closes bug #143 Conflicts: htroot/Supporter.java htroot/yacy/crawlReceipt.java htroot/yacy/transferRWI.java htroot/yacy/transferURL.java source/de/anomic/crawler/CrawlStacker.java source/de/anomic/data/ListManager.java source/net/yacy/peers/Protocol.java source/net/yacy/repository/Blacklist.java source/net/yacy/repository/LoaderDispatcher.java source/net/yacy/search/Switchboard.java source/net/yacy/search/index/MetadataRepository.java source/net/yacy/search/index/Segment.java source/net/yacy/search/query/RWIProcess.java source/net/yacy/search/snippet/MediaSnippet.java	13 years ago
Roland 'Quix0r' Haeder	213f006bf1	One is okay ... Conflicts: htroot/Trails.html	13 years ago
Roland 'Quix0r' Haeder	af5a597e47	Scroogle is not comming back, remove dead code Conflicts: source/net/yacy/search/Switchboard.java	13 years ago
cominch	7a4dab6d1d	- removed unused variables - do not replace malformed or invalid URLs in urlproxy git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@7835 6c8d7289-2bf4-0310-a012-ef5d649a1542 Conflicts: source/de/anomic/http/server/HTTPDFileHandler.java	13 years ago
Michael Peter Christen	90c6fc4b63	load all - but not the persistent local.rdf - triples from DATA/TRIPLESTORE at startup time. The local.rdf is loaded only if the persistent switch is on (as before).	13 years ago
Michael Peter Christen	a9eb40c160	fix for autocomplete in index.html	13 years ago
Michael Peter Christen	dd020a1a8a	removed autocrawler and feedback servlet link since that was not cherry-picked	13 years ago
cominch	aa0295917c	augmentation Conflicts: source/net/yacy/interaction/AugmentHtmlStream.java	13 years ago
cominch	87a3fbb3c2	interaction javascript	13 years ago
cominch	ed2ea0f08e	augmented browsing modification Conflicts: htroot/interaction/OverlayInteraction.html source/net/yacy/interaction/AugmentHtmlStream.java	13 years ago
cominch	d4802dc8d5	small change	13 years ago
cominch	a120ef660b	RDF demo servlet	13 years ago
cominch	09a34cfe1b	prepare RDF dump routines	13 years ago
cominch	300b235ce8	Updated Demo Servlet Conflicts: htroot/About.html htroot/DemoServlet.html htroot/DemoServlet.java htroot/interaction/interaction.js source/net/yacy/interaction/Interaction.java	13 years ago
cominch	90512640bf	Added config switches for custom parser Conflicts: source/net/yacy/document/TextParser.java	13 years ago
cominch	a12cbcba36	Add a global value store	13 years ago
cominch	e14f2881ae	interaction: add special table interaction Conflicts: source/net/yacy/interaction/Interaction.java	13 years ago
cominch	4e4e7a99f8	interaction: add global variable store Conflicts: source/net/yacy/interaction/Interaction.java	13 years ago
cominch	bde07ed7a8	Add tagging overlay element Conflicts: htroot/env/templates/jqueryheader.template htroot/yacysearchitem.java source/net/yacy/interaction/Interaction.java	13 years ago
cominch	bee3bee8f3	Small fix - return value of JSON should be empty	13 years ago
cominch	ff4ba3ee05	Small fix Conflicts: htroot/yacysearchitem.java	13 years ago
cominch	f05e3968f7	Quick fix	13 years ago
cominch	e859481889	Add Triplestore settings functionality Conflicts: htroot/env/templates/header.template	13 years ago
cominch	b0bc0b4572	Add new demonstration module for client-side key-value store (backend: triplestore): /DemoServletInteraction.html Conflicts: source/net/yacy/interaction/Interaction.java	13 years ago
cominch	c9dc6cda02	Demonstration: include value from interaction in search results Conflicts: htroot/interaction/OverlayInteraction.html htroot/yacysearchitem.java	13 years ago
cominch	ae8adb0e58	Small changes	13 years ago
cominch	bcbd8eee33	Add several parsers, for RDFa and rdf files. Conflicts: source/net/yacy/document/TextParser.java	13 years ago
cominch	9ef5a80f4e	add interaction for triples and selector for augmented browsing Conflicts: htroot/interaction/interaction.js source/net/yacy/interaction/Interaction.java	13 years ago
cominch	5d20cd324a	Add Triplestore and RDF query interface Conflicts: build.xml defaults/yacy.init source/net/yacy/interaction/AugmentHtmlStream.java	13 years ago
cominch	bc9a618e0a	augmented browsing: ignore js and css, integrate more user interaction Conflicts: htroot/interaction/Footer.html source/net/yacy/interaction/AugmentHtmlStream.java	13 years ago
cominch	9cbfc1a1c0	augmentedProxy, which forwards every proxy request to a rewrite engine to customize existing webpages. originally implemented by Florian Richter. Conflicts: source/de/anomic/http/server/HTTPDProxyHandler.java	13 years ago
cominch	1626be7916	Add menu entries for urlproxy / augmented browsing	13 years ago
Michael Peter Christen	5b25272f40	added location search to main menu	13 years ago
Michael Peter Christen	ea0dceb55d	bugfix: do not switch off standard memory strategy when performing a forced GC PLEASE CHECK if your peer has standard memory switched on!	13 years ago
Michael Peter Christen	dd14b19c26	lazy initialization of block rank table ... only normal web search uses this. When interactive search or location search is used, the block rank is switched off	13 years ago
Michael Peter Christen	701b9a28a0	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Conflicts: htroot/PerformanceMemory_p.java	13 years ago
Michael Peter Christen	ab7107b34b	fixed RWIProcess queue limits: now discovering hidden results for mass result retrieval	13 years ago
Michael Peter Christen	10c9c17d51	fixed handlemap spread factor and null iterator handling	13 years ago
Michael Peter Christen	a61f44f9e4	lazy initialization of block rank table. this causes that the table is not initialized when there is no search is done. the effect is most strong if YaCy is started headless which causes no browser pop-up which otherwise would load the search page and therefore trigger the initialization of the table.	13 years ago
Michael Peter Christen	c8bbd180e4	enhanced hint for debian package automatic update	13 years ago
Michael Peter Christen	9ad84c5e9f	fix for NPE in PerformanceMemory	13 years ago
Michael Peter Christen	96e9d77270	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git Conflicts: source/net/yacy/cora/sorting/WeakPriorityBlockingQueue.java	13 years ago

... 3 4 5 6 7 ...

4191 Commits (3bc5ee6e3d0d707c477521b27ca4f85633aa5179)