yacy_search_server

Commit Graph

Author	SHA1	Message	Date
reger	c454ef69c6	add shortMemory check to heuristic search and skip operation on shortMemory (no request to remote openserch systems)	10 years ago
reger	11b21308c0	fix: malformed filename in image search fix for http://mantis.tokeek.de/view.php?id=533	10 years ago
reger	9e1ec5fec4	refactor: just some more useages of constant for term ":[* TO *]"	10 years ago
reger	8c491f51a5	remove hardcoded initialization of language nav if not used	10 years ago
Marc Nause	a311c97c9b	Added & in start script for *NIX which was lost a few commits ago.	10 years ago
Michael Peter Christen	b5ac29c9a5	added a html field scraper which reads text from html entities of a given css class and extends a given vocabulary with a term consisting with the text content of the html class tag. Additionally, the term is included into the semantic facet of the document. This allows the creation of faceted search to documents without the pre-creation of vocabularies; instead, the vocabulary is created on-the-fly, possibly for use in other crawls. If any of the term scraping for a specific vocabulary is successful on a document, this vocabulary is excluded for auto-annotation on the page. To use this feature, do the following: - create a vocabulary on /Vocabulary_p.html (if not existent) - in /CrawlStartExpert.html you will now see the vocabularies as column in a table. The second column provides text fields where you can name the class of html entities where the literal of the corresponding vocabulary shall be scraped out - when doing a search, you will see the content of the scraped fields in a navigation facet for the given vocabulary	10 years ago
Michael Peter Christen	1cb290170e	refactoring of autotagging code (combined same code pieces)	10 years ago
Michael Peter Christen	c3b55455fc	enhanced initialization speed of vocabularies by using better normalization and by removal of unused data structures	10 years ago
Michael Peter Christen	68c605d637	replace with CommonPattern.SPACE for split	10 years ago
Michael Peter Christen	de3e373913	using precompiled CommonPattern.TAB for split	10 years ago
Michael Peter Christen	1f5047b15f	using precompiled pattern CommonPattern.SEMICOLON for splits	10 years ago
Michael Peter Christen	a8a2b7a803	persistency for vocabulary facet switch	10 years ago
Michael Peter Christen	efbc9a3561	introducting a new getConfig method which parses comma-separated llists from setting fields; refactoring for all places where such lists are parsed	10 years ago
Michael Peter Christen	69eacdf4eb	applying precompiled CommonPattern.COMMA.split to all places where split(",") was used	10 years ago
Michael Peter Christen	ac19690d30	refactoring with CommonPattern.COMMA	10 years ago
Michael Peter Christen	cf9b22ca5c	do not reindex based on vocabulary fields (there are meanwhile many of them) and some default settings	10 years ago
Michael Peter Christen	5a060c9f26	refactoring of reindexSolr (just replaced constant string)	10 years ago
Michael Peter Christen	b5a55c8b3d	fix for wkhtmltopdf (custom header does not work)	10 years ago
Michael Peter Christen	3d717b749a	fix for urlmaskfilter	10 years ago
Michael Peter Christen	2636582435	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	10 years ago
reger	0260d3d800	Allow to hide linkstructure graphic in crawl monitor using/setting the config param DECORATION_GRAFICS_LINKSTRUCTURE	10 years ago
Michael Peter Christen	bee5ee7cce	removed some warnings	10 years ago
Michael Peter Christen	783cf6fbc7	the LinkedBlockingQueue is much faster than the ArrayBlockingQueue (strange but this is the result of a test: ArrayBlockingQueue: 39461 lines / second; LinkedBlockingQueue: 60774 lines / second)	10 years ago
Michael Peter Christen	6390454652	fix for vocabulary on/off setting	10 years ago
Michael Peter Christen	a3c5995bde	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	10 years ago
reger	5ca0762179	fix: eom on parsing ico file by genericImageParser trace: java.lang.OutOfMemoryError: Java heap space at java.awt.image.DataBufferInt.<init>(DataBufferInt.java:75) at java.awt.image.Raster.createPackedRaster(Raster.java:467) at java.awt.image.DirectColorModel.createCompatibleWritableRaster(DirectColorModel.java:1032) at java.awt.image.BufferedImage.<init>(BufferedImage.java:331) at net.yacy.document.parser.images.bmpParser$IMAGEMAP.<init>(bmpParser.java:149) at net.yacy.document.parser.images.bmpParser.parse(bmpParser.java:69) at net.yacy.document.parser.images.genericImageParser.parse(genericImageParser.java:116)	10 years ago
Michael Peter Christen	4cd2d68e03	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	10 years ago
Michael Peter Christen	dc5700148f	update to latest code changes from json.org	10 years ago
reger	42b0672be3	Let auto-disabled crawls recover if low resource condition vanished. Analog to autodisabled DHT switch autodisabled crawls back on upon mem ok by remembering the autodisable by conf parameter.	10 years ago
Michael Peter Christen	b32e0b5457	fix for shell script	10 years ago
Michael Peter Christen	29f6e9db7a	write java version to status page	10 years ago
Michael Peter Christen	604ccd8072	new development cycle	10 years ago
Michael Peter Christen	287c528f46	replaced old JavaApplicationStub for Mac Application framework with new script. Adopted the YaCyApp environment and fixed a problem in the startYACY.sh application wrapper which caused wrong usage of logging option -l which caused that files had been written to the YaCy application folder. As a result of this fix, it is not necessary any more to change path settings in Info.plist if libraries are changed.	10 years ago
Michael Peter Christen	2bc2564668	Release 1.82	10 years ago
Michael Peter Christen	4c9d2a7c64	reverted 'do not show all options' strategy. This is actually confusing new users. Will be activated maybe again if there is an optional tutorial mode which can be switched on for this special purpose of running a tutorial.	10 years ago
Michael Peter Christen	7db2888336	fixed font size and print page generation in pdf snapshots	10 years ago
reger	24f68a4eb7	refactor opensearch heuristic introduce FederateSearchManager handling search heuristic to external systems via specific FederateSearchConnectors, which provide the query() functionallity, the translation to YaCy schema .toYaCySchema() and the search() routine to deliver results to searchevents, which is generally implemented in Abstract connector. The manager enforces now a min 15s delay between calls to external systems. Besides the OpensearchConnector a SolrFederateSearchConnector is available. It uses a additional config file for fieldname translation. default heuristicopensearch.conf: - openbdb.com removed - seems not longer to deliver results - config via solrconnector to datacite.org added (large technical library archive)	10 years ago
Michael Peter Christen	3b51636ecb	fix for mediawiki import	10 years ago
Michael Peter Christen	b07afbc115	a test with http://validator.w3.org/feed/#validate_by_input shows that the time format was wrong; we must use RFC-822	10 years ago
Michael Peter Christen	8cafdb989a	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	10 years ago
reger	66839f73fa	remove debug limit from commit before	10 years ago
reger	4214f250d0	Add option for extended search (Autosearch) to Bookmark.html asking all connected peers for the searchterm added as description to the bookmark created by the bookmark icon. Intended for searches/research projects with not sufficient results from local and DHT selected remote target peers. Function: the process checks newly created bookmarks for description starting with "query=..." and takes this to ask every peer for 20 search results and adds it to the local index in a background job. link to start/stop the process added to /Bookmarks.html	10 years ago
reger	bb37cb32e4	Add title import for bookmark icon if avail in index	10 years ago
reger	8e751d754a	- add javadoc to busythread with hint about the init parameter useage - remove obsolete 10_httpd config parameter	10 years ago
Michael Peter Christen	3e6c3e2237	documents pushed over the api/push_p.html interface will have their unique flag set by default	10 years ago
Michael Peter Christen	0871e43fcc	better scale	10 years ago
Michael Peter Christen	35c24608cc	fix for division by zero (rare cases)	10 years ago
Michael Peter Christen	4144c7cc52	do not write frame links to webgraph	10 years ago
reger	4eb89d7f15	revert clickservlet (default was indeed a mistakenly)	10 years ago
Michael Peter Christen	61ae9d2d11	do not use the clickservlet by default. From my personal view, this technique should not be used at all! This project is about privacy, the existence of a click servlet is one example why people should NOT use a search portal if such exists.	10 years ago

... 2 3 4 5 6 ...

11736 Commits (9d8f426890a4926db2debdb279e6999e2328459a) All Branches Search

11736 Commits (9d8f426890a4926db2debdb279e6999e2328459a)

All Branches