yacy_search_server

Commit Graph

Author	SHA1	Message	Date
Michael Peter Christen	6842783761	fixed and enhanced postprocessing	12 years ago
Michael Peter Christen	219d5934a4	fixed termination bug in Solr Connector	12 years ago
Michael Peter Christen	bf1bdd52a6	prevent requesting of 0-facets (which actually exist)	12 years ago
Michael Peter Christen	9d5895f643	enhanced and fixed postprocessing	12 years ago
Michael Peter Christen	f86fe90eda	enhanced mass storage speed to remote solr servers	12 years ago
Michael Peter Christen	6ed9821209	fixed several problems in solr connectors	12 years ago
Michael Peter Christen	191fd3d7e7	added an optimization option to HandleSet mass data storage structure	12 years ago
Michael Peter Christen	94b565ea0d	fixed keepalive min value	12 years ago
Michael Peter Christen	5ec5be5769	fixed logging for remote solr configuration	12 years ago
Michael Peter Christen	24a052ecb9	removed debug code for existsByIds	12 years ago
Michael Peter Christen	087df05e24	added option to Config_Network_p.html to enable remote search while DHT-Receive is switched off.	12 years ago
Michael Peter Christen	1a4a69c226	set more logger to 'final static'	12 years ago
Michael Peter Christen	c60947360d	logger should be static	12 years ago
Michael Peter Christen	69b8d61c47	fix for search requests in GSA interface which contain 'funny' characters (like ':' etc.)	12 years ago
orbiter	b085cb522b	replaced old existsByIds for embedded Solr with obviously much faster new selection method (including stil existing debug code to test that this is in fact better)	12 years ago
orbiter	4234b0ed6c	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
orbiter	909bbb49d8	added (partly commented) test code for url rewrite methods .. to be completed	12 years ago
orbiter	74c86a72a0	better default value for crawler user agent	12 years ago
Michael Peter Christen	899e7e92b0	added debug code	12 years ago
Michael Peter Christen	a5c1249ee2	reverted autowarming setting in solrconfig	12 years ago
Michael Peter Christen	87a956e881	calculating and showing the number of files and the average size of a file in the HTCACHE in ConfigHTCache_p.html	12 years ago
Michael Peter Christen	acc1f8a749	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
Michael Peter Christen	81d9e23532	fixed another memory leak in the PDF parser: the class org.apache.pdfbox.pdmodel.font.PDFont occupies 8MB of space which cannot be cleaned if PDFont.clearResources is called. The attempt to clean the class cache therefore causes that the class is loaded and this cache is initialized with some rubbish. I tried to prevent to instantiate this class by usage of a hacked findLoadedClass call to the SystemClassLoader (which is protected ...). Now, without using the PDF parser at all, 8MB of RAM space is not occupied, however, when the first PDF arrives this space will be taked and never given back to GC. WAKE UP YOU LAZY PDFBOX HACKER AND FIX THIS SHIT!	12 years ago
Michael Peter Christen	c152d996e6	reduced footprint of BookmarksDB which can take quite a lot of memory if the number of bookmarks is high (i.e. > 2000 URLs)	12 years ago
Michael Peter Christen	81bb50118e	found and fixed a huge memory leak in solr caching (inside Solr). The not-flushed Solr cache is now handled in this way: - it is smaller by default - an Solr-internal process is started to flush the cache periodically (this does NOT clean the cache, just removes old objects) - a Solr-external process (the standard YaCy cleanup-process) now has direct access to the solr internal cache and flushes them completely. The time frame for such a flush is defined by the cleanup-process frequency, by default 10 minutes.	12 years ago
reger	7b17cdf6dd	add content_type:image/* to image search - see numerous idx entries with content_type image without url_file_ext_s (for various reason) which should be included in result - try it yourself with following sample query /solr/select?q=content_type:image/* AND -url_file_ext_s:[* TO *]&defType=edismax&fl=sku,url_file_ext_s,content_type adresses also possible url without or deviating extension.	12 years ago
sixcooler	987f410011	URL-export:add query and fix for cast-class-exception	12 years ago
Michael Peter Christen	ffe8276063	replaced referrer link masking to 'pure' links to the referring page (that was more useful during testing)	12 years ago
Michael Peter Christen	a8253ca49c	added missing unicode transformation in href link contents during parsing	12 years ago
Michael Peter Christen	0cf9e9580b	added clickdepth and CR computation debug code to verify that the process is complete	12 years ago
Michael Peter Christen	7f768b42d3	we do not need the load-image flag any more since this is now controlled by parser switches	12 years ago
Michael Peter Christen	234a974955	load image only if their parser flag is activated	12 years ago
Michael Peter Christen	b2c329929f	Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git	12 years ago
Michael Peter Christen	60187a4ec2	fix in html parser	12 years ago
Michael Peter Christen	e1c1e57877	less overhead calling exist() with only one hash	12 years ago
reger	3d5d366f1c	fix html header in Solr HTMLResponseWriter - move 1st body content after </head> tag - add closing <span> tag	12 years ago
Michael Peter Christen	5a02d650ee	avoid cloning	12 years ago
reger	a09e70cd68	fix typo in GitRevTask (branch)	12 years ago
Michael Peter Christen	cc39667399	Speed enhancements and less CPU usage during Solr searches when using the embedded Solr (the default). This was obtained by cirumventing solrj search encapsulation and the implementation of direct index access methods to Solr. The effect will not only be seen during search, but this has also a strong effect on suggestions (much more) and less CPU power usage during index distribution (which needs many search requests)	12 years ago
Michael Peter Christen	434e13b46d	in host browser also show the properties of failed documents including referrer urls (this is a VERY USEFUL SEO and Web Admin feature!!)	12 years ago
orbiter	176acce5cb	version number change for next development cycle	12 years ago
orbiter	1ac504ae51	use html encoding for urls in metadata	12 years ago
reger	69599566f9	catch one more malformed url in proxy url rewrite	12 years ago
reger	605530fec5	catch proxy url rewrite exception malformed url (" http:\/\/" ) may cause error response testcase http://localhost:8090/proxy.html?url=http://dictionary.reference.com/browse/test	12 years ago
orbiter	aaa945518d	next intermediate release 1.64	12 years ago
Michael Peter Christen	25951cee14	- fixed opensearchdescription, this delivered an url with missing 'global' option - added display=2 to compare_yacy to remove the superfluous border	12 years ago
Michael Peter Christen	f1bfe64361	integrated startpage to compare_yacy	12 years ago
Michael Peter Christen	2f57327f20	added boolean load property to CacheResource_p servlet which causes that the servlet loads the page from the web.	12 years ago
Michael Peter Christen	9bb7eab389	hacks to prevent storage of data longer than necessary during search and some speed enhancements. This should reduce the memory usage during heavy-load search a bit.	12 years ago
orbiter	3c3cb78555	- removed a lot of garbage and bloated code from GuiHandler. - transformed log lines to String before they are stored because the storage space is about 1:250 (45kb for one line before transformation, 180 bytes afterwards) - this saves up to 10MB RAM so we can increase the number of lines to 1000 again.	12 years ago

1 2 3 4 5 ...

9998 Commits (6842783761021b6dafe33c42a58f284256a1aae0) All Branches Search

9998 Commits (6842783761021b6dafe33c42a58f284256a1aae0)

All Branches