Commit Graph

9984 Commits (b085cb522b7db8f431df5c831fd40383997f742f)
 

Author SHA1 Message Date
orbiter b085cb522b replaced old existsByIds for embedded Solr with obviously much faster
11 years ago
orbiter 4234b0ed6c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter 909bbb49d8 added (partly commented) test code for url rewrite methods .. to be
11 years ago
orbiter 74c86a72a0 better default value for crawler user agent
11 years ago
Michael Peter Christen 899e7e92b0 added debug code
11 years ago
Michael Peter Christen a5c1249ee2 reverted autowarming setting in solrconfig
11 years ago
Michael Peter Christen 87a956e881 calculating and showing the number of files and the average size of a
11 years ago
Michael Peter Christen acc1f8a749 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 81d9e23532 fixed another memory leak in the PDF parser:
11 years ago
Michael Peter Christen c152d996e6 reduced footprint of BookmarksDB which can take quite a lot of memory if
11 years ago
Michael Peter Christen 81bb50118e found and fixed a huge memory leak in solr caching (inside Solr). The
11 years ago
reger 7b17cdf6dd add content_type:image/* to image search
11 years ago
sixcooler 987f410011 URL-export:add query and fix for cast-class-exception
11 years ago
Michael Peter Christen ffe8276063 replaced referrer link masking to 'pure' links to the referring page
11 years ago
Michael Peter Christen a8253ca49c added missing unicode transformation in href link contents during
11 years ago
Michael Peter Christen 0cf9e9580b added clickdepth and CR computation debug code to verify that the
11 years ago
Michael Peter Christen 7f768b42d3 we do not need the load-image flag any more since this is now controlled
11 years ago
Michael Peter Christen 234a974955 load image only if their parser flag is activated
11 years ago
Michael Peter Christen b2c329929f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 60187a4ec2 fix in html parser
11 years ago
Michael Peter Christen e1c1e57877 less overhead calling exist() with only one hash
11 years ago
reger 3d5d366f1c fix html header in Solr HTMLResponseWriter
11 years ago
Michael Peter Christen 5a02d650ee avoid cloning
11 years ago
reger a09e70cd68 fix typo in GitRevTask (branch)
11 years ago
Michael Peter Christen cc39667399 Speed enhancements and less CPU usage during Solr searches when using
11 years ago
Michael Peter Christen 434e13b46d in host browser also show the properties of failed documents including
11 years ago
orbiter 176acce5cb version number change for next development cycle
11 years ago
orbiter 1ac504ae51 use html encoding for urls in metadata
11 years ago
reger 69599566f9 catch one more malformed url in proxy url rewrite
11 years ago
reger 605530fec5 catch proxy url rewrite exception
11 years ago
orbiter aaa945518d next intermediate release 1.64
11 years ago
Michael Peter Christen 25951cee14 - fixed opensearchdescription, this delivered an url with missing
11 years ago
Michael Peter Christen f1bfe64361 integrated startpage to compare_yacy
11 years ago
Michael Peter Christen 2f57327f20 added boolean load property to CacheResource_p servlet which causes that
11 years ago
Michael Peter Christen 9bb7eab389 hacks to prevent storage of data longer than necessary during search and
11 years ago
orbiter 3c3cb78555 - removed a lot of garbage and bloated code from GuiHandler.
11 years ago
Michael Peter Christen 5afa6e3aee Automatically flush the log cache if a short memory status is reached.
11 years ago
Michael Peter Christen 030d0776ff Enhanced crawl start for very, very large crawl lists (i.e. > 5000)
11 years ago
Michael Peter Christen 6aabc4e5c8 reduced logging line memory, 10000 lines had filled up 450MB! grrr.
11 years ago
Michael Peter Christen 1a8783147b enhanced computation of number of solr documents.
11 years ago
Michael Peter Christen 4948c39e48 added concurrency for mass crawl check
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen 82621bead0 When doing bootstraping, always accept one seedlist-File without
11 years ago
Michael Peter Christen 16e3b357b3 replaced old tag cloud and adopted design a bit
11 years ago
Michael Peter Christen dc38d35986 added matching in url field in Table_API_p search
11 years ago
Michael Peter Christen 691d7e70fa added hint to development/commit rss feed
11 years ago
Michael Peter Christen b81859c751 Show a RSS icon in the right top corner of search results. This replaces
11 years ago
Michael Peter Christen 1a09771be8 fixed sitemap crawl start
11 years ago
orbiter b743e6d79f - prevent that crawl filter have empty (never-match) content
11 years ago
orbiter 20bbde8665 fix for mustmatch regex computation: result had correct semantic, but
11 years ago