Commit Graph

9505 Commits (b9b446bca6ec750472b8fb2f1b34f1da2585aa44)
 

Author SHA1 Message Date
Michael Peter Christen 791e1dcfdf when a new crawl is started, delete all entries about error-urls for
12 years ago
Michael Peter Christen c6a6f4c4e6 added a hack which makes the HostBrowser more performant when the given
12 years ago
Michael Peter Christen 619bf7e875 fixed filetype modified for media types in text search
12 years ago
Michael Peter Christen 97f82994a6 automatically pause the crawler if there is a problem with solr
12 years ago
Michael Peter Christen 64ac2b7b7d new submenu template
12 years ago
Michael Peter Christen 5e77801aac update to web interface structure
12 years ago
Michael Peter Christen 8fb370d9f8 renovated the way how search results are count. should be correct now...
12 years ago
Michael Peter Christen 7bec253bb0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen d88eb657fd Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1
12 years ago
orbiter 354ef8000d - added 'deleteold' option to crawler which causes that documents are
12 years ago
reger 633fbe9188 Fix Metadata handling
12 years ago
Michael Peter Christen 19d1f474ce host browser now shows also number of pending files per subdirectory +
12 years ago
Michael Peter Christen 75dd706e1b update to HostBrowser:
12 years ago
Michael Peter Christen e2c4c3c7d3 migration to solr 4.0.0
12 years ago
Michael Peter Christen b764de424a code cleanup
12 years ago
Michael Peter Christen 69aa39d664 update to libraries required by solr 4.0.0
12 years ago
Michael Peter Christen 9330ad4838 - fixed the delete option in host browser
12 years ago
Michael Peter Christen a63179f3f9 added the MIME attribute for the R tag in GSA search result writer
12 years ago
Michael Peter Christen 40df2fd193 added the host browser as link to search results. that means you can
12 years ago
Michael Peter Christen 1168d09de8 more refactoring - integrated the code of SnippetProcess into
12 years ago
Michael Peter Christen 6629e37685 tried to clean up the search process mess
12 years ago
Michael Peter Christen c5f67a5d6d fixed a problem with local search from solr results: now all results
12 years ago
sixcooler 02957d5982 missing license-files
12 years ago
Michael Peter Christen 16216c2344 added missing libraries
12 years ago
sixcooler 9d062873d2 bump to httpclient-4.2.2
12 years ago
Michael Peter Christen f8f05ecba7 - added a delete button in host browser to delete a complete subpath
12 years ago
Michael Peter Christen 0716a24737 added more / all new crawl profile fields into crawl profile editor
12 years ago
Michael Peter Christen 4a14122ba7 in case that a crawl profile has a collection assigned, use the
12 years ago
Michael Peter Christen 0fe8be7981 enhaced data structures for balancer and latency computation which
12 years ago
Michael Peter Christen ac9540dfb6 removed options for stopwords which are not used
12 years ago
Michael Peter Christen ce3fed8882 added the Google Search Appliance (GSA) api interface to the main menu.
12 years ago
Michael Peter Christen b2ffd49817 less latency
12 years ago
Michael Peter Christen 0833937c1c better balancing and duetime-cumputation also for no-delay intranet
12 years ago
Michael Peter Christen c326aa8f67 disabled writing new entries to crawl stacks to prevent that a domain
12 years ago
Michael Peter Christen 6905182d41 - fix for number of words log message
12 years ago
Michael Peter Christen c25d7bcb80 - added concurrency for robots.txt loading
12 years ago
Michael Peter Christen a94c537afc fixed getSize() which can use the cache size while the crawl is running
12 years ago
Michael Peter Christen 96912c9471 enhancement to solr caching: consider that during a get() the document
12 years ago
Michael Peter Christen a87811bc38 more auto-commit calls when a search interface is opened, but not when a
12 years ago
Michael Peter Christen 3d3d654e88 if a network configuration is choosed which does not allow DHT and no
12 years ago
Michael Peter Christen 2d9e577ad0 replaced the custom robots.txt loader by the standard http loader
12 years ago
Michael Peter Christen 799d71bc67 enhanced solr caching:
12 years ago
Michael Peter Christen a33e2742cb - removed unnecessary synchronized and deadlock in crawler
12 years ago
orbiter 8952153ecf update to Balancer algorithm:
12 years ago
orbiter 354f0d9acd moved static method from ClusteredScoreMap to MapDataMining because it
12 years ago
reger 722a447b0d - optimize code of augmented parsing to enhence document tags
12 years ago
Michael Peter Christen 8e1248ffe3 force a commit in advance of a search for the administrator to get most
12 years ago
Michael Peter Christen 3b48c78190 added an option to force a commit to solr.
12 years ago
sixcooler 2d972f289a rise commitWithinMs to default-value from SwitchBoard
12 years ago
orbiter 8fde1dd3b6 another performance and memory hack to graphics: this makes it possible
12 years ago