Commit Graph

4868 Commits (49886fab08bef6816b7a3e627fb567f77453731a)

Author SHA1 Message Date
Michael Peter Christen a2b66fe2eb Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen d8e79731df fixed wrong used memory display
11 years ago
orbiter da5d4128bf prevent npe
11 years ago
Michael Benz 072d4aa0c0 Updated German translation and Blacklist_p.html
11 years ago
orbiter f6e441dd77 refactoring
11 years ago
orbiter c3f6c06f2c removed host increment on stored documents from crawler (that was wrong)
11 years ago
Michael Peter Christen a86c2fe77d fixed usage of media flag when started by automated process
11 years ago
Michael Benz f11314aae7 Improved German de.lng translation and fixed adresses -> addresses in \htroot\CrawlStartScanner_p.html
11 years ago
Michael Peter Christen f0eec6d0f3 Merge branch 'master' of git://gitorious.org/~copro/yacy/copros-rc1
11 years ago
Michael Benz 6278af4993 Edit German de locale and improved translation
11 years ago
Michael Peter Christen 69391e5d9e changed strategy to test existence of documents in Solr: using the
11 years ago
reger a02e33dcb6 add edit-link to PK field of table admin
11 years ago
Michael Peter Christen 9eb668e951 enhanced the resource observer
11 years ago
Michael Peter Christen cb2c25d930 in case that the crawler is running and the search user is the peer
11 years ago
Michael Peter Christen bf97e38b83 removed clearURLIndex, which is a stub remaining from the old metadata
11 years ago
Michael Peter Christen bc28247089 Added methods in resource observer to calculate the available and the
11 years ago
reger 365f77ea8c make internal page links relative to ease any future development for context aware servlets
11 years ago
Michael Peter Christen d9858e1b8a removed warnings and superfluous logging
11 years ago
Michael Peter Christen 7e71dcc417 removed interaction fragments
11 years ago
Michael Peter Christen 94245ce0a8 fixed "Size in KBytes" calculation in PerformanceQueues_p.html,
11 years ago
Michael Peter Christen 726e8c3ad5 removed unused classes and servlets
11 years ago
Michael Peter Christen 6e59ca4ebf removed jena library and all code that depended on jena. When jena was
11 years ago
Michael Peter Christen 0e6729f9bc Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 9228214f9b enrichment of PerformanceMemory display of SolrInfoMBean table
11 years ago
Michael Peter Christen e8bdf16ea7 added statistic information for solr resources in PerformanceMemory
11 years ago
reger 1a2b298a65 fix: select all checkbox Tables_p
11 years ago
Michael Peter Christen 931541d198 re-inserted default value re-set button to performance queues and
11 years ago
reger bd1685c94a fix not needed getFileExtension().toLower (double)
11 years ago
orbiter a11f072504 enhanced didyoumean
11 years ago
Michael Peter Christen bc395c7439 reduced color depth of star icons (for smaller file sizes)
11 years ago
Michael Peter Christen 9e0e39a9a4 small change to start/stop/pause icon style
11 years ago
orbiter 22e3524797 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter c40ba51ca6 added new suggest method which replaces more-than-one suggestions:
11 years ago
reger ad4b213145 remove unused static var from HTTPDProxyHandler
11 years ago
reger 6c6056836d fix vocabulary navigator checkbox selection (from last commit)
11 years ago
reger cb71413d19 fix page nav, to keeping modifier
11 years ago
orbiter ba5ab11cc4 less logging
11 years ago
Michael Peter Christen 322854a5f8 fix auth for forced ping
11 years ago
Michael Peter Christen fbf4f77d80 fixed missing corona in network picture
11 years ago
Michael Peter Christen d2b8f2b477 enhancements for staticIP and ipv6 handling
11 years ago
reger 91d79c1ac4 disable wrong forward to https on port change
11 years ago
reger 193b8235c2 remove double jquery-1.3.1.js and adjust header links to jquery-1.3.2
11 years ago
reger f307d65dcf prepare for a language navigator
11 years ago
orbiter 768b1306b8 Added a write-enabled checkbox for remote solr servers.
11 years ago
orbiter f7d6dd136f changed solr paths according to new default paths
11 years ago
Michael Peter Christen 8b14e92ba4 added button in host browser to re-load 404/failed documents
11 years ago
reger f47067b0ce fix search navigator not showing activated nav
11 years ago
reger 9a96a7d73f put list quick navigator buttons belowon BlackList_p editor
11 years ago
Michael Peter Christen 6ada0daae9 making latency_factor and maximum number of same hosts in loader queue
11 years ago
Michael Peter Christen be5e808236 - removed hardcoded load-test which is now handled in BusyQueues
11 years ago
sixcooler 40a4030b55 configurable max-load values for YaCy-Threads:
11 years ago
Michael Peter Christen 77531850b5 reverted crawling strategy from latest commit.
11 years ago
Michael Peter Christen c0da966dfa enhanced crawler speed
11 years ago
Michael Peter Christen 1ea17bd9f3 - removed old metadata database and all migration code
11 years ago
reger 97e84439fb adjusted ConfigHeuristic and changed QueryGoal.getOriginalQueryString to .getQueryString
11 years ago
orbiter fd4abc0565 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter d5b8e473c8 added load limit for DHT transfer: RWI acceptance only if local load is
11 years ago
reger 41c126978b fix bug: Crawl Start (Expert) crawls "?-URLs" even if told not to do so
11 years ago
Michael Peter Christen a9ed28c0b5 no commit if no action is requested
11 years ago
reger 0c754dd794 implemented DIGEST authentication, which is for remote login more secure
11 years ago
Michael Peter Christen f8ce7040ab remote search peer selection schema change:
11 years ago
reger 6932aa4d7a use configured admin-username for api calls
11 years ago
reger c656e67c97 fix: display proper error msg on admin user change
11 years ago
orbiter 2ead4e44d9 introduced a new storage path ARCHIVE inside of DATA which will be used
11 years ago
reger 30d925a96e reimplemented server access restriction
11 years ago
orbiter 3cb6c7861f fixed shutdown authenticaton problem
11 years ago
Michael Peter Christen 7005ecdabd cleanup
11 years ago
Michael Peter Christen 2939b47986 removed non-working realm setting in http client (auth for localhost was
11 years ago
Michael Peter Christen 9bd71fdbb4 made the access tracker class static because it shall be used by the
11 years ago
Michael Peter Christen 7d6fc79eb8 refactoring (usage of constant names for attributes of authentication
11 years ago
reger cabe0943cd fix opensearch resultcount in yacysearch.rss
11 years ago
reger eaf596a257 adding proxy status to (private) status box
11 years ago
reger e3d8459906 extend ssl enabled msg on status page
11 years ago
reger 58ecf5e4dd add to blacklist button in CrawlResults
11 years ago
reger 17b454f957 fix external link (open in new tab)
11 years ago
reger dd8ea0cdd6 fix "add to blacklist" button style in IndexControlRWIs_p
11 years ago
orbiter 2861183359 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter 4035e20f0b unescaping the path
11 years ago
orbiter 7e21d1ff70 "inaccessible" better describes the state of a server which cannot be
11 years ago
reger 7f9b9315fe Merge origin/master
11 years ago
reger 8eaabb9600 remove dependency from old serverCore.java
11 years ago
orbiter 2018e55f8b switched back on index deletion (was accidently off because new jetty
11 years ago
orbiter d4942ad5e0 startRecord fix; this is not according to SRU definition because this
11 years ago
reger 3d913558ab display configured adminUserName in ConfigAccounts_p
11 years ago
reger fbdd89e198 Merge origin/master
11 years ago
reger 65a2f3d5e7 tweak Jetty credentials to work with YaCy UserDB
11 years ago
Michael Peter Christen ee17bd0b69 added option to attach remote solr servers in read-only mode
11 years ago
Michael Peter Christen 25f9c35033 add patch which shall prevent that naive search mistakes like usage of
11 years ago
reger e05320b776 upd: to open more external links in new browser-tab
11 years ago
reger cbb5dc01e4 remove obsolete htroot/solr htroot/gsa YaCy-servlets
11 years ago
reger 71cac1a278 added SSL/HTTPS connector to support SSL/https connection on port 8443
11 years ago
reger f681ce15ae remove obsolete HTTPServer input field
11 years ago
Michael Peter Christen 20b48f894f refactoring: moving all servlets to the same package (the solr servlet
11 years ago
Michael Peter Christen 84167adb49 removed unused anomichttpd code after migration to jetty
11 years ago
Michael Peter Christen b461a27abb fixed the SolrServlet
11 years ago
Michael Peter Christen 7603e879dc Merge branch 'master' into HEAD
11 years ago
Michael Peter Christen 25250405f1 solr servlet preparation for join with jetty branch
11 years ago
reger c84c313fe1 Merge origin/master into jetty
11 years ago
Michael Peter Christen 74466d731a use pre-compiled patterns in ymark
11 years ago
Michael Peter Christen 09412ea3a4 counting search requests in solr interface
11 years ago
Michael Peter Christen 67e7dc0cc6 added more properties to seedlist servlet
11 years ago
Michael Peter Christen 79771c60c0 IPv6 fixes
11 years ago
reger 92d9c56f9f Merge origin/master into jetty
11 years ago
Michael Peter Christen da380343c2 perform greedy learning heuristic only if load < 1.0
11 years ago
Michael Peter Christen 81926c055d fixed bug with image search in yacyinteractive
11 years ago
Michael Peter Christen edda0699e4 changed default timeout for port scanner
11 years ago
Michael Peter Christen f1b5db2c45 - performance graph does not shop peer ping in memory monitor any more
11 years ago
Michael Peter Christen 0db8e34625 enhanced webgraph processing
11 years ago
Michael Peter Christen 9d8b32c63a fixed a division by zero
11 years ago
Michael Peter Christen 957f6297fb Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger effea4bca0 Merge origin/master into jetty
11 years ago
reger b49e90d2e9 remove reference to solrServlet from YaCy servlet select
11 years ago
Michael Peter Christen 38e1e3a707 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
sixcooler 2c2ebb0d92 tried some hardening in order not letting any Solr-Searchers open
11 years ago
Michael Peter Christen cca79d12ef setting of some default values to make an client development start easy
11 years ago
Michael Peter Christen 3d4b5e66ce disallow remote robots to crawl the HostBrowser servlet
11 years ago
Michael Peter Christen 234ca720f5 only admins should be able to force a commit
11 years ago
Michael Peter Christen 2c39b65409 fixes for searches containing stopwords. The fix was done using a
11 years ago
orbiter 61409788eb less word hash computations (removing some overhead because of MD5
11 years ago
reger 5c4a3d1c01 Merge origin/master into jetty
11 years ago
Michael Peter Christen caa20d63d9 fixed seedlist (hash was missing)
11 years ago
Michael Peter Christen ccf2f4e43b refactoring of seed attributes (introduced more constants)
11 years ago
Michael Peter Christen c927b428d3 fixed json
11 years ago
Michael Peter Christen 64048ff217 fir for XSS
11 years ago
orbiter b7f1e5af51 added new servlet which generates the same file as the principal peers
11 years ago
orbiter 3e552550d1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter c2d720cdaf purge a lucene cache - possible memory leak fix
11 years ago
reger f111f30ace Merge origin/master into jetty
11 years ago
Michael Peter Christen f4172cbb3d fix for another XSS bug
11 years ago
orbiter ff86cb683f fixed some XSS bugs reported by Marius from http://ctf365.com/
11 years ago
orbiter 19a051bec8 more monitoring for postprocessing and enhanced layout in Crawler
11 years ago
Michael Peter Christen fceac8cffd more monitoring for postprocessing
11 years ago
Michael Peter Christen 9d5895f643 enhanced and fixed postprocessing
11 years ago
Michael Peter Christen 087df05e24 added option to Config_Network_p.html to enable remote search while
11 years ago
Michael Peter Christen 1a4a69c226 set more logger to 'final static'
11 years ago
Michael Peter Christen 69b8d61c47 fix for search requests in GSA interface which contain 'funny'
11 years ago
orbiter 4234b0ed6c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter 74c86a72a0 better default value for crawler user agent
11 years ago
reger 1437c45383 merge rc1/master
11 years ago
Michael Peter Christen 87a956e881 calculating and showing the number of files and the average size of a
11 years ago
Michael Peter Christen acc1f8a749 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 81bb50118e found and fixed a huge memory leak in solr caching (inside Solr). The
11 years ago
sixcooler 987f410011 URL-export:add query and fix for cast-class-exception
11 years ago
Michael Peter Christen ffe8276063 replaced referrer link masking to 'pure' links to the referring page
11 years ago
reger b38de92a16 Merge origin/master into jetty
11 years ago
Michael Peter Christen 434e13b46d in host browser also show the properties of failed documents including
11 years ago
orbiter 1ac504ae51 use html encoding for urls in metadata
11 years ago
reger f017066197 Merge origin/master into jetty
11 years ago
Michael Peter Christen 25951cee14 - fixed opensearchdescription, this delivered an url with missing
11 years ago
Michael Peter Christen f1bfe64361 integrated startpage to compare_yacy
11 years ago
Michael Peter Christen 2f57327f20 added boolean load property to CacheResource_p servlet which causes that
11 years ago
Michael Peter Christen 9bb7eab389 hacks to prevent storage of data longer than necessary during search and
11 years ago
Michael Peter Christen 5afa6e3aee Automatically flush the log cache if a short memory status is reached.
11 years ago
Michael Peter Christen 030d0776ff Enhanced crawl start for very, very large crawl lists (i.e. > 5000)
11 years ago
Michael Peter Christen 4948c39e48 added concurrency for mass crawl check
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen 16e3b357b3 replaced old tag cloud and adopted design a bit
11 years ago
Michael Peter Christen dc38d35986 added matching in url field in Table_API_p search
11 years ago
Michael Peter Christen 691d7e70fa added hint to development/commit rss feed
11 years ago
Michael Peter Christen b81859c751 Show a RSS icon in the right top corner of search results. This replaces
11 years ago
Michael Peter Christen 1a09771be8 fixed sitemap crawl start
11 years ago
orbiter b743e6d79f - prevent that crawl filter have empty (never-match) content
11 years ago
orbiter f597fdb602 make it easier to filter properties (case insensitive)
11 years ago
reger f46c723398 allow to choose used http server, YaCy-Anomic or Jetty
11 years ago
reger 1adb4b8741 merge rc1/master
11 years ago
reger 37d24f3318 make use of declared static string ACTION_LOCATION
11 years ago
reger eea504c117 update Info.plist
11 years ago
reger a44eede8b8 merge rc1/master
11 years ago
reger 54a0272338 searchpage javascript (latestinfo) causes reset of search statistic after moving to next page
11 years ago
Michael Peter Christen 91fa99e9bb added new icon/image for latest commit
11 years ago
Michael Peter Christen 9fac9249bc - replaced 'edit' link with a clone symbol in Table_API_p since that is
11 years ago
Michael Peter Christen 0f6db6ad5b Merge remote-tracking branch 'jensbees/crawlexpert-post'
11 years ago
Jens Bertram 3252c1ec39 Merge upstream/master into crawlexpert-post
11 years ago
Michael Peter Christen 90c8577840 enhanced ranking; patches to replace old ranking
11 years ago
bhoerdzn a3824dfbaa check URL on inital load, if set
11 years ago
bhoerdzn 52f49d475b add a hidden field for "crawlingstart" since jQuery omits the submit button value
11 years ago
bhoerdzn b0c0ec2dec link recorded crawl starts back to "CrawlStartExpert_p" in "Process Scheduler"
11 years ago
bhoerdzn d64d45361c use integer types for boolean values
11 years ago
bhoerdzn eda123d6fd remove debugging code intercepting post requests
11 years ago
bhoerdzn 5057f27bbd fix typo in parsing "cachePolicy" parameter
11 years ago
bhoerdzn 98f5c9018d Fixed template vars for "deleteold". Fixed parsing "deleteold" parameter. Stop "setState" overwriting "deletold" state on load.
11 years ago
bhoerdzn a6a62986d4 correct state handling for country code restriction
11 years ago
bhoerdzn 4066b85155 correctly set initial state for load filters
11 years ago
bhoerdzn 8c91c3e7cd set form boolean values to 0 & 1 instead of false & true
11 years ago
bhoerdzn c27fabc88e fixed wrong parameter check
11 years ago
bhoerdzn 2214bf5396 Remove some post parameters, if they are set to default values, as their values are already set by YaCy. Added some documentation.
11 years ago
reger 71d2655c02 downgrade to Jetty 8 to assure support of JRE 1.6
11 years ago
orbiter 705b3338ee list more fields available for search and for ranking boosts
11 years ago
bhoerdzn 405878182f Use list template for all other option lists. Fixed some template expressions.
11 years ago
bhoerdzn 8e74098cd4 Use list template for "reloadIfOlderNumber".
11 years ago
bhoerdzn 52bad7b908 Dynamic toggling of form fields, based on passed in and selected values. This will also cut down the post string by disabling not needed fields.
11 years ago
Michael Peter Christen e56aa4fe93 fixed search navigation
11 years ago
Michael Peter Christen 4fbc4740df removed warnings
11 years ago
bhoerdzn 45cf553bc3 try to guess default crawling mode, if none set
11 years ago
bhoerdzn b4f0c822f2 assign strings before checking contents
11 years ago
bhoerdzn 499abe8f91 set default values for string parameters
11 years ago
bhoerdzn 42ea56eaad made crawStartExpert_p aware of post variables; extended template where needed
11 years ago
reger c7c706fd9f merge with rc1/master
11 years ago
Michael Peter Christen 82bfd9e00a - crawl profiles shall be deleted from active and passive stacks if they
11 years ago
orbiter 8ac2e8c8c9 added location navigator which causes that the image to the map search
11 years ago
orbiter d86d2be5c3 automatically removed Places autotagging if no location library is
11 years ago
reger 5c4ba9b5db merge rc1 master
11 years ago
reger 70c51775ae Merge remote-tracking branch 'origin/master' into jetty
11 years ago
orbiter d2effd21db fix for npe during location search
11 years ago
Michael Peter Christen e40671ddb7 better and consistent deletions for error urls
11 years ago
Michael Peter Christen 2602be8d1e - removed ZURL data structure; removed also the ZURL data file
11 years ago
Michael Peter Christen 61c5e40687 - replaced the properties object in AnchorURL with distinct variables
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
reger 13fc86c960 Merge remote-tracking branch 'origin/master' into jetty
11 years ago
reger 127adbf5cf remove references to 10_http thread (legacy http server)
11 years ago
Michael Peter Christen 3e22d05290 added option for daterange properties in GSA interface to use an left-
11 years ago
reger 36b7159282 - remove double initialization of jetty
11 years ago
reger 63ed04260a Merge remote-tracking branch 'origin/master' into jetty
11 years ago
Michael Peter Christen 35ab2cef7b added parsing of 'date', 'dc:date', 'dc.date' and 'last-modified' in
11 years ago
reger aafef72a8a merged current rc1/master into jetty branch to allow further development with latest version
11 years ago
Michael Peter Christen dbef8ccfcb forced deletion of ZURL entries for a specific host for each host that
11 years ago
Michael Peter Christen e137ff4171 refactoring (im preparation for new removeHost method)
11 years ago
Michael Peter Christen 9e12fdff23 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 049c3b3f2e added an option to exclude image search results from text search. This
11 years ago
Michael Peter Christen 5d71a4c8bc fix for dc:description field
11 years ago
reger 392174de8c remove all_words, all_strings lists from QueryGoal
11 years ago
Michael Peter Christen cb85b22725 redesign of the image search process (with much better results,
11 years ago
Michael Peter Christen 6184fd9d9a fix for solr/gsa result logging
11 years ago
reger 29967102a2 optimized QueryGoal (reducing mem and computation by removing all_hashes)
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
orbiter 5b14bdfffd npe fix
11 years ago
orbiter 1ca4b9612c added special handling of the BinaryResponseWriter in the solr interface
11 years ago
Michael Peter Christen a88a62f7aa added a feature to set a collection for a crawl result based on a
11 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user
11 years ago
Michael Peter Christen 47b1c81d08 - refactoring
11 years ago
Michael Peter Christen e6b423c4d9 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
reger 94bec24d14 add back menu to Surftips page (currently no menu is displayed)
11 years ago
Michael Peter Christen 1f299b0d42 removed link.gif as link button because this image is now shown
11 years ago
Michael Peter Christen 48ddd50a6c html fix
11 years ago
reger 96ae332427 revert del _blank (last commit) in template
11 years ago
reger 43348a98a9 add some href target=_blank to ext. links with external icon
11 years ago
reger 82d81a57bd info msg if no embedded Solr http://bugs.yacy.net/view.php?id=279
11 years ago
reger 02fe8b43ba Field Re-Indexing: display list of fields in reindex queue
11 years ago
sixcooler 7f501b7c38 clear some caches before reporting low Memory
11 years ago
reger 070bf85b33 css fix for IE10 showing border on all img within <a /> tag since introduction of external link icon (commit 112836dcc9)
11 years ago
sixcooler 8a96140f92 fix / workaround for
11 years ago
Michael Peter Christen 2674d28ef4 protection against self-ping (may be cause by fraud attempts)
11 years ago
orbiter f3d001c7ab more space in the about section
11 years ago
Michael Peter Christen e879b97b0a added line to enhance debugging
11 years ago
Michael Peter Christen 76afcccaaf fix for default boolean post values: the default value MUST NOT be TRUE,
11 years ago
orbiter 252c525709 fixed feed api servlet and and enhanced RSSReader class
11 years ago
Marc Nause 112836dcc9 Improved external links.
11 years ago
Marc Nause d64a094f0e External links in HTML interface are marked as external with small icon.
11 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago