Commit Graph

4983 Commits (3562b5e3a42c7c440ff791637863764842db1f23)

Author SHA1 Message Date
Michael Peter Christen 69b8d61c47 fix for search requests in GSA interface which contain 'funny'
11 years ago
orbiter 4234b0ed6c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
orbiter 74c86a72a0 better default value for crawler user agent
11 years ago
reger 1437c45383 merge rc1/master
11 years ago
Michael Peter Christen 87a956e881 calculating and showing the number of files and the average size of a
11 years ago
Michael Peter Christen acc1f8a749 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
11 years ago
Michael Peter Christen 81bb50118e found and fixed a huge memory leak in solr caching (inside Solr). The
11 years ago
sixcooler 987f410011 URL-export:add query and fix for cast-class-exception
11 years ago
Michael Peter Christen ffe8276063 replaced referrer link masking to 'pure' links to the referring page
11 years ago
reger b38de92a16 Merge origin/master into jetty
11 years ago
Michael Peter Christen 434e13b46d in host browser also show the properties of failed documents including
11 years ago
orbiter 1ac504ae51 use html encoding for urls in metadata
11 years ago
reger f017066197 Merge origin/master into jetty
11 years ago
Michael Peter Christen 25951cee14 - fixed opensearchdescription, this delivered an url with missing
11 years ago
Michael Peter Christen f1bfe64361 integrated startpage to compare_yacy
11 years ago
Michael Peter Christen 2f57327f20 added boolean load property to CacheResource_p servlet which causes that
11 years ago
Michael Peter Christen 9bb7eab389 hacks to prevent storage of data longer than necessary during search and
11 years ago
Michael Peter Christen 5afa6e3aee Automatically flush the log cache if a short memory status is reached.
11 years ago
Michael Peter Christen 030d0776ff Enhanced crawl start for very, very large crawl lists (i.e. > 5000)
11 years ago
Michael Peter Christen 4948c39e48 added concurrency for mass crawl check
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen 16e3b357b3 replaced old tag cloud and adopted design a bit
11 years ago
Michael Peter Christen dc38d35986 added matching in url field in Table_API_p search
11 years ago
Michael Peter Christen 691d7e70fa added hint to development/commit rss feed
11 years ago
Michael Peter Christen b81859c751 Show a RSS icon in the right top corner of search results. This replaces
11 years ago
Michael Peter Christen 1a09771be8 fixed sitemap crawl start
11 years ago
orbiter b743e6d79f - prevent that crawl filter have empty (never-match) content
11 years ago
orbiter f597fdb602 make it easier to filter properties (case insensitive)
11 years ago
reger f46c723398 allow to choose used http server, YaCy-Anomic or Jetty
11 years ago
reger 1adb4b8741 merge rc1/master
11 years ago
reger 37d24f3318 make use of declared static string ACTION_LOCATION
11 years ago
reger eea504c117 update Info.plist
11 years ago
reger a44eede8b8 merge rc1/master
11 years ago
reger 54a0272338 searchpage javascript (latestinfo) causes reset of search statistic after moving to next page
11 years ago
Michael Peter Christen 91fa99e9bb added new icon/image for latest commit
11 years ago
Michael Peter Christen 9fac9249bc - replaced 'edit' link with a clone symbol in Table_API_p since that is
11 years ago
Michael Peter Christen 0f6db6ad5b Merge remote-tracking branch 'jensbees/crawlexpert-post'
11 years ago
Jens Bertram 3252c1ec39 Merge upstream/master into crawlexpert-post
11 years ago
Michael Peter Christen 90c8577840 enhanced ranking; patches to replace old ranking
11 years ago
bhoerdzn a3824dfbaa check URL on inital load, if set
11 years ago
bhoerdzn 52f49d475b add a hidden field for "crawlingstart" since jQuery omits the submit button value
11 years ago
bhoerdzn b0c0ec2dec link recorded crawl starts back to "CrawlStartExpert_p" in "Process Scheduler"
11 years ago
bhoerdzn d64d45361c use integer types for boolean values
11 years ago
bhoerdzn eda123d6fd remove debugging code intercepting post requests
11 years ago
bhoerdzn 5057f27bbd fix typo in parsing "cachePolicy" parameter
11 years ago
bhoerdzn 98f5c9018d Fixed template vars for "deleteold". Fixed parsing "deleteold" parameter. Stop "setState" overwriting "deletold" state on load.
11 years ago
bhoerdzn a6a62986d4 correct state handling for country code restriction
11 years ago
bhoerdzn 4066b85155 correctly set initial state for load filters
11 years ago
bhoerdzn 8c91c3e7cd set form boolean values to 0 & 1 instead of false & true
11 years ago
bhoerdzn c27fabc88e fixed wrong parameter check
11 years ago
bhoerdzn 2214bf5396 Remove some post parameters, if they are set to default values, as their values are already set by YaCy. Added some documentation.
11 years ago
reger 71d2655c02 downgrade to Jetty 8 to assure support of JRE 1.6
11 years ago
orbiter 705b3338ee list more fields available for search and for ranking boosts
11 years ago
bhoerdzn 405878182f Use list template for all other option lists. Fixed some template expressions.
11 years ago
bhoerdzn 8e74098cd4 Use list template for "reloadIfOlderNumber".
11 years ago
bhoerdzn 52bad7b908 Dynamic toggling of form fields, based on passed in and selected values. This will also cut down the post string by disabling not needed fields.
11 years ago
Michael Peter Christen e56aa4fe93 fixed search navigation
11 years ago
Michael Peter Christen 4fbc4740df removed warnings
11 years ago
bhoerdzn 45cf553bc3 try to guess default crawling mode, if none set
11 years ago
bhoerdzn b4f0c822f2 assign strings before checking contents
11 years ago
bhoerdzn 499abe8f91 set default values for string parameters
11 years ago
bhoerdzn 42ea56eaad made crawStartExpert_p aware of post variables; extended template where needed
11 years ago
reger c7c706fd9f merge with rc1/master
12 years ago
Michael Peter Christen 82bfd9e00a - crawl profiles shall be deleted from active and passive stacks if they
12 years ago
orbiter 8ac2e8c8c9 added location navigator which causes that the image to the map search
12 years ago
orbiter d86d2be5c3 automatically removed Places autotagging if no location library is
12 years ago
reger 5c4ba9b5db merge rc1 master
12 years ago
reger 70c51775ae Merge remote-tracking branch 'origin/master' into jetty
12 years ago
orbiter d2effd21db fix for npe during location search
12 years ago
Michael Peter Christen e40671ddb7 better and consistent deletions for error urls
12 years ago
Michael Peter Christen 2602be8d1e - removed ZURL data structure; removed also the ZURL data file
12 years ago
Michael Peter Christen 61c5e40687 - replaced the properties object in AnchorURL with distinct variables
12 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
12 years ago
reger 13fc86c960 Merge remote-tracking branch 'origin/master' into jetty
12 years ago
reger 127adbf5cf remove references to 10_http thread (legacy http server)
12 years ago
Michael Peter Christen 3e22d05290 added option for daterange properties in GSA interface to use an left-
12 years ago
reger 36b7159282 - remove double initialization of jetty
12 years ago
reger 63ed04260a Merge remote-tracking branch 'origin/master' into jetty
12 years ago
Michael Peter Christen 35ab2cef7b added parsing of 'date', 'dc:date', 'dc.date' and 'last-modified' in
12 years ago
reger aafef72a8a merged current rc1/master into jetty branch to allow further development with latest version
12 years ago
Michael Peter Christen dbef8ccfcb forced deletion of ZURL entries for a specific host for each host that
12 years ago
Michael Peter Christen e137ff4171 refactoring (im preparation for new removeHost method)
12 years ago
Michael Peter Christen 9e12fdff23 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 049c3b3f2e added an option to exclude image search results from text search. This
12 years ago
Michael Peter Christen 5d71a4c8bc fix for dc:description field
12 years ago
reger 392174de8c remove all_words, all_strings lists from QueryGoal
12 years ago
Michael Peter Christen cb85b22725 redesign of the image search process (with much better results,
12 years ago
Michael Peter Christen 6184fd9d9a fix for solr/gsa result logging
12 years ago
reger 29967102a2 optimized QueryGoal (reducing mem and computation by removing all_hashes)
12 years ago
orbiter f106345eef link strings should not be tokenized
12 years ago
orbiter 5b14bdfffd npe fix
12 years ago
orbiter 1ca4b9612c added special handling of the BinaryResponseWriter in the solr interface
12 years ago
Michael Peter Christen a88a62f7aa added a feature to set a collection for a crawl result based on a
12 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user
12 years ago
Michael Peter Christen 47b1c81d08 - refactoring
12 years ago
Michael Peter Christen e6b423c4d9 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
reger 94bec24d14 add back menu to Surftips page (currently no menu is displayed)
12 years ago
Michael Peter Christen 1f299b0d42 removed link.gif as link button because this image is now shown
12 years ago
Michael Peter Christen 48ddd50a6c html fix
12 years ago
reger 96ae332427 revert del _blank (last commit) in template
12 years ago
reger 43348a98a9 add some href target=_blank to ext. links with external icon
12 years ago
reger 82d81a57bd info msg if no embedded Solr http://bugs.yacy.net/view.php?id=279
12 years ago
reger 02fe8b43ba Field Re-Indexing: display list of fields in reindex queue
12 years ago
sixcooler 7f501b7c38 clear some caches before reporting low Memory
12 years ago
reger 070bf85b33 css fix for IE10 showing border on all img within <a /> tag since introduction of external link icon (commit 112836dcc9)
12 years ago
sixcooler 8a96140f92 fix / workaround for
12 years ago
Michael Peter Christen 2674d28ef4 protection against self-ping (may be cause by fraud attempts)
12 years ago
orbiter f3d001c7ab more space in the about section
12 years ago
Michael Peter Christen e879b97b0a added line to enhance debugging
12 years ago
Michael Peter Christen 76afcccaaf fix for default boolean post values: the default value MUST NOT be TRUE,
12 years ago
orbiter 252c525709 fixed feed api servlet and and enhanced RSSReader class
12 years ago
Marc Nause 112836dcc9 Improved external links.
12 years ago
Marc Nause d64a094f0e External links in HTML interface are marked as external with small icon.
12 years ago
Michael Peter Christen 58fe986cca Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
12 years ago
sixcooler 7d53ac86a3 fix for Blacklist (-Administration)
12 years ago
orbiter f425b2c61c re-try to fetch url after a soft commit
12 years ago
orbiter bf0ad04e1b apply load limitation also to dht-in
12 years ago
Roland Haeder b58ca8622d Some cleanups:
12 years ago
Roland Haeder e2ee412160 Use SwitchboardConstants.LISTS_PATH_DEFAULT instead of 'DATA/LISTS'
12 years ago
Roland Haeder ae19401af0 Removed another duplicate occurance of Blacklist.BLACKLIST_FILENAME_FILTER
12 years ago
Roland Haeder 59225487ea Fix for blacklist export, also applied the filename filter here
12 years ago
Roland Haeder 952fc0e7bd Removed superfluous check for files ending '.black' as the previous commit already excluded all other files (e.g. .ser dumps), added logging in catch-all block
12 years ago
Roland Haeder 060fec1577 Reuse Blacklist.BLACKLIST_FILENAME_FILTER
12 years ago
Roland Haeder 29049c71f5 Possible fix for ticket http://bugs.yacy.net/view.php?id=270, the filter for only including *.black must be applied
12 years ago
Michael Peter Christen 4c242f9af9 always use a default value for boolean options to have transparency for
12 years ago
orbiter 9c681cc00d added segment sizes, postprocessing status and cpu load to crawler
12 years ago
orbiter 86b514cf46 added load info to status_p.xml
12 years ago
orbiter 056b42f5aa - added information about segment count to status_p.xml
12 years ago
orbiter 6fb2811e68 fixes for problems with remote solr and non-activated webgraph index
12 years ago
orbiter e24016e30a added the property federated.service.solr.indexing.timeout to yacy.init
12 years ago
orbiter 232100301c removed double-ocurring value assignments
12 years ago
Roland Haeder aaedc0405d Fixes and avoid of catching bad exceptions (some):
12 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
12 years ago
Felix Ableitner 376f9cd9d0 Merge branch 'master' of git://gitorious.org/yacy/rc1 into blacklist_structure
12 years ago
Michael Peter Christen 89c0aa0e74 added collection_sxt to error documents
12 years ago
Michael Peter Christen 0df5195cb0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 1fd006cc56 fixes using the embedded connector
12 years ago
orbiter aba7cc5de7 added cpu load information to status page
12 years ago
Roland Haeder 59b4fdd5ad Merge remote-tracking branch 'upstream/master'
12 years ago
orbiter 5493389576 stealth mode shall only be available for authorized users, because
12 years ago
Roland Haeder ebbb3bc5c1 Fixed CHMOD on many files + added missing loggers (e.g. jena) and made some noisy loggers quiet
12 years ago
Michael Peter Christen bcc623a843 refactoring of load_delay: this is a matter of client identification
12 years ago
orbiter 2be456e7fb added a postprocessing field into api/status_p.xml to show if the
12 years ago
orbiter 575f913154 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter c4efb612e2 added list of crawls to status_p.xml
12 years ago
Lotus bb6caa346c Do not allow automatic update in case YaCy is installed to the Program
12 years ago
orbiter dac88561ae minimum access time has a tight connection to ClientIdentification,
12 years ago
Felix Ableitner a020697d64 Fixed problems with blacklist entry insertion.
12 years ago
sixcooler bff8c753c6 re-insert this file - was deleted by mistake
12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
orbiter c79f687110 enhanced the network scanner: find more hosts automatically by removal
12 years ago
orbiter b4677d1cad fix for bug #252
12 years ago
Michael Peter Christen 07261fe274 Merge remote-tracking branch 'nutomics/blacklist_structure'
12 years ago
Michael Peter Christen dea71851d2 - better concurrency for network scanner
12 years ago
orbiter 9f0cc9b401 enhanced network scanner
12 years ago
orbiter f8c28efd66 fix for rssTerminal coloring
12 years ago
Felix Ableitner 44f8fcf62e Changed class structure of Blacklist.
12 years ago
Michael Peter Christen 3054a6d4b9 added a patch from Sebastian M.B., submitted by email for coloring of
12 years ago
Michael Peter Christen 78af998f8f Merge commit 'fd90fcc4e08f80acbfd1c9a7ec62ce04cd309594'
12 years ago
Michael Peter Christen 57ffdfad4c added a crawl option to obey html-meta-robots-noindex. This is on by
12 years ago
Felix Ableitner fd90fcc4e0 Fixes #196.
12 years ago
Michael Peter Christen f1c5338210 prepartion for greedy crawl profiles and refactoring
12 years ago
Michael Peter Christen e6f361f474 adding the canonical tag to crawl queues
12 years ago
Michael Peter Christen 203921006a redesign of citation index storage
12 years ago
Michael Peter Christen e92b9275ce Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 56cdcfa2fa fixed greedy learning mode - global is not a search attribute in
12 years ago
Michael Peter Christen 32aa1d4569 removed unused option for queries
12 years ago
Michael Peter Christen 0c5bed7e2c added configuration option for greedy learning function to ConfigPortal
12 years ago
sixcooler 5d1f619f07 possible helpful closing of solr-requests
12 years ago
Michael Peter Christen 9d291764d1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
sixcooler e5abccdfe4 added optimize-option
12 years ago
Michael Peter Christen 8ea6ddf636 removed attributes from ConfigPortal.html which are redundant to
12 years ago
Michael Peter Christen 64140f35cd fix for solr requests if no query part is given (prevent npe)
12 years ago
Michael Peter Christen 23fb458963 - fix to gsa searchresult answer in case that no query part is given
12 years ago
Michael Peter Christen 660a196989 refactoring
12 years ago
Michael Peter Christen 54024958ac added url_file_name_s in qeury for live-search of urls
12 years ago
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name
12 years ago
Michael Peter Christen f542cf7d9c fix for daterange: the to-date is inclusive
12 years ago
Michael Peter Christen c36720d45f added daterange option to gsa api
12 years ago
Michael Peter Christen 4e3007f4a0 typo
12 years ago
Michael Peter Christen 2cb6b6bc21 added target="_blank" to shutdown links
12 years ago
orbiter c8e94ad7c7 fix for citation search in case that the citation is very fresh
12 years ago
orbiter 57dcf68665 added a feed-back message inside the shutdown page
12 years ago
Michael Peter Christen 0600d510e1 show the citation report also in ViewFile
12 years ago
Michael Peter Christen 1a92b61d69 fixed usage of ViewFile which needs a commit before showing latest crawl
12 years ago
Michael Peter Christen 570511f3c8 removed fields references_internal_id_sxt and
12 years ago
Michael Peter Christen fd1776a3b0 added a new 'Citations' function: each search result item can now be
12 years ago
Michael Peter Christen 1762911f57 added synchronizations and timeouts in solr api; missing
12 years ago
Michael Peter Christen 2fd7bbb450 reduced load on solr; no seed update in Status and no exists-check in
12 years ago
Michael Peter Christen 7ee71c2354 changed administration page headline to 'admnistration'
12 years ago
Michael Peter Christen efd973d29d changed p2p/stealth mode text and links a bit
12 years ago
Michael Peter Christen 6115bef335 added a 'greedy learning' mechanismn which will cause that a 'fresh'
12 years ago
Michael Peter Christen a5e328d7c5 new icons
12 years ago
Michael Peter Christen b85db72a73 added another response writer which can present search result with
12 years ago
Michael Peter Christen 5132bf719c added new buttons to search result page in p2p mode which show the
12 years ago
orbiter 2b320313d9 replaced yacydoc servlet usage by a solr result output using an html
12 years ago
orbiter 200769d0c6 show the cache link in search results only if there is actually a cache
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen fdcd4e6a6f fixes to index deletion: quoting of host name (a '-' may be part of the
12 years ago
reger 7480e87386 - fix stopword handling for RWI see example http://bugs.yacy.net/view.php?id=247
12 years ago
orbiter 5c7ddc67fe in GSA api enable usage of solr fq-attribute together with GSA
12 years ago
Michael Peter Christen eb9d0ba5b1 ranking and boost function update, small bugfixes, better default search
12 years ago
Michael Peter Christen 5f92c68f1f removed block rank ranking and all YBR files in /ranking
12 years ago
Michael Peter Christen 164603b946 cleanup
12 years ago
Michael Peter Christen 0c1a018bbd removed 'later' tactic because it used too much RAM, reduced number of
12 years ago
Michael Peter Christen 709e9b8ce7 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 9e07447d47 added new link for SMW
12 years ago
Michael Peter Christen 3c04dd11de removed dead link
12 years ago
Michael Peter Christen 281959a2d7 added option to re-boot the embedded solr during run-time. Added also
12 years ago
Michael Peter Christen 80a7989e8c fixed ClassCastException: [Ljava.lang.Object; cannot be cast to
12 years ago
orbiter da621e827e prevent NPE in case RWI is disabled
12 years ago
Michael Peter Christen 7300d81f40 include API Table deletion requests to the API recorder
12 years ago
Michael Peter Christen d2ade87b49 fixed missing thisaddress in yacysearch.html which caused that the
12 years ago
Michael Peter Christen 179d032181 added a (badly formatted) delete button for process scheduler entries
12 years ago
reger c03f75ebc3 fix DHT url receive see http://bugs.yacy.net/view.php?id=242
12 years ago
Marc Nause 8fb1b1e290 *) simplified banner creation code
12 years ago
Marc Nause cd0b5f31b4 *) updated links to description of regex
12 years ago
Michael Peter Christen 8f2d3ce2f9 reduced locking situation in crawler: shifted synchronized location and
12 years ago
Michael Peter Christen f93501e6e0 nice crawl name if crawl is started with file:// (was: null)
12 years ago
Michael Peter Christen b4f0cac102 added the reindexing job servlet to the submenu structure
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
Michael Peter Christen c91c67c3cd reject bad solr requests
12 years ago
Michael Peter Christen 44e363f37f refactoring of WorkflowProcessor, added process counter, update of
12 years ago
reger 79401cb938 added reindex option for documents with disabled or obsolete fields to Solr Schema Editor page (IndexSchema_p.html)
12 years ago
Michael Peter Christen b24d1d18e4 removed synchronization and concurrency in Fulltext class, concurrent
12 years ago
Michael Peter Christen f965d04496 added new peer icons for Mentor peers and Mentee peers (not used yet)
12 years ago
Michael Peter Christen b9b446bca6 - added ssl configuration sign (a lock) to network statistic/table
12 years ago
Michael Peter Christen 7095446ad3 added checkbox (near port) to switch on ssl support (https access) to
12 years ago
Michael Peter Christen e6c8b545c2 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 4baa0d4a97 Added a default keystore for ssl encryption of the YaCy web interface.
12 years ago
Michael Peter Christen 038f956821 fix for sitemap detection: the sitemap url was not visible if it
12 years ago
Michael Peter Christen e26bdd4a52 fixes to deletion methods (removed unnecessary concurrency and added
12 years ago
Michael Peter Christen f7f3e28c5e prevent that the size of the index is computed too many times.
12 years ago
Michael Peter Christen cca19d94d4 re-declared some fields to be of type string rather than text which
12 years ago
Michael Peter Christen ed1d5bace6 draw the names of other peers which receive/send dht into the network
12 years ago
Michael Peter Christen b528448332 enlarge network graph circle according to image height and reduce the
12 years ago
Michael Peter Christen f1bb54943e typo
12 years ago
Michael Peter Christen d7fd346917 - added regular-expression based deletions
12 years ago
Michael Peter Christen 3841854c97 abstraction of catchall term
12 years ago
sixcooler e145afb8d6 fix for PerformanceMemory showing UNRESOLVED_PATTERN by removing
12 years ago
Michael Peter Christen 1b102d98d8 - added index deletion to index administration submenu
12 years ago
Michael Peter Christen 0e2ee00fea added an index deletion servlet and some style changes for the
12 years ago
Michael Peter Christen e4f7e5bcfe fixed bad css change
12 years ago
Michael Peter Christen 3502b4c697 refactoring (renaming) of yacy-solr api
12 years ago
Michael Peter Christen 3a0fcfbeda Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 25499eead5 - added a new field for the regular expression in crawl start
12 years ago
reger 0a9b0992f3 RinkingSolr_p: include warning if boost field not in local index
12 years ago
orbiter e1bfe9d07a - reduction of the concurrently running processes to make YaCy more
12 years ago
Michael Peter Christen c091000165 added collection attribute also to the rss feed reader
12 years ago
orbiter f7571386a3 added a 'collection' property attribute in yacysearch.html which can be
12 years ago
orbiter 3e79bd4b1f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter d571e739b6 increased row limitation for authorized users from 10000 to 100000000 in
12 years ago
Michael Peter Christen a1fffe8e86 fixed default ranking values
12 years ago
Michael Peter Christen 1d30082446 added hindi translation configuration
12 years ago
Michael Peter Christen 97775fbebc fixed ranking for add-function queries: this did not work. The option
12 years ago
Michael Peter Christen 298bf2deb5 fix to ranking configuration servlet
12 years ago
Michael Peter Christen 2db058b551 added in RankingSolr_p.html a select box to switch between different
12 years ago
Michael Peter Christen 6fbca35215 fixed api table navigation
12 years ago
Michael Peter Christen f24ac518e6 redesign of exists()-query (can now be called with query) and the
12 years ago
Michael Peter Christen 27d6222880 added new field host_extent_i which, after a crawl and postprocessing,
12 years ago
Michael Peter Christen 579eb01a49 showing now the details of references count in host browser:
12 years ago
reger 0f4237d8e5 add admin option to delete load errors from index
12 years ago
Marc Nause e99c8789ff *) fixed encoding of query in link to map (in case geolocalization is
12 years ago
Michael Peter Christen 082e3274d6 - setting the same default ranking in the solr interface as for YaCy
12 years ago
Michael Peter Christen edc0b33f6d - showing references count and clickdepth in host browser
12 years ago
orbiter 2c3b024196 if the crawl was paused (automatically), show the reason for pausing in
12 years ago
reger 566a3b0294 fix: Index Administration > Reverse Word Index (IndexControlRWIs_p) corrected use of word search to word-hash search
12 years ago
reger 40b3f2c5fe comment out dead menue link
12 years ago
reger bf1e1ddca1 fix typo in prev commit
12 years ago
reger d4d93be779 uncomment "used time" calculation for remote search log
12 years ago
reger 36202f27b0 improve remote search log, set "Returned Results" to transmitcount (instead of no value)
12 years ago
reger 254074b11d Merge branch 'master' of git://gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 870aedf3c6 fixes for better search interface integration in yaml templates
12 years ago
Michael Peter Christen 735eb70525 better search timing; prevents '0 results' for very large local
12 years ago
Michael Peter Christen 342ba1049b - callback fix
12 years ago
reger 31d16f20d7 fix invisible icon not found
12 years ago
orbiter 243b66ae6d Merge branch 'master' of git://gitorious.org/~frankensteen91/yacy/frankensteen91s-yacy
12 years ago
Frank 7763f2554f add the new PPMbar in Crawler_p for a better style and better use.
12 years ago
orbiter e4d26d1cb4 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 940c6849ee enhanced did-you-mean (a bit): can now remember previously searched
12 years ago
reger d57b221921 add: reset Solr schema filed selection to default button in IndexSchema_p
12 years ago
Michael Peter Christen 9406a2e438 fixed NPE during index abstract computation
12 years ago
Michael Peter Christen d725782440 turned severe message to warning message about network failure events
12 years ago
Michael Peter Christen 2d36a7eaf5 - do not create a new query for all remote peers
12 years ago
Michael Peter Christen 2080fc7406 removed unused tag fields
12 years ago
reger 7804c12976 fix error msg in ConfigHeuristics_p
12 years ago
reger 230a12bfe2 adjust Opensearch discover function to new webgraph Solr schema
12 years ago
orbiter 47114910d5 fix for possible memory leaks
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 68e739a90b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 3d9ce9cd04 - added more selection criteria for network seed list
12 years ago
orbiter 168e8d9b4d added/fixed missing DOCTYPE line (submitted by Thomas)
12 years ago
Michael Peter Christen 25300913fa fixes to search debugging after testing with the different search
12 years ago
Michael Peter Christen 2d472a39f4 DHT-transferred metadata and crawl receipts now also use the delayed
12 years ago
Michael Peter Christen 221ed7d764 - enhanced concurrency during search without IO blocking
12 years ago
Marc Nause 2714b59f38 *) For some reason this seems to fix a ClassCastException on my system
12 years ago
orbiter 0f7ea7ad9f - enhanced solr.add procedure for mass adds
12 years ago
orbiter 7ff10bdb1b fix of page navigation for formatted totalcount numbers
12 years ago
orbiter a734fbc4a5 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter d74472f562 corrected result counter
12 years ago
orbiter aa3c26c62e added recrawl/reload to CrawlStartSite for a timeout of 3 days
12 years ago
orbiter c1b7e61882 added option to create empty vocabularies
12 years ago
bubu e0edad689d fix link to IndexSchema_p.html
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 35fa718b77 testing to use solr for portalsearch caused some bugfixing but no full
12 years ago
Michael Peter Christen 008288719c fix for schema export to consider also automatically generated
12 years ago
Michael Peter Christen 089dee1770 - generalized SchemaConfiguration into super-class Configuration and
12 years ago
Michael Peter Christen 56d5946a59 - added flags in IndexFederated_p.html to switch on or off the webgraph
12 years ago
Michael Peter Christen 14cceb6b17 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 58e1e6fa2b fixes to schema
12 years ago
reger d31a109efe remove obsolete Solr "commit within" input field from IndexFederated
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen 89ede0fe84 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
orbiter 594ed63f2a fixed interactive search which caused an error if pubDate is not present
12 years ago
Michael Peter Christen 98a4a4aa97 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen b6de1f42dc Full redesign of solr connection architecture. This was done to support
12 years ago
Marc Nause efb6cf7d21 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
12 years ago
Marc Nause ce5b7afab2 *) removed Skype online indicator (was not working anymore)
12 years ago
Michael Peter Christen 4111606654 removed the commitWithin attribute because that is not the way how the
12 years ago
Michael Peter Christen c20fa3640d fix to unbalanced tag and license for null objects
12 years ago
Michael Peter Christen 3a6097966d added jsonp option to yjson result writer
12 years ago
Michael Peter Christen de58043205 Added image license generation for solr image search results when
12 years ago
Michael Peter Christen d3508fa8ff fixed json search, quotes, auto-facets, urls etc. for
12 years ago
Michael Peter Christen 02fa31b5bf better filesearch layout
12 years ago
Michael Peter Christen e55ec3071d reduced number of facets in yacyinteractive (only filetype necessary)
12 years ago
Michael Peter Christen 16d90859b7 reverted put-semantics back to as-usual in serverObjects and introduced
12 years ago
Michael Peter Christen c34af7fe94 extended JSON Response Writer and Opensearch Response Writer for the
12 years ago
Michael Peter Christen 762b687e47 extended the serverObjects to be able to hold multipel values for a
12 years ago
Michael Peter Christen d70d99fab5 added more metadata fields and facets to OpensearchResponseWriter.
12 years ago
Michael Peter Christen 51e7ab4f70 moved bookmarks back to more prominent location (even if this does not
12 years ago
Michael Peter Christen dee8b24d3c better error handling for bookmarks
12 years ago
Marc Nause 27894d2c1a Merge branch 'master' of git@gitorious.org:yacy/rc1.git
12 years ago
Marc Nause 75f9568472 *) only install files from the RELEASE directory
12 years ago
Michael Peter Christen eb80405a16 added a disable function in RemoteCrawl_p servlet which prevents setting
12 years ago
Michael Peter Christen 1e3d8cc235 show a link for the host in the host browser; see
12 years ago
Michael Peter Christen 7de502f43d Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Marc Nause 3bc5ee6e3d *) added protection against CSRF in update download page
12 years ago
Michael Peter Christen 3834829b37 bugfixes and more logging for solr connector
12 years ago
Michael Peter Christen d1cb4cbc84 enhanced network scanner, is faster and more flexible now
12 years ago
Michael Peter Christen 7dfcc92b71 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen 0b6566a389 optimizations when starting large crawl requests with many start urls in
12 years ago
orbiter a2160054d7 ability to create vocabularies also without any objectspace: this
12 years ago
Michael Peter Christen be27567b53 allow more links when starting a crawl by file
12 years ago
reger 3777b338c7 bugfix: location url for migrate urldb button onclick
12 years ago
reger 8447814a31 correct headermenue in migrateurldb_p.html
12 years ago
Michael Peter Christen 99185d7048 one more fix for author_sxt
12 years ago
Michael Peter Christen b6ae6262f6 - add the copyField author_sxt only if author exists
12 years ago
Michael Peter Christen 088373b4ea catch exception if solr connection change fails
12 years ago
Michael Peter Christen e23a596c1d added a copyField for author_sxt for automated schema generation
12 years ago
Michael Peter Christen f1a4feda3e security fix for suggest (don't let users ask for too much)
12 years ago
Michael Peter Christen 244b157299 fix for external solr schema definition
12 years ago
Michael Peter Christen 0fe7b6fd3b migrated the index export methods from the old metadata to solr. Now
12 years ago
Michael Peter Christen 8eebeea533 fix for search result link in ViewFile
12 years ago
Michael Peter Christen 31e854bef6 Merge remote-tracking branch 'copro/master'
12 years ago
Michael Peter Christen 4735bd47f4 - changed solr commit call and added an optimize option. Since Solr
12 years ago
Copro 0025983993 Fix typo embedd -> embed
12 years ago
Copro 3ea8380959 Adding Vimeo tag to wiki commands to embedd Video video with id
12 years ago
Copro ee9d7fd93d Added feature to embedd Youtube videos to wiki commands for usage in
12 years ago
Michael Peter Christen 9ccdd21d76 Merge remote-tracking branch 'aleksejs/fixtrans'
12 years ago
Michael Peter Christen aa067da86b set the 'all' option as option at end of the list because the all option
12 years ago
Michael Peter Christen edbc86d2b0 integrated search term into opensearch result title. this makes better
12 years ago
Michael Peter Christen 4faa07c214 added a timeout for topic computation (solr is here much slower than the
12 years ago
Michael Peter Christen d2d5be032d added a 'inlink' search option according to the suggestion in the YaCy
12 years ago
Michael Peter Christen 76e1e91b11 with strict compiler settings, IndexFederated_p does not compile without
12 years ago
reger 3897bb4409 added (manual) urldb migration (link on: Index Administraton -> Federated Solr Index)
12 years ago
reger 3b6e08b49f prevent checking of urldb if empty
12 years ago
reger 1fb452174a read defaults from yacy.init for "Set to Defaults" button
12 years ago
reger f143804382 fix configuration for search page navigators
12 years ago
Michael Peter Christen 24db2fcd9d fix for Network info
12 years ago
Michael Peter Christen fc47109608 added 'Last Hour' to network statistics
12 years ago
Michael Peter Christen 38d3feae65 added separate delete commands for the local+remote solr index, the old
12 years ago
Michael Peter Christen 6f0baaa309 added the clickdepth post-processing: some links may have 'shortcuts' to
12 years ago
Michael Peter Christen 0f5b6f38c1 enhanced root-url detection
12 years ago
Michael Peter Christen 8ae08a2cac moved HTCache, Heuristics and Parser servlet to a more appropriate menu
12 years ago
Michael Peter Christen 5c0c56cfe1 Preparations to produce a click depth attribute in the search index.
12 years ago
Michael Peter Christen 295884fd54 - Merge commit '168b1d130d9d67b5e8855a0b50c4ba7ad4a416f8'
12 years ago
reger 276e63401e small sanitary fixes
12 years ago
reger f301336adf fix: no results with configuration citation reference index switched off
12 years ago
orbiter fe50702eb0 added a filterscannerfail attribute to QueryParams which causes that a
12 years ago
reger 168b1d130d Adding heuristic to get search results from configured systems which support opensearch specification
12 years ago
reger 7761b60325 fix: Broken Link on Crawler_p.html - issue 218
12 years ago
reger e9e0d63897 Add config option to show HostBrowser link in search result
12 years ago
Michael Peter Christen 4a9182ae16 use the search configuration to default the cacheStrategy to the value
12 years ago
Michael Peter Christen e1f89efd0d - made image search in interactive search using the ViewImage servlet -
12 years ago
reger fbf84e9ff3 fix SeedUpload setting propery name for include template file
12 years ago
Michael Peter Christen 9e4033f229 fix for event starter: delete start time when event is removed
12 years ago
Michael Peter Christen 99edbf6f14 fix for config basic: do not accept empty peer names
12 years ago
Michael Peter Christen 24c9bb35f7 extended the Scheduler: introduced scheduled events
12 years ago
Michael Peter Christen 433143ba40 removed protocol, tld, ext from the urlmask and created specific
12 years ago
Michael Peter Christen 84f82541e8 search process enhancements
12 years ago
Michael Peter Christen 02020b590b - removed all extension types from extension navigation which are not
12 years ago
Michael Peter Christen 01200f06cc using the author field as solr-native facet. this makes it necessary to
12 years ago
Michael Peter Christen 7ad5457db0 using the solr facets as navigation in yacyinteractive.html instead of
12 years ago
Michael Peter Christen 1052263af3 - added a new solr field references_i which stores the number of
12 years ago
Michael Peter Christen 34f8786508 removed dependency of vocabulary navigation from Jena and it's
12 years ago
reger 664499bb10 PerformanceQueues: disable input for hardcoded httpd performance values
12 years ago
Michael Peter Christen 9319b90d8a - fixes for host navigation
12 years ago
Michael Peter Christen cb5cbec14d distinguishing modified query string and original query string
12 years ago
Michael Peter Christen fb0fa9a102 - fixed 'delete from subpath' during crawl start which deleted nothing;
12 years ago
orbiter 54e193a2b8 you can now search for '*' to get just ALL entries in the search index
12 years ago
orbiter 7f5526e6ef allow larger no-proxy expressions
12 years ago
reger e80dfeca23 - making blacklist path part case insensitive (solving http://bugs.yacy.net/view.php?id=171)
12 years ago
Michael Peter Christen 4491072256 - clear the search cache when altering the solr boosts
12 years ago
Michael Peter Christen 2b7d46bc1f using a filter query for the site parameter in GSA api
12 years ago
Michael Peter Christen 10527e28ae fix for wrong display of error urls in HostBrowser
12 years ago
Michael Peter Christen 5f5d66921e patch for funny symbols in url paths (like tilde)
12 years ago
Michael Peter Christen 8aa08261a7 update to Solr Boost handling
12 years ago
Michael Peter Christen 908ad2f174 Added a new servlet to configure the solr ranking using field boosts
12 years ago
Michael Peter Christen a598fb6227 renamed Ranking_p.html to RankingRWI_p.html
12 years ago
Michael Peter Christen 72f165d58b added a Boost class which stores solr query boost values. The class can
12 years ago
reger bb20691d4f fix: respect config setting of "show Nav Top-Menu" in HostBrowser.html for public users (as hostbrowser is now available in search results)
12 years ago
Michael Peter Christen 3de784c8dd replaced more split and replaceAll missing pattern pre-compilation with
12 years ago
Michael Peter Christen 8fc3679c66 using more pre-compile pattern for split methods
12 years ago
Michael Peter Christen d48e9788d2 enhanced search result processing behavior
12 years ago
Michael Peter Christen eca68fa197 added debug code to crawler monitor
12 years ago
Michael Peter Christen 205f8b222b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter c54cb85422 added link to
12 years ago
Michael Peter Christen b7004043ea - added a field cache for solr queries which call only for a single
12 years ago
Michael Peter Christen bf42179982 introduced more structure in HostBrowser, table view, better counting,
12 years ago
Michael Peter Christen 4eab3aae60 removed overhead by preventing generation of full search results when
12 years ago
Michael Peter Christen a114bb23bb - using edismax in gsa interface
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
Michael Peter Christen f5ca5cea44 - added field options to all solr queries. This can be used to restrict
12 years ago
Michael Peter Christen 46be4af5b9 Merge commit '2bb8f045cc92f31fc7e720cc30b38af417563890'
12 years ago
Michael Peter Christen 952e143580 FINALLY YaCy can now search for full strings using double- or
12 years ago
orbiter 5dfd6359cb redesign of the QueryParams class: introduced QueryGoal which holds the
12 years ago
Michael Peter Christen 5fd3b93661 added deletion of hosts during crawl start if deleteold option was given
12 years ago
Michael Peter Christen d64445c3cb because we have the inurl:<term> - searchmodifier, we don't actually
12 years ago
orbiter b55ea2197f - redesign of crawl start servlet
12 years ago
orbiter 1c66de4bd4 - removed scheduled crawling options in crawl start because it is
12 years ago
Michael Peter Christen 2e7219f9fd removed hightlighting of search results within collections in GSA
12 years ago
Michael Peter Christen 074dfd297b added icons and a selection for hosts with urls pending for crawler or
12 years ago
cominch 21df1ad9e0 update and generalization of the SMW import and content control routines
12 years ago
Michael Peter Christen 4c4e0eece2 added new submenu 'Target Analysis' with three servlets which are useful
12 years ago
Michael Peter Christen 61995d508e do the commit anyway before calling a search interface
12 years ago
Michael Peter Christen 86ec199126 using a better file name
12 years ago
Michael Peter Christen 5105256927 update to search result logging (this was a remaining issue from the
12 years ago
Michael Peter Christen 570e42c4e3 fix for filetype naviagtor
12 years ago
Michael Peter Christen 71ed8e5e07 bugfixes for crawler
12 years ago
Michael Peter Christen 29fbbb49dc better colors for host browser and corrected document count
12 years ago
Michael Peter Christen 6244b084cd fixed wrong order of result count values
12 years ago
Michael Peter Christen 631b08e7e2 update to HostBrowser
12 years ago
Michael Peter Christen 51f420e4f5 removed location search because it is only working in special cases
12 years ago
Michael Peter Christen 15d1460b40 added information about the reason of pausing of crawls
12 years ago
Michael Peter Christen 2371ef031c added solr faceted search support to YaCy search results
12 years ago
Michael Peter Christen d481abd087 added the visualization of error-urls to host browser
12 years ago
Michael Peter Christen a15819fbec fix for some interface problems
12 years ago
Michael Peter Christen 791e1dcfdf when a new crawl is started, delete all entries about error-urls for
12 years ago
Michael Peter Christen c6a6f4c4e6 added a hack which makes the HostBrowser more performant when the given
12 years ago
Michael Peter Christen 64ac2b7b7d new submenu template
12 years ago
Michael Peter Christen 5e77801aac update to web interface structure
12 years ago
Michael Peter Christen 8fb370d9f8 renovated the way how search results are count. should be correct now...
12 years ago
orbiter 354ef8000d - added 'deleteold' option to crawler which causes that documents are
12 years ago
Michael Peter Christen 19d1f474ce host browser now shows also number of pending files per subdirectory +
12 years ago
Michael Peter Christen 75dd706e1b update to HostBrowser:
12 years ago
Michael Peter Christen e2c4c3c7d3 migration to solr 4.0.0
12 years ago
Michael Peter Christen 9330ad4838 - fixed the delete option in host browser
12 years ago
Michael Peter Christen 40df2fd193 added the host browser as link to search results. that means you can
12 years ago
Michael Peter Christen 1168d09de8 more refactoring - integrated the code of SnippetProcess into
12 years ago
Michael Peter Christen 6629e37685 tried to clean up the search process mess
12 years ago
Michael Peter Christen c5f67a5d6d fixed a problem with local search from solr results: now all results
12 years ago
Michael Peter Christen f8f05ecba7 - added a delete button in host browser to delete a complete subpath
12 years ago
Michael Peter Christen 0716a24737 added more / all new crawl profile fields into crawl profile editor
12 years ago
Michael Peter Christen 4a14122ba7 in case that a crawl profile has a collection assigned, use the
12 years ago
Michael Peter Christen 0fe8be7981 enhaced data structures for balancer and latency computation which
12 years ago
Michael Peter Christen ac9540dfb6 removed options for stopwords which are not used
12 years ago
Michael Peter Christen ce3fed8882 added the Google Search Appliance (GSA) api interface to the main menu.
12 years ago
Michael Peter Christen 0833937c1c better balancing and duetime-cumputation also for no-delay intranet
12 years ago
Michael Peter Christen c25d7bcb80 - added concurrency for robots.txt loading
12 years ago
Michael Peter Christen a87811bc38 more auto-commit calls when a search interface is opened, but not when a
12 years ago
Michael Peter Christen 3d3d654e88 if a network configuration is choosed which does not allow DHT and no
12 years ago
Michael Peter Christen 2d9e577ad0 replaced the custom robots.txt loader by the standard http loader
12 years ago
Michael Peter Christen 799d71bc67 enhanced solr caching:
12 years ago
orbiter 8952153ecf update to Balancer algorithm:
12 years ago
Michael Peter Christen 8e1248ffe3 force a commit in advance of a search for the administrator to get most
12 years ago
Michael Peter Christen 1baf498d59 - show more lines in online log
12 years ago
Michael Peter Christen f2d0418218 because the new PngEncoder had a problem with the PixelGrabber which is
12 years ago
Michael Peter Christen d5d64019e5 - added a method for the RasterPlotter to draw arrow endings to lines
12 years ago
Michael Peter Christen 342543a6c4 fix for host browser
12 years ago
Michael Peter Christen 85ca07b90e when a new crawl is started, an equal crawl, if still running, is
12 years ago
Michael Peter Christen 906e51214a the web structure image shows the pivot dot in a different color
12 years ago
orbiter 276dd6452b removed warnings
12 years ago
orbiter 59bf4677b6 added option to view the complete directory structure in host browser
12 years ago
Michael Peter Christen b991685782 Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1
12 years ago
Michael Peter Christen 9eaede50e7 enhanced web structure images
12 years ago
Michael Peter Christen ae6feb5610 showing the web structure graph as animation in the crawl monitor
12 years ago
Michael Peter Christen 39317a6c66 enhanced webstructure image: introduced
12 years ago
sixcooler 57ddd63888 not hold a expensive cache of references for DHT-out,but but load them
12 years ago
reger 1dc6482feb format crawler timeout output string in seconds (was days)
12 years ago
Michael Peter Christen ef937af35d more custom field usage in gsa search result
12 years ago
Michael Peter Christen ce0e5b1e17 - more refactoring / private methods
12 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow
12 years ago
Michael Peter Christen 7f71dfab03 added a HostBrowser.xml api file and changed a bit of attribute naming
12 years ago
Michael Peter Christen e5b3c172ff removed hack which translated Solr documents to virtual RWI entries
12 years ago
Michael Peter Christen 5d16c23a1f specified more URIMetadata as URIMetadataNode
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen cc98496ff3 enhanced the HostBrowser:
12 years ago
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects
12 years ago