Commit Graph

4037 Commits (4a9e64caea5b0111b807c539918b465e0f2051ec)

Author SHA1 Message Date
luccioman 3ca695390c FTP crawl start URLs : applied crawl profile depth control
8 years ago
luccioman 128c8ef8d4 Fixed title rendering having non ASCII chars in QuickCrawlLink_p.html.
8 years ago
reger 8eb6fba59c activate filetype navigator plugin and restrict config (append) of navs
8 years ago
luccioman c25e48e969 Enabled displaying results after 14th page for local search queries.
8 years ago
luccioman a3886c6adb Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman feaa87005e Improved indentation for easier debugging steps.
8 years ago
reger bab4804d11 add FileTypeNavigator plugin
8 years ago
reger d35c47090c remove obsolete put of HttpServletRequest attributes to YaCy servlet
8 years ago
reger 0959038624 correct DefaultServlet resource pathinContext calculation
8 years ago
reger c50e23c495 reduce creation of empty legacy RequestHeader() in situation where null
8 years ago
reger 87f6631a2a adjust Cache getHeader to prev. changes/commit
8 years ago
reger 6be7339b1d remove the overhead of unused reverseMappingCache of HeaderFramewor / RequestHeader
8 years ago
reger c702eb6786 del dead menu link to /repository
8 years ago
reger baa5d9b9e3 adjust DomainHandler working on resolved .yacy domain
8 years ago
luccioman 1ba705c23d Use loaderDispatcher instead of HTTPClient to download releases.
8 years ago
luccioman 467650c042 Hardened system update checks.
8 years ago
luccioman b5711b8fe1 Added some Javadocs.
8 years ago
reger 0d2964cf2b expanded error message on rejected crawl url due to faile dns lookup
8 years ago
luccioman 00e81fcc15 Check HTTP status when downloading a release, and report eventual error.
8 years ago
reger 0758c868c9 add HostNavigator plugin
8 years ago
reger 60160877f5 bundle initialization of search navigation plugins in separate handler
8 years ago
reger 3151cda3a5 catch ip-format exception on wrong server access setting ip filter
8 years ago
reger b32bcdf344 list entries in outgoing cookie monitor one per line
8 years ago
reger 3f32262654 enable getCookies for HeaderFramework reusing Jetty CookieCutter
8 years ago
reger 4186ee6fc0 add other custom response header entries set by servlets to the response
8 years ago
luccioman d27adc2b92 Fixed language detector initialization and NullPointerException cases.
8 years ago
luccioman a1f922b34a Reduced locations vocabulary memory footprint.
8 years ago
reger 9c06e752e4 allow request.setAttribute w/o "not implemented" exception by default
8 years ago
reger 59ab42e7d6 add UserDB lastaccess update calls on login
8 years ago
luccioman bf8a6d9848 Reduced GeoNames locations memory footprint.
8 years ago
luccioman 3f561c1635 Fixed a NullPointerException case.
8 years ago
luccioman 6bc2bf1aa4 Small memory footprint reduction for GeonamesLocation.
8 years ago
luccioman 7f846ef674 Small complementary memory footprint improvement for synonyms.
8 years ago
luccioman 568e3dde6a Improved synonyms memory footprint.
8 years ago
reger 60b3adfb43 fix ext2mime to return given default on input=null
8 years ago
reger f7e9f9be5f move Digest auth checks from DefaultServlet to adminAuthenticated,
8 years ago
luccioman cca3417b87 Fixed image and favicon viewing for unauthenticated local requests.
8 years ago
reger 02092de3d8 remove login cookie generation for static admin ind User servlet
8 years ago
luccioman fc575fc760 Fixed a NullPointerException case.
8 years ago
reger 9a8691129f fix typing error from commit 60ba5c117c
8 years ago
reger f9328f07e2 completing the usage of CONNECTION_PROP_CLIENT_HTTPSERVLETREQUEST in
8 years ago
reger 8e3e3ed191 update the older ResponseHeader patch to handle cookies,
8 years ago
reger 866d3a1960 make RequestHeader login succeed (without throwing exception by default)
8 years ago
reger 44a6a4e795 fix authentication by hit in userdb (wrong parameter)
8 years ago
luccioman aa9ddf3c23 Added control over Robots.txt active threads maximum number.
8 years ago
luccioman 3092a8ced5 Fixed thread name consistency for improved monitoring.
8 years ago
luccioman eec5779889 Added a name prefix to pooled threads for easier monitoring.
8 years ago
reger 59130777a6 add high scored items first to YearNavigator (to make sure to be included
8 years ago
luccioman 0ba5a838f7 Added charset meta to Solr HTML writers.
8 years ago
reger 08a0acc35d make a YearNavigator availabel, useable as SearchEvent.naviator plugin.
8 years ago
reger 7742579ca4 make a LanguageNavigator availabel, useable for the SearchEvent.naviator
8 years ago
reger 0d3bef659b implement RequestHeader.setCharacterEncoding for legacy header,
8 years ago
Michael Peter Christen 5320209963 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 83f5e3d715 added+disabled a federate search experiment
8 years ago
reger 4eeb448eb3 use DigestURL in UrlProxyServlet as parameter to pass requested url to
8 years ago
reger bad8f87998 remove old/obsolete clear text "adminAccount" credential entry from init
8 years ago
reger 811cf637f8 fix Jetty9YaCySecurityHandler, length check of Basic credential,
8 years ago
reger fdcf33f08f fix Domain.stripToHostName for some IPv6 cases
8 years ago
reger ac6e198bd1 add unit test for Domains.stripToPort,
8 years ago
reger f27531f5ec fix Domains.stripToPort, make ipv6 save
8 years ago
reger 67744a8038 fix HeaderFramework.getRequestURL on host with port considering ipv6 host
8 years ago
reger 66cc0dd173 refactor: move GSA specific date formatter to GSAservlet
8 years ago
reger d525967999 refactor: move convertHeaderFromJetty to ProxHandler (only used with active proxy
8 years ago
reger 60ba5c117c fix legacy getHeaderCookies to work with cookies from original
8 years ago
reger 30f8d1e2d7 let RequestHeader.logout succeed w/o throwing exception by default
8 years ago
reger 28afd3a2f8 fix UserDB.proxyAuth from header string
8 years ago
luccioman 0806de8fdc Ensure file input stream are closed in both normal and error cases.
8 years ago
luccioman a0dfbaca6a FileUtils : added some JavaDocs and unit test cases
8 years ago
reger 59448461d3 make use of userInRole for quick login verification
8 years ago
reger 2a4d826d9e adjust servlet RequestHeader.getLocale
8 years ago
reger 9db68acb4f remove obsolete X_YACY... header declarations
8 years ago
reger 8e9aece786 more use of RequestHeader constant referer, authorization
8 years ago
reger d631fbc019 make more use of the new ServletRequest interface methodes
8 years ago
reger 395f2e8946 Make ServletRequest implement the standardized HttpServletRequest interface,
8 years ago
luccioman 74fec066f4 Converted more URLs to pure relative ones.
8 years ago
luccioman 0f0393e5e3 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman 7296e3884f Switched even more URLs to pure relative ones.
8 years ago
reger 49eae79c01 fix Tables.hasIndex check for tablename = key
8 years ago
luccioman 84b81c1af0 Switched more URLs to relative ones when possible.
8 years ago
luccioman 731684105a Improved absolute URLs rendering in OpenSearch desc and RSS feeds.
8 years ago
reger 669f60223e upd Column.toString to output encoder "{bytes}"
8 years ago
reger c9e81d2fa0 fix Column parsing from celldefinition string, without cellwidth def.
8 years ago
reger e0816ef2e5 use human readable date format in CrawlStacker error message
8 years ago
luccioman 54d879a9b3 Generate HTML relative (to each peer) links from hosted WikiCode.
8 years ago
luccioman 2da5f339f8 Fixed /News.html and /Wiki.html pages in Search Portal mode (issue #87).
8 years ago
reger 8fe28a83f2 harmonize used lastmodified date for rwi and fulltext in storeDocument
8 years ago
reger 3d1d297308 refactor namespace navigator as part of navigatorplugin map, this allows
8 years ago
reger 67f660523b Make navigators underlaying indexfield name accessible in interface
8 years ago
reger 5eb3ee4e20 Add search navigator interface to allow for additional navigators (plugins)
8 years ago
reger fd3f58fcaa improve query modifier parsing of "collection:" and possible collision
8 years ago
reger af39a76bf6 Reduce number of default max. search navigator lines (from 10000)
8 years ago
reger 20a1b29ed3 add simple test case for ReferenceContainer helpful for debugging
8 years ago
reger 3c7220bc7b Refacture rwi reference word position and word distance calculation
8 years ago
luccioman f0639d810c Customized name for Threads still using the default "Thread-n" pattern.
8 years ago
luccioman db3b9db9c2 Crawl from local file : faster task end when manually terminating crawl.
8 years ago
reger 4c67ed3f8d catch rwi ranking div by zero exception
8 years ago
luccioman 47af33a04c Advanced Crawl from local file : better processing of large files.
8 years ago
luccioman ee92082a3b Updated javadocs : warning about closing stream responsibility.
8 years ago
luccioman 6f49ece22f Fixed redirected URLs processing as crawl start point.
8 years ago
reger 68217465fe div by null in word distance calculation
8 years ago
luccioman 7263d17436 Removed mentions of deprecated LURL-db.
8 years ago
reger 8b74a6bf57 fix min/max calculation of WordReferenceVars.distance()
8 years ago
luccioman da362628fb Added fine log level for too long blacklist matching processing.
8 years ago
reger aaae7c6462 adjust ConcurrentScoreMap internal value map to interface and use parameter
8 years ago
reger 31d2a5645e remove obsolete query variable
8 years ago
luccioman a588ed7628 Applied image headers customization to the new ViewFavicon servlet.
8 years ago
luccioman 7717a3d43d Fixed license headers on files created to improve favicon management.
8 years ago
luccioman 6e1959f469 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
reger 685d8e86bf Avoid frequent data type casting (float/long) for rwi score
8 years ago
luccioman 3ccd89e274 Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments
8 years ago
luccioman 4b699c469a Blacklist refactoring : extracted a function for easier unit testing
8 years ago
luccioman 54cfcc3f56 CrawlCheck_p.html : also display info about disallowed URLs.
8 years ago
luccioman 8b341e9818 Robots : properly handle URLs including non ASCII characters
8 years ago
reger e68b00678e prevent negative score on URIMetadataNode - in the special case were no
8 years ago
luccioman 242707f9b4 Fixed loadFromCache with strategy IFFRESH.
8 years ago
reger b752bcfecb adjust date in text detection to ignore some program version strings
8 years ago
reger b017e97421 optimize condenser language detection a little.
8 years ago
reger ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
8 years ago
reger 474f0476c6 adjust Tokenizer sentence count on trailing text after last recognized sentence
8 years ago
reger 3861ac9293 upd maven dependency-check plugin to reflect changes of https://nvd.nist.gov
8 years ago
reger 681a61dafb adjust rwi index result word position handling used for rwi ranking
8 years ago
reger 14f7577231 add support for older Word versions (Word6/Word95) to docParser
8 years ago
reger 1a79c64495 generalize DateDetection with holiday date rules readily available in icu
8 years ago
reger 6f68f08354 correct DateDetection Silvester date
8 years ago
reger 32a2e3a22a have RSSFeed.getChannel return empty message on missing channel element,
8 years ago
luccioman 8d57b5b970 Added some javadocs.
8 years ago
luccioman 60df09fff9 Fixed some HTML validation errors : Illegal character in query
8 years ago
reger 862f28eaa6 display number of documents/rss-items for label "docs" in load_rss_p servlet
8 years ago
luccioman dcdea2d02f Fixed shutdown for crawler.MaxActiveThreads value greater than 200
8 years ago
luccioman d286ba2c3e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman b8f6458152 Prevent yacy main thread from hanging on browser opening process.
8 years ago
reger 70e1eb30a5 prevent StringIndexOutOfBounds in getLocalFile()
8 years ago
luccioman 1bb0b135ac Avoid duplication of various MS Windows file URLs flavors
8 years ago
luccioman b9a8476f02 Removed unused import
8 years ago
reger e73c1eea8c remove unused rootpattern, leftover from commit
8 years ago
reger 6f8c3ccea4 improve url hash computation for file path with mixed java & windows
8 years ago
reger efcb6a1e74 fix supported mime XML -> xml for rssParser (mime normalized to lower case for comparison)
8 years ago
luccioman b3b75b0498 Accessibility : add a customizable alternative text to YaCy log
8 years ago
luccioman f2bc1b268d Updated URL fragment validation rules according to current standards
8 years ago
luccioman b1b8e69da8 Fixed NullPointerException cases
8 years ago
luccioman 3ee4f56c39 Improved ErrorCache behavior when switching networks
8 years ago
luccioman 7d5ba2afa4 Added some JavaDoc and moved crawlStacker close at the right place.
8 years ago
luccioman 8edbcd8ad4 Log eventual Solr instances close errors.
8 years ago
reger 330768c8a2 fix for solr write.lock after mode change http://mantis.tokeek.de/view.php?id=686
8 years ago
reger 585d2a6441 test case: for NewsPool to check the id modificator (for unique id)
8 years ago
luccioman de5c873e38 Removed unused JavaScript file docs.min.js
8 years ago
Michael Peter Christen df51e4ef07 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
8 years ago
Michael Peter Christen e063aaf97f enable fuzzy search, solr style (append a ~ to get a fuzzyness on the
8 years ago
reger ff6589fc0f test case: simulating multi word query for local rwi index
8 years ago
reger e990297d2e avoid NPE on hello message with missing "yourip" key
8 years ago
reger e51ab8c7aa hack to generate a unique message-id for messages created in the same second
8 years ago
Michael Peter Christen b82300358a removed version number check because it does not work any more if
8 years ago
Michael Peter Christen 2107674999 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 0d28f563f4 fix for java version "9-ea"
8 years ago
reger 3b694b3935 add some javadoc to rwi wordreference distance, position
8 years ago
reger a4465c97d6 as requested, disable/remove old swf parser
8 years ago
reger 7f63fc50f3 prepare a IndexSegment test case for RWI index testing
8 years ago
reger 96467c5467 remove not needed counter in Tokeninzer (completing last changes)
8 years ago
luccioman d66b0f7b7b Fixed french messages encoding in YaCy tray.
8 years ago
reger 7efb66ee10 adjust the WordReference.join wordsintext calc to take the max (instead of sum)
8 years ago
luccioman 0a9ff14d96 Fixed NullPointerException case and added Javadoc
8 years ago
luccioman 06d4f93d03 Merged master into postprocessing branch
8 years ago
Michael Peter Christen b73d2db914 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 25a3c7a6d0 catch exception and write end of object
8 years ago
reger 272cdd496a reactivate sentence counter in WordTokenizer for phrasepos ranking,
8 years ago
Michael Peter Christen 5e165a8150 removed unused imports
8 years ago
Michael Peter Christen c716648c78 enhanced json encoding of strings
8 years ago
Michael Peter Christen 6139bd85a8 fix for broken facet names
8 years ago
Michael Peter Christen 5060f9fee9 fix for too long snippets
8 years ago
Michael Peter Christen 8681cee3f3 fix for bad comma
8 years ago
Michael Peter Christen db6d8fc197 fix for bad json
8 years ago
Michael Peter Christen 8f4a341735 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 9934f546bb added default fl to solr query, removed large texts retrieval and
8 years ago
reger 120bf7e6e2 implemented RWI WordReference to return the word position value (was always left empty)
8 years ago
reger e310ec5f70 fix posInText ranking calculation to score 0 on no position info
8 years ago
luccioman 74f9927ddc Merge remote-tracking branch 'origin/master' into dist_macOS
8 years ago
reger 51c077f493 adjust the getTopics() and getTopicNavigator() to current useage
8 years ago
reger 39dd244693 fix ConcurrentScoreMap.set() calculation of totalCount()
8 years ago
reger ebf818ad95 log a error on aborted news publish (due to duplicate news.id)
8 years ago
reger cc2d9dd3f1 reactivate the use of included-in-topwords boost in postRanking
8 years ago
luccioman 39ea28adfd Merged master to dist_macOS branch.
8 years ago
luccioman 8255e91c99 Fixed serverClassLoader.findClass method
8 years ago
reger 6801673a07 apply postranking media search boost only on media queries
8 years ago
luccioman 1dc4306058 Fixed indentation for better readability.
8 years ago
luccioman 8c49a755da Postprocessing refactoring
8 years ago
luccioman 42f45760ed Refactored postprocessing
8 years ago
reger 4386e84b55 correct NewPool rentention calculation
8 years ago
reger 5e72d37f0a TransNews_p: add ad-hoc translation of target file on positive vote (additon to local translation)
8 years ago
reger 9462a32244 Added news service for easy, community driven UI translation support.
8 years ago
reger f8d6543a23 Rename class CreateTranslationMaster to TranslationManager and add
8 years ago
reger 19b4509d54 speed-up reading of xlif language file, by using xmlparser (stax) instead of jaxb
8 years ago
Michael Peter Christen e1fac86f53 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen a9316ceff6 force browser-caching of favicons from search results
8 years ago
Orbiter 503312ca43 Merge pull request #61 from luccioman/heroku_experiments
8 years ago
reger 33bf35d90f missing file for prev commint "Introduction of additional language setting browser"
8 years ago
reger 16e8ed3f01 Introduce additional language setting "browser/Browser Language" for UI internationalization.
8 years ago
reger 3b47a07dd1 change unused servletProperties entry CONNECTION_PROP_CLIENT_REQUEST_HEADER to
8 years ago
reger 036c1dc6ef fix CookieTest_p formatting (output of <br> as text),
8 years ago
Michael Peter Christen bf6709d196 fixed missing browser activation in linux
8 years ago
Michael Peter Christen d8504418b6 enhanced browser-caching of static content
8 years ago
Michael Peter Christen 079112358c Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen efeb592661 don't do solr optimization, this create high IO load. We should leave
8 years ago
luccioman 46b8836548 Copy image resources contained in donation iframe.
8 years ago
reger 4c7a77662a eleminate dependency on file-extension in storeDocument but use supported mime-type
8 years ago
reger ebde21079a refactor xlsParser to include Excel file attribute (like author) in parser result doc.
8 years ago
luccioman 744c9a2615 Opensearch desc : handle https protocol url with default port (443)
8 years ago
luccioman b9c28893ee Merged master to 'heroku' branch.
8 years ago
Michael Peter Christen 103a8348b3 fix for NPE and small performance enhancement
8 years ago
reger 2910fe35c1 add missing scheduler calc of next exec_date (call of calculateAPIScheduler)
8 years ago
reger 70d47ae38a keep scheduler selection by repeat entry from 07311020d4
8 years ago
reger 7c3f932e5d revert due to conflict with double count recording by schedulter / servlet by the commit under normal operation (no shutdown)
8 years ago
reger 07311020d4 postpone apicall exec date init until actual call
8 years ago
reger 5e335b32da fix Blacklist.contains() matching path pattern to string
8 years ago
reger 5e9e871192 fix Blacklist.remove by using pattern.toString to find pattern to remove,
8 years ago
reger 1843ea7e69 on Blacklist.add pattern to source file also update internal entry maps
8 years ago
reger bf6ce33da3 Correct use of _htDocsPath config in YaCyDefaultServlet to use servlet config variable
8 years ago
luccioman 480027ec98 Merge remote-tracking branch 'origin/master' into heroku_experiments
8 years ago
reger fcad2d0744 add uses of config constant INDEX_RECEIVE_ALLOW
8 years ago
reger 226f81cfcf declare poison pill url MultiProtocolURL() as protected to make sure not
8 years ago
reger f8632ad292 prevent string index out of bounds MultiProtocolURL.getPaths
8 years ago
reger 35a7d57260 update lucenematchversion to current (5.2.0 -> 5.5.0)
8 years ago
reger 9b07bbf955 deprecate newurl(), not used and already replaced
8 years ago
luccioman 47d486298f Merged changes from master.
8 years ago
reger 774b3906a9 fix GenericFormatter.parse ("time","timeoffset")
8 years ago
reger 27163af0e1 improve detection of referenced links by taking http and https link protocol
8 years ago
reger f89d4eb51d fix MultiProtocolURL init (assign of host) for urls with '/' in query part
8 years ago
reger 87fcfc6d78 Adjusted hash computation and toNormalform for file:// protocol to deliver
8 years ago
luccioman d6bf90803f Merged from maain master branch.
8 years ago
luccioman 9b9c112263 Handle more propertly local port configuration by system property
8 years ago
reger 3811184abd fix GSA servlet clientIP retrival
8 years ago
reger 7ab41d4ff1 use directories original lastmodified date in file- & smbloader in response
8 years ago
reger 708bcbb042 one more replacement to use cached hosthash vs. calculated
8 years ago
luccioman b57a06d88e Let Heroku decide which http port to use
8 years ago
reger 22db449f2a to prevent crawler to concurrently access and alter same crawl queue
8 years ago
luccioman 893a40995a Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Orbiter 50c5ddf1a1 Merge pull request #56 from luccioman/LibreJS
8 years ago
Michael Peter Christen 7466d390b2 small refactoring + do not accept too old peers during bootstrap
8 years ago
luccioman 6e96c7341a Merge remote-tracking branch 'origin/master'
8 years ago
reger 8d58a48029 remove wrong log line in CrawlSwitchboard
8 years ago
reger 5aaa057c65 ignore empty input lines in FileUtils.getListArray() to poka joke blacklist read.
9 years ago
reger 41c36ffd75 exclude rejected results from result count
9 years ago
reger d4da4805a8 internal wiki code, require header line to start with markup
9 years ago
reger e952e355a2 have Translator servlet adhoc apply added translation by translating a single file
9 years ago
reger b119ff65be clean out not used Switchboard variables
9 years ago
reger 223071337b Translator to take caution of word boundaries to identify text portion to
9 years ago
luccioman 009657791e Merge remote-tracking branch 'origin/master' into LibreJS
9 years ago
luccioman a73c9327a5 JavaScript License fixes for LibreJS compatibility
9 years ago
reger 0c40401d28 fix MessageBoard test for null data
9 years ago
reger 5b22c63030 Adjust TranslatorXliff to load default 1st and merge downloaded or modified local translation.
9 years ago
reger a2e0f00456 optimize Translator
9 years ago