Commit Graph

8842 Commits (0e4c93f02a3dc5cd78f69f93d22ebe09f7ec3f2a)

Author SHA1 Message Date
Andreas 0c4db9eef0 Merge pull request #3 from yacy/master
7 years ago
reger c31d94664a Update deprecated SolrInputDocument.addField() with boost value
7 years ago
luccioman 7e271f9cf5 Updated travis config : install ghostscript, required for Html2Image
7 years ago
luccioman 32c9dfa768 Added partial bzip2 stream parsing support and bzipParser Junit test
7 years ago
luccioman dd9cb06d25 Fixed RWI distance calculation on multi words search queries.
7 years ago
luccioman 6b11bf3a12 Fixed NullPointerException case on 'Browser' lang selection
7 years ago
reger ae1c675c85 fix array out of bounds in YJsonResponseWriter and OpensearchResponsWriter
7 years ago
otter 73d1d577fd prevent integer overflow in chartDot for nodes with a big index
7 years ago
otter 4e2ccdfcac prevent integer overflow in chartLine
7 years ago
luccioman 27ab733685 Ensure private search features are not lost on Digest auth timeout
7 years ago
reger ba60f65040 Adjust filetype: query modifier parameter to lower case
7 years ago
luccioman 57a33aefb0 Removed unnecessary max counts init on empty search navigators.
7 years ago
luccioman ef8aea7f8d Made the dates navigator max elements number user configurable.
7 years ago
luccioman 9e86d183b8 Disable manual search results resorting when resorting is done with JS
8 years ago
luccioman 66cb9c4ff9 Added Solr filter queries for audio, video and application domains
8 years ago
luccioman 5d3ceb31b7 Improved search navigators counters accuracy and consistency.
8 years ago
luccioman 8e4f31bdc7 Updated internal ISO 639-1 language codes with latest standards.
8 years ago
luccioman a28428047a Fixed count of filtered results from local solr.
8 years ago
Michael Peter Christen 2f71005a93 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 2314f8e358 try to fix problem
8 years ago
luccioman 3c9df6e0ce Use local solr filtered results in total search results count.
8 years ago
luccioman a1a0515312 Added a button to manually refresh sorting of p2p search results.
8 years ago
luccioman 4eba88f2ff Removed some unnecessary uses of java.lang.reflect api.
8 years ago
luccioman da3dbf9ea1 Use Javadoc style comments on SearchEvent properties.
8 years ago
luccioman c6ae87168a Added unit tests on the gzip parser.
8 years ago
luccioman 169ffdd1c7 Finer control on max links to parse in the html parser.
8 years ago
luccioman e41d046a9d Improved parsing support for OOXML spreadsheets (.xlsx)
8 years ago
reger 51a4e03c93 Allow to stop currently running warc import (stop button)
8 years ago
luccioman 6cec2cdcb5 Use unredirected robots.txt URL when adding an entry to the table.
8 years ago
luccioman 3f0446f14b Ensure proper synchronous robots entry retrieval on first check.
8 years ago
luccioman b23a563065 Prevent search result failure on incomplete images information.
8 years ago
Michael Peter Christen 30d71c6359 added usage of X-Real-IP http header
8 years ago
Michael Peter Christen f45378c11c Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 7f395ef937 added image link in search results
8 years ago
luccioman 780173008e Implemented partial stream parsing of tar archives.
8 years ago
luccioman acab6a6def Also handle text content when parsing XML within limits.
8 years ago
reger 2a07799ad1 Correction of d03e2c98ea
8 years ago
reger d03e2c98ea Fix Conjunction.addOperator to do nothing if term is empty
8 years ago
reger b6a41df4f7 Remove deprecated YaCyProxyServlet
8 years ago
luccioman 8a94fef9e0 Prevent unwanted cached bytes duplication on stream parsing.
8 years ago
reger 4979439e87 Skip public post of jre version.
8 years ago
reger e918ec199e Replace deprecated ConcurrentHashSet with recommended Java8
8 years ago
reger fb71994342 Harmonizing use of xml reader / sax parser in XMLBlacklistImporter
8 years ago
reger 275d65fffe Patch last_modified date with internal FirstSeenTime() if no date provided
8 years ago
reger d1b23afed6 Remove obsolete Protocol parameter ttl (time to live)
8 years ago
reger 15d78b1064 Replace deprecated getIP with getIPs in Protocol transferURL() and
8 years ago
reger ed36b47bec Replace one more deprecated peerDeparture in Protocol.transferIndex()
8 years ago
luccioman 0ee8c030c4 Log an error when Solr folder migration fails for some reason.
8 years ago
luccioman 5a646540cc Support parsing gzip files from servers with redundant headers.
8 years ago
luccioman 11a7f923d4 Distinguish response parsing failures from unexpected exceptions.
8 years ago
luccioman eda7b0aeb6 Merge branch 'master' of https://github.com/yacy/yacy_search_server
8 years ago
reger 3005be7349 Clean up unmaintained and unused AugmentParser trail.
8 years ago
luccioman cb4f1358e1 Added gzip parser support for max content bytes limit
8 years ago
luccioman 5216c681a9 Added HTML parser support for maximum content bytes parsing limit
8 years ago
luccioman 4aafebc014 Merge pull request #122 from Scarfmonster/patch-1
8 years ago
luccioman 651fad6da5 Added RSS parser support for maximum content bytes parsing limit
8 years ago
luccioman 452a17a8d5 Finer control on bounded input streams with custom stream implementation
8 years ago
luccioman f8f1959ebb Added parsing within bounds implementation to the generic parser.
8 years ago
luccioman e0f400a0bd Support trying multiple parsers even when streaming on large resources.
8 years ago
luccioman 1e84956721 Support loading local files with a per request specified maximum size.
8 years ago
luccioman f369679d1c Fixed read/copy on input streams reading sometimes less than expected.
8 years ago
luccioman bf55f1d6e5 Started support of partial parsing on large streamed resources.
8 years ago
luccioman 90a7c1affa HTML parser : removed unnecessary remaining recursive processing
8 years ago
reger e6e20dab52 upd to Jetty 9.4.6.v20170531
8 years ago
luccioman dcc56318bb Made remote search max system load limits configurable from UI.
8 years ago
reger ddd13b776d Add keyword constraint to rwi query result filter
8 years ago
luccioman e82eaee4b6 Apply consistent behavior on HTTP resource size exceeding limit.
8 years ago
luccioman 0b75e92ac2 Do not wrap unnecessarily loader IOExceptions in IOExceptions
8 years ago
luccioman 433bdb7c0d Respect maxFileSize limit also when streaming HTTP and when relevant.
8 years ago
luccioman 9b1bb2545e Refactored plain-text URLs detection implementation.
8 years ago
luccioman 8da3174867 Ensure lower case conversion consistency with any default locale.
8 years ago
luccioman 286f3018bd Made mime type and extension normalization locale independent.
8 years ago
luccioman 319231a458 Added a generic XML parser, able to parse elements text and URLs.
8 years ago
Ryszard Goń 3cedbbd4ed Wrong password was removed after the SSL certificate import
8 years ago
luccioman 64cec2790d Improved character encoding detection from Content-Type header
8 years ago
luccioman 0487336ec3 Prevent integer overflow in table statistics and use strong typing
8 years ago
luccioman d2a4a27f52 Improved stream-oriented parsing entering conditions.
8 years ago
luccioman 9dd790087d Added HT Cache basic statistics (hit rate)
8 years ago
luccioman 5fdd5d16b1 Use volatile to ensure concurrent threads use up to date property value
8 years ago
luccioman 28b451a0b3 Made Cache compression level and lock timeout user configurable
8 years ago
luccioman a7394b479b Limit the synchronization blocking time on some Cache operations.
8 years ago
Michael Peter Christen c94a8c76bd re-added solr synchronization hack
8 years ago
Michael Peter Christen 6fe735945d migrated Solr 5.5 -> Solr 6.6 and from Java 1.7 -> 1.8
8 years ago
luccioman ce89492319 Ensure system resource release by closing document stream.
8 years ago
luccioman 8399275142 Properly close file output streams even on exceptions scenarios.
8 years ago
luccioman 4e4dc6c4e5 Removed unnecessary finalize implementation.
8 years ago
luccioman a04feac064 Ensure file input streams proper closing in both success and failures
8 years ago
luccioman d98c04853d Ensure proper closing of file input streams.
8 years ago
luccioman c53c58fa85 Unsure closing ChunkIterator stream in every possible use case.
8 years ago
luccioman 29e52bda39 Merge branch 'master' of https://github.com/yacy/yacy_search_server
8 years ago
luccioman a9cb083fa1 Improved consistency between loader openInputStream and load functions
8 years ago
reger a814f3d885 Introduce keyword query parameter
8 years ago
luccioman c226ded799 Fix unescape of URLs having some '%' chars but not percent-encoded
8 years ago
luccioman 306a82dd71 Fixed scraper NullPointerException cases on malformed URLs.
8 years ago
luccioman aa55d71cf5 Fixed a NullPointerException case on Digest authentication.
8 years ago
luccioman 02ec0ed13c Quoted param value in Solr query to avoid unwanted traces in logs
8 years ago
reger 1737af37cf Set request originator to own peer in warc importer
8 years ago
reger 039162fbf0 Change warc importer to use defaultsurrogate-crawl profile, as reported
8 years ago
Michael Peter Christen 3b1d640a3c enhanced debugging
8 years ago
Michael Peter Christen 7de7879f13 added a cache to prevent too many seed enumerations
8 years ago
luccioman bd7411a53a Enable p2p and cluster communication when "Protection of all pages" on
8 years ago
luccioman 31ad043bb9 Added user interface feedback on results feeding termination status.
8 years ago
sgaebel ff6392215e added closing of lst-Tag in solr-Export
8 years ago
luccioman d90b001e1b Improved previous merge "Show ranking in HTML UI".
8 years ago
luccioman 0f0f42b509 Added some JavaDoc
8 years ago
reger 077d062be3 Adjust mergeDocuments to keep youngest last-modified date of document
8 years ago
luccioman 654801523e Fixed StringIndexOutOfBoundsException case.
8 years ago
luccioman 522a268305 Improved new blacklist entries URL scheme detection.
8 years ago
luccioman 532981b363 Updated putHTML() JavaDoc
8 years ago
luccioman 58d23047dd Handle '?' and '+' chars as valid wild cards when adding to blacklist.
8 years ago
luccioman a87281b498 Added MediaWiki dump import scheduling feature.
8 years ago
luccioman edd7ccac40 Added some JavaDoc
8 years ago
luccioman 79fdf14b0a Fixed regression introduced by commit 9ad4d16
8 years ago
Michael Peter Christen 7678fd67e3 copied fix from yacy_grid_parser for wrong array type
8 years ago
Michael Peter Christen 200b100fb8 added patch to rewrite altered yacy grid schema into yacy schema
8 years ago
reger 9ad4d16829 Add a responsHeader to the solr index export with a format identifier
8 years ago
luccioman 9697209ef6 Fixed Index Export feature for compatibility with old indexed documents.
8 years ago
luccioman 88c062639b Added some JavaDoc
8 years ago
luccioman 31fff2c986 Extended WikiCode template inclusion syntax support.
8 years ago
Michael Peter Christen 973d74712f added yacy grid flatjson surrogate parser
8 years ago
luccioman b1da92648e Fixed surrogates import monitoring page (/CrawlResults.html?process=7)
8 years ago
luccioman 527d494c1a Fixed "Unchecked conversion" compilation warnings.
8 years ago
reger c77e43a391 Take out mailto collect in internal parsed document
8 years ago
Michael Peter Christen 335868edba Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
8 years ago
reger bec34d3546 Add url input field as source for WarcImporter
8 years ago
luccioman f66438442e Extended Mediawiki dump import to remote URLs.
8 years ago
luccioman e5c3b16748 Improved http client close time on stream processing errors.
8 years ago
luccioman 23775e76e2 Fixed endless loop case in wikicode processing.
8 years ago
luccioman 0bc868a819 Improved support for non ASCII chars in local file system URLs
8 years ago
reger 7b80189bda Activate hosts navigator plugin. This includes rwi results in the navigator
8 years ago
Michael Peter Christen f5ad29edb1 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 76e9135526 added flatjson parser (stub, unfinished)
8 years ago
reger b7417ac329 Introduce a Keyword search navigator using the index field keywords.
8 years ago
luccioman 09e72eb0a4 Set Config Portal as a private administration page.
8 years ago
reger ba339a2a45 Add servlet to import warc file from filesystem IndexImportWarc_p.html.
8 years ago
Michael Peter Christen 1d81b8f102 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 69081bce00 added export to elasticsearch. The export dump can easily be imported to
8 years ago
reger 510f11d374 Implement surrogate import from Warc archives (as first option handle
8 years ago
luccioman 4b649b0a11 Fixed NPE case and API URL link on Solr HTML output for webgraph core.
8 years ago
luccioman af28a07780 Updated API calls recording/replay with recent changes.
8 years ago
reger 81670c3484 One more use of SwitchboardConstants.SERVER_PORT constant,
8 years ago
luccioman cde237b687 Enforced access controls on some administrative actions.
8 years ago
luccioman df5970df6d Extended Apache HTTP Digest Auth. for use of YaCy encoded password
8 years ago
reger f05976c017 Display the local search word statistic in alphabetic order
8 years ago
reger 3dd23c178b Introduce the option to configure a shutdown port.
8 years ago
reger a2afb4bae0 add switchboardconstants for server ports config keys
8 years ago
reger 56d0a87a83 remove double occuance of geo:lat in rss tokens
8 years ago
reger b4fa1141b8 implement RequestHeader getRequestURI, getRequestURL for legacy request
8 years ago
reger 209a7374bd remove unused import pdfParser
8 years ago
reger de1c1c16db Improve pdf text extraction resource handling.
8 years ago
reger 9b6d1abd9e eliminate some compiler unchecked and deprecation warnings
8 years ago
reger 18c7563dbe Extend DCEntry.getLanguage convert to ISO639-1 codes for more languages
8 years ago
reger ce87025462 further avoid to set connect info properties as header value
8 years ago
reger cd4d891ea4 use pre-defined "Connection" header key, replace depreceated
8 years ago
luccioman 0173b0bc32 Added an advanced settings page for referrer policy settings.
8 years ago
reger 81963a89fe fix proxyservlet response url to respect http scheme if a relative
8 years ago
luccioman cdcd923375 Privacy enhancement : added settings to control referrer policy.
8 years ago
reger 86534a56f7 fixed ReindexSolrBusyThread new and unexpected repeat of same query with
8 years ago
reger 275c0cddd1 Adjust DefaultServlet test case to recent change,
8 years ago
reger 41e2ee0eca Fix call parameter for ConnectionInfo in MonitorHandler
8 years ago
luccioman ac766327d3 Switched a few more Solr fields from strictly mandatory to optional
8 years ago
reger f254fcfc67 fix htmlParser <script> text extraction on code containing expression
8 years ago
luccioman 2f191e0e1c Improved MultiprocotolURL non ASCII characters support.
8 years ago
luccioman 18e8b3a220 Merge branch 'escape' of https://github.com/sinkuu/yacy_search_server
8 years ago
reger 7419989de3 Correct dublincore title property text to lowercase in htmlresponsewriter,
8 years ago
Burkhard 4fdc11cae8 Update SearchEvent.java
8 years ago
luccioman cdc7f3e431 Switched some Solr fields from mandatory to optional
8 years ago
luccioman 3475d8c1a9 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman c68a8be2d9 Refactored and enforced Solr mandatory fields for proper operation
8 years ago
reger 334c70c37a correct fromDate init value on missing param in api/timeline_p servlet
8 years ago
reger cc770512d5 add hint of query syntax in AccessTracker log (qs=normal querystring,
8 years ago
luccioman e5858bc8c8 Fixed a NullPointerException case possible on Index Export
8 years ago
reger 7e53860fc7 fix NPE in HTMLResponseWriter on missing document title
8 years ago
reger 5e8879beb7 Reduce self generated content for text_t (visible text index field)
8 years ago
luccioman 6e89d125f2 Added robots.txt support for heuristics federated search.
8 years ago
sinkuu a46b232bf1 Use java.net.URLDecoder
8 years ago
luccioman bf16de29c1 Added support for HTML OpenSearch results.
8 years ago
luccioman 54405577aa Replaced absolute redirection locations by relative ones when possible.
8 years ago
luccioman 1857651988 Added a new Debug/Analysis advanced settings subsection.
8 years ago
luccioman 526f2d6a8b Fixed NPE case occurring when local solr index is disabled in search.
8 years ago
luccioman def55ec166 Improved termination of timed out remote solr requests to peers.
8 years ago
luccioman 08de58b6d3 Named a Thread without name for easier monitoring
8 years ago
luccioman 9a5a124bf2 Distinguished solr connectors thread names for easier monitoring.
8 years ago
reger 1f497ccad5 Add consistency check for related index fields upon load and save of
8 years ago
luccioman 68afe900d0 Added user-friendly controls over disk usage configuration settings.
8 years ago
reger 95d2a28599 adjust the Field-Reindex Thread to verify and update the document id
8 years ago
luccioman fc01b69eca Fixed local image search pagination regression.
8 years ago
Michael Peter Christen 02d0b3172c Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen d4f45cf05e added dc.date.modified and dc.date.created to date parser
8 years ago
reger f9180fabc4 assure that RWI Index.Segment IODispatcher is not blocking on shudown
8 years ago
reger e61ee180a7 Group all proxy settings on System Administration by adding settings of
8 years ago
luccioman 39e081ef38 Fixed display of crawler pending URLs counts in HostBrowser.html page.
8 years ago
reger df80c57842 add ukr and pol to DCEntry.getLanguage ISO639-2 3-char language code
8 years ago
luccioman e048e74072 Added an optional parameter to webstructure.xml api.
8 years ago
reger 581b00cc20 remove obsolete lastmodified calculation in WebgraphConfig
8 years ago
luccioman 5c8958bcea Updated Javadoc and Junit tests for the WebStructureGraph class.
8 years ago
luccioman d9766ca981 Fixed WatchWebStructure_p.html render to include https URLs.
8 years ago
luccioman ed3dd5e31a Fixed webstructure.xml API used with a domain name 'about' parameter.
8 years ago
luccioman 0da1e6ba16 Factored code re-implementing DigestURL.hosthash() method.
8 years ago
luccioman 86adfef30f Added automated unit tests and perfs test for WebStructureGraph class.
8 years ago
luccioman 9cea7cbb10 Detailed some Javadoc related to /api/webstructure.xml usage.
8 years ago
luccioman 6a4d51d8f9 Cleaned up some Javadoc warnings.
8 years ago
luccioman 86dc198698 Fixed some JavaDocs broken links.
8 years ago
reger 16beb551ea fix DC.Elements namespace in DublinCore vocabulary class
8 years ago
luccioman 339f005ced Blacklist import and update performance improvements.
8 years ago
luccioman e3892b0957 Added some JavaDoc.
8 years ago
reger 4c9be29a55 fix concurrency issue with htmlParser using not current scraper data
8 years ago
reger eedee6eabb fix exception on URIMetadataNote instantiation with corrected id hash on
8 years ago
luccioman c1401d821e Adjusted crawl depth control for FTP crawl start URLs.
8 years ago
reger 68d4dc5cc5 Complete harmonization RequestHeader getCookie with std ServletRequest
8 years ago
reger a1e5f7dbca fix of fulltext.remove() by id of webgraph document
8 years ago
luccioman 1df558a6c6 Fixed YaCy proper shutdown triggered by SIGTERM signal.
8 years ago
reger b522d540b9 Include itemprop latitude/longitude (see schema.org) in attribute
8 years ago
reger 083df255e4 fix html tag attribute parsing containing attribute w/o value
8 years ago
reger cb95b7339a include html5 <time> tag in content scraper,
8 years ago
reger 7bf2bcf504 fix and prevent exception on missing required cookie name
8 years ago
luccioman 3ca695390c FTP crawl start URLs : applied crawl profile depth control
8 years ago
luccioman 128c8ef8d4 Fixed title rendering having non ASCII chars in QuickCrawlLink_p.html.
8 years ago
reger 8eb6fba59c activate filetype navigator plugin and restrict config (append) of navs
8 years ago
luccioman c25e48e969 Enabled displaying results after 14th page for local search queries.
8 years ago
luccioman a3886c6adb Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman feaa87005e Improved indentation for easier debugging steps.
8 years ago
reger bab4804d11 add FileTypeNavigator plugin
8 years ago
reger d35c47090c remove obsolete put of HttpServletRequest attributes to YaCy servlet
8 years ago
reger 0959038624 correct DefaultServlet resource pathinContext calculation
8 years ago
reger c50e23c495 reduce creation of empty legacy RequestHeader() in situation where null
8 years ago
reger 87f6631a2a adjust Cache getHeader to prev. changes/commit
8 years ago
reger 6be7339b1d remove the overhead of unused reverseMappingCache of HeaderFramewor / RequestHeader
8 years ago
reger c702eb6786 del dead menu link to /repository
8 years ago
reger baa5d9b9e3 adjust DomainHandler working on resolved .yacy domain
8 years ago
luccioman 1ba705c23d Use loaderDispatcher instead of HTTPClient to download releases.
8 years ago
luccioman 467650c042 Hardened system update checks.
8 years ago
luccioman b5711b8fe1 Added some Javadocs.
8 years ago
reger 0d2964cf2b expanded error message on rejected crawl url due to faile dns lookup
8 years ago
luccioman 00e81fcc15 Check HTTP status when downloading a release, and report eventual error.
8 years ago
reger 0758c868c9 add HostNavigator plugin
8 years ago
reger 60160877f5 bundle initialization of search navigation plugins in separate handler
8 years ago
reger 3151cda3a5 catch ip-format exception on wrong server access setting ip filter
8 years ago
reger b32bcdf344 list entries in outgoing cookie monitor one per line
8 years ago
reger 3f32262654 enable getCookies for HeaderFramework reusing Jetty CookieCutter
8 years ago
reger 4186ee6fc0 add other custom response header entries set by servlets to the response
8 years ago
luccioman d27adc2b92 Fixed language detector initialization and NullPointerException cases.
8 years ago
luccioman a1f922b34a Reduced locations vocabulary memory footprint.
8 years ago
reger 9c06e752e4 allow request.setAttribute w/o "not implemented" exception by default
8 years ago
reger 59ab42e7d6 add UserDB lastaccess update calls on login
8 years ago
luccioman bf8a6d9848 Reduced GeoNames locations memory footprint.
8 years ago
luccioman 3f561c1635 Fixed a NullPointerException case.
8 years ago
luccioman 6bc2bf1aa4 Small memory footprint reduction for GeonamesLocation.
8 years ago
luccioman 7f846ef674 Small complementary memory footprint improvement for synonyms.
8 years ago
luccioman 568e3dde6a Improved synonyms memory footprint.
8 years ago
reger 60b3adfb43 fix ext2mime to return given default on input=null
8 years ago
reger f7e9f9be5f move Digest auth checks from DefaultServlet to adminAuthenticated,
8 years ago
luccioman cca3417b87 Fixed image and favicon viewing for unauthenticated local requests.
8 years ago
reger 02092de3d8 remove login cookie generation for static admin ind User servlet
8 years ago
luccioman fc575fc760 Fixed a NullPointerException case.
8 years ago
reger 9a8691129f fix typing error from commit 60ba5c117c
8 years ago
reger f9328f07e2 completing the usage of CONNECTION_PROP_CLIENT_HTTPSERVLETREQUEST in
8 years ago
reger 8e3e3ed191 update the older ResponseHeader patch to handle cookies,
8 years ago
reger 866d3a1960 make RequestHeader login succeed (without throwing exception by default)
8 years ago
reger 44a6a4e795 fix authentication by hit in userdb (wrong parameter)
8 years ago
luccioman aa9ddf3c23 Added control over Robots.txt active threads maximum number.
8 years ago
luccioman 3092a8ced5 Fixed thread name consistency for improved monitoring.
8 years ago
luccioman eec5779889 Added a name prefix to pooled threads for easier monitoring.
8 years ago
reger 59130777a6 add high scored items first to YearNavigator (to make sure to be included
8 years ago
luccioman 0ba5a838f7 Added charset meta to Solr HTML writers.
8 years ago
reger 08a0acc35d make a YearNavigator availabel, useable as SearchEvent.naviator plugin.
8 years ago
reger 7742579ca4 make a LanguageNavigator availabel, useable for the SearchEvent.naviator
8 years ago
reger 0d3bef659b implement RequestHeader.setCharacterEncoding for legacy header,
8 years ago
Michael Peter Christen 5320209963 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
Michael Peter Christen 83f5e3d715 added+disabled a federate search experiment
8 years ago
reger 4eeb448eb3 use DigestURL in UrlProxyServlet as parameter to pass requested url to
8 years ago
reger bad8f87998 remove old/obsolete clear text "adminAccount" credential entry from init
8 years ago
reger 811cf637f8 fix Jetty9YaCySecurityHandler, length check of Basic credential,
8 years ago
reger fdcf33f08f fix Domain.stripToHostName for some IPv6 cases
8 years ago
reger ac6e198bd1 add unit test for Domains.stripToPort,
8 years ago
reger f27531f5ec fix Domains.stripToPort, make ipv6 save
8 years ago
reger 67744a8038 fix HeaderFramework.getRequestURL on host with port considering ipv6 host
8 years ago
reger 66cc0dd173 refactor: move GSA specific date formatter to GSAservlet
8 years ago
reger d525967999 refactor: move convertHeaderFromJetty to ProxHandler (only used with active proxy
8 years ago
reger 60ba5c117c fix legacy getHeaderCookies to work with cookies from original
8 years ago
reger 30f8d1e2d7 let RequestHeader.logout succeed w/o throwing exception by default
8 years ago
reger 28afd3a2f8 fix UserDB.proxyAuth from header string
8 years ago
luccioman 0806de8fdc Ensure file input stream are closed in both normal and error cases.
8 years ago
luccioman a0dfbaca6a FileUtils : added some JavaDocs and unit test cases
8 years ago
reger 59448461d3 make use of userInRole for quick login verification
8 years ago
reger 2a4d826d9e adjust servlet RequestHeader.getLocale
8 years ago
reger 9db68acb4f remove obsolete X_YACY... header declarations
8 years ago
reger 8e9aece786 more use of RequestHeader constant referer, authorization
8 years ago
reger d631fbc019 make more use of the new ServletRequest interface methodes
8 years ago
reger 395f2e8946 Make ServletRequest implement the standardized HttpServletRequest interface,
8 years ago
luccioman 74fec066f4 Converted more URLs to pure relative ones.
8 years ago
luccioman 0f0393e5e3 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman 7296e3884f Switched even more URLs to pure relative ones.
8 years ago
reger 49eae79c01 fix Tables.hasIndex check for tablename = key
8 years ago
luccioman 84b81c1af0 Switched more URLs to relative ones when possible.
8 years ago
luccioman 731684105a Improved absolute URLs rendering in OpenSearch desc and RSS feeds.
8 years ago
reger 669f60223e upd Column.toString to output encoder "{bytes}"
8 years ago
reger c9e81d2fa0 fix Column parsing from celldefinition string, without cellwidth def.
8 years ago
reger e0816ef2e5 use human readable date format in CrawlStacker error message
8 years ago
luccioman 54d879a9b3 Generate HTML relative (to each peer) links from hosted WikiCode.
8 years ago
luccioman 2da5f339f8 Fixed /News.html and /Wiki.html pages in Search Portal mode (issue #87).
8 years ago
reger 8fe28a83f2 harmonize used lastmodified date for rwi and fulltext in storeDocument
8 years ago
reger 3d1d297308 refactor namespace navigator as part of navigatorplugin map, this allows
8 years ago
reger 67f660523b Make navigators underlaying indexfield name accessible in interface
8 years ago
reger 5eb3ee4e20 Add search navigator interface to allow for additional navigators (plugins)
8 years ago
reger fd3f58fcaa improve query modifier parsing of "collection:" and possible collision
8 years ago
reger af39a76bf6 Reduce number of default max. search navigator lines (from 10000)
8 years ago
reger 20a1b29ed3 add simple test case for ReferenceContainer helpful for debugging
8 years ago
reger 3c7220bc7b Refacture rwi reference word position and word distance calculation
8 years ago
luccioman f0639d810c Customized name for Threads still using the default "Thread-n" pattern.
8 years ago
luccioman db3b9db9c2 Crawl from local file : faster task end when manually terminating crawl.
8 years ago
reger 4c67ed3f8d catch rwi ranking div by zero exception
8 years ago
luccioman 47af33a04c Advanced Crawl from local file : better processing of large files.
8 years ago
luccioman ee92082a3b Updated javadocs : warning about closing stream responsibility.
8 years ago
luccioman 6f49ece22f Fixed redirected URLs processing as crawl start point.
8 years ago
reger 68217465fe div by null in word distance calculation
8 years ago
luccioman 7263d17436 Removed mentions of deprecated LURL-db.
8 years ago
reger 8b74a6bf57 fix min/max calculation of WordReferenceVars.distance()
8 years ago
luccioman da362628fb Added fine log level for too long blacklist matching processing.
8 years ago
reger aaae7c6462 adjust ConcurrentScoreMap internal value map to interface and use parameter
8 years ago
reger 31d2a5645e remove obsolete query variable
8 years ago
luccioman a588ed7628 Applied image headers customization to the new ViewFavicon servlet.
8 years ago
luccioman 7717a3d43d Fixed license headers on files created to improve favicon management.
8 years ago
luccioman 6e1959f469 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
reger 685d8e86bf Avoid frequent data type casting (float/long) for rwi score
8 years ago
luccioman 3ccd89e274 Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments
8 years ago
luccioman 4b699c469a Blacklist refactoring : extracted a function for easier unit testing
8 years ago
luccioman 54cfcc3f56 CrawlCheck_p.html : also display info about disallowed URLs.
8 years ago
luccioman 8b341e9818 Robots : properly handle URLs including non ASCII characters
8 years ago
reger e68b00678e prevent negative score on URIMetadataNode - in the special case were no
8 years ago
luccioman 242707f9b4 Fixed loadFromCache with strategy IFFRESH.
8 years ago
reger b752bcfecb adjust date in text detection to ignore some program version strings
8 years ago
reger b017e97421 optimize condenser language detection a little.
8 years ago
reger ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
8 years ago
reger 474f0476c6 adjust Tokenizer sentence count on trailing text after last recognized sentence
8 years ago
reger 3861ac9293 upd maven dependency-check plugin to reflect changes of https://nvd.nist.gov
8 years ago
reger 681a61dafb adjust rwi index result word position handling used for rwi ranking
8 years ago
reger 14f7577231 add support for older Word versions (Word6/Word95) to docParser
8 years ago
reger 1a79c64495 generalize DateDetection with holiday date rules readily available in icu
8 years ago
reger 6f68f08354 correct DateDetection Silvester date
8 years ago
reger 32a2e3a22a have RSSFeed.getChannel return empty message on missing channel element,
8 years ago
luccioman 8d57b5b970 Added some javadocs.
8 years ago
luccioman 60df09fff9 Fixed some HTML validation errors : Illegal character in query
8 years ago
reger 862f28eaa6 display number of documents/rss-items for label "docs" in load_rss_p servlet
8 years ago
luccioman dcdea2d02f Fixed shutdown for crawler.MaxActiveThreads value greater than 200
8 years ago
luccioman d286ba2c3e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman b8f6458152 Prevent yacy main thread from hanging on browser opening process.
8 years ago
reger 70e1eb30a5 prevent StringIndexOutOfBounds in getLocalFile()
8 years ago
luccioman 1bb0b135ac Avoid duplication of various MS Windows file URLs flavors
8 years ago
luccioman b9a8476f02 Removed unused import
8 years ago
reger e73c1eea8c remove unused rootpattern, leftover from commit
8 years ago
reger 6f8c3ccea4 improve url hash computation for file path with mixed java & windows
8 years ago
reger efcb6a1e74 fix supported mime XML -> xml for rssParser (mime normalized to lower case for comparison)
8 years ago
luccioman b3b75b0498 Accessibility : add a customizable alternative text to YaCy log
8 years ago
luccioman f2bc1b268d Updated URL fragment validation rules according to current standards
8 years ago
luccioman b1b8e69da8 Fixed NullPointerException cases
8 years ago
luccioman 3ee4f56c39 Improved ErrorCache behavior when switching networks
8 years ago
luccioman 7d5ba2afa4 Added some JavaDoc and moved crawlStacker close at the right place.
8 years ago
luccioman 8edbcd8ad4 Log eventual Solr instances close errors.
8 years ago
reger 330768c8a2 fix for solr write.lock after mode change http://mantis.tokeek.de/view.php?id=686
8 years ago
reger 585d2a6441 test case: for NewsPool to check the id modificator (for unique id)
8 years ago
luccioman de5c873e38 Removed unused JavaScript file docs.min.js
8 years ago
Michael Peter Christen df51e4ef07 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
Michael Peter Christen e063aaf97f enable fuzzy search, solr style (append a ~ to get a fuzzyness on the
9 years ago
reger ff6589fc0f test case: simulating multi word query for local rwi index
9 years ago
reger e990297d2e avoid NPE on hello message with missing "yourip" key
9 years ago
reger e51ab8c7aa hack to generate a unique message-id for messages created in the same second
9 years ago
Michael Peter Christen b82300358a removed version number check because it does not work any more if
9 years ago
Michael Peter Christen 2107674999 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
Michael Peter Christen 0d28f563f4 fix for java version "9-ea"
9 years ago
reger 3b694b3935 add some javadoc to rwi wordreference distance, position
9 years ago
reger a4465c97d6 as requested, disable/remove old swf parser
9 years ago
reger 7f63fc50f3 prepare a IndexSegment test case for RWI index testing
9 years ago
reger 96467c5467 remove not needed counter in Tokeninzer (completing last changes)
9 years ago
luccioman d66b0f7b7b Fixed french messages encoding in YaCy tray.
9 years ago
reger 7efb66ee10 adjust the WordReference.join wordsintext calc to take the max (instead of sum)
9 years ago
luccioman 0a9ff14d96 Fixed NullPointerException case and added Javadoc
9 years ago
luccioman 06d4f93d03 Merged master into postprocessing branch
9 years ago
Michael Peter Christen b73d2db914 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
Michael Peter Christen 25a3c7a6d0 catch exception and write end of object
9 years ago
reger 272cdd496a reactivate sentence counter in WordTokenizer for phrasepos ranking,
9 years ago
Michael Peter Christen 5e165a8150 removed unused imports
9 years ago
Michael Peter Christen c716648c78 enhanced json encoding of strings
9 years ago
Michael Peter Christen 6139bd85a8 fix for broken facet names
9 years ago
Michael Peter Christen 5060f9fee9 fix for too long snippets
9 years ago
Michael Peter Christen 8681cee3f3 fix for bad comma
9 years ago
Michael Peter Christen db6d8fc197 fix for bad json
9 years ago
Michael Peter Christen 8f4a341735 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
9 years ago
Michael Peter Christen 9934f546bb added default fl to solr query, removed large texts retrieval and
9 years ago
reger 120bf7e6e2 implemented RWI WordReference to return the word position value (was always left empty)
9 years ago
reger e310ec5f70 fix posInText ranking calculation to score 0 on no position info
9 years ago
luccioman 74f9927ddc Merge remote-tracking branch 'origin/master' into dist_macOS
9 years ago
reger 51c077f493 adjust the getTopics() and getTopicNavigator() to current useage
9 years ago
reger 39dd244693 fix ConcurrentScoreMap.set() calculation of totalCount()
9 years ago
reger ebf818ad95 log a error on aborted news publish (due to duplicate news.id)
9 years ago
reger cc2d9dd3f1 reactivate the use of included-in-topwords boost in postRanking
9 years ago
luccioman 39ea28adfd Merged master to dist_macOS branch.
9 years ago
luccioman 8255e91c99 Fixed serverClassLoader.findClass method
9 years ago
reger 6801673a07 apply postranking media search boost only on media queries
9 years ago
luccioman 1dc4306058 Fixed indentation for better readability.
9 years ago