Commit Graph

12941 Commits (b5711b8fe126805cdfb8bc3f8647713729a18394)
 

Author SHA1 Message Date
reger 685d8e86bf Avoid frequent data type casting (float/long) for rwi score
8 years ago
luccioman 3ccd89e274 Fixed MultiProtocolURL.resolveBackpath to handle remaining '..' segments
8 years ago
luccioman f1f4459f88 Added some unit tests for Blacklist.isListed()
8 years ago
luccioman 4b699c469a Blacklist refactoring : extracted a function for easier unit testing
8 years ago
luccioman 54cfcc3f56 CrawlCheck_p.html : also display info about disallowed URLs.
8 years ago
luccioman 8b341e9818 Robots : properly handle URLs including non ASCII characters
8 years ago
luccioman 75bb77f0cb Refactoring : extracted a method to handle authorized action links.
8 years ago
luccioman c996b04741 HTML validation : fixed URL encoding of search results action links.
8 years ago
luccioman 2b81703828 Refactored search result action links construction.
8 years ago
reger e68b00678e prevent negative score on URIMetadataNode - in the special case were no
8 years ago
luccioman 242707f9b4 Fixed loadFromCache with strategy IFFRESH.
8 years ago
reger c778219768 remove module for swfparser from maven parent pom
8 years ago
luccioman 094aed8664 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
luccioman c7402a2f89 Removed invalid empty form action.
8 years ago
luccioman 37df2e19fd Removed xmlns attribute which no more makes sense in HTML5 pages.
8 years ago
luccioman 94924e288f Added some accessibility improvements to the main interface.
8 years ago
luccioman dd86f7c44e Fixed HTML validation errors and grouped radios options in fieldsets
8 years ago
luccioman fc0c72c84b Switched to the short HTML Doctype
8 years ago
reger 7c81160f45 correct blacklist export as text url to blacklists_p.txt
8 years ago
reger b752bcfecb adjust date in text detection to ignore some program version strings
8 years ago
reger b017e97421 optimize condenser language detection a little.
8 years ago
reger ae3717d087 adjust Tokenizer sentence count to ignore repeated punktuation (like !!!! )
8 years ago
luccioman b5eb7a9217 Removed unnecessary crawlingDomFilterDepth hidden field.
8 years ago
luccioman f6d7c6ee1f Fixed Recorded action URLs beginning displayed in /Table_API_p.html
8 years ago
reger 474f0476c6 adjust Tokenizer sentence count on trailing text after last recognized sentence
8 years ago
luccioman 34658ddb9b Merge pull request #76 from luccioman/crawler
8 years ago
luccioman 0065c9b9ea Crawl monitoring : refresh running crawls table
8 years ago
luccioman e1e632ad84 Switched to the short HTML Doctype
8 years ago
luccioman 4d8611e5e7 Tables accessibility : added missing <thead> sections.
8 years ago
luccioman 9fb3142317 Restricted variables scope to function handleStatus() in Crawler.js
8 years ago
reger 3861ac9293 upd maven dependency-check plugin to reflect changes of https://nvd.nist.gov
8 years ago
reger 681a61dafb adjust rwi index result word position handling used for rwi ranking
8 years ago
reger 14f7577231 add support for older Word versions (Word6/Word95) to docParser
8 years ago
reger 8794e06721 upd to poi-3.15.jar
8 years ago
reger e25f2ee88b mention date search parameter in search option help (index.html)
8 years ago
reger 1a79c64495 generalize DateDetection with holiday date rules readily available in icu
8 years ago
reger 6f68f08354 correct DateDetection Silvester date
8 years ago
reger 32a2e3a22a have RSSFeed.getChannel return empty message on missing channel element,
8 years ago
reger fedb9f8151 del double entry in master.lng
8 years ago
luccioman 8d57b5b970 Added some javadocs.
8 years ago
luccioman 4585a60d7e Made use of the constant corresponding to the hard-coded value.
8 years ago
luccioman 60df09fff9 Fixed some HTML validation errors : Illegal character in query
8 years ago
luccioman a76a46a2e9 Removed invalid rel="[count]" from links in tagcloud.
8 years ago
reger 862f28eaa6 display number of documents/rss-items for label "docs" in load_rss_p servlet
8 years ago
luccioman 5027912f30 Fixed <p> spacers : blocks elements such as <div> are not allowed inside
8 years ago
luccioman abe489a0b5 Removed unnecessary ARIA "form" role on native HTML form elements.
8 years ago
luccioman cca4186044 Fixed HTML validation error : "Stray end tag div"
8 years ago
luccioman dcdea2d02f Fixed shutdown for crawler.MaxActiveThreads value greater than 200
8 years ago
reger ada473ced2 fix ConfigBasic servlet parameter name for Japanese _jp->_ja
8 years ago
luccioman d286ba2c3e Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago