Commit Graph

8585 Commits (46c9da6428cdfd1f4fdb9eadd3f14d5610570dbe)

Author SHA1 Message Date
Michael Peter Christen 25573bd5ab added a crawl filter based on <div> tag class names
7 years ago
luccioman d95b288f19 Removed use of deprecated Jetty IPAccessHandler for client filtering.
7 years ago
reger cc7a93e6b6 remove deprecated jetty continuation class from urlproxyservlet
7 years ago
Michael Peter Christen 607b39b427 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
7 years ago
Michael Peter Christen 4355de0f3c (more!) evaluation of XRealIP from nginx reverse proxy
7 years ago
luccioman a4494d6e01 Improved support for internationalized domain names on "site:" modifier
7 years ago
luccioman d07006bac4 Do locale independant case conversion on "filetype:" query modifier.
7 years ago
luccioman 8fbf25d1ed Made "site:" query modifier case insensitive.
7 years ago
luccioman 867388e05b Refactored 'site:' query modifier parsing into a dedicated function.
7 years ago
luccioman c9d80b5b77 Prefer fine URL match over approximate URL mask regex on final filtering
7 years ago
luccioman 0a120787e3 Improved accuracy of URLs search filters : protocol, tld, host, file ext
7 years ago
luccioman d1c7dfd852 Fixed URL parsing with fragment and empty path
7 years ago
luccioman e07ef1b610 Apply tld query modifier on Solr host_s mandatory field.
7 years ago
luccioman 478e92deff Fixed url mask filter generated when protocol modifier is not null
7 years ago
luccioman 29de4a65d7 Refactored url mask filter build from query modifiers
7 years ago
reger d5a75537e4 remove redundant setting of timeout for remoteinstance
7 years ago
luccioman f01aac31fd Made possible to use https for remote search on peers with SSL enabled.
7 years ago
luccioman e2f6427a63 Added a basic JUnit test for the Visio parser (vsdParser)
7 years ago
luccioman 1e9cdaabd4 Do locale neutral case conversion of HTML charset name.
7 years ago
luccioman 7206f1ed71 Do locale neutral case conversions on domain names.
7 years ago
luccioman 398c66f06c Do locale neutral case conversions in MultiProtocolURL
7 years ago
luccioman 9531b83598 Do locale neutral case conversions in Classification
7 years ago
luccioman d22fc0d0a2 Updated lists of known sponsored and country-code TLDs.
7 years ago
luccioman ac209cac2e Updated the generic top-level known domains list.
7 years ago
luccioman 938d8a9731 Added some JavaDoc
7 years ago
luccioman e0eda84c24 Remove old hard-coded holiday dates from DateDection class.
7 years ago
luccioman cb10daba92 Renamed Chinese & Greek lng files using ISO639-1 codes.
7 years ago
luccioman 46f37e38dc Customized Threads with generic name for easier monitoring.
7 years ago
luccioman 046be566e1 Updated a license header typo.
7 years ago
Apply55gx 3c905a2a5c fix typo
7 years ago
luccioman 8e732d437c Enable HTTP Digest authentication for non admin users.
7 years ago
luccioman d8eaf621cc Fixed blacklist returned location URL on empty parameters
7 years ago
luccioman af198b990b Added an optional login link/status to the search public top nav bar.
7 years ago
luccioman 1de86cf1bf Fixed JPEG snapshot resizing when running on OpenJDK.
7 years ago
luccioman a17a418e78 Fixed NullPointerException cases on snapshot images parsing.
7 years ago
luccioman 285f0d6a39 Consistently encode snapshot image with format requested on the API.
7 years ago
luccioman 34ca73d61b Fixed a NullPointerException case on images encoding errors.
7 years ago
luccioman 7c319c841e Fixed pdf2image conversion with imagemagick on PDFs having transparency
7 years ago
luccioman 6e497241f7 Properly close resources (even on error) on OS and ThreadDump classes.
7 years ago
luccioman fe75f326d8 Fixed ProfilingGraph calculation integer overflows and added test class.
7 years ago
luccioman 5d1ef8fdfc Merge branch 'master' of https://github.com/otteresk/yacy_search_server
7 years ago
luccioman 8303e15419 Reduced number of search navigators refresh requests in JS resort mode
7 years ago
luccioman dbff7b14fc Add a configurable limit to tags initially displayed in search results
7 years ago
Andreas 0c4db9eef0 Merge pull request #3 from yacy/master
7 years ago
reger c31d94664a Update deprecated SolrInputDocument.addField() with boost value
7 years ago
luccioman 7e271f9cf5 Updated travis config : install ghostscript, required for Html2Image
7 years ago
luccioman 32c9dfa768 Added partial bzip2 stream parsing support and bzipParser Junit test
7 years ago
luccioman dd9cb06d25 Fixed RWI distance calculation on multi words search queries.
7 years ago
luccioman 6b11bf3a12 Fixed NullPointerException case on 'Browser' lang selection
7 years ago
reger ae1c675c85 fix array out of bounds in YJsonResponseWriter and OpensearchResponsWriter
7 years ago
otter 73d1d577fd prevent integer overflow in chartDot for nodes with a big index
7 years ago
otter 4e2ccdfcac prevent integer overflow in chartLine
7 years ago
luccioman 27ab733685 Ensure private search features are not lost on Digest auth timeout
7 years ago
reger ba60f65040 Adjust filetype: query modifier parameter to lower case
7 years ago
luccioman 57a33aefb0 Removed unnecessary max counts init on empty search navigators.
7 years ago
luccioman ef8aea7f8d Made the dates navigator max elements number user configurable.
7 years ago
luccioman 9e86d183b8 Disable manual search results resorting when resorting is done with JS
7 years ago
luccioman 66cb9c4ff9 Added Solr filter queries for audio, video and application domains
7 years ago
luccioman 5d3ceb31b7 Improved search navigators counters accuracy and consistency.
7 years ago
luccioman 8e4f31bdc7 Updated internal ISO 639-1 language codes with latest standards.
7 years ago
luccioman a28428047a Fixed count of filtered results from local solr.
7 years ago
Michael Peter Christen 2f71005a93 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
7 years ago
Michael Peter Christen 2314f8e358 try to fix problem
7 years ago
luccioman 3c9df6e0ce Use local solr filtered results in total search results count.
7 years ago
luccioman a1a0515312 Added a button to manually refresh sorting of p2p search results.
7 years ago
luccioman 4eba88f2ff Removed some unnecessary uses of java.lang.reflect api.
7 years ago
luccioman da3dbf9ea1 Use Javadoc style comments on SearchEvent properties.
7 years ago
luccioman c6ae87168a Added unit tests on the gzip parser.
7 years ago
luccioman 169ffdd1c7 Finer control on max links to parse in the html parser.
7 years ago
luccioman e41d046a9d Improved parsing support for OOXML spreadsheets (.xlsx)
7 years ago
reger 51a4e03c93 Allow to stop currently running warc import (stop button)
7 years ago
luccioman 6cec2cdcb5 Use unredirected robots.txt URL when adding an entry to the table.
7 years ago
luccioman 3f0446f14b Ensure proper synchronous robots entry retrieval on first check.
7 years ago
luccioman b23a563065 Prevent search result failure on incomplete images information.
7 years ago
Michael Peter Christen 30d71c6359 added usage of X-Real-IP http header
7 years ago
Michael Peter Christen f45378c11c Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
7 years ago
Michael Peter Christen 7f395ef937 added image link in search results
7 years ago
luccioman 780173008e Implemented partial stream parsing of tar archives.
7 years ago
luccioman acab6a6def Also handle text content when parsing XML within limits.
7 years ago
reger 2a07799ad1 Correction of d03e2c98ea
7 years ago
reger d03e2c98ea Fix Conjunction.addOperator to do nothing if term is empty
7 years ago
reger b6a41df4f7 Remove deprecated YaCyProxyServlet
7 years ago
luccioman 8a94fef9e0 Prevent unwanted cached bytes duplication on stream parsing.
7 years ago
reger 4979439e87 Skip public post of jre version.
7 years ago
reger e918ec199e Replace deprecated ConcurrentHashSet with recommended Java8
7 years ago
reger fb71994342 Harmonizing use of xml reader / sax parser in XMLBlacklistImporter
7 years ago
reger 275d65fffe Patch last_modified date with internal FirstSeenTime() if no date provided
7 years ago
reger d1b23afed6 Remove obsolete Protocol parameter ttl (time to live)
7 years ago
reger 15d78b1064 Replace deprecated getIP with getIPs in Protocol transferURL() and
7 years ago
reger ed36b47bec Replace one more deprecated peerDeparture in Protocol.transferIndex()
7 years ago
luccioman 0ee8c030c4 Log an error when Solr folder migration fails for some reason.
7 years ago
luccioman 5a646540cc Support parsing gzip files from servers with redundant headers.
7 years ago
luccioman 11a7f923d4 Distinguish response parsing failures from unexpected exceptions.
7 years ago
luccioman eda7b0aeb6 Merge branch 'master' of https://github.com/yacy/yacy_search_server
7 years ago
reger 3005be7349 Clean up unmaintained and unused AugmentParser trail.
7 years ago
luccioman cb4f1358e1 Added gzip parser support for max content bytes limit
7 years ago
luccioman 5216c681a9 Added HTML parser support for maximum content bytes parsing limit
7 years ago
luccioman 4aafebc014 Merge pull request #122 from Scarfmonster/patch-1
7 years ago
luccioman 651fad6da5 Added RSS parser support for maximum content bytes parsing limit
7 years ago
luccioman 452a17a8d5 Finer control on bounded input streams with custom stream implementation
7 years ago