Commit Graph

1107 Commits (fc958230c455774ec1262bf29f87309ffa248edd)

Author SHA1 Message Date
reger b3c9041f79 remove with localHostNames redundant (but unused) publicIPv4HostNames and publicIPv6HostNames
9 years ago
reger 9e94989237 upd to PDFBox 2.0.1
9 years ago
reger 24b0fa2a38 extend snapshot Html2Image.pdf2image to use PDFBox image export capability
9 years ago
reger 3adb670f44 remove never used Domains.myHostNames set
9 years ago
reger ec24a0c85a add test case for optimized toTokens()
9 years ago
reger 258cd41577 reduce logging (EmbeddedSolrConnector.query)
9 years ago
reger 6d56beaed8 fix assertion exception in toString of MultiProtocolURL
9 years ago
reger 937fbb0b9f correct isHidden() for smb from last commit
9 years ago
reger 535d4bf75f respect hidden attribute for file and smb directory listing
9 years ago
reger a6617ad887 expand initRemoteCrawler() to terminate worker threads if called to deactivate
9 years ago
reger c91e712178 further refactor using standard java / (one) utf-8 charset variable
9 years ago
luc 571bc55937 Refactoring : use StandardCharsets constants instead of hard-coded
9 years ago
sixcooler 5a35f9383a bump to solr/lucene 5.4.0
9 years ago
reger a5faf73afa remove obsolete yacy.init entries interaction.*
9 years ago
reger 45b9bd8403 adjust MultiProtocolURL.protocol detection to handle mailto with "://" in parameters,
9 years ago
reger b7e8358645 make use of header.getContentType where possible (mime is normalized afterwards)
9 years ago
reger 7a8c077838 fix HeaderFramework.mime() to strip charset parameter.
9 years ago
reger dec3e6ad96 fix: adjust urlstub for mailto links
9 years ago
reger 71c416f383 show mailto links in ViewFile.html linklist
9 years ago
reger 4d2b934487 prevent mailto links getting into parser result document's in/outbound link collection
9 years ago
sixcooler 1be67d9ab6 CachedSolrConnector was replaced by ConcurrentUpdateSolrConnector years
9 years ago
reger 28b8bc290a fix use of NETWORK_SEARCHVERIFY for rwi verification
9 years ago
reger 020630efd8 remove unused network scanner parameter from queryparameter
9 years ago
luc f01d49c37a Process large or local file images dealing directly with content
9 years ago
luc 3c4c77099d If available, check content length before downloading. Check also
9 years ago
reger 2985baaa01 Exclude repetitive protocol part in tokenized url
9 years ago
Michael Peter Christen d1ae999ef9 replaced HashMap with LinkedHashMap to preserve the object order
10 years ago
reger c9937973e3 unescape MultiProtocolURL getAttributes() return values.
10 years ago
reger 43c27aa550 upd to solr/lucene 5.3.1
10 years ago
reger 688f7b2a5c allow/display svg images in image results previews
10 years ago
Michael Peter Christen 8e555d79a3 add also 1-character tokens to the token list because that could be also
10 years ago
reger bad34804fe optimize parseInt for <img> tag attribute parsing
10 years ago
reger 52e3eb4ce8 harmonize/correct assignment to Ymarkmeta.mime
10 years ago
Michael Peter Christen 87f358058e Fix for index entries which have id's not computed as hash from the url.
10 years ago
Michael Peter Christen 5f706797cb patch for a bug inside of solr since solr 5.0 when using a boost
10 years ago
reger b4cbdea1e7 adapt SolrServerConnector.add to handle error on partial update input document.
10 years ago
reger e37a4f0b3d prevent metadata records in index w/o valid url
10 years ago
reger 4cf875336c complete TODO: getFileExtension handle dot in query part
10 years ago
sixcooler 87e4abe393 fight the fieldcache by usind DocValues: in Solr-5.x the fieldcache has
10 years ago
reger c33229fc0c check mime prior to ext for metadata modification for images
10 years ago
reger 206883f80d fix: Preserve protocol in url proxy
10 years ago
Michael Peter Christen 8028410ab7 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen df3314ac1a added a new facet type based on a probabilistic classifier using
10 years ago
reger e2e73258ca remove obsolete interface SearchAccumulator
10 years ago
Michael Peter Christen dbbad23e12 removed warnings
10 years ago
Michael Peter Christen 500cfa9457 enhanced logging
10 years ago
Michael Peter Christen 203df5a750 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
reger ee77f24e52 use some more declared HeaderFramework constants
10 years ago
Michael Peter Christen 11a848da5a Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen b94bd7f20a a collection of search query enhancements:
10 years ago