Commit Graph

3351 Commits (711183bd72e290fa70069c32016385d050fe866e)

Author SHA1 Message Date
reger f91298d3b6 fix one implicit Integer/Long type conversion
9 years ago
reger 821262a179 add CommonPattern for multiple spaces
10 years ago
Michael Peter Christen 90f75c8c3d added enrichment of synonyms and vocabularies for imported documents
10 years ago
Michael Peter Christen 7829480b82 refactoring: separated condenser and tokenizer
10 years ago
Michael Peter Christen 593de05922 enhanced surrogate import process speed (dramatically!)
10 years ago
Michael Peter Christen 3c4c69adea fix for
10 years ago
Michael Peter Christen 1fec7fb3c1 suppress access to solr when doing search suggestions in case that the
10 years ago
Michael Peter Christen 694b22f165 migration to Solr 5.2: huge benefits - this is a lot faster!
10 years ago
sixcooler e427efbe54 Next Try for a fix for upload-connection staying in blocked state.
10 years ago
reger 0fab445b19 Resourceobserver log warning - deleting releases files - only on actual deletes
10 years ago
sixcooler ef6a64b2a4 Fix for upload-connection staying in blocked state.
10 years ago
reger c973f94936 add log entry on release file delete by ResourceObserver
10 years ago
reger 121972752c implement deleteOldDownloads in RexourceObserver on low diskspace
10 years ago
Michael Peter Christen 9c12555be5 added link to Snapshots in search results if the snapshot exists and
10 years ago
reger 72f6a0b0b2 enhance recrawl job
10 years ago
reger 7478338a40 remove augmented parsing activation from frontend
10 years ago
reger 11aa2edfe1 remove RDFa parser activation from frontend
10 years ago
reger 49b79987c9 remove obsolete searchfl work table
10 years ago
Michael Peter Christen d0aff91f23 fix for index import
10 years ago
Michael Peter Christen 34de1e8cbc gzip compression will perform more efficient and with better compression
10 years ago
Michael Peter Christen 98be59ce9c full solr xml exports will now be automatically compressed during
10 years ago
Michael Peter Christen a1a8edfc0a wrap HeaReader close() in a catch Throwable block to prevent that an
10 years ago
Michael Peter Christen b43811d38c added surrogate import process for exported solr dumps.
10 years ago
Michael Peter Christen b77537294d prevent disc usage when showing tray animation
10 years ago
Michael Peter Christen eec78e1b0c added intensity option to graphics
10 years ago
Michael Peter Christen a5007f345e re-licensing some of my old visualization classes under LGPL 2.1
10 years ago
Michael Peter Christen c99a665593 adding a 3-pixel font generator made some time ago..
10 years ago
Michael Peter Christen c7576d6028 added a full solr export to the IndexControlURLs_p.html servlet. The
10 years ago
Michael Peter Christen 197f7449e5 All entities of crawl profiles are now editable in the crawl profile
10 years ago
reger 1d8e1e4bac - Image search expand box, adjust javascript hs padtominsize parameter, to make sure expand box doesn't shrink on small images
10 years ago
reger 8b35656007 remove hard throw exception in makeResultEntry
10 years ago
reger af57fbefad use available mime (instead null) on imageresult from metadatanode
10 years ago
reger dd7782bac0 revert deletion of BinSearch
10 years ago
reger 000dde9511 Eleminate duplication of values for search ResultEntry
10 years ago
reger 29c4aa3991 fix compiler notification of missing serialID
10 years ago
reger 3d53da8236 refactor ResultEntry to be based on MetadataNode/SolrDocument
10 years ago
reger d882991bc5 Implement sharing of ioDispatcher for term & citation index
10 years ago
reger 370ba9da71 On imageSearch prefere mime to sort out none-image documents
10 years ago
reger cd31633369 improve MultiprotocolURL.getFileExtension()
10 years ago
reger c60ccdfbcf Increase IODspatcher dumpQueue size to 2 to reduce risk of concurrent emergency dump,
10 years ago
reger 8a9622c31c fix string OoB on getImagelinks with long alttext
10 years ago
reger 3e742d1e34 Init remote crawler on demand
10 years ago
reger 13f013f64a Limit extra sleep of BusyThread on LowMemCycle
10 years ago
reger cd7c0e0aae detail optimization of RecrawlThread
10 years ago
reger ace71a8877 Initial (experimental) implementation of index update/re-crawl job
10 years ago
reger 141cd80456 correct log msg text
10 years ago
reger f3ce99bfb8 fix extract of inboundlinks_protocol_sxt
10 years ago
reger 2bc9cb5828 fix early return in addToCrawler
10 years ago
Michael Peter Christen f5f88272e4 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen 5c67c4d460 fix for latest commit, see
10 years ago
reger c37dda8849 fix NPE on MultiProtocolURL on url with parameter value and '='
10 years ago
Michael Peter Christen f810915717 added crawl start from a clone with very, very large url: they are now
10 years ago
Michael Peter Christen 51de86c992 disabled debug thread dumps
10 years ago
Michael Peter Christen d524a9d77c Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
10 years ago
Michael Peter Christen 0710648c31 enable api calls with very long urls
10 years ago
reger 31346e873b upd library reference of missing jsch-0.1.21 in seeduploadscp.xml
10 years ago
reger 609c52e987 refactor getBookmark
10 years ago
reger 1481a8ab56 add opensearch rss results to dht collection (due to text = snippet)
10 years ago
reger f134aa7f7f persist bookmark timestamp
10 years ago
reger 752eec6697 fix NPE in addToIndex when used outside searchEvent
10 years ago
Michael Peter Christen fbf85a1561 added temporary debug output in http client
10 years ago
Michael Peter Christen ff29b0e503 added option to re-index exported xml snapshot dumps to
10 years ago
Michael Peter Christen 6f4fe4b175 revert of 8a7c68e4c7
10 years ago
Michael Peter Christen 97930a6aad added must-not-match filter to snapshot generation.
10 years ago
Michael Peter Christen 9d8f426890 adding a try-catch to link graph processing to prevent that a single
10 years ago
reger 8a5b8f8789 on bookmaring of search result, remember orig. query in separate bookmark property
10 years ago
reger 7224209486 break out of NormalizeDistributor loop on timeout
10 years ago
reger 47e61f8325 fix typo in image filter query
10 years ago
reger 4b4ab6799f fix String out of range in Collection Nav
10 years ago
reger 572cfe8fd4 improve character encoding for urlproxy servlet
10 years ago
reger 6bc8a9b11e make Quality of Service Servlet available to prioritize requests from local host
10 years ago
Ryszard Goń ca1a70aec8 fix for Accept '?' URLs column in Crawl Profile List
10 years ago
reger 5408448a56 skip redundant add. of keywords to text
10 years ago
reger 296e97c78e put https port in peers dna
10 years ago
Michael Peter Christen fed26f33a8 enhanced timezone managament for indexed data:
10 years ago
Michael Peter Christen b060ba900d added parsing of contentprop attribute in html tags for
10 years ago
Michael Peter Christen 4cb4f67f38 added parsing of dd, dt and article html fields. The parsed result is
10 years ago
reger 1395f10e95 fix typecast for css links
10 years ago
Michael Peter Christen 3288489fd2 more logging during start-up
10 years ago
Michael Peter Christen abaaaef5f1 fix for filter queries
10 years ago
Michael Peter Christen 4d00175157 <experimental> added parsing of <article> html element.
10 years ago
Michael Peter Christen 1df6492019 enhanced suggestions
10 years ago
Michael Peter Christen ae02c92fd0 logging fix
10 years ago
Michael Peter Christen 5651713134 better debugging of fq
10 years ago
Michael Peter Christen f5a032f293 split query into filter query and text query to get better ranking
10 years ago
Michael Peter Christen 2e88028c1a when selecting collections in navigation, do show the un-selected
10 years ago
Michael Peter Christen 1de9b21c65 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 5f4cd8d6f5 replace deprecated getIP with getIPs in AbstractRemoteHandler
10 years ago
Michael Peter Christen fa7edc9f7a refactoring of filter queries (several queries instead only one)
10 years ago
Michael Peter Christen 40389987ec Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
Michael Peter Christen f9ba50379d added an expansion option to search facets on result page:
10 years ago
reger 1f0f77bb77 make location facet return results
10 years ago
reger b1ec0644e5 fix NPE in location search on missing/empty PubDate in underlaying rss data
10 years ago
reger c1dcc8c456 fix display and limit of max server connections after startup
10 years ago
reger 839b962c20 correct percent encoding for '%' char
10 years ago
Michael Peter Christen 9bf0d7ecb9 added a new collection type 'dht' to all documents from the peer-to-peer
10 years ago
reger 796770e070 prevent overwrite of crawled or received full documents by (newer) metadata
10 years ago
Michael Peter Christen ee2490ab98 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 431311df42 fix get fresh_date_dt to allow returned value to be date in future
10 years ago
otter 74c7e8b686 Fixes hanging FlushThread (see
10 years ago