Commit Graph

8032 Commits (15853a5402a79de23ff1a4ae9daa827bb96968d6)

Author SHA1 Message Date
reger 7be1c7a05a fix logger name
9 years ago
reger 1d940e5a94 upd commons-compress 1.11
9 years ago
reger 7789c32c82 delete crawl queue on init exception
9 years ago
reger f781b9dd47 revert call condition f. migration.installSkins
9 years ago
reger 3adb670f44 remove never used Domains.myHostNames set
9 years ago
reger 6ecc180299 fix rwi doubledom return best (highest) ranking
9 years ago
reger 2343e3f1cd keep and update existing xlf translation master instead of create new
9 years ago
reger a1935f485f Added utility class CreateTranslationMasters to create a language independant
9 years ago
reger acaf51b296 keep ConfigLanguage_p as 1st entry in exported translation file
9 years ago
reger 61c5b6b403 fix empty drop down list in ConfigLanguage after wrong/empty download
9 years ago
reger 4eddabee42 translate Network History screen -> de
9 years ago
reger 90c79014ae remove unused translator routine which also doesn't handle rel path input
9 years ago
reger 902e79e261 Introduce a TranslatorXliff wich can read/write xliff from/to internal translation map.
9 years ago
reger d9adc2c255 load handler for Transparent Proxy on startup only if feature is activated
9 years ago
reger ec24a0c85a add test case for optimized toTokens()
9 years ago
reger cada24f918 adjust utility ListNonTranslatedFiles for path compare on windows
9 years ago
reger fb8ae14b21 make migration version safe
9 years ago
reger 258cd41577 reduce logging (EmbeddedSolrConnector.query)
9 years ago
reger 6783ef5540 move example code SearchClient out of yacycore package
9 years ago
Michael Peter Christen b89465d952 0N - basic dump upload servlet infrastructure, to share index dumps
9 years ago
Michael Peter Christen f12a900f3e harmonization of http post of files for one and several files - this had
9 years ago
Michael Peter Christen 849ab671a9 0n: modified the p2p bootstraping process - rules had been too tight and
9 years ago
reger 764f5100f0 fix delete of temp file after odt % ooxml parser
9 years ago
reger 379e9b330d use supplied url port to get robots.txt in crawlers hostqueue
9 years ago
reger 58a959403d fix mixed logfactory in UrlProxyServlet,
9 years ago
Michael Peter Christen 2494a820c7 0N - added recording of dump exports if given time frame is not negative
9 years ago
Michael Peter Christen ef2cc4f690 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
Michael Peter Christen a6bf0b1649 0N - added option to generate index export files for a specific number
9 years ago
reger 6d56beaed8 fix assertion exception in toString of MultiProtocolURL
9 years ago
reger 42a7bdb2af fix SolrSelectServlet authentication to default to true
9 years ago
reger dbb28bb4f3 del unused statistic parameter (from status servlet)
9 years ago
reger 06d0e2aeb9 result heuristic (also used in greedy learning mode) to use outbound links if result is full index doc. Otherwise use default loader methode.
9 years ago
reger caf9e98f09 put metadata dc_publisher in corresponding schema field
9 years ago
reger 38e2b054d4 remove servlet classloder internal cache map (to save the resources, cache hits marginal)
9 years ago
reger 6f0b073bf3 override detected language (statistic langdetect) only with TLD determided
9 years ago
reger b65e2b527d include use of condenser's content text for language detection.
9 years ago
reger 937fbb0b9f correct isHidden() for smb from last commit
9 years ago
reger 535d4bf75f respect hidden attribute for file and smb directory listing
9 years ago
reger c28142095a add findClass() to servlet class loader (used in YaCyDefaltServlet)
9 years ago
reger a6617ad887 expand initRemoteCrawler() to terminate worker threads if called to deactivate
9 years ago
reger 2048b7e057 support scraping start-/enddate from html tag with property "datetime"
9 years ago
reger 900d4584ba complet resource cleanup of lists in contentscraper's close()
9 years ago
reger 1f18653de0 pass parsed swf content trough htmlscraper
9 years ago
reger 18ecf57792 add support of compressed swf to swfParser
9 years ago
sixcooler 5cb7ba0dc4 fix for connections not getting closed to get favicon.ico during seach
9 years ago
reger ed3e16e092 apply remote result count config value to Bookmark Autosearch
9 years ago
Ryszard Goń a98c395023 Add the Autocrawl thread
9 years ago
Ryszard Goń 1728cd30c6 Create autocrawl profiles
9 years ago
reger ff27824964 fix swfParser reading file signature
9 years ago
reger c91e712178 further refactor using standard java / (one) utf-8 charset variable
9 years ago
luc 571bc55937 Refactoring : use StandardCharsets constants instead of hard-coded
9 years ago
reger 1af0e9ef74 remove workaround for Solr bug regarding multivalued date fields
9 years ago
sixcooler 5a35f9383a bump to solr/lucene 5.4.0
9 years ago
reger a58d34a4e8 check error URL cache before adding errorDoc to index
9 years ago
reger e9539b1086 reintroduce special handling of file upload multipart/form-data from HTTPDemon.parseMultipart
9 years ago
reger cd26717ba2 fix low memory status hint (dht-in disabled)
9 years ago
reger a5faf73afa remove obsolete yacy.init entries interaction.*
9 years ago
sixcooler dce1cb65c4 Merge remote-tracking branch 'choose_remote_name/master'
9 years ago
reger 46ac0867ff fix poison mediawikiimporter output queue also after ExecutionException
9 years ago
reger a7591d3ed0 fix mediawikiimporter number format exception on coordinate parsing
9 years ago
reger 9da1712a31 increase http header EXPIRES for css and images in DefaultServlet
9 years ago
reger 6d54eb3d36 skip loading document on crawl start for YMark bookmarks
9 years ago
reger 80e2c82249 fix NPE on empty blog importfile parameter
9 years ago
reger e84d94f8ca fix mime table for ms office / open office documents
9 years ago
reger 45b9bd8403 adjust MultiProtocolURL.protocol detection to handle mailto with "://" in parameters,
9 years ago
reger d5fd031449 fix reading of ippattern config array in URLProxy
9 years ago
reger b7e8358645 make use of header.getContentType where possible (mime is normalized afterwards)
9 years ago
reger 7a8c077838 fix HeaderFramework.mime() to strip charset parameter.
9 years ago
reger b4b6910d60 fix (todo): correct doc.id of remote search result if no match with newly
9 years ago
reger dec3e6ad96 fix: adjust urlstub for mailto links
9 years ago
reger cb83e65f89 drop returning document language "en" if unknown (fix todo)
9 years ago
reger 0c5548a7ff fix (todo) remove redundant holding of email link nameproperty in parser document
9 years ago
reger 71c416f383 show mailto links in ViewFile.html linklist
9 years ago
reger 6b7c10cef8 fix dc:date in mediawikiimporter/document.writexml to use lastmodified
9 years ago
reger 14803d58cd let html scraper accept html5 <link rel="icon"> for favicon links
9 years ago
luc b4cdacee76 Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
luc ba0a293f5c Corrected another case of
9 years ago
reger 4d2b934487 prevent mailto links getting into parser result document's in/outbound link collection
9 years ago
luc 8c4ab9c76b Added an option to eventually limit size of remote solr documents put to
9 years ago
luc a2c08402af Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
luc 70595d05d0 Modified MemoryControl.main() test to properly end for better results
9 years ago
sixcooler 1be67d9ab6 CachedSolrConnector was replaced by ConcurrentUpdateSolrConnector years
9 years ago
reger 28b8bc290a fix use of NETWORK_SEARCHVERIFY for rwi verification
9 years ago
reger 020630efd8 remove unused network scanner parameter from queryparameter
9 years ago
luc ad5586f8f6 Merge branch 'master' of https://github.com/yacy/yacy_search_server
9 years ago
luc 8ebefa4233 Fixed MediaWiki import : DCEntry conversion to SolrInputDocument was
9 years ago
luc 7736ee5a42 Updated MediaWimporter main() : display usage in console and stop
9 years ago
reger cdb8f3b10d make current ranking score value avail. to search interface / api
9 years ago
luc 27d11f8671 Fixed isSolrDump function : PushBackInputStream was not unread when
9 years ago
Michael Peter Christen 135a123a77 less logging in new language detection
10 years ago
Michael Peter Christen ef8cd80593 fix for npe
10 years ago
reger 0347bfa71f Apply collection query constraint/modifiert to rwi result stack.
10 years ago
luc 2a67d2ba6f Corrected error management for unsupported image formats, parsing
10 years ago
Michael Peter Christen d6e9834040 Merge branch 'master' of
10 years ago
Michael Peter Christen d82d311995 Merge branch 'master' of https://github.com/luccioman/yacy_search_server
10 years ago
reger b5371ea8c1 read/init crawl queue in a thread
10 years ago
reger 1160b13172 remove unused md5 from ViewFile servlet params
10 years ago
reger e163ea88f6 fix vsdParser (Visio) parser return statement
10 years ago
reger b2c8bc0ae6 remove md5_s from default index fields
10 years ago
luc e40ae0943b - No max dimensions specified : render raw image data when source and
10 years ago