Commit Graph

909 Commits (b8479430b693107b32ede29d22a6a35e9a7c63f8)

Author SHA1 Message Date
luccioman 8da3174867 Ensure lower case conversion consistency with any default locale.
7 years ago
luccioman 319231a458 Added a generic XML parser, able to parse elements text and URLs.
7 years ago
luccioman 0487336ec3 Prevent integer overflow in table statistics and use strong typing
8 years ago
luccioman 5fdd5d16b1 Use volatile to ensure concurrent threads use up to date property value
8 years ago
luccioman 28b451a0b3 Made Cache compression level and lock timeout user configurable
8 years ago
luccioman a7394b479b Limit the synchronization blocking time on some Cache operations.
8 years ago
luccioman 8399275142 Properly close file output streams even on exceptions scenarios.
8 years ago
luccioman a04feac064 Ensure file input streams proper closing in both success and failures
8 years ago
luccioman c53c58fa85 Unsure closing ChunkIterator stream in every possible use case.
8 years ago
reger f9180fabc4 assure that RWI Index.Segment IODispatcher is not blocking on shudown
8 years ago
luccioman 0da1e6ba16 Factored code re-implementing DigestURL.hosthash() method.
8 years ago
reger eedee6eabb fix exception on URIMetadataNote instantiation with corrected id hash on
8 years ago
reger 083df255e4 fix html tag attribute parsing containing attribute w/o value
8 years ago
luccioman eec5779889 Added a name prefix to pooled threads for easier monitoring.
8 years ago
luccioman a0dfbaca6a FileUtils : added some JavaDocs and unit test cases
8 years ago
reger 49eae79c01 fix Tables.hasIndex check for tablename = key
8 years ago
reger 669f60223e upd Column.toString to output encoder "{bytes}"
8 years ago
reger c9e81d2fa0 fix Column parsing from celldefinition string, without cellwidth def.
8 years ago
reger 20a1b29ed3 add simple test case for ReferenceContainer helpful for debugging
8 years ago
reger 3c7220bc7b Refacture rwi reference word position and word distance calculation
8 years ago
luccioman f0639d810c Customized name for Threads still using the default "Thread-n" pattern.
8 years ago
reger 4c67ed3f8d catch rwi ranking div by zero exception
8 years ago
luccioman ee92082a3b Updated javadocs : warning about closing stream responsibility.
8 years ago
reger 68217465fe div by null in word distance calculation
8 years ago
reger 8b74a6bf57 fix min/max calculation of WordReferenceVars.distance()
8 years ago
luccioman 6e1959f469 Merge branch 'master' of https://github.com/yacy/yacy_search_server.git
8 years ago
reger 685d8e86bf Avoid frequent data type casting (float/long) for rwi score
8 years ago
reger 681a61dafb adjust rwi index result word position handling used for rwi ranking
8 years ago
reger ff6589fc0f test case: simulating multi word query for local rwi index
8 years ago
reger 3b694b3935 add some javadoc to rwi wordreference distance, position
8 years ago
reger 96467c5467 remove not needed counter in Tokeninzer (completing last changes)
8 years ago
reger 7efb66ee10 adjust the WordReference.join wordsintext calc to take the max (instead of sum)
8 years ago
reger 120bf7e6e2 implemented RWI WordReference to return the word position value (was always left empty)
8 years ago
Michael Peter Christen 103a8348b3 fix for NPE and small performance enhancement
8 years ago
luccioman 6e96c7341a Merge remote-tracking branch 'origin/master'
8 years ago
reger 5aaa057c65 ignore empty input lines in FileUtils.getListArray() to poka joke blacklist read.
9 years ago
reger 4cc38e979d add InputStream close after reading input file (Vocabulary_p servlet)
9 years ago
Burkhard 9a18e2297b Merge pull request #51 from JeremyRand/multiple-boost-query
9 years ago
reger f0d7b93372 make use and activate autodetect charset in Vocabulary input from file
9 years ago
JeremyRand 58824dfa6c Refactor escaping in config file read/write code. Now it uses Apache Commons StringUtils instead of RegEx.
9 years ago
luc 26f1ead57c Created ViewFavicon class specialized in favicon viewing.
9 years ago
luc 07222b3e1a Added favicon url transmission in RWI chunks.
9 years ago
luc 3cc5619d93 Improved HTML icons indexing and rendering in search results.
9 years ago
luc 571bc55937 Refactoring : use StandardCharsets constants instead of hard-coded
9 years ago
sixcooler dce1cb65c4 Merge remote-tracking branch 'choose_remote_name/master'
9 years ago
reger b4b6910d60 fix (todo): correct doc.id of remote search result if no match with newly
9 years ago
reger cb83e65f89 drop returning document language "en" if unknown (fix todo)
9 years ago
luc 70595d05d0 Modified MemoryControl.main() test to properly end for better results
9 years ago
reger cdb8f3b10d make current ranking score value avail. to search interface / api
9 years ago
Michael Peter Christen d82d311995 Merge branch 'master' of https://github.com/luccioman/yacy_search_server
9 years ago
reger 1160b13172 remove unused md5 from ViewFile servlet params
9 years ago
reger b2c8bc0ae6 remove md5_s from default index fields
9 years ago
luc 5bbb2e1730 Ensure resource is closed when reading a full file InputStream
9 years ago
reger 7d0d19cb8e avoid File.deleteOnExit() on temp files
9 years ago
reger 02e4489a23 set tmpfile.deleteOnExit by default,
9 years ago
reger ca3d26a401 harmonize wordsintitle & CollectionSchema.title_words_val calculation,
9 years ago
sixcooler d3b9349b6f simplification / speedup of GenerationMemoryStrategy
9 years ago
luc c38d6c1f37 Correction for mantis 535: inurl: parameter doesn't work on URLs with
9 years ago
reger 3f2b8ab5e5 optionally include mime in p2p url exchange string
9 years ago
reger e37a4f0b3d prevent metadata records in index w/o valid url
9 years ago
Michael Peter Christen c40c302748 when many crawl queues are generated, this NPE can occur; probably
9 years ago
luccioman 2f0f0180e2 Added a function to list files recursively.
9 years ago
reger 0e4ba0360b fix NPE on .yacyh result url of disconnected peer
9 years ago
Michael Peter Christen dbbad23e12 removed warnings
9 years ago
Michael Peter Christen b94bd7f20a a collection of search query enhancements:
9 years ago
Michael Peter Christen 34de1e8cbc gzip compression will perform more efficient and with better compression
10 years ago
Michael Peter Christen a1a8edfc0a wrap HeaReader close() in a catch Throwable block to prevent that an
10 years ago
reger 8b35656007 remove hard throw exception in makeResultEntry
10 years ago
reger dd7782bac0 revert deletion of BinSearch
10 years ago
reger 000dde9511 Eleminate duplication of values for search ResultEntry
10 years ago
reger d882991bc5 Implement sharing of ioDispatcher for term & citation index
10 years ago
reger c60ccdfbcf Increase IODspatcher dumpQueue size to 2 to reduce risk of concurrent emergency dump,
10 years ago
reger 13f013f64a Limit extra sleep of BusyThread on LowMemCycle
10 years ago
Michael Peter Christen fed26f33a8 enhanced timezone managament for indexed data:
10 years ago
Michael Peter Christen 9bf0d7ecb9 added a new collection type 'dht' to all documents from the peer-to-peer
10 years ago
Michael Peter Christen ee2490ab98 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 431311df42 fix get fresh_date_dt to allow returned value to be date in future
10 years ago
otter 74c7e8b686 Fixes hanging FlushThread (see
10 years ago
reger 706f75ddc2 try to fix hang on index blob merge on shutdown
10 years ago
Michael Peter Christen fd4e2c809a Show dates in the content of a document in the search result:
10 years ago
reger df83fcc4fc disable optimistic GC assumption in StandardMemoryStrategy
10 years ago
Michael Peter Christen ac19690d30 refactoring with CommonPattern.COMMA
10 years ago
Michael Peter Christen 3d717b749a fix for urlmaskfilter
10 years ago
reger 24f68a4eb7 refactor opensearch heuristic
10 years ago
reger 8e751d754a - add javadoc to busythread with hint about the init parameter useage
10 years ago
Michael Peter Christen 3cd7deb3b8 do not flush non-errors to stdout because this is a concurrency issue.
10 years ago
reger 198102304b refactor size() -> filesize() of URIMetadataNode
10 years ago
reger c6f634a4f2 remove redundant caching of urlhash in URIMetadataNode
10 years ago
Michael Peter Christen 413eeefed4 added character set detection library from
10 years ago
Michael Peter Christen a304058840 added Image Events as another option to generate images with a mac if no
10 years ago
Michael Peter Christen 321840fde3 Replaced all fixed thread pools with cached thread pools. The cached
10 years ago
Michael Peter Christen 7bfab5eb9d set Busy- and Blocking-Threads to daemon mode (they will now not prevent
10 years ago
Michael Peter Christen ad0da5f246 added new web page snapshot infrastructure which will lead to the
10 years ago
Michael Peter Christen 4920ab7b76 optimize usage of size() cache
10 years ago
Michael Peter Christen 2beb6abeb6 disabled crazy sleep loop
10 years ago
Michael Peter Christen 8aee7f940e added missing class for latest changes
10 years ago
Michael Peter Christen 97039049e4 fix in key enumeration methods for cases where the enumeration is done
10 years ago
Michael Peter Christen 421ee64f33 another fix to ordering of table indexes; fixes also network stats
10 years ago
Michael Peter Christen 1db476c67e fix for bad table iteration
10 years ago
orbiter 0fcd8097a3 removed unused options from BusyThreads
10 years ago