Commit Graph

881 Commits (6b45cd579922574059e5385153b84be3ca07533b)

Author SHA1 Message Date
reger ff6589fc0f test case: simulating multi word query for local rwi index
8 years ago
reger 3b694b3935 add some javadoc to rwi wordreference distance, position
8 years ago
reger 96467c5467 remove not needed counter in Tokeninzer (completing last changes)
8 years ago
reger 7efb66ee10 adjust the WordReference.join wordsintext calc to take the max (instead of sum)
8 years ago
reger 120bf7e6e2 implemented RWI WordReference to return the word position value (was always left empty)
8 years ago
Michael Peter Christen 103a8348b3 fix for NPE and small performance enhancement
8 years ago
luccioman 6e96c7341a Merge remote-tracking branch 'origin/master'
8 years ago
reger 5aaa057c65 ignore empty input lines in FileUtils.getListArray() to poka joke blacklist read.
9 years ago
reger 4cc38e979d add InputStream close after reading input file (Vocabulary_p servlet)
9 years ago
Burkhard 9a18e2297b Merge pull request #51 from JeremyRand/multiple-boost-query
9 years ago
reger f0d7b93372 make use and activate autodetect charset in Vocabulary input from file
9 years ago
JeremyRand 58824dfa6c Refactor escaping in config file read/write code. Now it uses Apache Commons StringUtils instead of RegEx.
9 years ago
luc 26f1ead57c Created ViewFavicon class specialized in favicon viewing.
9 years ago
luc 07222b3e1a Added favicon url transmission in RWI chunks.
9 years ago
luc 3cc5619d93 Improved HTML icons indexing and rendering in search results.
9 years ago
luc 571bc55937 Refactoring : use StandardCharsets constants instead of hard-coded
9 years ago
sixcooler dce1cb65c4 Merge remote-tracking branch 'choose_remote_name/master'
9 years ago
reger b4b6910d60 fix (todo): correct doc.id of remote search result if no match with newly
9 years ago
reger cb83e65f89 drop returning document language "en" if unknown (fix todo)
9 years ago
luc 70595d05d0 Modified MemoryControl.main() test to properly end for better results
9 years ago
reger cdb8f3b10d make current ranking score value avail. to search interface / api
9 years ago
Michael Peter Christen d82d311995 Merge branch 'master' of https://github.com/luccioman/yacy_search_server
9 years ago
reger 1160b13172 remove unused md5 from ViewFile servlet params
9 years ago
reger b2c8bc0ae6 remove md5_s from default index fields
9 years ago
luc 5bbb2e1730 Ensure resource is closed when reading a full file InputStream
9 years ago
reger 7d0d19cb8e avoid File.deleteOnExit() on temp files
9 years ago
reger 02e4489a23 set tmpfile.deleteOnExit by default,
9 years ago
reger ca3d26a401 harmonize wordsintitle & CollectionSchema.title_words_val calculation,
9 years ago
sixcooler d3b9349b6f simplification / speedup of GenerationMemoryStrategy
9 years ago
luc c38d6c1f37 Correction for mantis 535: inurl: parameter doesn't work on URLs with
9 years ago
reger 3f2b8ab5e5 optionally include mime in p2p url exchange string
9 years ago
reger e37a4f0b3d prevent metadata records in index w/o valid url
9 years ago
Michael Peter Christen c40c302748 when many crawl queues are generated, this NPE can occur; probably
9 years ago
luccioman 2f0f0180e2 Added a function to list files recursively.
9 years ago
reger 0e4ba0360b fix NPE on .yacyh result url of disconnected peer
9 years ago
Michael Peter Christen dbbad23e12 removed warnings
9 years ago
Michael Peter Christen b94bd7f20a a collection of search query enhancements:
9 years ago
Michael Peter Christen 34de1e8cbc gzip compression will perform more efficient and with better compression
10 years ago
Michael Peter Christen a1a8edfc0a wrap HeaReader close() in a catch Throwable block to prevent that an
10 years ago
reger 8b35656007 remove hard throw exception in makeResultEntry
10 years ago
reger dd7782bac0 revert deletion of BinSearch
10 years ago
reger 000dde9511 Eleminate duplication of values for search ResultEntry
10 years ago
reger d882991bc5 Implement sharing of ioDispatcher for term & citation index
10 years ago
reger c60ccdfbcf Increase IODspatcher dumpQueue size to 2 to reduce risk of concurrent emergency dump,
10 years ago
reger 13f013f64a Limit extra sleep of BusyThread on LowMemCycle
10 years ago
Michael Peter Christen fed26f33a8 enhanced timezone managament for indexed data:
10 years ago
Michael Peter Christen 9bf0d7ecb9 added a new collection type 'dht' to all documents from the peer-to-peer
10 years ago
Michael Peter Christen ee2490ab98 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
10 years ago
reger 431311df42 fix get fresh_date_dt to allow returned value to be date in future
10 years ago
otter 74c7e8b686 Fixes hanging FlushThread (see
10 years ago
reger 706f75ddc2 try to fix hang on index blob merge on shutdown
10 years ago
Michael Peter Christen fd4e2c809a Show dates in the content of a document in the search result:
10 years ago
reger df83fcc4fc disable optimistic GC assumption in StandardMemoryStrategy
10 years ago
Michael Peter Christen ac19690d30 refactoring with CommonPattern.COMMA
10 years ago
Michael Peter Christen 3d717b749a fix for urlmaskfilter
10 years ago
reger 24f68a4eb7 refactor opensearch heuristic
10 years ago
reger 8e751d754a - add javadoc to busythread with hint about the init parameter useage
10 years ago
Michael Peter Christen 3cd7deb3b8 do not flush non-errors to stdout because this is a concurrency issue.
10 years ago
reger 198102304b refactor size() -> filesize() of URIMetadataNode
10 years ago
reger c6f634a4f2 remove redundant caching of urlhash in URIMetadataNode
10 years ago
Michael Peter Christen 413eeefed4 added character set detection library from
10 years ago
Michael Peter Christen a304058840 added Image Events as another option to generate images with a mac if no
10 years ago
Michael Peter Christen 321840fde3 Replaced all fixed thread pools with cached thread pools. The cached
10 years ago
Michael Peter Christen 7bfab5eb9d set Busy- and Blocking-Threads to daemon mode (they will now not prevent
10 years ago
Michael Peter Christen ad0da5f246 added new web page snapshot infrastructure which will lead to the
10 years ago
Michael Peter Christen 4920ab7b76 optimize usage of size() cache
10 years ago
Michael Peter Christen 2beb6abeb6 disabled crazy sleep loop
10 years ago
Michael Peter Christen 8aee7f940e added missing class for latest changes
10 years ago
Michael Peter Christen 97039049e4 fix in key enumeration methods for cases where the enumeration is done
10 years ago
Michael Peter Christen 421ee64f33 another fix to ordering of table indexes; fixes also network stats
10 years ago
Michael Peter Christen 1db476c67e fix for bad table iteration
10 years ago
orbiter 0fcd8097a3 removed unused options from BusyThreads
10 years ago
sixcooler 72561926aa do not overwrite yacy.conf in case of an exception
10 years ago
Michael Peter Christen bc275dca07 added network history graph image /NetworkHistory.png which can show
10 years ago
Michael Peter Christen ee27be3399 misc bugfixes (concurrency, memory protection)
10 years ago
Michael Peter Christen 7817fc50c9 added a high cpu cycle monitor to PerformanceQueues
10 years ago
orbiter 3ac31614a3 added option to reverse-sort YaCy tables (internal API change only)
10 years ago
Michael Peter Christen ec6082c872 very bad language detection hack fix hack
10 years ago
Michael Peter Christen a7dd89c4de changed method to write the citation index: do not catch up references
10 years ago
reger ea6c9e9b07 reduce mem buffer overhead for gap files during r/w
10 years ago
orbiter 487021fb0a snippet computation update
10 years ago
Michael Peter Christen 0ceeceb35e more logic on Solr queries; usage of the query terms in posprocessing,
10 years ago
reger 2b8cc5832c fix seek error for 0 file size records file
11 years ago
reger 2ba394333f fix Crawler HostQueue release of stackfile
11 years ago
Michael Peter Christen 501d55cd35 removed superfluous assert
11 years ago
Michael Peter Christen f0db501630 better handling of ranking parameters and new default values for date
11 years ago
Michael Peter Christen 6634b5b737 debug code for index distribution testing
11 years ago
orbiter 97983ba89f fixed generics warnings for generic array instantiation that appeared
11 years ago
orbiter 88f4af90da removed warnings
11 years ago
orbiter 89f76da24b Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
sixcooler b8cee9b7d8 remove tables from tabletracker on close to avoid lots of dead entrys in
11 years ago
orbiter f15c832587 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
11 years ago
reger ffc5b75c73 optimize and fix lat / lon assignment
11 years ago
reger 9313447de2 reimplement tighter lat/lon calc in URIMetadataNode
11 years ago
orbiter a3542f29b4 npe fix
11 years ago
orbiter c48d2a2a02 npe fix
11 years ago
orbiter 12ba890205 removed warnings
11 years ago
reger 727dfb5875 refactore URIMetadataNode to further unify interaction with index
11 years ago
Michael Peter Christen 1aea01fe5b fix for Table in case that requested file does not exist and paths also
11 years ago
Michael Peter Christen da86f150ab - added a new Crawler Balancer: HostBalancer and HostQueues:
11 years ago