Commit Graph

23 Commits (09783ae89eb11688d86e4a3c1c539fb8566fde49)

Author SHA1 Message Date
Michael Peter Christen 0689f4f0ae Check if the character is a minus sign and is followed by a letter or a
1 year ago
Michael Peter Christen 5db97a8928 parser can now separate numbers from words also when they are not
1 year ago
Michael Peter Christen e3797de7de enhanced the word tokenizer to recognize numbers in a proper way
1 year ago
Michael Peter Christen 8285fe715a tab to spaces for classes supporting the condenser.
1 year ago
luccioman e115e57cc7 Reduced text snippet extraction processing time.
7 years ago
reger 272cdd496a reactivate sentence counter in WordTokenizer for phrasepos ranking,
8 years ago
Michael Peter Christen 90f75c8c3d added enrichment of synonyms and vocabularies for imported documents
10 years ago
reger 0c97cc2440 skip unused call parameter for hashSentence()
10 years ago
reger 5790c7242e skip to tokenize punktuation as word in WordTokenizer
10 years ago
Michael Peter Christen 9bb7eab389 hacks to prevent storage of data longer than necessary during search and
11 years ago
Michael Peter Christen 8219a445f3 refactoring
12 years ago
orbiter 63762d8f89 removed kelondro dependencies from cora
12 years ago
orbiter 78fc3cf8f8 refactoring and new usage of SentenceReader: this class appeared as one
13 years ago
Roland 'Quix0r' Haeder a093ccf5eb Now used synchronization in all close() methods to make sure all objects
13 years ago
Michael Peter Christen ef78f22ee1 performance hack
13 years ago
Al Sutton 8993cac4d8 Initial performance improvements
13 years ago
orbiter 0d858d48ec replaced String with StringBuilder in suggestion process
13 years ago
orbiter 6b22865dbc - removed some warinings
13 years ago
sixcooler eb14111200 encapsulate potential expensive objects in TextSnippet to allow GC them asap
13 years ago
orbiter 694fa3a2a5 - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion
14 years ago
low012 3b40b98256 *) set SVN properties
14 years ago
orbiter 5892fff51f introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased.
14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
14 years ago