Commit Graph

186 Commits (d223cf0ae4b5853841f141fec98590497798b1d5)

Author SHA1 Message Date
Michael Peter Christen 151ccd50a9 fix for image size field values (must be multi-valued)
9 years ago
reger eaf0e8ff2c start recording/indexing pixel size for image document
9 years ago
reger c33229fc0c check mime prior to ext for metadata modification for images
9 years ago
Michael Peter Christen 8028410ab7 Merge branch 'master' of git@github.com:yacy/yacy_search_server.git
9 years ago
Michael Peter Christen df3314ac1a added a new facet type based on a probabilistic classifier using
9 years ago
reger 1409cabe8b exclude more default search fields from text copy to text_t
9 years ago
Michael Peter Christen 0aa6fcf259 remove old vocabularies and synonyms before adding new
9 years ago
reger f91298d3b6 fix one implicit Integer/Long type conversion
9 years ago
reger 821262a179 add CommonPattern for multiple spaces
10 years ago
Michael Peter Christen 90f75c8c3d added enrichment of synonyms and vocabularies for imported documents
10 years ago
reger f3ce99bfb8 fix extract of inboundlinks_protocol_sxt
10 years ago
reger 5408448a56 skip redundant add. of keywords to text
10 years ago
Michael Peter Christen b060ba900d added parsing of contentprop attribute in html tags for
10 years ago
Michael Peter Christen 4cb4f67f38 added parsing of dd, dt and article html fields. The parsed result is
10 years ago
reger 1395f10e95 fix typecast for css links
10 years ago
reger 7e09bff4a1 exclude default search fields from text copy to text_t
10 years ago
Michael Peter Christen 535f1ebe3b added a new way of content browsing in search results:
10 years ago
reger 9e1ec5fec4 refactor: just some more useages of constant for term ":[* TO *]"
10 years ago
Michael Peter Christen 68c605d637 replace with CommonPattern.SPACE for split
10 years ago
Michael Peter Christen 3e6c3e2237 documents pushed over the api/push_p.html interface will have their
10 years ago
Michael Peter Christen d2792a43fd do not write iframe and embed links into webgraph, but use them anyway
10 years ago
Michael Peter Christen eb78388a98 changed prefer strategy for http unique in such a way that http is
10 years ago
Michael Peter Christen 66b5a56976 Added and integrated new date detection class which can identify date
10 years ago
Michael Peter Christen 6a1865f507 refactoring date -> lastModified
10 years ago
reger 70cf7060a4 coding fixes suggested in
10 years ago
Michael Peter Christen 70f03f7c8e do not cache search requests to Solr if the result is used for
10 years ago
Michael Peter Christen c67c5c0709 added new solr schema fields which record the occurences of vocabulary
10 years ago
Michael Peter Christen 0550b54d56 added fix to postprocessing: avoid caching of postprocessing collection
10 years ago
Michael Peter Christen 0a879c98e7 added new 'firstSeen' database table and necessary data structures which
10 years ago
Michael Peter Christen 95d87f00b3 fix for bad query generation in doublecheck in postprocessing
10 years ago
Michael Peter Christen 92007e5d2d more enhancements to posprocessing speed
10 years ago
Michael Peter Christen 9a7fe9e0d1 fix for bad timing computation in postprocessing
10 years ago
Michael Peter Christen bd16119a00 another fix for postprocessing (the query for "" on numeric field did
10 years ago
Michael Peter Christen 327e83bfe7 more fixes in postprocessing: partitioning of the complete queue to
10 years ago
orbiter 71758f0d62 enhanced postprocessing by usage of a field-list generation to prevent
10 years ago
Michael Peter Christen fe537679de fix for exact_signature_unique_b, exact_signature_copycount_i,
10 years ago
Michael Peter Christen 2e5214eb21 added field postprocessing.partialUpdate to settings which can be used
10 years ago
Michael Peter Christen 2e09da9832 npe fix
10 years ago
Michael Peter Christen d80418f1b1 added partial updates to solr during postprocessing: during
10 years ago
Michael Peter Christen b1cfbc4a04 added new solr field url_paths_count_i which can be used to enhance the
10 years ago
Michael Peter Christen 30d4402cd1 fixed location search
10 years ago
orbiter f3a12801f0 Merge branch 'master' of git@gitorious.org:yacy/rc1.git
10 years ago
orbiter d93325a578 lazy handling of process_sxt field (part of postprocessing)
10 years ago
reger b5ca20de15 preserve content_type (mime) if supplied in preference of construct in from file type.
10 years ago
reger fb1fcc2b03 handle noarchive tag, skip writing page to cache
10 years ago
Michael Peter Christen 2645dc816a added warning for not well-formed postprocessing queries
10 years ago
Michael Peter Christen 6d3d4c4ea6 changed the concurrent enumeration of query results in such a way that
10 years ago
Michael Peter Christen e87dc08c0d set the correct fail time in error docs
10 years ago
Michael Peter Christen a7dd89c4de changed method to write the citation index: do not catch up references
10 years ago
orbiter d68438c3d9 make sure that the postprocessing background thread never dies by any
10 years ago