Commit Graph

398 Commits (a725a4242f769b97bf5c1c585db3765af5372619)

Author SHA1 Message Date
Michael Peter Christen addba047e2 changes in ranking computation 12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'. 12 years ago
Michael Peter Christen 6a4878940b fix in html parser and bookmark generation 12 years ago
reger 3897bb4409 added (manual) urldb migration (link on: Index Administraton -> Federated Solr Index) 12 years ago
reger 168b1d130d Adding heuristic to get search results from configured systems which support opensearch specification 12 years ago
Michael Peter Christen 95712fdc8b update to pdf parser 12 years ago
Michael Peter Christen 34f8786508 removed dependency of vocabulary navigation from Jena and it's 12 years ago
Michael Peter Christen 72f165d58b added a Boost class which stores solr query boost values. The class can 12 years ago
Michael Peter Christen b5ee88c6af added more logging to get info which url causes performance problems 12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents. 12 years ago
Michael Peter Christen f5ca5cea44 - added field options to all solr queries. This can be used to restrict 12 years ago
orbiter 5dfd6359cb redesign of the QueryParams class: introduced QueryGoal which holds the 12 years ago
Michael Peter Christen d88eb657fd Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1 13 years ago
Michael Peter Christen 6905182d41 - fix for number of words log message 13 years ago
Michael Peter Christen a33e2742cb - removed unnecessary synchronized and deadlock in crawler 13 years ago
reger 722a447b0d - optimize code of augmented parsing to enhence document tags 13 years ago
orbiter 276dd6452b removed warnings 13 years ago
Michael Peter Christen b991685782 Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1 13 years ago
Michael Peter Christen b7ac1da6a3 gsa results shall have only one title in metadata and that should be the 13 years ago
reger 87aab9aa7c - fix: with augmented parsing = on; missing metadata in index (like title) due to overwriting metadata by adding multiple result docs from augmentparser with same url 13 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow 13 years ago
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects 13 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the 13 years ago
orbiter 68d0f8de03 Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1 13 years ago
reger bfb0d4c69b - add language detection from <html lang="xx"> tag 13 years ago
Michael Peter Christen 7e3e45fd04 added Open Graph Metadata default fields, see http://ogp.me/ns# 13 years ago
Michael Peter Christen c3e5f667a7 added schema.org breadcrumb counter to parser and solr schema 13 years ago
Michael Peter Christen 4b5e0c1500 added an url rewriter which can be used to remove session ids from urls 13 years ago
Michael Peter Christen 584663ae8c - redesign of solr query construction 13 years ago
Michael Peter Christen 6ab64746d7 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
sof 5cb244b79b Merge remote branch 'origin/master' 13 years ago
apfelmaennchen 88b062210c Added a parser for audio file tags (e.g. ID3 tags for MP3 files) based 13 years ago
Michael Peter Christen 31485a963d refactoring 13 years ago
Michael Peter Christen 3d33a5bdf6 turned the synonyms_t Text field into a multi-valued String field 13 years ago
Michael Peter Christen 3b959ee002 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
orbiter 3190347814 added a synonyms_t field to solr and a process to read synonym files. 13 years ago
Michael Peter Christen 411d0e839b added an underline text field to solr to record all underlined texts 13 years ago
Michael Peter Christen 24d2ee3c52 - better date ranking 13 years ago
sixcooler 6c50d016ed pdf- and zipParser should not use forced Memory-Limits 13 years ago
Michael Peter Christen 1533bfd63b refactoring 13 years ago
Michael Peter Christen 8219a445f3 refactoring 13 years ago
Michael Peter Christen 00c1c777fa refactoring 13 years ago
orbiter 63762d8f89 removed kelondro dependencies from cora 13 years ago
Michael Peter Christen e54ac38095 - some corrections in usage of getFile() and getFileName() 13 years ago
Michael Peter Christen 528d6763fa - added new solr fields: 13 years ago
Michael Peter Christen e8acd542b5 - added faceted drill-down for host and geolocation to solr queries 13 years ago
orbiter 67f2866cd0 small fixes 13 years ago
orbiter 67edfd991c Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git 13 years ago
orbiter d9173ba7ed added more solr fields to integrate values from URIMetadataRow. All 13 years ago
Michael Peter Christen 24d9db1613 snippet retrieval loading processes may use a smaller minimum load time 13 years ago