Commit Graph

9468 Commits (70e981b3330c07d8def36f481ef396cf645f6370)
 

Author SHA1 Message Date
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects
12 years ago
Michael Peter Christen 4023d88b0b added date info in parser errors
12 years ago
Michael Peter Christen 1b02408936 use less cache
12 years ago
Michael Peter Christen e45a3235e0 default cache size was much too high; decreased solr cache size
12 years ago
Michael Peter Christen 613cf7da7f enhancement to post argument parsing - possible fix to zero-filled
12 years ago
Michael Peter Christen 36c13ed15b less solr prefetch
12 years ago
Michael Peter Christen f3fc8eac80 fixed clear scripts
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the
12 years ago
Michael Peter Christen 53789555b9 fix for crawl start filter
12 years ago
Michael Peter Christen abebb3b124 added a crawl start checker which makes a simple analysis on the list of
12 years ago
Michael Peter Christen 941873fba4 moved the index deletion functions from IndexControlRWIs to
12 years ago
orbiter ae246c30c3 fixed interpretation of directDocByURL attribute during crawl start
12 years ago
orbiter 68d0f8de03 Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1
12 years ago
reger bfb0d4c69b - add language detection from <html lang="xx"> tag
12 years ago
Michael Peter Christen 7e3e45fd04 added Open Graph Metadata default fields, see http://ogp.me/ns#
12 years ago
Michael Peter Christen c3e5f667a7 added schema.org breadcrumb counter to parser and solr schema
12 years ago
Michael Peter Christen a06930662c replaced some more .getBytes() with UTF8/ASCII.getBytes()
12 years ago
Michael Peter Christen bd769de604 since the solr index is now used for all pages that are indexed locally,
12 years ago
Michael Peter Christen 554db5608b fix for ViewFile
12 years ago
Michael Peter Christen 4b5e0c1500 added an url rewriter which can be used to remove session ids from urls
12 years ago
orbiter 9190599d21 use links in AccessTracker
12 years ago
Michael Peter Christen 877042a6b5 fix for portal mode
12 years ago
Michael Peter Christen 42e525ca9a enhanced the host browser
12 years ago
Michael Peter Christen 76d218fbef fixes to crawl profiles
12 years ago
Michael Peter Christen 2f536cb54d code cleanup: removed unised methods and made more methods and objects
12 years ago
Michael Peter Christen 584663ae8c - redesign of solr query construction
12 years ago
Michael Peter Christen 6ab64746d7 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
Michael Peter Christen a8167e6e5b clean-up: removed unused methods in kelondro
12 years ago
sof 5cb244b79b Merge remote branch 'origin/master'
12 years ago
apfelmaennchen 88b062210c Added a parser for audio file tags (e.g. ID3 tags for MP3 files) based
12 years ago
Michael Peter Christen 28bd3e62b1 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 4fed4a86d8 another fix to location search
12 years ago
orbiter 507c612015 Merge branch 'master' of git://gitorious.org/~reger/yacy/bbyacy-rc1
12 years ago
reger 5650b0333e adjusted Netbeans-IDE classpath to current jars
12 years ago
reger b58e1f6d67 - add translation for ConfigHeuristics_p.html # section search-result
12 years ago
orbiter 0f7a54452d fix for location search query encoding
12 years ago
Michael Peter Christen 679d562908 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
sixcooler 9aa21506be bump to httpcore-4.2.2 (maintenance release)
12 years ago
Michael Peter Christen 31485a963d refactoring
12 years ago
Michael Peter Christen 406e1f3e7e added an option to start indexing right from the host browser
12 years ago
Michael Peter Christen f8a3ab2d82 added the usage of synonyms to the GSA search interface
12 years ago
Michael Peter Christen 3d33a5bdf6 turned the synonyms_t Text field into a multi-valued String field
12 years ago
Michael Peter Christen 41ab2a2279 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter c8b1a693dc ups, added missing class for last commit
12 years ago
Michael Peter Christen 3b959ee002 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
orbiter 3190347814 added a synonyms_t field to solr and a process to read synonym files.
12 years ago
Michael Peter Christen 411d0e839b added an underline text field to solr to record all underlined texts
12 years ago
orbiter be4c96f3b1 The HostBrowser now offers to index files that are discovered because
12 years ago
Michael Peter Christen c4a3d8870f fixed computation of links in host browser which are not indexed but
12 years ago
Michael Peter Christen 97a47319c8 added nice links to the host browser:
12 years ago