Commit Graph

47 Commits (3c3cb7855555d5b2712bb1bc05cddb5c3175ec45)

Author SHA1 Message Date
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen d328cc4a83 fix for didyoumean, added also more asian alphabets
11 years ago
Michael Peter Christen 61c5e40687 - replaced the properties object in AnchorURL with distinct variables
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
Michael Peter Christen 57ffdfad4c added a crawl option to obey html-meta-robots-noindex. This is on by
12 years ago
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name
12 years ago
Michael Peter Christen 8f2d3ce2f9 reduced locking situation in crawler: shifted synchronized location and
12 years ago
Michael Peter Christen 1b102d98d8 - added index deletion to index administration submenu
12 years ago
Michael Peter Christen 25499eead5 - added a new field for the regular expression in crawl start
12 years ago
Michael Peter Christen 1d30082446 added hindi translation configuration
12 years ago
orbiter 940c6849ee enhanced did-you-mean (a bit): can now remember previously searched
12 years ago
Michael Peter Christen c95a84103a complete redesign of search process:
12 years ago
Michael Peter Christen 788288eb9e added the generation of 50 (!!) new solr field in the core 'webgraph'.
12 years ago
Michael Peter Christen b6de1f42dc Full redesign of solr connection architecture. This was done to support
12 years ago
Michael Peter Christen c20fa3640d fix to unbalanced tag and license for null objects
12 years ago
Michael Peter Christen de58043205 Added image license generation for solr image search results when
12 years ago
Michael Peter Christen 762b687e47 extended the serverObjects to be able to hold multipel values for a
12 years ago
Michael Peter Christen 6a4878940b fix in html parser and bookmark generation
12 years ago
Michael Peter Christen dee8b24d3c better error handling for bookmarks
12 years ago
Michael Peter Christen ab4f74c82c fix for xml blacklist import
12 years ago
Copro 3ea8380959 Adding Vimeo tag to wiki commands to embedd Video video with id
12 years ago
Copro ee9d7fd93d Added feature to embedd Youtube videos to wiki commands for usage in
12 years ago
Michael Peter Christen 9e4033f229 fix for event starter: delete start time when event is removed
12 years ago
Michael Peter Christen 99271ffd13 copy work tables from defaults/data/work if exist there and not in
12 years ago
Michael Peter Christen 24c9bb35f7 extended the Scheduler: introduced scheduled events
12 years ago
orbiter a4a780b871 - fix for bad url conversion in bookmarks when using smb urls
12 years ago
Michael Peter Christen 3de784c8dd replaced more split and replaceAll missing pattern pre-compilation with
12 years ago
Michael Peter Christen 4eab3aae60 removed overhead by preventing generation of full search results when
12 years ago
Michael Peter Christen f5ca5cea44 - added field options to all solr queries. This can be used to restrict
12 years ago
cominch 05742b4562 remove old SMW importer which was part of the ymarks package
12 years ago
Michael Peter Christen 0fe8be7981 enhaced data structures for balancer and latency computation which
12 years ago
Michael Peter Christen ac9540dfb6 removed options for stopwords which are not used
12 years ago
Michael Peter Christen a33e2742cb - removed unnecessary synchronized and deadlock in crawler
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 21fe8339b4 - enhanced generation of url objects
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the
12 years ago
Michael Peter Christen 2f536cb54d code cleanup: removed unised methods and made more methods and objects
12 years ago
orbiter 3190347814 added a synonyms_t field to solr and a process to read synonym files.
12 years ago
Michael Peter Christen 24f4ca4d85 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
12 years ago
apfelmaennchen 116f429e35 fix for java.lang.RuntimeException: TableColumnIndex not available...
12 years ago
Michael Peter Christen 1533bfd63b refactoring
12 years ago
Michael Peter Christen 8219a445f3 refactoring
12 years ago
Michael Peter Christen 00c1c777fa refactoring
12 years ago