Commit Graph

52 Commits (8d650ca22518741b4392e8539aae90f7eeb3366d)

Author SHA1 Message Date
Michael Peter Christen f0db501630 better handling of ranking parameters and new default values for date
11 years ago
reger ffc5b75c73 optimize and fix lat / lon assignment
11 years ago
reger 9313447de2 reimplement tighter lat/lon calc in URIMetadataNode
11 years ago
orbiter 12ba890205 removed warnings
11 years ago
reger 727dfb5875 refactore URIMetadataNode to further unify interaction with index
11 years ago
reger 227c42bc96 eleminate obsolete URIMetaDataRow class
11 years ago
reger c9f92abddc fix: application link count
11 years ago
reger a3e2cca8e9 improve isOlder check to not overwrite node index with metadata on equal load date
11 years ago
Michael Peter Christen 1b4fa2947d - fixed a problem which ocurred when a document was not recognized with
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Michael Peter Christen e8e558a9b7 fix for content domain classification in URIMetadataNode
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
Michael Peter Christen cf12835f20 replaced the single-text description solr field with a multi-value
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
reger a6bf44212e bugfix: location (lat/lon) meta data retrival (Double.NaN check)
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen addba047e2 changes in ranking computation
12 years ago
Michael Peter Christen 2b6c79d347 in method exists() also use the new caching-stacks for
12 years ago
Michael Peter Christen 3b1d9dc884 made index storage from DHT search result concurrently. This prevents
12 years ago
orbiter d74472f562 corrected result counter
12 years ago
Michael Peter Christen 91a0401d59 introduced a second core named 'webgraph'. This core will hold the link
12 years ago
Michael Peter Christen 24c9bb35f7 extended the Scheduler: introduced scheduled events
12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents.
12 years ago
Michael Peter Christen ce0e5b1e17 - more refactoring / private methods
12 years ago
Michael Peter Christen ccc3760a47 Refactoring and redesign of data architecture to make URIMetadataRow
12 years ago
Michael Peter Christen e5b3c172ff removed hack which translated Solr documents to virtual RWI entries
12 years ago
Michael Peter Christen 43f3345c90 - removed dependencies from URIMetadataRow and made direct access to
12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the
12 years ago
Michael Peter Christen 2f536cb54d code cleanup: removed unised methods and made more methods and objects
12 years ago
Michael Peter Christen 24d2ee3c52 - better date ranking
12 years ago
Michael Peter Christen ca313e404f - if a "/date" modifier is used, the solr remote query applies an
12 years ago
Michael Peter Christen 1533bfd63b refactoring
12 years ago
Michael Peter Christen 872f83ebe0 refactoring
12 years ago
Michael Peter Christen 8219a445f3 refactoring
12 years ago
Michael Peter Christen 00c1c777fa refactoring
12 years ago
orbiter 563d584420 removed more dependencies in cora from kelondro
12 years ago
Michael Peter Christen d8425e6809 added collections to crawl monitor
12 years ago
Michael Peter Christen 528d6763fa - added new solr fields:
12 years ago
Michael Peter Christen 316b5fe116 - added a solr type definition verifier
12 years ago
Michael Peter Christen e8acd542b5 - added faceted drill-down for host and geolocation to solr queries
12 years ago
Michael Peter Christen 4716546ef5 - reduced memory usage in index transmission using a transformation of
12 years ago
Michael Peter Christen 06b0081fdc fix for NPE during host navigation computation
12 years ago
Michael Peter Christen 755f5e76cf removed strange assert statements and simplified code in metadata
12 years ago
Michael Peter Christen 9bece5ac5f enhanced snippet fetch - removed a bug that caused documents to be
12 years ago
Michael Peter Christen e5ef840f40 - renamed DoubleSolrConnector to MirrorSolrConnector and added a
12 years ago
Michael Peter Christen 94a334f128 another fix to the Solr metadata reading process and to the shutdown
12 years ago
Michael Peter Christen b51df6c7e8 - added coordinate storage in solr schema
12 years ago
Michael Peter Christen f9c0e6e950 - Implemented and integrated the URIMetadataNode object which is a
12 years ago
orbiter 0cbda0b2b8 - replaced all length() == 0 and size() == 0 with isEmpty()
13 years ago