Commit Graph

87 Commits (6d3d4c4ea63a9eae81ec69cf05787756245664d1)

Author SHA1 Message Date
Michael Peter Christen 6d3d4c4ea6 changed the concurrent enumeration of query results in such a way that
10 years ago
orbiter 500e0b9927 fix for browsing of file paths in Index Browser
10 years ago
orbiter 22ce4fb4dd better error handling for remote solr queries and exists-checks
10 years ago
Michael Peter Christen b5fc2b63ea removed exist() retrieval functions from error cache and replaced it
11 years ago
Michael Peter Christen 62c72360ee cleanup of checkAcceptanceInitially in CrawlStacker, should avoid
11 years ago
orbiter 4ee4ba1576 fix for NPE in IndexCreateParserErrors_p.html caused by bad handling of
11 years ago
Michael Peter Christen 10cf8215bd added crawl depth for failed documents
11 years ago
Michael Peter Christen b4b0d14c04 fix for display bug
11 years ago
Michael Peter Christen 9a5ab4e2c1 removed clickdepth_i field and related postprocessing. This information
11 years ago
Michael Peter Christen bd886054cb new structure and enhancements for link graph computation:
11 years ago
Michael Peter Christen a6bb9be97e - added d3.js for visualizations using embedded svg
11 years ago
Michael Peter Christen cca851a417 introduced new solr field crawldepth_i which records the crawl depth of
11 years ago
Michael Peter Christen ff82a80eb3 Integrated HostBrowser back to administration interface; it can appear
11 years ago
Michael Peter Christen fda591695c fixed visibility of custom icon
11 years ago
Michael Peter Christen 8b44fcf0f4 added missing @Override annotation
11 years ago
orbiter 3c8d6e1eee added adminAccount switch to ConfigAccounts_p servlet to switch on
11 years ago
Michael Peter Christen 92655c7fd9 - added bootstrap css framework
11 years ago
Michael Peter Christen b08375da33 fix for bad/missing values of size_i
11 years ago
Michael Peter Christen 51800007c4 - added concurrency to postprocessing of webgraph document
11 years ago
Michael Peter Christen fdaeac374a - enhanced postprocessing speed and memory footprint (by using HashMaps
11 years ago
Michael Peter Christen 0f6b72f24b do not use luke requests for remote solr servers if the result is
11 years ago
orbiter da5d4128bf prevent npe
11 years ago
orbiter f6e441dd77 refactoring
11 years ago
orbiter c3f6c06f2c removed host increment on stored documents from crawler (that was wrong)
11 years ago
Michael Peter Christen 69391e5d9e changed strategy to test existence of documents in Solr: using the
11 years ago
Michael Peter Christen 8b14e92ba4 added button in host browser to re-load 404/failed documents
11 years ago
Michael Peter Christen 234ca720f5 only admins should be able to force a commit
11 years ago
Michael Peter Christen 64048ff217 fir for XSS
11 years ago
Michael Peter Christen ffe8276063 replaced referrer link masking to 'pure' links to the referring page
11 years ago
Michael Peter Christen 434e13b46d in host browser also show the properties of failed documents including
11 years ago
orbiter 1ac504ae51 use html encoding for urls in metadata
11 years ago
Michael Peter Christen 2602be8d1e - removed ZURL data structure; removed also the ZURL data file
11 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not
11 years ago
Michael Peter Christen e137ff4171 refactoring (im preparation for new removeHost method)
11 years ago
orbiter f106345eef link strings should not be tokenized
11 years ago
Michael Peter Christen 76afcccaaf fix for default boolean post values: the default value MUST NOT be TRUE,
11 years ago
Michael Peter Christen 4c242f9af9 always use a default value for boolean options to have transparency for
11 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler
11 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog:
12 years ago
Michael Peter Christen 203921006a redesign of citation index storage
12 years ago
Michael Peter Christen 570511f3c8 removed fields references_internal_id_sxt and
12 years ago
Michael Peter Christen f7e77a21bf Added a citation reference computation for intra-domain link structures.
12 years ago
Michael Peter Christen 8dbc80da70 redesign of index.exist-test: this shall now not be done using a single
12 years ago
Michael Peter Christen e26bdd4a52 fixes to deletion methods (removed unnecessary concurrency and added
12 years ago
Michael Peter Christen f7f3e28c5e prevent that the size of the index is computed too many times.
12 years ago
Michael Peter Christen cca19d94d4 re-declared some fields to be of type string rather than text which
12 years ago
Michael Peter Christen 3841854c97 abstraction of catchall term
12 years ago
Michael Peter Christen 3502b4c697 refactoring (renaming) of yacy-solr api
12 years ago
Michael Peter Christen 579eb01a49 showing now the details of references count in host browser:
12 years ago
reger 0f4237d8e5 add admin option to delete load errors from index
12 years ago