Commit Graph

29 Commits (43c8defd7932d5c0ce9d9ec137d328409c62d4d7)

Author SHA1 Message Date
orbiter b2263bc720 enhanced document type recognition
16 years ago
orbiter 57a88d435b redesign of parser mime type detection and parser steering
16 years ago
orbiter 21b8704fb4 refactoring of the ParserDispatcher and ParserConfig: resulted into Idiom, Parser and Classification classes
16 years ago
orbiter dafffd0153 refactoring of parsers and document processing
16 years ago
orbiter 154bbc3364 code cleanup: call of static methods directly to the class
16 years ago
orbiter ce1adf9955 serialized all logging using concurrency:
16 years ago
orbiter 99bf0b8e41 refactoring of plasmaWordIndex:
16 years ago
orbiter c2359f20dd refactoring: better abstraction of reference and metadata prototypes.
16 years ago
orbiter 67aaffc0a2 - added Latency control to the crawler:
16 years ago
orbiter 14a1c33823 refactoring of wordIndex class
16 years ago
orbiter efcd95dc37 simplification of (internal) query process / refactoring
16 years ago
orbiter aa44d9bad9 more refactoring of kelondro.text / deleted de.anomic.index
16 years ago
orbiter 024da2916b refactoring of logging
16 years ago
orbiter 7ee494fde5 more refactoring of kelondro:
16 years ago
orbiter bf93767ec6 refactoring of kelondro database classes
16 years ago
orbiter fc27bf8c4c refactoring of kelondro classes:
16 years ago
orbiter 05c235de32 fix for npe
16 years ago
orbiter d39d420b39 performance hacks
16 years ago
orbiter 1918a0173e added more exception handling during crawling
16 years ago
orbiter 674ad2d55b different handling of error cases that occur during loading files with http or ftp:
16 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache:
17 years ago
orbiter ce57de6cb3 - fixed re-setting of DHT Send/Receive settings
17 years ago
orbiter 77ee0765a4 - added domain statistic generation to IndexControlURLs_p.html servlet
17 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
17 years ago
orbiter 7989335ed6 Preparations to replace the HTCache with a new storage data structure:
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
orbiter 3ca98fee42 removed superfluous copyright statement
17 years ago
orbiter cfe6790498 - added option to switch between yacy networks, especially between the two default networks (freeworld and intranet),
17 years ago
orbiter 1689030ee8 refactoring: moved all crawler classes into their own package
17 years ago