Commit Graph

21 Commits (607b39b427f76ec139df5d9d5479cf09a0d6fe4a)

Author SHA1 Message Date
luccioman 452a17a8d5 Finer control on bounded input streams with custom stream implementation 8 years ago
luccioman 1e84956721 Support loading local files with a per request specified maximum size. 8 years ago
luccioman a9cb083fa1 Improved consistency between loader openInputStream and load functions 8 years ago
luccioman f66438442e Extended Mediawiki dump import to remote URLs. 8 years ago
reger c50e23c495 reduce creation of empty legacy RequestHeader() in situation where null 8 years ago
reger 7ab41d4ff1 use directories original lastmodified date in file- & smbloader in response 9 years ago
luc 5bbb2e1730 Ensure resource is closed when reading a full file InputStream 9 years ago
reger 141cd80456 correct log msg text 10 years ago
orbiter 4b06adb751 fix for file urls 11 years ago
reger 6932aa4d7a use configured admin-username for api calls 11 years ago
orbiter 3cb6c7861f fixed shutdown authenticaton problem 11 years ago
Michael Peter Christen 91a875dff5 self-healing of mistakenly deactivated crawl profiles. This fixes a bug 12 years ago
Michael Peter Christen 5e31bad711 - the webgraph shall store all links which appear on a web page and not 12 years ago
Michael Peter Christen 765943a4b7 Redesign of crawler identification and robots steering. A non-p2p user 12 years ago
Roland Haeder 841a28ae76 Added 'final' for all exception blocks as this helps the Java compiler 12 years ago
Michael Peter Christen 5878c1d599 - refactoring of log to ConcurrentLog: 12 years ago
Michael Peter Christen 16d1d744fa added url_file_name_s in default collection schema for the file name 12 years ago
Michael Peter Christen d6b82840f8 added a feature to find similarities in documents. 12 years ago
Michael Peter Christen 5f0ab25382 removed the option to prevent removal of & parts inside of the 13 years ago
Michael Peter Christen a06930662c replaced some more .getBytes() with UTF8/ASCII.getBytes() 13 years ago
Michael Peter Christen 00c1c777fa refactoring 13 years ago