Commit Graph

19 Commits (7860d5d632a21130498449872dd8ac757a30792a)

Author SHA1 Message Date
orbiter bfcf9b7aa3 - added language detection using metadata from documents: html and odt documents provide this information 17 years ago
danielr 3bb870bfcd added final where possible 17 years ago
orbiter c3d461d191 - removed superfluous copyright statement 17 years ago
orbiter efd0b8371a - added parsing of Dublin Core - compliant metadata (see RFC 5013 and ISO 15836) to html parser 17 years ago
low012 b08f877e97 *) tried to get rid of warnings when compiling parsers (http://forum.yacy-websuche.de/viewtopic.php?t=660) 17 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation: 18 years ago
orbiter 6b9eea3932 - removed differentiation between longTitle and shortTitle; this cannot be used for search results, 18 years ago
orbiter a738b57b31 added author tag to indexing content 18 years ago
theli f17ce28b6d *) plasmaHTCache: 19 years ago
theli cd5f349666 *) Better handling of large files during parsing 19 years ago
orbiter df1629b05a - code cleanup 19 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher) 19 years ago
theli 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser) 19 years ago
theli d0a5a53789 *) changes needed for multi-language support 19 years ago
theli f3ac4dbbb9 *) better handling of server shutdown 19 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL 19 years ago
theli bdf30117c1 *) Redesign of parser configuration 20 years ago
theli 285936d778 *) trying to set document title properly 20 years ago
theli 361f05978d Multiple updates regarding the yacy seedUpload facility, 20 years ago