Commit Graph

15 Commits (6412c926bce37dca1e605d4f197685957fe89b31)

Author SHA1 Message Date
theli f17ce28b6d *) plasmaHTCache:
18 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
18 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
18 years ago
theli 74c3e7cf29 *) storing document charset into plasmaParserDocument object (is needed later by the condenser)
18 years ago
theli d0a5a53789 *) changes needed for multi-language support
18 years ago
theli b0e8ff6eda *) some TODO makers for UTF-8 problem
18 years ago
theli f3ac4dbbb9 *) better handling of server shutdown
18 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL
19 years ago
orbiter 83e0e765ec redesigned some parts of the html scanner & parser
19 years ago
orbiter b21b9df2d0 added section headlines generation to html parser
19 years ago
orbiter 3d8a5ae652 code cleanup
19 years ago
theli bdf30117c1 *) Redesign of parser configuration
19 years ago
hydrox 56b9f34411 *)removed unused imports
19 years ago
theli 361f05978d Multiple updates regarding the yacy seedUpload facility,
20 years ago
theli 351c86d5d9 *) Migration of optional Content Parser integration
20 years ago