Commit Graph

121 Commits (d9cea5ff23a4c714ea2bf7f0232b897a260a1401)

Author SHA1 Message Date
danielr 8422ee5ec4 - fixed UnsupportedEncoding (in proxy) using defaultCharset if no characterEncoding can be determined
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
orbiter 7052f2f61f - added copyright header of ResourceObserver
17 years ago
f1ori b6301a54fa * added class ListDirs to provoid generic listing of directories in systemdirectories and jar-files
17 years ago
danielr 7feae906aa - organize imports
17 years ago
orbiter 1689030ee8 refactoring: moved all crawler classes into their own package
17 years ago
orbiter d0678f7ab9 refactoring as result of
17 years ago
orbiter a3df23659c re-implementation of charset checking
17 years ago
orbiter 685794e7e7 fix for parser/encoding Exception
17 years ago
orbiter e024e3b9cf added new default profiles to distinguish snippet fetch for local and global search
17 years ago
orbiter 202a3adb3e refactoring of HttpClient Writer processes
17 years ago
orbiter c3342e1178 - removed class with only one static method
17 years ago
danielr 5c3c1fdf41 replaced httpc with Apache Jakarta Commons HttpClient (includes some refactoring ;)
17 years ago
orbiter 87a8747ce3 - enhanced recognition, parsing, management and double-occurrence-handling of image tags
17 years ago
orbiter 3441ec3928 - some small changes to highslide integration to get it working... (does not work yet)
17 years ago
orbiter efd0b8371a - added parsing of Dublin Core - compliant metadata (see RFC 5013 and ISO 15836) to html parser
17 years ago
orbiter f4e9ff6ce9 more generics
17 years ago
orbiter ecd7f8ba4e - added NEAR operator (must be written in UPPERCASE in search query)
17 years ago
low012 b08f877e97 *) tried to get rid of warnings when compiling parsers (http://forum.yacy-websuche.de/viewtopic.php?t=660)
17 years ago
orbiter af10f729df fixed image search and favicon loading
17 years ago
orbiter 4fefa53135 removed parser object pool, see also svn 4106
18 years ago
orbiter 842308ea97 - redesigned crawl start menu, integrated monitoring pages
18 years ago
orbiter 11b4f80bde - fixed non-closing client connections
18 years ago
orbiter 1488769e1f cleanup of unmaintained and outdated performance methods:
18 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation:
18 years ago
orbiter 57a5b6fa71 some generalization of remote proxy configuration and setting handling in httpc
18 years ago
orbiter 40b0547611 - documentaton changes (removed old forum links)
18 years ago
orbiter 557f8d80e4 - better logging
18 years ago
orbiter 26ddf797eb added bmp and ico image format to all parser/viewing methods
18 years ago
orbiter 6518bb6c08 changed release strategy:
18 years ago
theli 339153d40e *) favicons that are specified in the document content via html link-tags
18 years ago
rramthun 18a5380ee3 *) situation-dependent lock-buttons for search-page
18 years ago
karlchenofhell 0a64047081 - plasmaParserDocument can process subdocuments now (other archive-parsers may want to use this method)
18 years ago
orbiter 871ee1ce0f one step closer to automatic updates:
18 years ago
orbiter 6b9eea3932 - removed differentiation between longTitle and shortTitle; this cannot be used for search results,
18 years ago
orbiter a738b57b31 added author tag to indexing content
18 years ago
karlchenofhell 6fbe31425a - some code-cleanup (no more syntax-warnings here)
18 years ago
orbiter f25c0e98d1 - replaced String by StringBuffer in condenser
18 years ago
allo 782db9099d version independent name for commons-pool lib
18 years ago
orbiter e4570bffaf -implemented a specialized snippet-fetch for media content
18 years ago
orbiter 937ccd4e76 fix for snippet-generation
18 years ago
orbiter ad1e4aa88e added selection of audio, video, image and application resources
18 years ago
orbiter ceb9e3aa17 - enhanced parser: collection of audio, video, image and application links
18 years ago
theli 92f774edd1 *) Better charset encoding detection
19 years ago
theli decb09df6d *) Trying to be more tolerant against wrong charset names
19 years ago
theli e9afe39cbb *) Trying to be more tolerant against wrong charset names
19 years ago
theli 7526c831a8 *) Suppressing stracktrace
19 years ago
theli 22649408ad *) Better errorhandling for charset encoding problem during content parsing
19 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
19 years ago