Commit Graph

100 Commits (efd0b8371a04d12d3b9f52532f3f3aba1f7b3790)

Author SHA1 Message Date
orbiter efd0b8371a - added parsing of Dublin Core - compliant metadata (see RFC 5013 and ISO 15836) to html parser
17 years ago
orbiter f4e9ff6ce9 more generics
17 years ago
orbiter 45339c3db5 more generics
17 years ago
orbiter df2a7a8ac8 more generics
17 years ago
orbiter b1e08d354c repaired indexing after search snippet loading
17 years ago
orbiter 4fefa53135 removed parser object pool, see also svn 4106
17 years ago
orbiter a31b9097a4 preparations for mass remote crawls:
17 years ago
fuchsi f717beecb1 - Changed yFormatter handling to be more flexible and produce more readable code for server pages. There are serverObject.putNum() methods to allow adding of number type values in a formatted form, and put() methods for number types that add them without formatting. This reduces the need to transform them into Strings in server pages and removes the HTML encoding step which is unecessary for numbers.
17 years ago
orbiter 01e0669264 re-designed some parts of DHT position calculation (effect is the same as before)
17 years ago
orbiter 341f7cb327 steps to enhance remote search performance:
17 years ago
orbiter 6c819a6fd9 added cache to favicon display
17 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation:
17 years ago
orbiter 4779f314fe first version of next-generation search interface:
17 years ago
orbiter e332b844b2 - enhanced remote search: during waiting time for remote crawls
17 years ago
orbiter b5346141b3 made the plasmaHTCache static (there is only one internet, so we need only one cache)
18 years ago
orbiter 947fc46904 refactoring of search process:
18 years ago
orbiter 40b0547611 - documentaton changes (removed old forum links)
18 years ago
orbiter 89e1848db6 fixed problem with favicons:
18 years ago
low012 c59a7ce5c2 *) hopefully fixed a stupid bug (my fault of course) that sometimes messed up the marking of search words in the snippets (see http://www.yacy-forum.de/viewtopic.php?p=37329#37329)
18 years ago
theli 339153d40e *) favicons that are specified in the document content via html link-tags
18 years ago
orbiter 6488ec8a80 no deletions in index in case that snippet-loading fails and there is no network connection
18 years ago
orbiter 861f41e67e redesigned NURL-handling:
18 years ago
orbiter 9f929b5438 better snippet handling in case of snippet load fail
18 years ago
orbiter f25c0e98d1 - replaced String by StringBuffer in condenser
18 years ago
orbiter 0a050bc043 enhanced ranking
18 years ago
orbiter 61798f0ae6 added option to distinguish between text crawl and media crawl
18 years ago
orbiter e4570bffaf -implemented a specialized snippet-fetch for media content
18 years ago
low012 694a6e4f44 *) better text snipptes: any possible searchword (welt, linux, tag) in welt-linux-tag will be marked correctly now
18 years ago
orbiter bddc197453 reverted by-mistake removed change from low012/SVN 3068
18 years ago
orbiter 1377c53aa3 extraction of media links from search results
18 years ago
low012 586add4c6c *) Better snippets: words like GNU/Linux will not prevent Linux or GNU from being marked if they are searchword (see http://www.yacy-forum.de/viewtopic.php?t=2891)
18 years ago
orbiter 937ccd4e76 fix for snippet-generation
18 years ago
orbiter bf0d820659 - added correct flagging of word properties
18 years ago
orbiter ceb9e3aa17 - enhanced parser: collection of audio, video, image and application links
18 years ago
orbiter b5a29e9651 - fix for snippets that are too short
18 years ago
orbiter 30888e7a2f implementation of search constraints
18 years ago
orbiter 497428c8ec refactoring
18 years ago
orbiter bb7d4b5d5e refactoring to prepare new RWI entry object
18 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
18 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
18 years ago
orbiter c8f3a7d363 added snippet-url re-indexing
18 years ago
low012 2cfd4633ac *) even better handling of searchwords in snippets, words can consist of letters and numbers now
18 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details)
18 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
18 years ago
theli f17ce28b6d *) plasmaHTCache:
18 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM
18 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
18 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
18 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
18 years ago
low012 f8ac694e51 *) fixed a bug where searchword in snippets were not displayed bold in front of a punctuation mark (see http://www.yacy-forum.de/viewtopic.php?p=25998)
18 years ago