Commit Graph

381 Commits (9d366ee9d7289ec98483f0082aed94cb79fe364f)

Author SHA1 Message Date
orbiter 1110d16af9 performance hack: replaced generic row.getColBytes() call with row.getPrimaryKeyBytes() where the column is 0 14 years ago
low012 ce012e11aa *) deleted LogStatistics since the page did not work anymore and it seemed to be obsolete, tell me if you miss it and I will add it again 14 years ago
low012 c5051c4020 *) fixed bug which caused entries to not be deleted when deleting by URL on IndexCreateWWWLocalQueue_p.html (I hope this did not break anything else) 14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain 14 years ago
orbiter 10ae8d961b - cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring) 14 years ago
lotus b1484299b2 same units for memory observer configuration (MiB) 14 years ago
orbiter 0769f4caa6 added search suggestions for interactive search: is only shown if there are no search results 14 years ago
f1ori e4aabaa1c3 * fix negative filelength for files >2G 14 years ago
f1ori ee3cef91e8 * fix filesize in ftp crawls 14 years ago
low012 3d95981f7d *) cleaning up the code a little bit 14 years ago
orbiter e88c428008 fix to ftp loader 14 years ago
orbiter 9b25a33fd9 - fixed numerous bugs 14 years ago
orbiter 7bdb13bf7f more fixes to smb crawling: better file names 14 years ago
orbiter 94c48500cc several fixes 14 years ago
f1ori 9d2159582f * fix system update if urls are in blacklist (for example for very general blacklists like *.de) 14 years ago
orbiter 56264dcc17 - added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls 14 years ago
orbiter a563b05b60 enhanced crawler: 14 years ago
orbiter c36da90261 added a very fast ftp file list generator to site crawler: 14 years ago
orbiter 4e2c14efbb fixed bugs in parser and ftp client 14 years ago
orbiter fffb91447a fixed crawl queue delete function 14 years ago
orbiter b769cce433 - added a catch-all parser for all documents that cannot be parsed: they will contributed with their document url for the search index only 14 years ago
f1ori 741a87a3e9 * make .yacy-domains crawlable (.yacy-domains are local domains, so only in custom networks/peers) 14 years ago
f1ori dca9e16f51 * don't index pages, which redirect, twice 14 years ago
orbiter 09badc697b - low-memory patch for crawler 15 years ago
orbiter 93c535d111 fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113 15 years ago
orbiter 4c72885cba added a sitemap entry parser and loader for sitemaps 15 years ago
f1ori def4253555 * add option to network definition to provide a domainlist (syntax like in blacklists) 15 years ago
f1ori 7d8de34778 * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null) 15 years ago
orbiter e3e3b49d52 - enhanced main release recognition 15 years ago
orbiter ca738ac924 - added a tag cloud to search results (using the topics) 15 years ago
orbiter e4d561971e added more score cluster options and made score cluster usage more transparent 15 years ago
orbiter e8f90201a5 fix for scheduling of rss feeds 15 years ago
orbiter 6a166c2040 patches for bad proxy behaviour 15 years ago
orbiter 45b1ab3d07 custom + generic skins: 15 years ago
orbiter 0d363a94d7 more performance hacks 15 years ago
orbiter 091dd3f6ec - enhanced intranet search speed 15 years ago
orbiter aacf572a26 - enhancements for search speed 15 years ago
orbiter 2c549ae341 fixed a number of small bugs: 15 years ago
orbiter f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option 15 years ago
orbiter d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed 15 years ago
orbiter 48c0d508ac fixes for crawling of smb links (file length not always available) 15 years ago
orbiter 461a2a6ec7 enhanced remote crawling: 15 years ago
orbiter 5870b13f3a - code cleanup / added debug line for further investigation in HTTPDemon.parseMultipart 15 years ago
sixcooler 17eebd4ef8 counting crawler traffic again: 15 years ago
orbiter 348dece62f redesign of the SortStack and SortStore classes: 15 years ago
orbiter 114bdd8ba7 fixed old sitemap importer which was not able to parse urls containing post elements 15 years ago
orbiter 5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1 15 years ago
orbiter 22047ffad5 enhanced computation speed of many replaceAll string operations 15 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy: 15 years ago
orbiter 65eaf30f77 redesign of crawl profiles data structure. target will be: 15 years ago