Commit Graph

4901 Commits (bc84d2bc9d96ce68a62f36385b9f416b7cfa571b)

Author SHA1 Message Date
sixcooler 3e8b72be50 update to httpclient-4.1 - sorry forgot some
14 years ago
low012 74b22dfa24 *) fixed bug which affected blacklist entries which consisted of domain _and_ path parts
14 years ago
orbiter 93732d6773 increased number of target peers
14 years ago
orbiter 70ca7cec8c fix for http://forum.yacy-websuche.de/viewtopic.php?p=21763#p21763
14 years ago
orbiter fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
14 years ago
orbiter 431f780f41 patch for bad data in url metadata
14 years ago
orbiter 5905f912c5 replaced more double types with float
14 years ago
orbiter 0cdfb82963 replaced more appearance of double values by float values
14 years ago
orbiter eb12e15738 moved all Double values to Float values because of
14 years ago
f1ori 982aa689ef * fix StringIndexOutOfBoundException in WebStructureGraph
14 years ago
orbiter 88773e4daa changed the default port from 8080 to 8090
14 years ago
orbiter 6c35b68f17 - removed 'peerName' property from the yacy settings file because this information is stored in the yacy seed file
14 years ago
orbiter 786166041a - added recording of all accessed and submitted servlets
14 years ago
orbiter 28f669bf0b - fixed/enhanced move to SD/16:9 images (network, web structure)
14 years ago
orbiter 0376f73fdb extended seed list uploader: do not only upload all active peers but also some more peers that are passive but had been active in the last 24 hours
14 years ago
orbiter 991b92f4ae enhanced network graphic
14 years ago
orbiter 3ae8f40fc8 removed yacy.network.group - this feature was never used
14 years ago
orbiter efb4ca8fa8 modified auto-delete of search failure-words:
14 years ago
orbiter f1f03d8c90 more logging for strange network loading bug
14 years ago
f1ori 4e29e9712a * create cleanupjob for cached failed urls
14 years ago
f1ori a321c7673d * adminAccountForLocalhost only for localhost
14 years ago
hermens 930cb412dd Let SHORT_MILSEC_FORMATTER make a new formatted String every millisecond
14 years ago
orbiter 090c73e32e catch a OOM in HeapReader iteration
14 years ago
low012 48463c4507 *) General private License? ;-)
14 years ago
orbiter c93f4dda72 - cleaned up yacy news
14 years ago
orbiter 6c1b14c8e1 - more control in access tracker: count number of returned search results (not only info how much is in the index)
14 years ago
low012 9f38c0023d *) Minor changes, mainly cleaning up a little bit, no functional changes.
14 years ago
orbiter 54e77e6255 refactoring
14 years ago
orbiter feefe17568 npe assert fix
14 years ago
orbiter 733903f2c9 fix for http://forum.yacy-websuche.de/viewtopic.php?p=21489#p21489
14 years ago
orbiter 24e4126eee added JSON parser code from json.org (added generics to it)
14 years ago
orbiter 10ae8d961b - cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring)
14 years ago
lotus 0e54233408 UPnP: map port again if we are not reachable (e.g. when router rebooted)
14 years ago
lotus b1484299b2 same units for memory observer configuration (MiB)
14 years ago
orbiter 387db84087 maybe found bug in non-working index dumper
14 years ago
orbiter 89ae6101b9 fix for NPE and added comment in search result
14 years ago
orbiter 0769f4caa6 added search suggestions for interactive search: is only shown if there are no search results
14 years ago
orbiter a4c9d27287 - moved some variables from Stwitchboard to new class AccessTracker
14 years ago
f1ori e4aabaa1c3 * fix negative filelength for files >2G
14 years ago
orbiter cdfe8afe3f fix for really bad table iteration implementation: reduction of IO
14 years ago
low012 9eae33f886 *) Ooops...
14 years ago
low012 a001e8075c *) minor enhancements
14 years ago
low012 11ea966f9e *) added SID file (Commodore 64) sound file parser
14 years ago
f1ori ee3cef91e8 * fix filesize in ftp crawls
14 years ago
orbiter b2ed4cfaf8 more small bugfixes and light refactoring
14 years ago
orbiter 3ca06d6290 patch for http://forum.yacy-websuche.de/viewtopic.php?p=21460#p21460
14 years ago
orbiter 903c824c2c - allow only scanned resourced with granted status
14 years ago
low012 936e976c23 *) added FreeMind (http://freemind.sourceforge.net/) mindmap parser
14 years ago
low012 3d95981f7d *) cleaning up the code a little bit
14 years ago
low012 2a6499364d *) minor changes
14 years ago
low012 c0274bd123 *) minor changes
14 years ago
orbiter fe46536f6e enhanced network scanner (less name resolving during scanning and no name resolving during search)
14 years ago
orbiter e753027c43 fix for http://forum.yacy-websuche.de/viewtopic.php?p=21439#p21439
14 years ago
orbiter bf4ef1513e - fix for map view
14 years ago
orbiter 6b70393d1d - new java version 1.6
14 years ago
orbiter e88c428008 fix to ftp loader
14 years ago
orbiter 59b70a5a92 another fix to the ftp crawler: now correct directory listings according to rfc2640 (path with spaces) and better title names for such files
14 years ago
orbiter 9b25a33fd9 - fixed numerous bugs
14 years ago
orbiter 7bdb13bf7f more fixes to smb crawling: better file names
14 years ago
orbiter 94c48500cc several fixes
14 years ago
orbiter 0ac7311a62 fix for token parser
14 years ago
orbiter 58b59f9bc8 - a collection of bug fixes and some redesign of the Scanner class
14 years ago
orbiter c54170421a fix for npe
14 years ago
orbiter c288fcf634 redesigned CrawlStartScanner user interface and added more features:
14 years ago
low012 6f4f957e50 *) cleaning up the code a little bit
14 years ago
f1ori 2521677a45 * deny adminForLocalhost and intranet network setup also on bootup and not only on network switch
14 years ago
f1ori 9d2159582f * fix system update if urls are in blacklist (for example for very general blacklists like *.de)
14 years ago
orbiter 56264dcc17 - added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls
14 years ago
orbiter 99a7fe87f9 - removed old intranet scanner (the generic scanner now completely subsumes the old one)
14 years ago
orbiter acab6801d9 added new network scanner
14 years ago
orbiter 14e4fae8e9 fixes to ftp client
14 years ago
orbiter a563b05b60 enhanced crawler:
14 years ago
orbiter c36da90261 added a very fast ftp file list generator to site crawler:
14 years ago
orbiter 4565b2f2c0 removed the display option from index.html, yacysearch.html and yacyinteractive.html
14 years ago
orbiter fc2e41e691 added a forwarder for the default page. The forwarder forwards a browser to a different page if the root file index.html is accessed. This can be done by setting the name of the forwarder page to the field
14 years ago
orbiter db99db4be9 some redesign of the search-fail-response mechanism:
14 years ago
f1ori 4915d1781a * use local backup-file, if remote network-definition is not availible
14 years ago
orbiter 18d33b5c6d fixed several search result navigation bugs
14 years ago
orbiter 49b5a206cd - better caclculation of search result size
14 years ago
orbiter 4e2c14efbb fixed bugs in parser and ftp client
14 years ago
orbiter d78e322e84 added a directory-structure reader to ftp client
14 years ago
orbiter f0651e5f2f added image search to yacyinteractive.html
14 years ago
orbiter fffb91447a fixed crawl queue delete function
14 years ago
orbiter b769cce433 - added a catch-all parser for all documents that cannot be parsed: they will contributed with their document url for the search index only
14 years ago
orbiter 21e84539e8 one more fix to Domains
14 years ago
orbiter e192d61972 fix for latest commit
14 years ago
orbiter 22453b13ad implemented local host address discovery as posted in
14 years ago
orbiter cc6499bf8d - added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed
14 years ago
orbiter a9f754c45f removed unused CR accumulation and distribution process
14 years ago
orbiter 3d945bb442 fix for ftp client: suppress bad directory listing time-out
14 years ago
orbiter d4a1a1850b removed warnings
14 years ago
low012 3b5830b7d4 *) Fixed typo.
14 years ago
low012 9b3fae9496 *) cleaning up the code a little bit
14 years ago
orbiter 7bb4b001ed - view image files from cache
14 years ago
low012 e7552bd719 *) cleaning up the code a little bit
14 years ago
orbiter 321eb012fe removed two warnings and reverted one change
14 years ago
apfelmaennchen 737aaf6952 various small changes to ymarks
14 years ago
apfelmaennchen 8a50670546 some code clean up for the last post
14 years ago
apfelmaennchen 442497868d another step towards an auto tagging function for YMarks
14 years ago
f1ori 741a87a3e9 * make .yacy-domains crawlable (.yacy-domains are local domains, so only in custom networks/peers)
14 years ago
f1ori fd74bc388c * fix small bug in sessionid-removal
14 years ago
f1ori dca9e16f51 * don't index pages, which redirect, twice
14 years ago
low012 eb79b952ef *) cleaner code
14 years ago
low012 38fdf43587 *) renamed classes according to standard Java coding conventions
14 years ago
low012 025e3f4790 *) renamed classes according to standard Java coding conventions
14 years ago
low012 3b9aa0504e *) removed unsused code
14 years ago
low012 db3db0fdb9 *) trying to make this class less confusing (probably failing)
14 years ago
apfelmaennchen 54e63b556e intermediate step for a YMark auto-tagging function based on word frequencies.
14 years ago
apfelmaennchen 403ee9c014 added a drill-down for metadata and word count to /api/ymarks/test_treeview.html
14 years ago
f1ori a025b1da89 * fix bug when browsing local filesystem (e. g. repository) with yacy
14 years ago
apfelmaennchen 11ae5b108e enabled rebuildIndex for /Table_YMark_p.html (rebuilds the tags and folders index)
14 years ago
sixcooler b87bf88ac8 using less memory on merging and rewriting blobs
14 years ago
apfelmaennchen 94a9be18a4 added a ymark table administration: /Table_YMark_p.html
14 years ago
apfelmaennchen 25339f93c7 more updates to ymarks
14 years ago
f1ori d62e449a11 * fix FilterEngine, forgot comparision-operator
14 years ago
apfelmaennchen cdd65aca71 update to ymarks
14 years ago
apfelmaennchen 808edffaf6 ymarks
14 years ago
f1ori 2c539b514a * add domaincheck (local/global/domainlist) to urlcleaner
14 years ago
orbiter 117fc86b3d fix for http://forum.yacy-websuche.de/viewtopic.php?p=21199#p21199
14 years ago
orbiter 441fbc26e2 security patch for WeakPriorityBlockingQueue (produced a deadlock)
14 years ago
orbiter 5dcb838293 - removed thread overhead when calling dns services
14 years ago
orbiter 4c50d3428e smaller file size for array stacks to support smaller deletion sizes
14 years ago
orbiter 09badc697b - low-memory patch for crawler
14 years ago
orbiter becc463d8a enhanced did-you-mean
14 years ago
apfelmaennchen 43586a2ace a update to ymarks (please test if you wish):
14 years ago
orbiter 93c535d111 fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113
14 years ago
orbiter 04932dc268 added rdf data structure for rss feeds
14 years ago
orbiter 84f2953cd8 fix for rss loader / rss type recognition
14 years ago
orbiter 4c72885cba added a sitemap entry parser and loader for sitemaps
14 years ago
orbiter 790e0b1894 - enhanced index deletion in IndexControlRWIs_p: delete also robots.txt database and cache if demanded
14 years ago
apfelmaennchen f5324b27f2 more updates to the new bookmarks (ymarks)....
14 years ago
orbiter 445619f3ec added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration.
14 years ago
sixcooler 85c65475fa smal but important correction of last commit @ HTTPClient
14 years ago
f1ori acd93b1b31 * add failsafe mechanisme to domainlist retrieval
14 years ago
orbiter 70c95608d4 Added CORS Access header for yacysearch.rss output
14 years ago
lotus 18729351e7 upnp: hint for wrongly detected local ip address
14 years ago
f1ori def4253555 * add option to network definition to provide a domainlist (syntax like in blacklists)
14 years ago
orbiter ac6b503adf untar files without gzip decompression even if the file has gz extension. this is done when the decompression fails.
14 years ago
apfelmaennchen efe0667fdd more new bookmark (ymarks) code with experimental html and xbel import
14 years ago
mikeworks caabebf9be Fixed spelling mistake omiting -> omitting in debug messages in ConfigUpdate_p.java and Switchboard.java
14 years ago
orbiter fb92f9ae8e added mime type image/jpeg (image/jpg is wrong but it is left here because it does not harm and this error also exists in configuration of web servers)
14 years ago
orbiter 155d556568 - better memory protection
14 years ago
f1ori 7d8de34778 * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null)
14 years ago
orbiter 25a8e55bc9 more logging about bad seeds
14 years ago
orbiter 959b8c6fa0 - allow greater seed size
14 years ago
orbiter e103419a56 - removed <3 peers barrier for peer ping feedback
14 years ago
apfelmaennchen d0e6c03b51 some updates to the new bookmark code...
14 years ago
orbiter facfd204e9 added a parent configuration option.
14 years ago
orbiter e3964f2c31 better catch of network definition load error; continue with secondary network load definition location
14 years ago
low012 65a0381f76 *) cleaning up code (still not done)
14 years ago
orbiter e3e3b49d52 - enhanced main release recognition
14 years ago
apfelmaennchen 9c94ebdee4 small changes to new bookmark code...
14 years ago
apfelmaennchen 244b56e9d3 an update to the new bookmark code...
14 years ago
low012 dc40f51b8d *) added headlines as proposed by Vega
14 years ago
apfelmaennchen f035f257da added some more bookmark code...
14 years ago
low012 22ed9c380c *) fixed bug which was introduced in r7226 (shame on me) which made wiki unusable (all entries were stored with empty subject as key -> edits were lost)
14 years ago
f1ori 60fd2e549d * log failures when writing config file
14 years ago
orbiter 58e74282af added a word counter statistic in condenser which is used by the did-you-mean to calculate best matches for given search words.
14 years ago
orbiter 863065abc4 added user agent logging to access tracker
14 years ago
apfelmaennchen a79728b97d some updates to experimental bookmark code...
14 years ago
apfelmaennchen ef782cd026 and even more experimental bookmark code...
14 years ago
orbiter ed4371dcf3 enhanced navigation implementation and enhanced tag cloud computation
14 years ago
orbiter ca738ac924 - added a tag cloud to search results (using the topics)
14 years ago
apfelmaennchen 7aca763ca8 Some more experimental bookmark code...
14 years ago
apfelmaennchen 4270ed696c Experimental code (I need to transfer the code to my macbook, sorry) for the new bookmarks API based on the Tables concept (same as for crawl starts). Currently you can add a bookmark by api/ymarks/add_ymark.xml?url=http://www.yacy.net&title=YaCy and watch the result via the standard view Tables_p.html.
14 years ago
orbiter e4d561971e added more score cluster options and made score cluster usage more transparent
14 years ago
orbiter e8f90201a5 fix for scheduling of rss feeds
14 years ago
orbiter 7cd9d9d22a - enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better
14 years ago
orbiter de722090b5 enhancements in did-you-mean guessing
14 years ago
orbiter a59c885ee0 autocomplete and did-you-mean can now understand _all_ languages and can generate suggestions in all languages and character types
14 years ago
orbiter b7acd92ce4 Auto-Suggestions for YaCy Search:
14 years ago
orbiter 24f1cba7b2 performance hacks:
14 years ago
orbiter 6a166c2040 patches for bad proxy behaviour
14 years ago
orbiter d607b30b6a performance enhancements for search and code review for database functions
14 years ago
orbiter 45b1ab3d07 custom + generic skins:
14 years ago
orbiter fcd40cd30f - disabled domZones (buggy, must think about better solution)
14 years ago
orbiter ec38eca278 fix for new URI equal method
14 years ago
orbiter 0d363a94d7 more performance hacks
14 years ago
orbiter b8aee6d402 performance hacks for better search performance
14 years ago
orbiter 091dd3f6ec - enhanced intranet search speed
14 years ago
low012 b9f405d1e8 *) added comments
14 years ago
orbiter 6e6994e328 latest bugfixes to search and indexing function after test of demo presentation
14 years ago
orbiter aacf572a26 - enhancements for search speed
14 years ago
sixcooler 61c82f3105 gzip-compresson @ transferRWI & transferURL back again
14 years ago
orbiter 2c549ae341 fixed a number of small bugs:
14 years ago
orbiter f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option
14 years ago
orbiter 3057a0b939 - intranet scanner now produces urls with host names, not ips if possible
14 years ago
orbiter c60aed4435 no caching in browser of dynamic web pages sent by YaCy http
14 years ago
orbiter e63896f2a8 added an intranet scanner and a servlet which shows all intranet addresses and an option to start a site-crawl for all these addresses at once.
14 years ago
orbiter e54cb7fb0c more bugfixes (also for latest commit)
14 years ago
orbiter be6b48311c misc bugfixes
14 years ago
orbiter d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed
14 years ago
low012 afa708d552 *) added <s>...</s> tag to WikiCode -> works just as the HTML equivalent
14 years ago
orbiter a83186ac7d fix for bug in cytrails
14 years ago
orbiter 48c0d508ac fixes for crawling of smb links (file length not always available)
14 years ago
orbiter 0bc6284e27 - added bugfix for access tracker in case of concurrency conflicts
14 years ago
f1ori e670e1ef8e add charset auto-detection for htmlParser
14 years ago
f1ori ddcd5ae78c fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2989
14 years ago
f1ori 8fe1102452 fix http://forum.yacy-websuche.de/viewtopic.php?p=20889#p18426
14 years ago
orbiter 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes
14 years ago