Commit Graph

4813 Commits (a196f24f60f362c6c7022ec2e24487b23f6999e0)

Author SHA1 Message Date
orbiter 694fa3a2a5 - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion
14 years ago
orbiter 30aed9824a moved getBytes() to UTF8.getBytes() to use a default String encoding
14 years ago
orbiter 1214615185 fix for 'invisible entry', see http://forum.yacy-websuche.de/viewtopic.php?p=22133#p22133
14 years ago
orbiter 3820525464 more memory protection: auto-flush of caches in case of memory shortage
14 years ago
orbiter 7962d35425 - removed file upload function in crawl start and replaced it with an input field for a file path where the crawl start file is loaded. This was necessary to support the API steering for file crawl starts, for two reasons:
14 years ago
orbiter e1b6916423 always try to guess the size of a StringBuilder to prevent too many memory re-allocations
14 years ago
low012 3b40b98256 *) set SVN properties
14 years ago
orbiter 2af8e33773 better performance computing search targets with index abstracts
14 years ago
orbiter 619b561a4a enhanced secondary search: index abstracts decompression is now much faster and does not cause strong CPU load after several searches with more than one word
14 years ago
orbiter 27ecdb5444 use less peers for remote search
14 years ago
orbiter cb1f49d0f2 replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'.
14 years ago
orbiter 7138f4036b less synchronization, better thread dump tool
14 years ago
orbiter 8d14916c74 more patches for a better out-of-memory management
14 years ago
orbiter c2c5b12882 - even less memory for circle tool
14 years ago
orbiter 799c534935 one more patch again OOM during secondary remote search
14 years ago
orbiter 77b1e921a9 this asserts prevents a network operation in case of sabotage and must be removed therefore
14 years ago
orbiter bed79402be introduction of a new remote search load control: the remote search has taken 10 results per peer with a time-out of 3 seconds so far. The attributes of number of results per peer and time-out time can now be configured.
14 years ago
orbiter 6dfaf6fef7 fix for bug in deletion of old seeds
14 years ago
orbiter 993b9bc1a8 memory/performance hacks, less synchronization, better concurrency
14 years ago
orbiter 42d90664f3 - fixed a memory leak in the httpc.post method (no finish)
14 years ago
orbiter 38dce547c0 better concurrency (less locking on date formatting) more logging and minor bug fixes
14 years ago
f1ori 59dea3a284 * implement url proxy, a proxy via the url http://peer:port/proxy.html?url=http://domain.tld/path
14 years ago
mikeworks 8b7b783c49 Tray.java: Broke the build on with wrong non UTF-8 encoded file and french umlauts (unmappable character for encoding UTF8)
14 years ago
mikeworks db65ada467 Tray.java: Added localization for french tray icon command - although this can probably also done better than with if statements. (preferably also from the locales file)
14 years ago
orbiter 89d337841c more logging for OOMs
14 years ago
orbiter b1781d7aae some more performance hacks
14 years ago
orbiter 5e186e0122 continuing the fight against deadlocks during time formatting: better caching.
14 years ago
orbiter dec24244cf added convenience class to generate UTF StringBody objects with a default UTF8 charset.
14 years ago
orbiter 1110d16af9 performance hack: replaced generic row.getColBytes() call with row.getPrimaryKeyBytes() where the column is 0
14 years ago
orbiter 19b2a50578 - enhanced date formatter cache
14 years ago
orbiter f2e8ffd768 enhancement in synchronisation
14 years ago
orbiter ad7fcb9d61 Enhanced Base64Order transformation: less overhead (transformation between StringBuilder and byte[])
14 years ago
orbiter 0ce17d823a - fixed bug in ordering
14 years ago
orbiter dec4f36700 - fix for missing favicons in search widgets
14 years ago
orbiter e3ef4e3021 - increased default peer ping time from 2 minutes to 1 minute
14 years ago
orbiter cd19d0517e added dns resolve to HTTPClient POST using a dns cache to prevent that that not-thread-safe built-in dns cache inside apache http client is used
14 years ago
orbiter d28f8040e0 removed unnecessary recording function that caused also a performance problem after serving too much files
14 years ago
orbiter af87af0d4c - removed synchronization in serverSwitch which should improve speed
14 years ago
orbiter 4bd65532da initialization of libraries concurrently (faster start-up)
14 years ago
orbiter 57e6728cb7 - removed usage of /etc/alternatives/www-browser because of problems with lynx, see:
14 years ago
orbiter d84b4a072e healing for some OOM problems
14 years ago
orbiter 82f262f685 - enhanced circle drawing speed
14 years ago
orbiter 29dc416ac6 more animations in graphics. See network and access picture.
14 years ago
orbiter a80ee9a03d THE GRID is coming to YaCy .. see new animated graphics on http://localhost:8090/AccessGrid_p.html
14 years ago
low012 ce012e11aa *) deleted LogStatistics since the page did not work anymore and it seemed to be obsolete, tell me if you miss it and I will add it again
14 years ago
low012 c5051c4020 *) fixed bug which caused entries to not be deleted when deleting by URL on IndexCreateWWWLocalQueue_p.html (I hope this did not break anything else)
14 years ago
orbiter d58071947a maybe terminateOldSessions is too slow, removed sleep
14 years ago
orbiter 6c52e31993 new methods to open a browser
14 years ago
orbiter 5892fff51f introduction of dht-burst modes: this can expand the number of target peers in some cases where a better heuristic is needed. The problematic cases are either when a muti-word search is made (still a hard case for our term-oriented DHT) or when a network operator wants that all robinson peers are asked. We therefore introduced two new network steering values that switch on more peers during the peer selection. Because the number of peers can now be very large, the number of maximum httpc connections was also increased.
14 years ago
orbiter 4588b5a291 - fixed document number limitation for crawls that restrict the number of documents per domain
14 years ago
low012 64f32e8f00 *) replaced all IPs in IP filters for proxy with the proper regular expression
14 years ago
orbiter 93732d6773 increased number of target peers
14 years ago
orbiter 70ca7cec8c fix for http://forum.yacy-websuche.de/viewtopic.php?p=21763#p21763
14 years ago
orbiter fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
14 years ago
orbiter 5905f912c5 replaced more double types with float
14 years ago
orbiter 0cdfb82963 replaced more appearance of double values by float values
14 years ago
orbiter eb12e15738 moved all Double values to Float values because of
14 years ago
f1ori 982aa689ef * fix StringIndexOutOfBoundException in WebStructureGraph
14 years ago
orbiter 88773e4daa changed the default port from 8080 to 8090
14 years ago
orbiter 6c35b68f17 - removed 'peerName' property from the yacy settings file because this information is stored in the yacy seed file
14 years ago
orbiter 786166041a - added recording of all accessed and submitted servlets
14 years ago
orbiter 28f669bf0b - fixed/enhanced move to SD/16:9 images (network, web structure)
14 years ago
orbiter 0376f73fdb extended seed list uploader: do not only upload all active peers but also some more peers that are passive but had been active in the last 24 hours
14 years ago
orbiter 991b92f4ae enhanced network graphic
14 years ago
orbiter 3ae8f40fc8 removed yacy.network.group - this feature was never used
14 years ago
orbiter efb4ca8fa8 modified auto-delete of search failure-words:
14 years ago
orbiter f1f03d8c90 more logging for strange network loading bug
14 years ago
f1ori 4e29e9712a * create cleanupjob for cached failed urls
14 years ago
f1ori a321c7673d * adminAccountForLocalhost only for localhost
14 years ago
low012 48463c4507 *) General private License? ;-)
14 years ago
orbiter c93f4dda72 - cleaned up yacy news
14 years ago
orbiter 6c1b14c8e1 - more control in access tracker: count number of returned search results (not only info how much is in the index)
14 years ago
low012 9f38c0023d *) Minor changes, mainly cleaning up a little bit, no functional changes.
14 years ago
orbiter 54e77e6255 refactoring
14 years ago
orbiter 10ae8d961b - cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring)
14 years ago
lotus 0e54233408 UPnP: map port again if we are not reachable (e.g. when router rebooted)
14 years ago
lotus b1484299b2 same units for memory observer configuration (MiB)
14 years ago
orbiter 89ae6101b9 fix for NPE and added comment in search result
14 years ago
orbiter 0769f4caa6 added search suggestions for interactive search: is only shown if there are no search results
14 years ago
orbiter a4c9d27287 - moved some variables from Stwitchboard to new class AccessTracker
14 years ago
f1ori e4aabaa1c3 * fix negative filelength for files >2G
14 years ago
orbiter cdfe8afe3f fix for really bad table iteration implementation: reduction of IO
14 years ago
f1ori ee3cef91e8 * fix filesize in ftp crawls
14 years ago
orbiter b2ed4cfaf8 more small bugfixes and light refactoring
14 years ago
low012 3d95981f7d *) cleaning up the code a little bit
14 years ago
orbiter 6b70393d1d - new java version 1.6
14 years ago
orbiter e88c428008 fix to ftp loader
14 years ago
orbiter 9b25a33fd9 - fixed numerous bugs
14 years ago
orbiter 7bdb13bf7f more fixes to smb crawling: better file names
14 years ago
orbiter 94c48500cc several fixes
14 years ago
orbiter 58b59f9bc8 - a collection of bug fixes and some redesign of the Scanner class
14 years ago
orbiter c54170421a fix for npe
14 years ago
low012 6f4f957e50 *) cleaning up the code a little bit
14 years ago
f1ori 2521677a45 * deny adminForLocalhost and intranet network setup also on bootup and not only on network switch
14 years ago
f1ori 9d2159582f * fix system update if urls are in blacklist (for example for very general blacklists like *.de)
14 years ago
orbiter 56264dcc17 - added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls
14 years ago
orbiter acab6801d9 added new network scanner
14 years ago
orbiter a563b05b60 enhanced crawler:
14 years ago
orbiter c36da90261 added a very fast ftp file list generator to site crawler:
14 years ago
orbiter 4565b2f2c0 removed the display option from index.html, yacysearch.html and yacyinteractive.html
14 years ago
orbiter fc2e41e691 added a forwarder for the default page. The forwarder forwards a browser to a different page if the root file index.html is accessed. This can be done by setting the name of the forwarder page to the field
14 years ago
orbiter db99db4be9 some redesign of the search-fail-response mechanism:
14 years ago
f1ori 4915d1781a * use local backup-file, if remote network-definition is not availible
14 years ago
orbiter 18d33b5c6d fixed several search result navigation bugs
14 years ago
orbiter 49b5a206cd - better caclculation of search result size
14 years ago
orbiter 4e2c14efbb fixed bugs in parser and ftp client
14 years ago
orbiter f0651e5f2f added image search to yacyinteractive.html
14 years ago
orbiter fffb91447a fixed crawl queue delete function
14 years ago
orbiter b769cce433 - added a catch-all parser for all documents that cannot be parsed: they will contributed with their document url for the search index only
14 years ago
orbiter 22453b13ad implemented local host address discovery as posted in
14 years ago
orbiter cc6499bf8d - added http://blekko.com as search heuristic (like scroogle). This was easy since they deliver their search results also as rss feed
14 years ago
orbiter a9f754c45f removed unused CR accumulation and distribution process
14 years ago
orbiter d4a1a1850b removed warnings
14 years ago
low012 3b5830b7d4 *) Fixed typo.
14 years ago
low012 9b3fae9496 *) cleaning up the code a little bit
14 years ago
orbiter 7bb4b001ed - view image files from cache
14 years ago
low012 e7552bd719 *) cleaning up the code a little bit
14 years ago
apfelmaennchen 737aaf6952 various small changes to ymarks
14 years ago
apfelmaennchen 8a50670546 some code clean up for the last post
14 years ago
apfelmaennchen 442497868d another step towards an auto tagging function for YMarks
14 years ago
f1ori 741a87a3e9 * make .yacy-domains crawlable (.yacy-domains are local domains, so only in custom networks/peers)
14 years ago
f1ori dca9e16f51 * don't index pages, which redirect, twice
14 years ago
low012 eb79b952ef *) cleaner code
14 years ago
low012 38fdf43587 *) renamed classes according to standard Java coding conventions
14 years ago
low012 025e3f4790 *) renamed classes according to standard Java coding conventions
14 years ago
low012 3b9aa0504e *) removed unsused code
14 years ago
low012 db3db0fdb9 *) trying to make this class less confusing (probably failing)
14 years ago
apfelmaennchen 54e63b556e intermediate step for a YMark auto-tagging function based on word frequencies.
14 years ago
apfelmaennchen 403ee9c014 added a drill-down for metadata and word count to /api/ymarks/test_treeview.html
14 years ago
apfelmaennchen 11ae5b108e enabled rebuildIndex for /Table_YMark_p.html (rebuilds the tags and folders index)
14 years ago
apfelmaennchen 94a9be18a4 added a ymark table administration: /Table_YMark_p.html
14 years ago
apfelmaennchen 25339f93c7 more updates to ymarks
14 years ago
apfelmaennchen cdd65aca71 update to ymarks
14 years ago
apfelmaennchen 808edffaf6 ymarks
14 years ago
f1ori 2c539b514a * add domaincheck (local/global/domainlist) to urlcleaner
14 years ago
orbiter 117fc86b3d fix for http://forum.yacy-websuche.de/viewtopic.php?p=21199#p21199
14 years ago
orbiter 09badc697b - low-memory patch for crawler
14 years ago
orbiter becc463d8a enhanced did-you-mean
14 years ago
apfelmaennchen 43586a2ace a update to ymarks (please test if you wish):
14 years ago
orbiter 93c535d111 fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113
14 years ago
orbiter 4c72885cba added a sitemap entry parser and loader for sitemaps
14 years ago
orbiter 790e0b1894 - enhanced index deletion in IndexControlRWIs_p: delete also robots.txt database and cache if demanded
14 years ago
apfelmaennchen f5324b27f2 more updates to the new bookmarks (ymarks)....
14 years ago
orbiter 445619f3ec added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration.
14 years ago
f1ori acd93b1b31 * add failsafe mechanisme to domainlist retrieval
14 years ago
orbiter 70c95608d4 Added CORS Access header for yacysearch.rss output
14 years ago
lotus 18729351e7 upnp: hint for wrongly detected local ip address
14 years ago
f1ori def4253555 * add option to network definition to provide a domainlist (syntax like in blacklists)
14 years ago
orbiter ac6b503adf untar files without gzip decompression even if the file has gz extension. this is done when the decompression fails.
14 years ago
apfelmaennchen efe0667fdd more new bookmark (ymarks) code with experimental html and xbel import
14 years ago
mikeworks caabebf9be Fixed spelling mistake omiting -> omitting in debug messages in ConfigUpdate_p.java and Switchboard.java
14 years ago
orbiter 155d556568 - better memory protection
14 years ago
f1ori 7d8de34778 * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null)
14 years ago
orbiter 25a8e55bc9 more logging about bad seeds
14 years ago
orbiter 959b8c6fa0 - allow greater seed size
14 years ago
orbiter e103419a56 - removed <3 peers barrier for peer ping feedback
14 years ago
apfelmaennchen d0e6c03b51 some updates to the new bookmark code...
14 years ago
orbiter facfd204e9 added a parent configuration option.
14 years ago
orbiter e3964f2c31 better catch of network definition load error; continue with secondary network load definition location
14 years ago
low012 65a0381f76 *) cleaning up code (still not done)
14 years ago
orbiter e3e3b49d52 - enhanced main release recognition
14 years ago
apfelmaennchen 9c94ebdee4 small changes to new bookmark code...
14 years ago
apfelmaennchen 244b56e9d3 an update to the new bookmark code...
14 years ago
low012 dc40f51b8d *) added headlines as proposed by Vega
14 years ago
apfelmaennchen f035f257da added some more bookmark code...
14 years ago
low012 22ed9c380c *) fixed bug which was introduced in r7226 (shame on me) which made wiki unusable (all entries were stored with empty subject as key -> edits were lost)
14 years ago
f1ori 60fd2e549d * log failures when writing config file
14 years ago
orbiter 58e74282af added a word counter statistic in condenser which is used by the did-you-mean to calculate best matches for given search words.
14 years ago
orbiter 863065abc4 added user agent logging to access tracker
14 years ago
apfelmaennchen a79728b97d some updates to experimental bookmark code...
15 years ago
apfelmaennchen ef782cd026 and even more experimental bookmark code...
15 years ago
orbiter ed4371dcf3 enhanced navigation implementation and enhanced tag cloud computation
15 years ago
orbiter ca738ac924 - added a tag cloud to search results (using the topics)
15 years ago
apfelmaennchen 7aca763ca8 Some more experimental bookmark code...
15 years ago
apfelmaennchen 4270ed696c Experimental code (I need to transfer the code to my macbook, sorry) for the new bookmarks API based on the Tables concept (same as for crawl starts). Currently you can add a bookmark by api/ymarks/add_ymark.xml?url=http://www.yacy.net&title=YaCy and watch the result via the standard view Tables_p.html.
15 years ago
orbiter e4d561971e added more score cluster options and made score cluster usage more transparent
15 years ago
orbiter e8f90201a5 fix for scheduling of rss feeds
15 years ago
orbiter 7cd9d9d22a - enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better
15 years ago
orbiter de722090b5 enhancements in did-you-mean guessing
15 years ago
orbiter a59c885ee0 autocomplete and did-you-mean can now understand _all_ languages and can generate suggestions in all languages and character types
15 years ago
orbiter b7acd92ce4 Auto-Suggestions for YaCy Search:
15 years ago
orbiter 6a166c2040 patches for bad proxy behaviour
15 years ago
orbiter d607b30b6a performance enhancements for search and code review for database functions
15 years ago
orbiter 45b1ab3d07 custom + generic skins:
15 years ago
orbiter fcd40cd30f - disabled domZones (buggy, must think about better solution)
15 years ago
orbiter 0d363a94d7 more performance hacks
15 years ago
orbiter b8aee6d402 performance hacks for better search performance
15 years ago
orbiter 091dd3f6ec - enhanced intranet search speed
15 years ago
low012 b9f405d1e8 *) added comments
15 years ago
orbiter 6e6994e328 latest bugfixes to search and indexing function after test of demo presentation
15 years ago
orbiter aacf572a26 - enhancements for search speed
15 years ago
sixcooler 61c82f3105 gzip-compresson @ transferRWI & transferURL back again
15 years ago
orbiter 2c549ae341 fixed a number of small bugs:
15 years ago
orbiter f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option
15 years ago
orbiter c60aed4435 no caching in browser of dynamic web pages sent by YaCy http
15 years ago
orbiter e63896f2a8 added an intranet scanner and a servlet which shows all intranet addresses and an option to start a site-crawl for all these addresses at once.
15 years ago
orbiter e54cb7fb0c more bugfixes (also for latest commit)
15 years ago
orbiter d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed
15 years ago
low012 afa708d552 *) added <s>...</s> tag to WikiCode -> works just as the HTML equivalent
15 years ago
orbiter a83186ac7d fix for bug in cytrails
15 years ago
orbiter 48c0d508ac fixes for crawling of smb links (file length not always available)
15 years ago
orbiter 0bc6284e27 - added bugfix for access tracker in case of concurrency conflicts
15 years ago
orbiter 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes
15 years ago
lotus 4450c240b7 npe fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2982
15 years ago
orbiter 84a023cbc8 fixed several search bugs
15 years ago
orbiter 97ee278931 enhanced search speed:
15 years ago
orbiter ee3820c9cc more logging for strange "java.lang.NoClassDefFoundError: de/anomic/http/server/RequestHeader" error
15 years ago
orbiter 377f001e0d sorting of crawl profile names in crawl profile editor, see
15 years ago
orbiter 3552476fbe terminated migration from apache httpclient-3.1 to 4.1:
15 years ago
orbiter a2f9974745 some redesign in the access tracker to realize sixcoolers question about "smartes way for deleting the first Object":
15 years ago
sixcooler 03f0414025 some minor correction of my last commit
15 years ago
sixcooler 42fa0eadb1 fix endless loop:
15 years ago
low012 5a9ea0308f *) further simplification of wiki code parser (less redundancy in code, less magic numbers), still not done with it...
15 years ago
orbiter 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification
15 years ago
orbiter 29fe401f93 - some layout and text enhancement for site crawl start
15 years ago
orbiter 461a2a6ec7 enhanced remote crawling:
15 years ago
orbiter 670ba4d52b - removed the remote crawl option from the network configuration submenu and
15 years ago
orbiter 89c2d8b81e better initial hash computation
15 years ago
orbiter 34e2f7f487 enhanced snippet fetch strategy: concurrent snippet fetch even for offline-snippet searches. This improves speed since it is now possible to fetch snippets offline and parsing of source files from the htcache can be enhanced using concurrency. This improves local and remote search.
15 years ago
orbiter 0cf006865e refactoring and enhanced concurrency
15 years ago
orbiter 83ac07874f - corrected return value of put() methods (not used anywhere, so it did not harm before)
15 years ago
orbiter 5702419194 fixed a bug in HTTPClient: keep-alive must be set to false, otherwise servers hold connections 2 seconds open until response.
15 years ago
orbiter 5870b13f3a - code cleanup / added debug line for further investigation in HTTPDemon.parseMultipart
15 years ago
orbiter ac1c08924e more performance hacks
15 years ago
orbiter 14c843d364 more performance hacks
15 years ago
orbiter 39f409a7bb performance hacks
15 years ago
orbiter 7ebef56add - redesign of a part of the remote search client to make it possible to have a test environment for remote search performance tests
15 years ago
orbiter 3c0e07ba72 removed all delays in shutdown process
15 years ago
orbiter 64860dc1bb enhanced search event logging (to be used for further improvements)
15 years ago
sixcooler 17eebd4ef8 counting crawler traffic again:
15 years ago
orbiter 32f73d1aaa added copy for Info.plist for Mac application release updates (this file contains class paths and start parameters)
15 years ago
orbiter 4c21d8dc9d - changed default values for online caution (the pausing may not be necessary any more)
15 years ago
orbiter 570ca577c6 performance hacks
15 years ago
orbiter 348dece62f redesign of the SortStack and SortStore classes:
15 years ago
orbiter 114bdd8ba7 fixed old sitemap importer which was not able to parse urls containing post elements
15 years ago
lotus 6a09f1f7e5 fix dedicated upnp testing
15 years ago
orbiter 5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1
15 years ago
orbiter c757a4aa9f - corrected lifetime computation for search events
15 years ago
orbiter fb828f3767 - performance enhancements in search response time using faster query ID computation and an ID cache
15 years ago
orbiter 22047ffad5 enhanced computation speed of many replaceAll string operations
15 years ago
orbiter e8228fba09 less locking in time format computation, caching and during secondary (remote) search evaluation
15 years ago
orbiter 9c0c94683c because of a bug in search result caching count search results had not been generated as fast as possible.
15 years ago
orbiter fa2eb9676e removed unused class
15 years ago
low012 5f391fcfa9 *) cleaned up in wikiCode parser (more to be done)
15 years ago
orbiter b3f0d06444 fixed a problem with restarts in YaCy mac applications: the DATA directory path was not submitted when doing a restart. This solves the problem by:
15 years ago
orbiter d4e4967e19 cleaned up code in yacyRelease (there will be work to do there)
15 years ago
orbiter 1da5241c2d do not block server session if maximum number of sessions is reached, just try to clean up once
15 years ago
orbiter 5de70c3d7c changed way of storage for search requests:
15 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy:
15 years ago
orbiter 65eaf30f77 redesign of crawl profiles data structure. target will be:
15 years ago
f1ori 55da979291 disable revision detection for git
15 years ago
orbiter 104318d58a - added nice colors to feed indexing state messages
15 years ago
orbiter 4f22e2df41 bugfixes for
15 years ago
orbiter 42414a6ae3 added two more tables in rss reader interface:
15 years ago
orbiter 0010cd9db1 Support for indexing of RSS feeds!
15 years ago
orbiter 0f276dd63f - MapHeap now implements Map<byte[], Map<String, String>>
15 years ago
orbiter c60d0282fd more abstraction for tables stored in heaps:
15 years ago
orbiter 3197ca42ed preparations to move the HTCache into cora:
15 years ago
orbiter 844f158686 - removed dependencies in header framework:
15 years ago
orbiter 5e7081cd19 refactoring towards a unified loading mechanism for MultiProtocolURIs
15 years ago
orbiter caece04f26 removed System.err and System.out usage from FTPClient; changed logging to log4j (preferred in yacy.cora)
15 years ago
orbiter 90531f78ff refactoring of the cora package to get subpackages for http and ftp (smb to come)
15 years ago
orbiter d0fb6bc2bc cleaned up superfluous classes after sixcoolers migration to HttpComponents-Client-4.x
15 years ago
sixcooler 661867923a ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 7aa860c505 - more logging
15 years ago
orbiter 4d5446d641 code cleanup
15 years ago
orbiter 70dd26ec95 added the new crawl scheduling function to the crawl start menu:
15 years ago
orbiter 5a994c9796 added a scheduler based on API actions
15 years ago
orbiter 189a986ebd - modified api-call interface to record api calls with references to api-call database (carries pk)
15 years ago
orbiter 86d7f8a989 - the web visualization can now be generated in custom color
15 years ago
orbiter 64d4204f44 fix for NPE in network image computation
15 years ago
orbiter 7fdb17bb96 redirect uncaught exceptions to logging + small other changes
15 years ago
f1ori 92df768c39 * fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2929&hilit=
15 years ago
orbiter eb97bed1df patch for http://forum.yacy-websuche.de/viewtopic.php?p=20576#p20576
15 years ago
orbiter 87b1684211 additional double-check in balancer
15 years ago
orbiter 0d81731e88 fixed crawler bug caused by NPE in logging
15 years ago
orbiter a82a93f2fc - better url double check in crawler
15 years ago
sixcooler a6ed6e8cb9 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 171f2bd84e - removed unused network oanet
15 years ago
orbiter b480b7a4d0 fix for bug in last commit
15 years ago
orbiter b12bfe1f91 better usage of OSM tile cache and YaCy cache by usage of better tile server computation based on a coordinate hash
15 years ago
orbiter 388aa021c2 - concurrent loading of OSM tiles
15 years ago
orbiter 301a59e07f moved browser access method from kelondro/util/OS to gui/framework/Browser
15 years ago
orbiter ec72387165 added a very early test version of a YaCy gui component.
15 years ago
sixcooler d88b9606d1 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2923
15 years ago
orbiter 6388a58fc7 better memory management and slightly less (in total and temporary) RAM allocation:
15 years ago
orbiter 5924a0d851 - enhanced concurrency in database index access for multicore
15 years ago
orbiter 610855e362 do not use network graph cache if called from authorized account
15 years ago
sixcooler 39d96abbb5 fix yacyRelease download
15 years ago
sixcooler c29f24a519 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter e7ea3b3cc5 added a buffer for network images to reduced load on yacy.net network image server
15 years ago
orbiter d5c65b17a6 added another network activity visualization: show strong query activity as radiation around peer
15 years ago
sixcooler 15e8c13526 ... migrating to HttpComponents-Client-4.x ...
15 years ago
mikeworks aa663cda4d ConfigUpdate_p.html and ConfigUpdate_p.java: Added check for downloaded releases and disabled buttons in case no new releases available
15 years ago
sixcooler b7102eff92 ... migrating to HttpComponents-Client-4.x ...
15 years ago
lotus 965aa97993 including sbbi upnplib as source again
15 years ago
lotus 74f6fd229e some comments + debug code
15 years ago
sixcooler 52718e6dcb ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 89b0f5bce8 fix for exception in http://forum.yacy-websuche.de/viewtopic.php?p=20418#p20418
15 years ago
sixcooler 5fa8038f10 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 22dbbcfa56 better (and corrected) recognition of intranet and internet-addresses. This corrects the isLocal property that is used by network definitions to restrict index ranges to local and global addresses. Address locations (intranet or internet) had been partly identified by the top level domain of the host address. Since intranet addresses can also be addressed using a host name that is in a country domain it is necessary to do a dns resolving for each check. The check is supported by a local dns cache so the intranet/internet check should not affect network traffic too much. To ensure that the cache works properly the cache class was upgraded to better concurrency data structures.
15 years ago
low012 0e6fed1fb6 *) less HTML errors (according to https://addons.mozilla.org/de/firefox/addon/249/)
15 years ago
sixcooler 0e56d29335 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler e1316d12d0 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler c5c67f0504 start migrating to HttpComponents-Client-4.x
15 years ago
orbiter 7188c54ddb patch to get dht access to developer peers
15 years ago
orbiter b6fb239e74 redesign of parser interface:
15 years ago
orbiter 150cf42a1b migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2
15 years ago
orbiter 11b7853940 added a configuration page for search heuristics. currently you can switch on there:
15 years ago
orbiter 5d00888c95 - added animated visualization for DHT-in and DHT-out in network graphic
15 years ago
orbiter bf25407fdd added peer hash to internal RSSFeed. The hash will be used to display news activities in the network graphic.
15 years ago
orbiter 1557e0f2d0 - some refactoring for internal RSSFeed (protocol of all actions as seen on status page)
15 years ago
orbiter 5a4684f21f allow words with length >= 2 (you can't search for 'wm' with 3-letter words...)
15 years ago
orbiter 37b8827a7a - removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection.
15 years ago
orbiter dcd01698b4 added a 'transition feature' that shall lower the barrier to move from g**gle to yacy (yes!):
15 years ago
orbiter 3a9dc52ac2 added a fascinating new way to search _and_ start a web crawl at the same time:
15 years ago
orbiter 2b4f8f6c06 animated network graphic!
15 years ago
orbiter 777195e8d1 more abstraction for access of LoaderDispatcher and cache
15 years ago
orbiter 7bcfa033c9 more abstraction of the htcache when using the LoaderDispatcher:
15 years ago
orbiter 2ddb952a5c added the (fixed and anhanced) secondary search process. The process was disabled since some time.
15 years ago
orbiter 58035ef784 fix in snippet loading
15 years ago
orbiter 986d4f34d9 added a consistency check for new queues
15 years ago
orbiter 73f03e05ee fixed a bug in snippet fetch strategy: cache only does not help if resource can only be found in web
15 years ago
orbiter fbf021bb50 redesign of index abstract processing - currently disabled until enough peers have fix in SVN 6928
15 years ago
orbiter 87087f12fe - scanned remote search process and enhanced some data structure and synchronizations here and there
15 years ago
orbiter b62fb38344 fix for case where no release provider responds during auto-update (caused NPE)
15 years ago
orbiter 3a1cebb598 bugfixes
15 years ago
orbiter 989819a28c - reduced peer-ping time-out from 30 to 10 seconds
15 years ago
orbiter b03caaa57a better handling of OOM situations
15 years ago
orbiter 56ff9d5fd4 - extended news size from 512 to 1024 characters
15 years ago
orbiter c71d829bb5 more time-out properties for http connection manager
15 years ago
orbiter 60e71876ad - more abstraction (HashMap -> Map)
15 years ago
orbiter a83772c71b fixes and enhancements for balancer:
15 years ago
orbiter 9cde05418f fixed url crawl list display
15 years ago
orbiter 30b337fa9f fixes to balancer when crawling filesystem (problem was: host == null)
15 years ago
orbiter 844853243a fixed balancer time guessing
15 years ago
orbiter 3f93a0cc8f redesign of remote proxy settings
15 years ago
orbiter 11639aef35 - added new protocol loader for 'file'-type URLs
15 years ago
orbiter 6950d8a33d fixes to SMB crawler
15 years ago
orbiter bfdb9f4e06 extended statistics on Network servlet page
15 years ago
orbiter 9842fab6e4 - fixes to query parameter
15 years ago
orbiter 6ec9ced4cd - fix for multi-word search for locations
15 years ago
orbiter 1defd580bc - added option to localization search to distinguish between a search for a location according to the search word only or for the relation between a web search results and locations found in the metadata fields
15 years ago
low012 ad823a4716 *) minor changes (only cosmetics, no functional changes)
15 years ago
low012 dcac90d2f9 *) removed unnecessary import
15 years ago
orbiter e43e61e502 added another geolocalization data source: GeoNames
15 years ago
orbiter 118d589eff replaced the very very old data structure 'Records' with a simple table to fix the problem from
15 years ago
orbiter 2a8f70f0ca - fix for caching of OSM tiles. if you want that this fix applies to your peer, please delete the crawl profiles
15 years ago
orbiter 2126c03a62 - removed download-limit that can be given for the crawler for non-crawler download tasks. This was necessary because the same procedure was used for other downloads like for the download of dictionary files where a limit is not useful. The limit still stays for the indexer
15 years ago
orbiter 3661cb692c added dictionary loader servlet that can be used to get the geolocalization file:
15 years ago
orbiter 90fa8fd4d4 - support gpx file extension
15 years ago
orbiter 7b880d73d0 adjustments to granted query size
15 years ago
orbiter 789c6b26ce added a location search service: using the following servlet/example:
15 years ago
orbiter cf43bdc87e This is a large bugfix and enhancement commit to support a better location detection for data
15 years ago
orbiter c45117f81f fixed dates in metadata
15 years ago
orbiter a7d038bb7a The oai ListFriends source list becomes configurable: just write them into defaults/oaiListFriendsSource.xml
15 years ago
orbiter 7ab207d93a better presentation of search result metadata and fixes to htcache loading
15 years ago
orbiter 5fbf866cae - fixed resumption token generation for oai-pmh import
15 years ago
orbiter fc5efcc05a enhanced and fixed OAI-PMH import
15 years ago
sixcooler c2098f9399 close unused connections if there to many for DHT
15 years ago
orbiter 40a8d132d9 tried to fix 100% CPU when calling Balancer.top()
15 years ago
orbiter 90c3e5d6f6 - cleanup, removed unused imports
15 years ago
orbiter 3aad50d38e :-(
15 years ago
orbiter 9edd38fbc5 connectionCount limit too low?
15 years ago
orbiter 7a05db0fcb fixed to prevent that too many open connections exist
15 years ago
orbiter a9b9bf667b fix for http://forum.yacy-websuche.de/viewtopic.php?p=19910#p19910
15 years ago
orbiter b18a7606a0 some performance hacks and fixed after reading dump in
15 years ago
orbiter 2bc3cba6f1 - fix for 'do not write to cache' rule.
15 years ago
orbiter 4cd5418963 removed finalize methods because of a hint in
15 years ago
orbiter bfa35d6d20 possible fix for ZURL.list counter
15 years ago
orbiter 65f383e70b some adjustments to the httpc after testing with a very slow httpd
15 years ago
orbiter 8c40f1cb8e self-healing for broken table files (may cause other problems, but better than nothing)
15 years ago
sixcooler 13f5b8e7ba fix for storing/getting bookmark-folders
15 years ago
orbiter 7b69d79727 enhanced remove() operation: in many cases it is not necessary to return the removed object to the called.
15 years ago
orbiter 93ea0a4789 enhanced remove operation in search consequences (which are triggered when the snippet fetch proves that the word has disappeared from the page that was stored in the index)
15 years ago
orbiter 64f29f990e a collection of performance hacks and code cleanup:
15 years ago
orbiter 8b8107b2a3 reduced IO-load and synchronization/blocking
15 years ago
orbiter 3a50b5aa04 enhanced object hash computation
15 years ago
orbiter 1a8a134e0c continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775 and continued in SVN 6790
15 years ago
orbiter dde394a977 - shifted some computation out of synchronization to allow more concurrency
15 years ago
orbiter f204076d25 removed usage of temporary files: causes too much IO
15 years ago
orbiter 48b9371735 changed balancer re-load counter. causes less blocking here doing intranet indexing.
15 years ago
orbiter 0d04ab1422 new access tracking data type strategy; previous data types may have caused deadlocks of httpd when performing statistic cleanups
15 years ago
low012 fc43f3028e *) hopefully fixing NPE issue introduced in r6797
15 years ago
orbiter 55d8e686ea performance hacks
15 years ago
orbiter 2e26744f4e more concurrency when normalizing RWI entries + cleanup
15 years ago
orbiter 555b333041 fix for wrong count of server processes. may fix non-access problems in some cases
15 years ago
orbiter 4917f96729 fixes for some changes in SVN 6797 that caused NPEs when the bookmarks initialized
15 years ago
low012 dff660441a *) changes for better code readability
15 years ago
low012 15d9ea8375 *) changes for better code readability
15 years ago
low012 2bc459252e *) changes for better code readability
15 years ago
orbiter 67ec58d8e7 search performance enhancement
15 years ago
hermens 4ec0092677 more null == proxy fixes
15 years ago
hermens 2f90f0ad56 Remove asserts blocking proxy use cases
15 years ago
sixcooler eb2a4bb555 workaround(?) for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2770&start=0&st=0&sk=t&sd=a&hilit=DefaultCharsetStringPart
15 years ago
orbiter 25aef069a6 continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775
15 years ago
low012 b97ad0f380 *) some minor changes for better code readability
15 years ago
orbiter ba51d140e1 added more info in assert in balancer
15 years ago
orbiter a85c5bb8a7 added support for multiple (fail-over) network definition locations when http-locations are given. multiple locations can be given with a comma-separated list of urls pointing to the network definition file
15 years ago
orbiter 9b3840cb66 performance hacks for the template engine + cleanup
15 years ago