Commit Graph

4692 Commits (14e45e90fd8c30f3d3a878acbcf37b1845bb8f3d)

Author SHA1 Message Date
orbiter ed4371dcf3 enhanced navigation implementation and enhanced tag cloud computation
15 years ago
orbiter ca738ac924 - added a tag cloud to search results (using the topics)
15 years ago
apfelmaennchen 7aca763ca8 Some more experimental bookmark code...
15 years ago
apfelmaennchen 4270ed696c Experimental code (I need to transfer the code to my macbook, sorry) for the new bookmarks API based on the Tables concept (same as for crawl starts). Currently you can add a bookmark by api/ymarks/add_ymark.xml?url=http://www.yacy.net&title=YaCy and watch the result via the standard view Tables_p.html.
15 years ago
orbiter e4d561971e added more score cluster options and made score cluster usage more transparent
15 years ago
orbiter e8f90201a5 fix for scheduling of rss feeds
15 years ago
orbiter 7cd9d9d22a - enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better
15 years ago
orbiter de722090b5 enhancements in did-you-mean guessing
15 years ago
orbiter a59c885ee0 autocomplete and did-you-mean can now understand _all_ languages and can generate suggestions in all languages and character types
15 years ago
orbiter b7acd92ce4 Auto-Suggestions for YaCy Search:
15 years ago
orbiter 6a166c2040 patches for bad proxy behaviour
15 years ago
orbiter d607b30b6a performance enhancements for search and code review for database functions
15 years ago
orbiter 45b1ab3d07 custom + generic skins:
15 years ago
orbiter fcd40cd30f - disabled domZones (buggy, must think about better solution)
15 years ago
orbiter 0d363a94d7 more performance hacks
15 years ago
orbiter b8aee6d402 performance hacks for better search performance
15 years ago
orbiter 091dd3f6ec - enhanced intranet search speed
15 years ago
low012 b9f405d1e8 *) added comments
15 years ago
orbiter 6e6994e328 latest bugfixes to search and indexing function after test of demo presentation
15 years ago
orbiter aacf572a26 - enhancements for search speed
15 years ago
sixcooler 61c82f3105 gzip-compresson @ transferRWI & transferURL back again
15 years ago
orbiter 2c549ae341 fixed a number of small bugs:
15 years ago
orbiter f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option
15 years ago
orbiter c60aed4435 no caching in browser of dynamic web pages sent by YaCy http
15 years ago
orbiter e63896f2a8 added an intranet scanner and a servlet which shows all intranet addresses and an option to start a site-crawl for all these addresses at once.
15 years ago
orbiter e54cb7fb0c more bugfixes (also for latest commit)
15 years ago
orbiter d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed
15 years ago
low012 afa708d552 *) added <s>...</s> tag to WikiCode -> works just as the HTML equivalent
15 years ago
orbiter a83186ac7d fix for bug in cytrails
15 years ago
orbiter 48c0d508ac fixes for crawling of smb links (file length not always available)
15 years ago
orbiter 0bc6284e27 - added bugfix for access tracker in case of concurrency conflicts
15 years ago
orbiter 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes
15 years ago
lotus 4450c240b7 npe fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2982
15 years ago
orbiter 84a023cbc8 fixed several search bugs
15 years ago
orbiter 97ee278931 enhanced search speed:
15 years ago
orbiter ee3820c9cc more logging for strange "java.lang.NoClassDefFoundError: de/anomic/http/server/RequestHeader" error
15 years ago
orbiter 377f001e0d sorting of crawl profile names in crawl profile editor, see
15 years ago
orbiter 3552476fbe terminated migration from apache httpclient-3.1 to 4.1:
15 years ago
orbiter a2f9974745 some redesign in the access tracker to realize sixcoolers question about "smartes way for deleting the first Object":
15 years ago
sixcooler 03f0414025 some minor correction of my last commit
15 years ago
sixcooler 42fa0eadb1 fix endless loop:
15 years ago
low012 5a9ea0308f *) further simplification of wiki code parser (less redundancy in code, less magic numbers), still not done with it...
15 years ago
orbiter 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification
15 years ago
orbiter 29fe401f93 - some layout and text enhancement for site crawl start
15 years ago
orbiter 461a2a6ec7 enhanced remote crawling:
15 years ago
orbiter 670ba4d52b - removed the remote crawl option from the network configuration submenu and
15 years ago
orbiter 89c2d8b81e better initial hash computation
15 years ago
orbiter 34e2f7f487 enhanced snippet fetch strategy: concurrent snippet fetch even for offline-snippet searches. This improves speed since it is now possible to fetch snippets offline and parsing of source files from the htcache can be enhanced using concurrency. This improves local and remote search.
15 years ago
orbiter 0cf006865e refactoring and enhanced concurrency
15 years ago
orbiter 83ac07874f - corrected return value of put() methods (not used anywhere, so it did not harm before)
15 years ago
orbiter 5702419194 fixed a bug in HTTPClient: keep-alive must be set to false, otherwise servers hold connections 2 seconds open until response.
15 years ago
orbiter 5870b13f3a - code cleanup / added debug line for further investigation in HTTPDemon.parseMultipart
15 years ago
orbiter ac1c08924e more performance hacks
15 years ago
orbiter 14c843d364 more performance hacks
15 years ago
orbiter 39f409a7bb performance hacks
15 years ago
orbiter 7ebef56add - redesign of a part of the remote search client to make it possible to have a test environment for remote search performance tests
15 years ago
orbiter 3c0e07ba72 removed all delays in shutdown process
15 years ago
orbiter 64860dc1bb enhanced search event logging (to be used for further improvements)
15 years ago
sixcooler 17eebd4ef8 counting crawler traffic again:
15 years ago
orbiter 32f73d1aaa added copy for Info.plist for Mac application release updates (this file contains class paths and start parameters)
15 years ago
orbiter 4c21d8dc9d - changed default values for online caution (the pausing may not be necessary any more)
15 years ago
orbiter 570ca577c6 performance hacks
15 years ago
orbiter 348dece62f redesign of the SortStack and SortStore classes:
15 years ago
orbiter 114bdd8ba7 fixed old sitemap importer which was not able to parse urls containing post elements
15 years ago
lotus 6a09f1f7e5 fix dedicated upnp testing
15 years ago
orbiter 5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1
15 years ago
orbiter c757a4aa9f - corrected lifetime computation for search events
15 years ago
orbiter fb828f3767 - performance enhancements in search response time using faster query ID computation and an ID cache
15 years ago
orbiter 22047ffad5 enhanced computation speed of many replaceAll string operations
15 years ago
orbiter e8228fba09 less locking in time format computation, caching and during secondary (remote) search evaluation
15 years ago
orbiter 9c0c94683c because of a bug in search result caching count search results had not been generated as fast as possible.
15 years ago
orbiter fa2eb9676e removed unused class
15 years ago
low012 5f391fcfa9 *) cleaned up in wikiCode parser (more to be done)
15 years ago
orbiter b3f0d06444 fixed a problem with restarts in YaCy mac applications: the DATA directory path was not submitted when doing a restart. This solves the problem by:
15 years ago
orbiter d4e4967e19 cleaned up code in yacyRelease (there will be work to do there)
15 years ago
orbiter 1da5241c2d do not block server session if maximum number of sessions is reached, just try to clean up once
15 years ago
orbiter 5de70c3d7c changed way of storage for search requests:
15 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy:
15 years ago
orbiter 65eaf30f77 redesign of crawl profiles data structure. target will be:
15 years ago
f1ori 55da979291 disable revision detection for git
15 years ago
orbiter 104318d58a - added nice colors to feed indexing state messages
15 years ago
orbiter 4f22e2df41 bugfixes for
15 years ago
orbiter 42414a6ae3 added two more tables in rss reader interface:
15 years ago
orbiter 0010cd9db1 Support for indexing of RSS feeds!
15 years ago
orbiter 0f276dd63f - MapHeap now implements Map<byte[], Map<String, String>>
15 years ago
orbiter c60d0282fd more abstraction for tables stored in heaps:
15 years ago
orbiter 3197ca42ed preparations to move the HTCache into cora:
15 years ago
orbiter 844f158686 - removed dependencies in header framework:
15 years ago
orbiter 5e7081cd19 refactoring towards a unified loading mechanism for MultiProtocolURIs
15 years ago
orbiter caece04f26 removed System.err and System.out usage from FTPClient; changed logging to log4j (preferred in yacy.cora)
15 years ago
orbiter 90531f78ff refactoring of the cora package to get subpackages for http and ftp (smb to come)
15 years ago
orbiter d0fb6bc2bc cleaned up superfluous classes after sixcoolers migration to HttpComponents-Client-4.x
15 years ago
sixcooler 661867923a ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 7aa860c505 - more logging
15 years ago
orbiter 4d5446d641 code cleanup
15 years ago
orbiter 70dd26ec95 added the new crawl scheduling function to the crawl start menu:
15 years ago
orbiter 5a994c9796 added a scheduler based on API actions
15 years ago
orbiter 189a986ebd - modified api-call interface to record api calls with references to api-call database (carries pk)
15 years ago
orbiter 86d7f8a989 - the web visualization can now be generated in custom color
15 years ago
orbiter 64d4204f44 fix for NPE in network image computation
15 years ago
orbiter 7fdb17bb96 redirect uncaught exceptions to logging + small other changes
15 years ago
f1ori 92df768c39 * fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2929&hilit=
15 years ago
orbiter eb97bed1df patch for http://forum.yacy-websuche.de/viewtopic.php?p=20576#p20576
15 years ago
orbiter 87b1684211 additional double-check in balancer
15 years ago
orbiter 0d81731e88 fixed crawler bug caused by NPE in logging
15 years ago
orbiter a82a93f2fc - better url double check in crawler
15 years ago
sixcooler a6ed6e8cb9 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 171f2bd84e - removed unused network oanet
15 years ago
orbiter b480b7a4d0 fix for bug in last commit
15 years ago
orbiter b12bfe1f91 better usage of OSM tile cache and YaCy cache by usage of better tile server computation based on a coordinate hash
15 years ago
orbiter 388aa021c2 - concurrent loading of OSM tiles
15 years ago
orbiter 301a59e07f moved browser access method from kelondro/util/OS to gui/framework/Browser
15 years ago
orbiter ec72387165 added a very early test version of a YaCy gui component.
15 years ago
sixcooler d88b9606d1 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2923
15 years ago
orbiter 6388a58fc7 better memory management and slightly less (in total and temporary) RAM allocation:
15 years ago
orbiter 5924a0d851 - enhanced concurrency in database index access for multicore
15 years ago
orbiter 610855e362 do not use network graph cache if called from authorized account
15 years ago
sixcooler 39d96abbb5 fix yacyRelease download
15 years ago
sixcooler c29f24a519 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter e7ea3b3cc5 added a buffer for network images to reduced load on yacy.net network image server
15 years ago
orbiter d5c65b17a6 added another network activity visualization: show strong query activity as radiation around peer
15 years ago
sixcooler 15e8c13526 ... migrating to HttpComponents-Client-4.x ...
15 years ago
mikeworks aa663cda4d ConfigUpdate_p.html and ConfigUpdate_p.java: Added check for downloaded releases and disabled buttons in case no new releases available
15 years ago
sixcooler b7102eff92 ... migrating to HttpComponents-Client-4.x ...
15 years ago
lotus 965aa97993 including sbbi upnplib as source again
15 years ago
lotus 74f6fd229e some comments + debug code
15 years ago
sixcooler 52718e6dcb ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 89b0f5bce8 fix for exception in http://forum.yacy-websuche.de/viewtopic.php?p=20418#p20418
15 years ago
sixcooler 5fa8038f10 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 22dbbcfa56 better (and corrected) recognition of intranet and internet-addresses. This corrects the isLocal property that is used by network definitions to restrict index ranges to local and global addresses. Address locations (intranet or internet) had been partly identified by the top level domain of the host address. Since intranet addresses can also be addressed using a host name that is in a country domain it is necessary to do a dns resolving for each check. The check is supported by a local dns cache so the intranet/internet check should not affect network traffic too much. To ensure that the cache works properly the cache class was upgraded to better concurrency data structures.
15 years ago
low012 0e6fed1fb6 *) less HTML errors (according to https://addons.mozilla.org/de/firefox/addon/249/)
15 years ago
sixcooler 0e56d29335 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler e1316d12d0 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler c5c67f0504 start migrating to HttpComponents-Client-4.x
15 years ago
orbiter 7188c54ddb patch to get dht access to developer peers
15 years ago
orbiter b6fb239e74 redesign of parser interface:
15 years ago
orbiter 150cf42a1b migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2
15 years ago
orbiter 11b7853940 added a configuration page for search heuristics. currently you can switch on there:
15 years ago
orbiter 5d00888c95 - added animated visualization for DHT-in and DHT-out in network graphic
15 years ago
orbiter bf25407fdd added peer hash to internal RSSFeed. The hash will be used to display news activities in the network graphic.
15 years ago
orbiter 1557e0f2d0 - some refactoring for internal RSSFeed (protocol of all actions as seen on status page)
15 years ago
orbiter 5a4684f21f allow words with length >= 2 (you can't search for 'wm' with 3-letter words...)
15 years ago
orbiter 37b8827a7a - removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection.
15 years ago
orbiter dcd01698b4 added a 'transition feature' that shall lower the barrier to move from g**gle to yacy (yes!):
15 years ago
orbiter 3a9dc52ac2 added a fascinating new way to search _and_ start a web crawl at the same time:
15 years ago
orbiter 2b4f8f6c06 animated network graphic!
15 years ago
orbiter 777195e8d1 more abstraction for access of LoaderDispatcher and cache
15 years ago
orbiter 7bcfa033c9 more abstraction of the htcache when using the LoaderDispatcher:
15 years ago
orbiter 2ddb952a5c added the (fixed and anhanced) secondary search process. The process was disabled since some time.
15 years ago
orbiter 58035ef784 fix in snippet loading
15 years ago
orbiter 986d4f34d9 added a consistency check for new queues
15 years ago
orbiter 73f03e05ee fixed a bug in snippet fetch strategy: cache only does not help if resource can only be found in web
15 years ago
orbiter fbf021bb50 redesign of index abstract processing - currently disabled until enough peers have fix in SVN 6928
15 years ago
orbiter 87087f12fe - scanned remote search process and enhanced some data structure and synchronizations here and there
15 years ago
orbiter b62fb38344 fix for case where no release provider responds during auto-update (caused NPE)
15 years ago
orbiter 3a1cebb598 bugfixes
15 years ago
orbiter 989819a28c - reduced peer-ping time-out from 30 to 10 seconds
15 years ago
orbiter b03caaa57a better handling of OOM situations
15 years ago
orbiter 56ff9d5fd4 - extended news size from 512 to 1024 characters
15 years ago
orbiter c71d829bb5 more time-out properties for http connection manager
15 years ago
orbiter 60e71876ad - more abstraction (HashMap -> Map)
15 years ago
orbiter a83772c71b fixes and enhancements for balancer:
15 years ago
orbiter 9cde05418f fixed url crawl list display
15 years ago
orbiter 30b337fa9f fixes to balancer when crawling filesystem (problem was: host == null)
15 years ago
orbiter 844853243a fixed balancer time guessing
15 years ago
orbiter 3f93a0cc8f redesign of remote proxy settings
15 years ago
orbiter 11639aef35 - added new protocol loader for 'file'-type URLs
15 years ago
orbiter 6950d8a33d fixes to SMB crawler
15 years ago
orbiter bfdb9f4e06 extended statistics on Network servlet page
15 years ago
orbiter 9842fab6e4 - fixes to query parameter
15 years ago
orbiter 6ec9ced4cd - fix for multi-word search for locations
15 years ago
orbiter 1defd580bc - added option to localization search to distinguish between a search for a location according to the search word only or for the relation between a web search results and locations found in the metadata fields
15 years ago
low012 ad823a4716 *) minor changes (only cosmetics, no functional changes)
15 years ago
low012 dcac90d2f9 *) removed unnecessary import
15 years ago
orbiter e43e61e502 added another geolocalization data source: GeoNames
15 years ago
orbiter 118d589eff replaced the very very old data structure 'Records' with a simple table to fix the problem from
15 years ago
orbiter 2a8f70f0ca - fix for caching of OSM tiles. if you want that this fix applies to your peer, please delete the crawl profiles
15 years ago
orbiter 2126c03a62 - removed download-limit that can be given for the crawler for non-crawler download tasks. This was necessary because the same procedure was used for other downloads like for the download of dictionary files where a limit is not useful. The limit still stays for the indexer
15 years ago
orbiter 3661cb692c added dictionary loader servlet that can be used to get the geolocalization file:
15 years ago
orbiter 90fa8fd4d4 - support gpx file extension
15 years ago
orbiter 7b880d73d0 adjustments to granted query size
15 years ago
orbiter 789c6b26ce added a location search service: using the following servlet/example:
15 years ago
orbiter cf43bdc87e This is a large bugfix and enhancement commit to support a better location detection for data
15 years ago
orbiter c45117f81f fixed dates in metadata
15 years ago
orbiter a7d038bb7a The oai ListFriends source list becomes configurable: just write them into defaults/oaiListFriendsSource.xml
15 years ago
orbiter 7ab207d93a better presentation of search result metadata and fixes to htcache loading
15 years ago
orbiter 5fbf866cae - fixed resumption token generation for oai-pmh import
15 years ago
orbiter fc5efcc05a enhanced and fixed OAI-PMH import
15 years ago
sixcooler c2098f9399 close unused connections if there to many for DHT
15 years ago
orbiter 40a8d132d9 tried to fix 100% CPU when calling Balancer.top()
15 years ago
orbiter 90c3e5d6f6 - cleanup, removed unused imports
15 years ago
orbiter 3aad50d38e :-(
15 years ago
orbiter 9edd38fbc5 connectionCount limit too low?
15 years ago
orbiter 7a05db0fcb fixed to prevent that too many open connections exist
15 years ago
orbiter a9b9bf667b fix for http://forum.yacy-websuche.de/viewtopic.php?p=19910#p19910
15 years ago
orbiter b18a7606a0 some performance hacks and fixed after reading dump in
15 years ago
orbiter 2bc3cba6f1 - fix for 'do not write to cache' rule.
15 years ago
orbiter 4cd5418963 removed finalize methods because of a hint in
15 years ago
orbiter bfa35d6d20 possible fix for ZURL.list counter
15 years ago
orbiter 65f383e70b some adjustments to the httpc after testing with a very slow httpd
15 years ago
orbiter 8c40f1cb8e self-healing for broken table files (may cause other problems, but better than nothing)
15 years ago
sixcooler 13f5b8e7ba fix for storing/getting bookmark-folders
15 years ago
orbiter 7b69d79727 enhanced remove() operation: in many cases it is not necessary to return the removed object to the called.
15 years ago
orbiter 93ea0a4789 enhanced remove operation in search consequences (which are triggered when the snippet fetch proves that the word has disappeared from the page that was stored in the index)
15 years ago
orbiter 64f29f990e a collection of performance hacks and code cleanup:
15 years ago
orbiter 8b8107b2a3 reduced IO-load and synchronization/blocking
15 years ago
orbiter 3a50b5aa04 enhanced object hash computation
15 years ago
orbiter 1a8a134e0c continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775 and continued in SVN 6790
15 years ago
orbiter dde394a977 - shifted some computation out of synchronization to allow more concurrency
15 years ago
orbiter f204076d25 removed usage of temporary files: causes too much IO
15 years ago
orbiter 48b9371735 changed balancer re-load counter. causes less blocking here doing intranet indexing.
15 years ago
orbiter 0d04ab1422 new access tracking data type strategy; previous data types may have caused deadlocks of httpd when performing statistic cleanups
15 years ago
low012 fc43f3028e *) hopefully fixing NPE issue introduced in r6797
15 years ago
orbiter 55d8e686ea performance hacks
15 years ago
orbiter 2e26744f4e more concurrency when normalizing RWI entries + cleanup
15 years ago
orbiter 555b333041 fix for wrong count of server processes. may fix non-access problems in some cases
15 years ago
orbiter 4917f96729 fixes for some changes in SVN 6797 that caused NPEs when the bookmarks initialized
15 years ago
low012 dff660441a *) changes for better code readability
15 years ago
low012 15d9ea8375 *) changes for better code readability
15 years ago
low012 2bc459252e *) changes for better code readability
15 years ago
orbiter 67ec58d8e7 search performance enhancement
15 years ago
hermens 4ec0092677 more null == proxy fixes
15 years ago
hermens 2f90f0ad56 Remove asserts blocking proxy use cases
15 years ago
sixcooler eb2a4bb555 workaround(?) for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2770&start=0&st=0&sk=t&sd=a&hilit=DefaultCharsetStringPart
15 years ago
orbiter 25aef069a6 continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775
15 years ago
low012 b97ad0f380 *) some minor changes for better code readability
15 years ago
orbiter ba51d140e1 added more info in assert in balancer
15 years ago
orbiter a85c5bb8a7 added support for multiple (fail-over) network definition locations when http-locations are given. multiple locations can be given with a comma-separated list of urls pointing to the network definition file
15 years ago
orbiter 9b3840cb66 performance hacks for the template engine + cleanup
15 years ago
orbiter 5c10f8bc5f enhanced latest hack
15 years ago
orbiter b3238bec83 performance hack for httpd
15 years ago
orbiter 1e8e79b9ef redesign of reference hash (URL-hash) parameter hand-over:
15 years ago
orbiter 749ffbd642 - added another catch case for the index dump and index merge process that should cause non-blocking behavior in case that index dump and/or index merge caused any unexpected exception.
15 years ago
orbiter 9ddb8e4a43 set an option for the java-internal image parser that prevents that the image is cached using the file-system in a temporary file. This should speed up image parsing during image indexing dramatically and should also cause better performance when showing the yacy banner and OSM tiles.
15 years ago
orbiter e12f1fd821 - added setting of access rights for executable scripts after auto-installation
15 years ago
orbiter 95f31da8da increase dump cache queue length from 1 to 2
15 years ago
orbiter 6c093d6aed - enhanced domain navigator computation
15 years ago
orbiter bb63c5d075 using a Pattern object with precompiled regular expressions to apply must-match constraints to search results: should speed up pre-sorting of search results and should cause richer search result sets
15 years ago
orbiter 90dd197ae7 - no latency for local crawls
15 years ago
orbiter bfb518cd47 some refactoring to get the LoaderDispatcher a little bit more independent from the switchboard
15 years ago
orbiter c855fc48c6 only load robots.txt for http and http protocol
15 years ago
orbiter 748abfcffa added patches to prevent yacy-protocol DoS settings
15 years ago
orbiter e820ed061a avoiding excessive DNS lookups to determine localhost
15 years ago
orbiter de88200e11 - added Byte Order Mark recognition to serverObjects
15 years ago
orbiter 3300930fc5 - (almost) fixed FTP crawler
15 years ago
orbiter 9623d9e6d2 added a smb loader component for the YaCy crawler
15 years ago
orbiter 48995e71c4 added soft-auth to general authentication scheme
15 years ago
orbiter 72f00dee59 removed never-used server access account function
15 years ago
orbiter 57e1eae95e longer time-out for url fetching .. may help to show all that links that the statistic say for a search result
15 years ago
orbiter 9e639603e3 after frequent occurrences of 100% CPU usages and permanent blockings I try to disable a function in a method that may cause the problem when calling an external library (apache http client 3.x). The thread dump that shows the problem is attached here.
15 years ago
orbiter 4144927d94 show less errors
15 years ago
orbiter b88f5fbb4b slightly changed crawling policy
15 years ago
orbiter 7684a575c4 fix for deletion of error database each time when YaCy starts up
15 years ago
orbiter f561e340c6 show more results of single domains when not authorized fully (up to 100)
15 years ago
orbiter c4bdb1e7f2 added one more option in ViewFile to show an iframe like for the orginal web page content but using the cache than the direct link to the content in the web. Upgraded the very old and previously not any more used CacheResource_p servlet to a new and working version.
15 years ago
orbiter c09a995930 better logging of double occurrences of urls in the crawler
15 years ago
orbiter 884b262130 - added a new Wiki Namespace Navigator
15 years ago
orbiter 617dfbbd06 allo 'authorization by encoded password' also if requesting client is not from localhost but from the same host as yacy is running on.
15 years ago
orbiter 599c3766c4 added authentication to automated API call
15 years ago
orbiter 727dd9b193 - fixed a bug in robots.txt parser
15 years ago
orbiter 54af9e6b49 - added parsing of robots meta-tag in html headers to detect a noindexing request
15 years ago
orbiter 46c4f8b68a better look-ahead into the crawl queue: show more on crawl monitor
15 years ago
lotus 7b546415dc added svn6695 for windows
15 years ago
orbiter f175f9a2d3 changed way how number of search requests are counted:
15 years ago
orbiter 84222e3b4f fix for auto-updater: delete old libraries before copy of new one
15 years ago
orbiter 93b7ddc27d fix for http://forum.yacy-websuche.de/viewtopic.php?p=19376#p19376
15 years ago
orbiter 8030ed3319 self-healing for lost crawl profile handles
15 years ago
orbiter e3e5e05ec2 fix for problem in ranking setting which was caused by the introduction of a toString() method in serverObjects
15 years ago
orbiter e3ccfb54aa fix for display problem in Firefox on MacOS X
15 years ago
orbiter 564927ce72 redesign of CrawlResult data structures because of OOM occurrences during URL deletion processes.
15 years ago
orbiter ef62d017e5 integrated session id filtering for crawler
15 years ago
orbiter d8d9984913 added framework for session id filtering (not ready yet)
15 years ago
orbiter 2bc36de336 - fix for bug in svn 6669
15 years ago
orbiter d378ca4604 better handling of concurrency in seed
15 years ago
orbiter 6538043d89 fix for http://forum.yacy-websuche.de/viewtopic.php?p=19189#p19189
15 years ago
lotus 945e0ba5a5 allow global search if res. observer disabled index transmission
15 years ago
lotus 8faeedd99a not a fix! for:
15 years ago
lotus 11188cd7eb resource observer now uses the Java 6 method to check for free space. thus, disk observing now needs Java 6 installed.
15 years ago
orbiter be18b5d8cd fix for 'cannot switch back to default language'-bug
15 years ago
orbiter 74e736c903 missing file for last commit
15 years ago
orbiter 308a973503 refactoring of tables data organisation
15 years ago
orbiter 8a76f38d26 Added a new steering servlet that can be used to repeat actions that had been made on the yacy interface. This can be used to:
15 years ago
orbiter 840527689b more simplification of bookmark class
15 years ago
orbiter d77782a8d5 removed bookmark tags file, tags are now stored only in RAM
15 years ago
orbiter ada0ce9de3 refactoring of bookmarks: there is a big performance problem in the bookmarks code and furthermore the bookmarks
15 years ago
orbiter a131ebbcb5 one more fix for NPE, see
15 years ago
orbiter 24060885b6 - added Tables abstraction in data.Tables.java
15 years ago
orbiter 7fdf59a77f misc NPE check
15 years ago
orbiter a512aef6ad fix for http://forum.yacy-websuche.de/viewtopic.php?p=18918#p18918
15 years ago
orbiter 3889438db6 fix for bookmarks
15 years ago
orbiter 23bcca07a3 removed directly linked servlets that had been there to test memory failures that appeared in that servlets
15 years ago
orbiter 69c29acb6e no exception thread dump if parser cannot parse becuase that mime-type/extension is in the deny-set
15 years ago
orbiter 8ce936bcdd added an api recording function: it shall be possible to record
15 years ago
orbiter e80e060ca6 - increased thread priority for server threads
15 years ago
orbiter 234f733a3d - relocation of seed db is better for network switch than re-initialization because of the embedding of the peers object in other objects
15 years ago
orbiter 473b11033d fixed network switch process - crawling did not work after a switch before this fix
15 years ago
orbiter fd7b348973 some fixes for the network switch
15 years ago
orbiter f6731c6240 more logging etc.
15 years ago
orbiter d9169cc6c3 increased proxy load time-out from 30000 to 60000 milliseconds
15 years ago
orbiter 938e806182 tried to fix date problem that may have prevented that foreign peers stay in the network
15 years ago
orbiter bd05e57d3b fix for http://forum.yacy-websuche.de/viewtopic.php?p=18563#p18563
15 years ago
orbiter 5df628a2a4 - added BEncoder class
15 years ago
orbiter 82f57f79e5 more PMD enhancements
15 years ago
orbiter 5d930c96f0 more fixes to search result page navigation
15 years ago
orbiter 8c520f128d reverted a change in ranking process committed this afternoon
15 years ago
orbiter a06f7ddb33 more PMD recommendations
15 years ago
orbiter 18172451a0 better search computation:
15 years ago
orbiter d126d6c1b5 renamed the servlet WatchCrawler_p to Crawler_p
15 years ago
orbiter 66c0a8e849 more PMD recommendations
15 years ago
orbiter 909a4f91c7 added a logging output for crawl starts that shows the URL that can be used to start the crawl again
15 years ago
orbiter bc96d74813 - clean-up of robots.txt parser
15 years ago
orbiter dd459281c8 applied code changes that are recommended by PMD
15 years ago
lotus eac2daf2e8 * reenable DHT if yet enough memory is available
15 years ago
orbiter d77a8f3b3e added some modifications recommended by PMD for better performance
15 years ago
orbiter d1973bae2a code cleanup: removed unused code and unused methods
15 years ago
orbiter a3b8b7b5c5 some redesign of the main menu structure:
15 years ago
lotus ab3cf60dbe fix for npe
15 years ago
orbiter 7f20963b41 add-on to last commit
15 years ago
orbiter eeca2ded92 fix for http://forum.yacy-websuche.de/viewtopic.php?p=18500#p18500
15 years ago
orbiter bb2e03761c - fix for deadlock with 100% CPU during search
15 years ago
orbiter dff4f95c78 some patches to get the torrent parser working
15 years ago
low012 82198acc06 *) minor changes
15 years ago
low012 b75547fc60 *) minor changes
15 years ago
orbiter 57d729e377 fix for negative numbers in network statistic
15 years ago
orbiter 4ac4fe952c patch for npe in bookmarks
15 years ago
orbiter d548bd41ad fix for a npe during search
15 years ago
orbiter 37245430c3 fix for NPE during DHT RWI selection
15 years ago
orbiter a37878b7d5 url parser regex performance hack
15 years ago
orbiter b527d2ebfa fix for media search
15 years ago
orbiter 362b7a929b added extensive memory protection logic to avoid out of memory errors that may be caused by the RowCollection memory allocation function
15 years ago
orbiter 8281e29963 - more configuration for profiling graph (number of events)
15 years ago
f1ori 5f0f6b71b4 * revert last commit, something is more broken than before
15 years ago
f1ori 8c8b642eba * fix timezone problem
15 years ago
orbiter 4782d2c438 fix for search bug that appeared when looking at page 3 of results or further
15 years ago
orbiter 29fde9ed49 better control of ranking order in sort stack
15 years ago
orbiter 66923ebc6c - modified method in RequestHeader that delivers the host name of requester: no more reverse domain lookup (may have killed interface performance in some cases)
15 years ago
orbiter e34e63a039 preset of proper HashMap dimensions: should prevent re-hashing and increase performance
15 years ago
orbiter 4a5100789f replaced _all_ size() == 0 with isEmpty() and all size() > 0 with !isEmpty(). The isEmpty() method is much faster in some cases, especially when used to access badly balanced hashtables where an size() operation becomes a large iteration.
15 years ago
orbiter f4946eaf27 - better thread dump
15 years ago
orbiter 9743b70d1c disabled keep-alive of server, not really needed for speed but a cause for much trouble and memory occupancy
15 years ago
orbiter 491ba6a1ba - some refactoring in workflow
15 years ago
orbiter 969123385b added json and rss output for image search
15 years ago
orbiter d183f8d980 refactoring (moved code from ContentTransformer to TemplateEngine)
15 years ago
orbiter 23aef43786 - better synchronization in SortStack
15 years ago
orbiter 7b1f5b0430 - better media search ranking
15 years ago
orbiter 4df88a4e7a - fixes for missing or bad hashCode computation
15 years ago
orbiter dbdf2570ba added comparator and more fixes for SortStack/SortStore
15 years ago
orbiter 1dff620181 Better implementation of SortStack and SortStore and adoptions in all using classes to implement the necessary Comparable interface and hash code computation.
15 years ago
orbiter fe41a84330 some enhancements in web caching: avoid double loading of response metadata and/or content
15 years ago
orbiter 06d0dcde20 more enhancements to image search
15 years ago
orbiter 4c6312d103 enhanced image search
15 years ago
orbiter 2d8f3ee301 some performance hacks
15 years ago
orbiter 94b2a664f3 - use a static DiskFileItemFactory (one instantiation is enough)
15 years ago
orbiter 013f337d3f - avoid unnecessary host name lookups for localhost
15 years ago
orbiter 20c5d78a5c fix for a ConcurrentModificationException
15 years ago
orbiter 7144d2df6e added crawlReceipt servlet as individual class to examine OOM problem as documented in
15 years ago
orbiter 29fe436e36 - fixed post-ranking including prefer mask
15 years ago
orbiter 5399d1e2bc refactoring (reason: get more abstraction to use the blacklist class; for integration in other servlets)
15 years ago
orbiter 1fa0ac26e9 better protection against NPEs during search/ranking
15 years ago
orbiter 4c99d4683d possible fix for lost crawl profile handles: clean-up job did wrong measurement to see if crawl is still running.
15 years ago
orbiter 18b21eaffe small fixes to search default values and server logging
15 years ago
lotus 6edc168cfe option to disable dht by memory limit:
15 years ago
orbiter 4431b9767e added about 450 replacements for printStackTrace() methods to pipe such traces into the log at DATA/LOG/
15 years ago
orbiter e3025ee691 - new icon for OAI-PMH loading action
15 years ago
orbiter f0b8db93f0 - more abstraction of serverCore thread access
15 years ago
orbiter 2889b9426e missing code for last commit
15 years ago
orbiter b6a8887ff5 better handling of running sessions without explicit hashtable
15 years ago
orbiter 1dc7ea986a added a dynamic keep-alive time-out for http server sessions:
15 years ago
orbiter b0b7a4f9a5 - added function to OAI-PMH reader that can pull all records from a server using an evaluation of the resumption token to get URL to retrieve remaining records
15 years ago
lotus 79251e6f60 configurable disk space hardlimit for dht
15 years ago
orbiter a0e891c63d - some redesign in UI menu structure to make room for new 'Content Integration' main menu containing import servlets for Wikimedia Dumps, phpbb3 forum imports and OAI-PMH imports
15 years ago
orbiter 30f108f97d added stub of oai-pmh importer (not working yet)
15 years ago
orbiter 77c99e500f added more control over memory allocation
15 years ago
orbiter 52470d0de4 - fix for xls parser
16 years ago
orbiter 5e8038ac4d - refactoring of blacklists
16 years ago
orbiter 26fafd85a5 - more refactoring
16 years ago
orbiter 3528b970d6 - refactoring
16 years ago
orbiter a8ce192f63 - shifted main classes to new package net.yacy
16 years ago
orbiter b79f4f062f refactoring of yacy documents and parsers: they depend now only on the kelondro classes
16 years ago
hermens 0fd9540866 Configuration of HTTPDProxyHandler logging
16 years ago
orbiter cee7a05ff2 - de-serialized the pdf parser
16 years ago
orbiter 9db928ce53 replaced fontbox 0.7.3 with fontbox 0.8.0
16 years ago
orbiter c2272785c7 - fix for xlsx and pptx parsing
16 years ago
orbiter c864901087 - moved httpd.mime to defaults path
16 years ago
low012 8829ec5f18 *) made sure that &nbsp; is replaced with a space and not just deleted in CharacterCoding.java
16 years ago
orbiter 6c347a37eb more options for DocumentIndex
16 years ago
orbiter e7f18ba24b refactoring
16 years ago
orbiter ce8dc575ca refactoring
16 years ago
orbiter bea3b99aff moved table and util classes
16 years ago
orbiter bd876eb4b7 moved io classes
16 years ago
orbiter c0e0e1f422 moved blob classes
16 years ago
orbiter 1e4f8b56ed accumulated classes from different packages into the new rwi package
16 years ago
orbiter 194da25a2f moved kelondro index
16 years ago
orbiter 4446acc8cd moved kelondro order
16 years ago
orbiter f677d534b1 start of a really extensive refactoring which will produce a hierarchical package structure with the domain yacy.net as package root
16 years ago
orbiter ea473e32b8 refactoring
16 years ago
orbiter 735e2737e3 * added index segments
16 years ago
orbiter 09de5da74a once again a performance hack
16 years ago
orbiter 2f6d88403e
16 years ago
orbiter d2615ea5a8 increased memory for scraper buffer to enhance parsing speed
16 years ago
orbiter 4bbbb74ec4 removed not necessary synchronization
16 years ago
hermens 67e5464cc2 Fix for SVN6380: x[] Arrays are unsuitable Keys for Maps without using a proper Comparator.
16 years ago
hermens aeab8c7917 Prevent failed DHT attemps from overwriting newer peer info
16 years ago
hermens 9324b5b6c5 Enhancements to DHT
16 years ago
hermens e49e2d75fe Limit the time Transmission.Chunks stay in the transmissionCloud by using a Map that iterates entires in insertion order.
16 years ago
orbiter 92db7c5d07 increased timeout for index retrieval
16 years ago
lotus 386b9f35f6 activated resource observer for windows 7
16 years ago
orbiter 6e0dc39a7d - some fixes to prevent blocking situations
16 years ago
orbiter 51f2bbf04b possible fix for problem in http://forum.yacy-websuche.de/viewtopic.php?p=17655#p17655
16 years ago
orbiter f8371707e5 - possibly better termination for SplitTable
16 years ago
orbiter 87780f2562 produce did-you-mean also for queries with more than one word
16 years ago
orbiter 04a548a1e3 - temporary integrated the transferURL servlet as static class instead as a class that is called using reflection to investigate the OOM problems in that class
16 years ago
orbiter ea427df944 fixed a worst case situation of the condenser which may cause a temporary full CPU load because of a bad data structure usage
16 years ago
orbiter 3e38035389 fix for interrupted thread during has() property check
16 years ago
orbiter 5bd1c1d205 just added some comments that had been produced to learn about OAI-PMH
16 years ago
orbiter 6aa474f529 - better logging for web cache access and fail reasons
16 years ago
orbiter 3671c37989 added experimental oai-pmh reader and integrated it with the existing dublin core parser
16 years ago
orbiter 58a00205d5 re-activated the emergency close when too many server connections exist
16 years ago
orbiter c57d2070e6 more logging
16 years ago
orbiter a995b95367 tried a fix for the httpd access bug (too many unclosed sessions)
16 years ago
orbiter e1fba41cad better logging
16 years ago
orbiter 2275f885a8 possible fix for concurrency problem
16 years ago
low012 a6a3090c3d *) blacklist cleaner supports usage of regular expressions now
16 years ago
orbiter 5a93807781 improved web cache speed:
16 years ago
orbiter 2e8b2867ff double performance of store method because it avoids one 'has'
16 years ago
orbiter afda5b1adc new join method for indexes (not yet used)
16 years ago
orbiter 65b66c2c18 better handling of array files of length 0
16 years ago
orbiter 1957b5797a fix for seed generation
16 years ago
orbiter 432154f725 new strategy for concurrent database index key retrieval
16 years ago
orbiter a11cd9f80f - removed reverse name lookup for http access logging (grr..)
16 years ago
orbiter 2e6bdce086 - added more logging to balancer
16 years ago
orbiter 1171a72006 fix for deadlock as seen in http://forum.yacy-websuche.de/viewtopic.php?p=17521#p17521
16 years ago
orbiter 031e6eefbd some updates to dublin core, metadata browsing, file indexing and parser stability
16 years ago
hermens 62a7341c4d Fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2204
16 years ago
low012 f65bfaa9af *) Removed base tag from errror page. This has been added by myself a long time ago as a workaround for some weird behavior of my router, but as it turns out, it does more bad than good in general: If HTTPS is used for communication with YaCy, entering a wrong passwort led to an errror page with a form which would send username and password unencrypted with the user possibly being unaware of this.
16 years ago
orbiter e4797ebcde fix for http://forum.yacy-websuche.de/viewtopic.php?p=17509#p17509
16 years ago
orbiter efa7fb34f0 better oom-awareness of miss-cache in cache
16 years ago
orbiter 3e9dcfc204 fix for http://forum.yacy-websuche.de/viewtopic.php?p=17504#p17504
16 years ago
orbiter c3a4aee255 some redesign with a possible fix for the ReferenceContainerCache.
16 years ago
orbiter aca8a78eb8 fix for shutdown of DocumentIndex objects
16 years ago
orbiter 23ab6fbca4 - navigation appear at correct position when opengeodb-results are also presented after a search
16 years ago
orbiter 4db34eea73 fix for OOM problem in kelondro Cache
16 years ago
orbiter 8ea1d7ab59 fix for wrong assert condition in search abstract generation
16 years ago
orbiter fbd77bd77c git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6328 6c8d7289-2bf4-0310-a012-ef5d649a1542
16 years ago
orbiter 54c7cbf1d9 - fast result for local search in case that less than 10 hits exists
16 years ago
orbiter 28d4b921b6 different approach for file search
16 years ago
orbiter f99f86c5c5 added concurrency to file indexing class
16 years ago
orbiter 902d16cf6c fixes to parser
16 years ago
orbiter 4a1c852435 fix in usage of RAM copy for Table objects and some cosmetics in asserts.
16 years ago
lotus dce450e2e0 possible fix for "hung" doc-documents
16 years ago