Commit Graph

434 Commits (d78e322e844d944e3810d8cc0e39ee6566cc27dd)

Author SHA1 Message Date
orbiter 22047ffad5 enhanced computation speed of many replaceAll string operations
14 years ago
orbiter e8228fba09 less locking in time format computation, caching and during secondary (remote) search evaluation
14 years ago
orbiter 9c0c94683c because of a bug in search result caching count search results had not been generated as fast as possible.
14 years ago
orbiter b3f0d06444 fixed a problem with restarts in YaCy mac applications: the DATA directory path was not submitted when doing a restart. This solves the problem by:
14 years ago
sixcooler ca0a03e9ea ... migrating to HttpComponents-Client-4.x ...
14 years ago
orbiter 3988a95fb5 added ability in rss reader to parse atom feeds
14 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy:
14 years ago
orbiter 65eaf30f77 redesign of crawl profiles data structure. target will be:
14 years ago
f1ori 938676265f fix shutdown command, close HttpClient connection pool
14 years ago
orbiter 4f22e2df41 bugfixes for
14 years ago
orbiter 42414a6ae3 added two more tables in rss reader interface:
14 years ago
orbiter 0010cd9db1 Support for indexing of RSS feeds!
14 years ago
orbiter 0f276dd63f - MapHeap now implements Map<byte[], Map<String, String>>
14 years ago
orbiter cf07b34c2d implemented the Map interface in the ARC classes so it will be possible to instantiate ARCs as
14 years ago
orbiter c60d0282fd more abstraction for tables stored in heaps:
14 years ago
orbiter d1be64d491 removed wrong assert
14 years ago
orbiter 3197ca42ed preparations to move the HTCache into cora:
14 years ago
orbiter 844f158686 - removed dependencies in header framework:
14 years ago
orbiter 80ba543d4c svn fix for uppercase problem
14 years ago
orbiter 5e7081cd19 refactoring towards a unified loading mechanism for MultiProtocolURIs
14 years ago
orbiter caece04f26 removed System.err and System.out usage from FTPClient; changed logging to log4j (preferred in yacy.cora)
14 years ago
orbiter 90531f78ff refactoring of the cora package to get subpackages for http and ftp (smb to come)
14 years ago
sixcooler 661867923a ... migrating to HttpComponents-Client-4.x ...
14 years ago
orbiter 7aa860c505 - more logging
14 years ago
orbiter 4d5446d641 code cleanup
14 years ago
orbiter 66ac3a7d9d corrected database row iteration
14 years ago
orbiter dfd416e3fb removed a mysterious image buffer
14 years ago
orbiter e10cd115a9 - added a new RSS reader interface. This is not finished but you can now load and look at RSS feeds. It will be used to index RSS feeds in a way that is appropriate for such kind of data.
14 years ago
orbiter 933dc1a600 removed old rss parser (will be replaced with parser from cora package)
14 years ago
orbiter 70dd26ec95 added the new crawl scheduling function to the crawl start menu:
14 years ago
orbiter 5a994c9796 added a scheduler based on API actions
14 years ago
orbiter 189a986ebd - modified api-call interface to record api calls with references to api-call database (carries pk)
14 years ago
orbiter 054c22e2c6 added TLDs from http://www.opennicproject.org
14 years ago
orbiter 86d7f8a989 - the web visualization can now be generated in custom color
14 years ago
orbiter 7fdb17bb96 redirect uncaught exceptions to logging + small other changes
14 years ago
orbiter a82a93f2fc - better url double check in crawler
14 years ago
sixcooler a6ed6e8cb9 ... migrating to HttpComponents-Client-4.x ...
14 years ago
orbiter 171f2bd84e - removed unused network oanet
14 years ago
sixcooler 1802c54317 LGPL-Header
14 years ago
orbiter a835a22b32 fixed isLocal() property (better recognition of intranet hosts)
14 years ago
orbiter 670c746dc5 dual-licensed HttpConnectionInfo for LGPL
14 years ago
orbiter 301a59e07f moved browser access method from kelondro/util/OS to gui/framework/Browser
14 years ago
orbiter ec72387165 added a very early test version of a YaCy gui component.
14 years ago
sixcooler d88b9606d1 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2923
14 years ago
orbiter 6388a58fc7 better memory management and slightly less (in total and temporary) RAM allocation:
14 years ago
orbiter 5924a0d851 - enhanced concurrency in database index access for multicore
14 years ago
orbiter 55a2536bcf enhancement in drawing speed and reduction of object allocation during drawing
14 years ago
orbiter 9ab06bc333 enhancement in sorting efficiency (database root operation): less object allocation
14 years ago
sixcooler 39d96abbb5 fix yacyRelease download
14 years ago
sixcooler 349e4dee9d ... migrating to HttpComponents-Client-4.x ...
14 years ago
sixcooler c29f24a519 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter d5c65b17a6 added another network activity visualization: show strong query activity as radiation around peer
15 years ago
orbiter 989948e1a9 fixed generic image parser
15 years ago
orbiter e1015ead2c static access to constants
15 years ago
orbiter 27d8a8b53e removed wrong com.sun.codec class access in generic image parser
15 years ago
orbiter bbf887d879 added generics to UPnP classes
15 years ago
sixcooler 15e8c13526 ... migrating to HttpComponents-Client-4.x ...
15 years ago
mikeworks b12db14b9f Added Generics to new net.yacy.upnp.* classes to eliminate compiler warnings
15 years ago
sixcooler b7102eff92 ... migrating to HttpComponents-Client-4.x ...
15 years ago
mikeworks 572e429eff - fixes UPnP not working discussion on forum: http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2881
15 years ago
mikeworks 2a20282505 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@6987 6c8d7289-2bf4-0310-a012-ef5d649a1542
15 years ago
lotus 965aa97993 including sbbi upnplib as source again
15 years ago
orbiter 60caade056 removed debug output
15 years ago
sixcooler 52718e6dcb ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler 5fa8038f10 ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter dec1419bc3 ;-)
15 years ago
orbiter 22dbbcfa56 better (and corrected) recognition of intranet and internet-addresses. This corrects the isLocal property that is used by network definitions to restrict index ranges to local and global addresses. Address locations (intranet or internet) had been partly identified by the top level domain of the host address. Since intranet addresses can also be addressed using a host name that is in a country domain it is necessary to do a dns resolving for each check. The check is supported by a local dns cache so the intranet/internet check should not affect network traffic too much. To ensure that the cache works properly the cache class was upgraded to better concurrency data structures.
15 years ago
orbiter 8674a65488 removed override directive which caused a compile error in eclipse helios
15 years ago
low012 dc5f0e357c *) fixed SVN properties
15 years ago
low012 01d6b952f0 *) minor changes for easier to read code, no functional changes
15 years ago
sixcooler 0e56d29335 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler 2ad5829b26 correct Timeoutparamter at HttpComponents-Client-4.x
15 years ago
sixcooler e1316d12d0 ... migrating to HttpComponents-Client-4.x ...
15 years ago
sixcooler c5c67f0504 start migrating to HttpComponents-Client-4.x
15 years ago
orbiter 25024d6ab2 fix for problen when accessing the metadata index. The index was not available for all peers with no RAM table copy.
15 years ago
orbiter b6fb239e74 redesign of parser interface:
15 years ago
low012 d4851441b0 *) Added Android packages to parser in order to be able to create a decentralized search for direct downloads of Android apps.
15 years ago
orbiter 150cf42a1b migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2
15 years ago
orbiter 5d00888c95 - added animated visualization for DHT-in and DHT-out in network graphic
15 years ago
orbiter bf25407fdd added peer hash to internal RSSFeed. The hash will be used to display news activities in the network graphic.
15 years ago
orbiter 1557e0f2d0 - some refactoring for internal RSSFeed (protocol of all actions as seen on status page)
15 years ago
orbiter 5a4684f21f allow words with length >= 2 (you can't search for 'wm' with 3-letter words...)
15 years ago
orbiter 37b8827a7a - removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection.
15 years ago
orbiter 103c848af8 enhancements in image drawing speed
15 years ago
orbiter 777195e8d1 more abstraction for access of LoaderDispatcher and cache
15 years ago
orbiter 7bcfa033c9 more abstraction of the htcache when using the LoaderDispatcher:
15 years ago
orbiter 7e2d6fac12 patch for bad values during local search join
15 years ago
orbiter 986d4f34d9 added a consistency check for new queues
15 years ago
orbiter 73f03e05ee fixed a bug in snippet fetch strategy: cache only does not help if resource can only be found in web
15 years ago
orbiter fbf021bb50 redesign of index abstract processing - currently disabled until enough peers have fix in SVN 6928
15 years ago
orbiter 87087f12fe - scanned remote search process and enhanced some data structure and synchronizations here and there
15 years ago
orbiter 7ddb70e7c6 new license for ai.greedy component: LGPL (nobody else than me modified that code)
15 years ago
orbiter de4f30bb2e UTF-8 fix
15 years ago
orbiter 3a1cebb598 bugfixes
15 years ago
orbiter 51332b787d reverted SVN 6869 as discussed with dulcedo in car after LinuxTag:
15 years ago
orbiter b03caaa57a better handling of OOM situations
15 years ago
orbiter 60e71876ad - more abstraction (HashMap -> Map)
15 years ago
orbiter a83772c71b fixes and enhancements for balancer:
15 years ago
orbiter 9cde05418f fixed url crawl list display
15 years ago
orbiter 2eea806005 less errors in image parser
15 years ago
orbiter 3f93a0cc8f redesign of remote proxy settings
15 years ago
orbiter 11639aef35 - added new protocol loader for 'file'-type URLs
15 years ago
orbiter 6950d8a33d fixes to SMB crawler
15 years ago
orbiter 98c1d65415 - show up to 10 locations (maps) after search (instead of a max of 5)
15 years ago
orbiter 9842fab6e4 - fixes to query parameter
15 years ago
orbiter 1defd580bc - added option to localization search to distinguish between a search for a location according to the search word only or for the relation between a web search results and locations found in the metadata fields
15 years ago
orbiter bd0a9df895 fix for bad location double check
15 years ago
orbiter e43e61e502 added another geolocalization data source: GeoNames
15 years ago
orbiter 118d589eff replaced the very very old data structure 'Records' with a simple table to fix the problem from
15 years ago
orbiter 2a8f70f0ca - fix for caching of OSM tiles. if you want that this fix applies to your peer, please delete the crawl profiles
15 years ago
orbiter 2126c03a62 - removed download-limit that can be given for the crawler for non-crawler download tasks. This was necessary because the same procedure was used for other downloads like for the download of dictionary files where a limit is not useful. The limit still stays for the indexer
15 years ago
orbiter 439b44be9e removed exit from computation in ReferenceContainerArray.get merge method
15 years ago
orbiter 789c6b26ce added a location search service: using the following servlet/example:
15 years ago
orbiter f23cbd2dab more bugfixes to date parser
15 years ago
orbiter cf43bdc87e This is a large bugfix and enhancement commit to support a better location detection for data
15 years ago
orbiter 6eba2cb96b fix in bmp parser
15 years ago
orbiter c45117f81f fixed dates in metadata
15 years ago
orbiter 0a5fd15703 :-(
15 years ago
orbiter ac16f582aa fix for http://forum.yacy-websuche.de/viewtopic.php?p=20017#p20017
15 years ago
orbiter a7d038bb7a The oai ListFriends source list becomes configurable: just write them into defaults/oaiListFriendsSource.xml
15 years ago
orbiter 5fbf866cae - fixed resumption token generation for oai-pmh import
15 years ago
orbiter fc5efcc05a enhanced and fixed OAI-PMH import
15 years ago
orbiter 455a763d7c performance hacks
15 years ago
orbiter b6cce08019 fixed a bug in rwi storage data size allocation
15 years ago
orbiter 90c3e5d6f6 - cleanup, removed unused imports
15 years ago
orbiter b18a7606a0 some performance hacks and fixed after reading dump in
15 years ago
orbiter 2bc3cba6f1 - fix for 'do not write to cache' rule.
15 years ago
orbiter 4cd5418963 removed finalize methods because of a hint in
15 years ago
orbiter cff8ed134f added index check to prevent blocking in synchronization
15 years ago
orbiter 5ab5ac80fe fix for NPE in TextParser
15 years ago
orbiter b95ae2518b fix for assert
15 years ago
orbiter 3247f0e901 fix for deadlocks caused by self-blocking access to TreeMap in concurrent environments. The TreeMap was replaced by a ConcurrentHashMap and additional care that the strings are compared all in lowercase
15 years ago
orbiter 027b971bde fix for concurrent quicksort: catch jobs from ThreadPoolExecutor that had been rejected because of full processing queues.
15 years ago
orbiter 8c40f1cb8e self-healing for broken table files (may cause other problems, but better than nothing)
15 years ago
orbiter 7b69d79727 enhanced remove() operation: in many cases it is not necessary to return the removed object to the called.
15 years ago
orbiter 93ea0a4789 enhanced remove operation in search consequences (which are triggered when the snippet fetch proves that the word has disappeared from the page that was stored in the index)
15 years ago
orbiter 7a59012632 fix for NPE
15 years ago
orbiter 1a6c2f77b4 fix for NPE in statistic servlet
15 years ago
orbiter 64f29f990e a collection of performance hacks and code cleanup:
15 years ago
orbiter 8b8107b2a3 reduced IO-load and synchronization/blocking
15 years ago
orbiter ed07046870 flush only when > 3000 RWIs present + code cleanup
15 years ago
orbiter 3a50b5aa04 enhanced object hash computation
15 years ago
orbiter 1a8a134e0c continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775 and continued in SVN 6790
15 years ago
orbiter dde394a977 - shifted some computation out of synchronization to allow more concurrency
15 years ago
orbiter f204076d25 removed usage of temporary files: causes too much IO
15 years ago
orbiter 650be3599f added a time-out to the RWI cache to flush the cache if it has not been written for ten minutes. This additional dump criteria is necessary because some data sources repeat their vocabulary and may cause that the number of words in a RWI does not increase while the number of references in the RWI set increases. Now the RWI Buffer is flushed every 10 minutes or later if at that time already a dump is ongoing.
15 years ago
orbiter ff6cf24b80 replaced RowSetArray in ObjectIndexCache with RowSet to reduce complexity in MergeIterator. This complexity caused too much computing overhead when the RowSetArray had become very large.
15 years ago
orbiter 55d8e686ea performance hacks
15 years ago
orbiter 2f181d0027 introduced concurrency in HTCACHE storage compression
15 years ago
orbiter 2e26744f4e more concurrency when normalizing RWI entries + cleanup
15 years ago