Commit Graph

823 Commits (5e165a8150df2d1a603732f451aa61392054dde5)

Author SHA1 Message Date
orbiter 7598a9e26b fix for thread dump
14 years ago
orbiter 8eef8722d1 update to ThreadDump analysis: freerunner and thread state recognition
14 years ago
orbiter 1df43b137d another performance hack
14 years ago
orbiter 7df0643f0e performance hacks
14 years ago
orbiter 813f297a95 another performance hack: re-use of known host addresses for isLocal property; avoids look-up in local hash
14 years ago
orbiter 035ebfbf3b - performance hacks (should affect the crawl balancer and reduce CPU load during crawl stack re-fill)
14 years ago
orbiter b250e6466d implemented crawl restrictions for IP pattern and country lists
14 years ago
orbiter 57d5529a01 performance hacks
14 years ago
orbiter d2ea250d99 refactoring:
14 years ago
orbiter 0c6d95e57b - more tolerance against failure of table opening
14 years ago
orbiter ce2a76d603 performance hack for search process
14 years ago
orbiter 2c4a672fe2 bugfixes and performance hacks for tabe index
14 years ago
orbiter dad5b586a4 added a concurrent warmin-up of Table data structures. that should speed-up the start-up process but may also cause stronger CPU load at that time.
14 years ago
orbiter 734059d33e performance hacks
14 years ago
orbiter dd4635e323 patches
14 years ago
orbiter 2842ce30d6 added synchronization in ReferenceContainer and logging for shrinking
14 years ago
orbiter cec3836e73 added reference limitation to IndexControlRWIs_p.html servlet
14 years ago
sixcooler ecb4986b38 refactored stuff from last commit to ReferenceContainer
14 years ago
sixcooler f7c4abfdd7 limit references per blob & term to the 100.000 youngest
14 years ago
orbiter 28f5b79deb added a fast mass-deletion method
14 years ago
orbiter 44d6416e2d ensure termination of shrink()
14 years ago
orbiter 52230a6864 replaced catching of Exception with Throwable, which catches also Errors
14 years ago
orbiter e1a3d609aa moved merger object from Segment to IndexCell to enable a correct shutdown sequence. This solves a bug where yacy cannot be shut down during an index merge that appears during the shutdown phase.
14 years ago
sixcooler d40a177c05 Generation Memory Strategy fine tuning
14 years ago
sixcooler 839f407fe4 Generation Memory Strategy fine tuning:
14 years ago
orbiter a5541751a8 - added memory computation to termlist_p.xml
14 years ago
orbiter 45e497a9bd fix for term iteration
14 years ago
orbiter 2c595a6a47 added new methods to count the number of objects in RWIs. lots of refactoring was necessary to introduce new Rating class and to unify naming of methods
14 years ago
orbiter 75df87832c refactoring/better naming of methods and classes
14 years ago
sixcooler 5f8a5ca32d - not doing merge-jobs while short on Memory
14 years ago
orbiter 965fabfb87 enhanced sorting speed (affects all DB operations)
14 years ago
orbiter 22d69a6368 refactoring in cora: added sorting package
14 years ago
orbiter 51cf697acd refactoring: moved all score-related classes to new ranking package
14 years ago
sixcooler 4fec99115b Implementation of strategies for controlling memory resources.
14 years ago
orbiter 2c58af6874 - added a short memory status simulation mode
14 years ago
orbiter c64faf41e2 addon to svn 7880
14 years ago
sixcooler 411ed159f8 do some extra sleep while running low on memory
14 years ago
sixcooler 07f5954570 try better handling of corrupt blobs
14 years ago
orbiter 0a3ab7da1b do not sort concrrently the same array
14 years ago
orbiter 44d74f8f89 performance hacks for seed generation (because thread dumps showed multiple occurrences at these code points)
14 years ago
sixcooler 5cd07d7f84 early freeing resources on deleting index reference if search-verification fails (aka Switchboard.cleanupJob)
14 years ago
sixcooler 9170a434ed throwing an exception again in FileUtils.copy(reader, writer)
14 years ago
sixcooler 916d79111e Runtime.maxMemory() DOES change @ runtime:
14 years ago
orbiter 1f300217f8 more protection for the cleanup thread
14 years ago
orbiter d13103a0a7 changed the way how the index cache is flushed: do not flush when a put was made because that could cause that many put calls synchronize for a long time when the dump or a merge is performed. Instead a watchdog thread is doing the dump and therefore puts cannot block any more which is good when a put happens during a search result preparation.
14 years ago
orbiter 6a6f27eaf3 do not sort arrays again if arrays are already sorted
14 years ago
orbiter 3d043ce9d6 - refactoring
14 years ago
orbiter 48b78e9ff4 disabling concurrency in new sort since that is not working yet correctly
14 years ago
orbiter 62ac73a108 fixed bugs and deadlocks in core database indexing structures:
14 years ago
orbiter 1912d0cccc changed handling of RowSet element retrieval: until today all elements had been copied from the underlying byte[] arrays into a new Entry object that again had a copy of a portion of that byte[] in its own bye[]. There was an option to just refer to the underlying byte[] with a pointer but that was almost never used. This commit now changes an interface to the Row class where it is now necessary to tell if a copy is always required. Fortunately the copy is only needed in very rare cases. That means that this change should cause much less memory allocation; it is expected that this happens especially during search situations.
14 years ago
orbiter bb8e3f8523 code cleanup
14 years ago
orbiter 11dc653de3 added a visualization of peer pings to the performance graphic
14 years ago
orbiter 6d2e252bcf fix for:
14 years ago
orbiter b666a929e7 fixed Semaphore handling in case of interruptions
14 years ago
orbiter 267290a821 removed the semaphores from the cache dump process because I believe some of the semaphores may be lost somewhere which then causes that the cache is never flushed and then the peer dies from a OOM. The re-introduced synchronization may not be the best solution but should ensure that the caches are flushed.
14 years ago
orbiter f803da8aae code cleanup
14 years ago
orbiter 31283ecd07 - added a search option to filter only specific network protocols. i.e. get only results from ftp servers. Just add '/ftp' to your search.
14 years ago
orbiter 7db208c992 performance hacks: more pre-allocated StringBuilder
14 years ago
orbiter 996f0a8764 disabled assert in Base64Order which eats away too much performance during testing with -l
14 years ago
orbiter f30d36b101 enhanced template engine
14 years ago
orbiter 0c1b29f3c9 - applied many small performance hacks
14 years ago
orbiter fe0c08455b more concurrency (enhancement) hacks
14 years ago
orbiter 87082f407e less String object creation during search
14 years ago
orbiter a36fda991e hack to increase speed of url hash computation
14 years ago
orbiter dbea40d536 - changed snippet fetch strategy logic: do not check if entry is in cache. This should reduce IO load on the HTCACHE which is a showstopper during large number of search requests
14 years ago
orbiter 4bea3f9714 hack to reduce resource contention caused by massive UTF8 decodings which use java.nio resources:
14 years ago
orbiter 746e3c3b06 Replaced a widely-used Property Object in the httpd with HashMap<String, Object> which is not synchronized like Properties
14 years ago
orbiter e28bd0d038 fix for some possible causes of memory leaks
14 years ago
orbiter 09ba6814c0 - non-blocking word hash computation with dynamic digest object generation (this was important!)
14 years ago
orbiter 10e2f588f8 - enhanced ybr ranking computation
14 years ago
orbiter bd55dcee50 - commented out experimental distributed ranking loading
14 years ago
orbiter 3ed4a09368 small features, some bug fixes and performance hacks
14 years ago
orbiter b45701d20f this is a re-implementation of the YaCy Block Rank feature
14 years ago
orbiter d27a0a67ff fix in log initialization according to hint from Dominic
14 years ago
orbiter 123375bfba added a new yacy protocol servlet 'idx'. This returns an index to one of the data entities that is stored in YaCy.
14 years ago
orbiter 5b579e21a3 code cleanup
14 years ago
orbiter 039126cfaf better handling of on/off switched solr indexing
14 years ago
orbiter dc54915df4 fix for very bad compare
14 years ago
orbiter deda54d684 - relaxed matching of string-search (this is now case-insensitive)
14 years ago
orbiter b77b8cac0c - enhanced html parser: recognized much more details in the content
14 years ago
orbiter 17530ca7b5 fix for bug http://bugs.yacy.net/view.php?id=10
14 years ago
orbiter 0430a94eaa the location search shows now not re-evaluated locations but only such locations that are attached as metadata to web pages
14 years ago
orbiter 8412f8787d fix for http://bugs.yacy.net/view.php?id=8
14 years ago
orbiter 9b25d07295 - added geo information parsing to html parser
14 years ago
orbiter b1a8d0c020 enhancements to web cache and less strict caching rules
14 years ago
f1ori df71776929 * fix bug #7
14 years ago
orbiter 78d4c45d09 enhancement during search process: fast fail of search in case that all index feeder have terminated.
14 years ago
orbiter 2b5f8585bf performance hack for Balancer and ip address parsing
14 years ago
orbiter b1d133b69f another anhancement to the ThreadDump function: better multiple dumps and filtering out of not interesting dump parts
14 years ago
orbiter a35d513bd8 fix for not-deleted .gap and .idx files
14 years ago
orbiter 859c99886c fix for multiple thread dump
14 years ago
orbiter 61acf55da4 avoided using a synchronized(this) for the hash computation to prevent that the lock on the object is (accidently) stolen by another thread and replaced this synchronization using the protocol object. Made also the protocol object final.
14 years ago
orbiter c2a968c23f fix for bug in formatting in ThreadDump
14 years ago
orbiter 078ecacf61 avoid synchronization in DigestURI hash requests
14 years ago
orbiter 1989ebc24b removed more warnings
14 years ago
orbiter 0324de1467 removed debug line
14 years ago
orbiter 1aba7869bf patch for Windows: do not use the thread lock feature from previous commit if used on Windows
14 years ago
orbiter 0a11727374 added new feature for Thread dump:
14 years ago
orbiter a07a1a8b1e removed type cast warnings
14 years ago
orbiter e6c3507b17 disabled some of the previous changes (did not work in openjdk)
14 years ago
orbiter f9e5c21083 update to thread dump logs
14 years ago
orbiter 8f11d3a5bb redesigned the ScoreMap classes:
14 years ago
orbiter a564230c48 more enhancements against blocked threads occurred in seed age evaluation (blocks httpd in some cases)
14 years ago
orbiter dc0db3550e avoid string conversion
14 years ago
orbiter 694fa3a2a5 - replaced more direct string-based UTF-8 conversions by predefined UTF-8 conversion
14 years ago
orbiter 30aed9824a moved getBytes() to UTF8.getBytes() to use a default String encoding
14 years ago
orbiter 3820525464 more memory protection: auto-flush of caches in case of memory shortage
14 years ago
orbiter 96bb33ed9b added default size to StringBuffer in logger (and it is not possible to replace the StringBuffer with a StringBuilder...)
14 years ago
orbiter e1b6916423 always try to guess the size of a StringBuilder to prevent too many memory re-allocations
14 years ago
low012 3b40b98256 *) set SVN properties
14 years ago
orbiter 619b561a4a enhanced secondary search: index abstracts decompression is now much faster and does not cause strong CPU load after several searches with more than one word
14 years ago
low012 bf27a72d53 *) set SVN properties
14 years ago
low012 b649ce2dd7 *) minor changes
14 years ago
orbiter 70a996a06c reverted SVN 7557 because these classes are called using reflection. The class declaration is in the log configuration. Without these classes you get errors during runtime and a non-formatted log output, i.e.:
14 years ago
orbiter cb1f49d0f2 replaced all 'new String' with default encoding (missing) or UTF-8 encoding with a String generation method that uses a pre-defined Charset constant for UTF-8. This avoids a cache-lookup for the Charset object using String hashing of the String 'UTF-8'.
14 years ago
low012 9d366ee9d7 *) removed unused code (I assume that most of the code was really dead, but if you need any of the classes, tell me and I will put it back in.)
14 years ago
orbiter 7138f4036b less synchronization, better thread dump tool
14 years ago
orbiter 8d14916c74 more patches for a better out-of-memory management
14 years ago
orbiter ce0c8247fc removed (most probably!?!) superfluos System.err output
14 years ago
orbiter 799c534935 one more patch again OOM during secondary remote search
14 years ago
orbiter f8d0454c53 small bug fixes and experiments with search speed enhancement
14 years ago
orbiter 993b9bc1a8 memory/performance hacks, less synchronization, better concurrency
14 years ago
orbiter 42d90664f3 - fixed a memory leak in the httpc.post method (no finish)
14 years ago
orbiter 38dce547c0 better concurrency (less locking on date formatting) more logging and minor bug fixes
14 years ago
orbiter b1781d7aae some more performance hacks
14 years ago
orbiter b2f147d28e performance hack: excluded map encoding in many cases from synchronization block, especially when doing an iteration
14 years ago
orbiter 5e186e0122 continuing the fight against deadlocks during time formatting: better caching.
14 years ago
orbiter 1110d16af9 performance hack: replaced generic row.getColBytes() call with row.getPrimaryKeyBytes() where the column is 0
14 years ago
orbiter 19b2a50578 - enhanced date formatter cache
14 years ago
orbiter 48a61c39a3 speed hacks in BLOB ArrayStack:
14 years ago
orbiter ad7fcb9d61 Enhanced Base64Order transformation: less overhead (transformation between StringBuilder and byte[])
14 years ago
orbiter 0ce17d823a - fixed bug in ordering
14 years ago
orbiter dec4f36700 - fix for missing favicons in search widgets
14 years ago
orbiter 804ae2275b - do not delete idx and gap files if the heap is not modified
14 years ago
orbiter 5e45ded8e2 - removed locks from WordReference
14 years ago
orbiter cd19d0517e added dns resolve to HTTPClient POST using a dns cache to prevent that that not-thread-safe built-in dns cache inside apache http client is used
14 years ago
orbiter af87af0d4c - removed synchronization in serverSwitch which should improve speed
14 years ago
orbiter d84b4a072e healing for some OOM problems
14 years ago
orbiter 6083f2f171 fix for (false) oom
14 years ago
orbiter fe93caac5a added flags and administration options to show advanced search and to show search result attributes (for each search result)
14 years ago
orbiter 431f780f41 patch for bad data in url metadata
14 years ago
orbiter 0cdfb82963 replaced more appearance of double values by float values
14 years ago
orbiter eb12e15738 moved all Double values to Float values because of
14 years ago
f1ori 982aa689ef * fix StringIndexOutOfBoundException in WebStructureGraph
14 years ago
orbiter 090c73e32e catch a OOM in HeapReader iteration
14 years ago
orbiter feefe17568 npe assert fix
14 years ago
orbiter 733903f2c9 fix for http://forum.yacy-websuche.de/viewtopic.php?p=21489#p21489
14 years ago
orbiter 10ae8d961b - cora package has now no dependencies to other yacy packages and becomes a 'base' package (refactoring)
14 years ago
lotus b1484299b2 same units for memory observer configuration (MiB)
14 years ago
orbiter 387db84087 maybe found bug in non-working index dumper
14 years ago
orbiter a4c9d27287 - moved some variables from Stwitchboard to new class AccessTracker
14 years ago
orbiter cdfe8afe3f fix for really bad table iteration implementation: reduction of IO
14 years ago
orbiter b2ed4cfaf8 more small bugfixes and light refactoring
14 years ago
orbiter e753027c43 fix for http://forum.yacy-websuche.de/viewtopic.php?p=21439#p21439
14 years ago
orbiter bf4ef1513e - fix for map view
14 years ago
orbiter 56264dcc17 - added CamelCase parser to MultiProtocolURI: generate better to-be-indexed words from urls
14 years ago
orbiter 99a7fe87f9 - removed old intranet scanner (the generic scanner now completely subsumes the old one)
14 years ago
orbiter a563b05b60 enhanced crawler:
14 years ago
orbiter db99db4be9 some redesign of the search-fail-response mechanism:
14 years ago
f1ori 4915d1781a * use local backup-file, if remote network-definition is not availible
14 years ago
orbiter f0651e5f2f added image search to yacyinteractive.html
14 years ago
orbiter a9f754c45f removed unused CR accumulation and distribution process
14 years ago
low012 9b3fae9496 *) cleaning up the code a little bit
14 years ago
orbiter 321eb012fe removed two warnings and reverted one change
14 years ago
low012 eb79b952ef *) cleaner code
14 years ago
sixcooler b87bf88ac8 using less memory on merging and rewriting blobs
14 years ago
orbiter 4c50d3428e smaller file size for array stacks to support smaller deletion sizes
14 years ago
orbiter becc463d8a enhanced did-you-mean
14 years ago
orbiter 93c535d111 fixed http://forum.yacy-websuche.de/viewtopic.php?p=21113#p21113
14 years ago
orbiter 445619f3ec added a submenu ConfigHTCache_p.html to set the size of the HTCache separately from the proxy configuration.
14 years ago
f1ori 7d8de34778 * add a bit documentation to DigestURI, use DigestURI(string) instead of DigestURI(string, null)
14 years ago
orbiter ed4371dcf3 enhanced navigation implementation and enhanced tag cloud computation
15 years ago
orbiter ca738ac924 - added a tag cloud to search results (using the topics)
15 years ago
orbiter e4d561971e added more score cluster options and made score cluster usage more transparent
15 years ago
orbiter 7cd9d9d22a - enhanced DidYouMean computation using a faster count on index entries; this causes that results can be ranked better
15 years ago
orbiter 24f1cba7b2 performance hacks:
15 years ago
orbiter d607b30b6a performance enhancements for search and code review for database functions
15 years ago
orbiter 0d363a94d7 more performance hacks
15 years ago
orbiter 091dd3f6ec - enhanced intranet search speed
15 years ago
orbiter aacf572a26 - enhancements for search speed
15 years ago
orbiter 2c549ae341 fixed a number of small bugs:
15 years ago
orbiter e54cb7fb0c more bugfixes (also for latest commit)
15 years ago
orbiter be6b48311c misc bugfixes
15 years ago
orbiter 48c0d508ac fixes for crawling of smb links (file length not always available)
15 years ago
orbiter 09c208a3ab patch for corrupted database files (just work on and forget key)
15 years ago
orbiter 97ee278931 enhanced search speed:
15 years ago
orbiter 8da4eb5de6 addition to patch in SVN 7111
15 years ago
orbiter 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification
15 years ago
orbiter 461a2a6ec7 enhanced remote crawling:
15 years ago
orbiter 0cf006865e refactoring and enhanced concurrency
15 years ago
orbiter 83ac07874f - corrected return value of put() methods (not used anywhere, so it did not harm before)
15 years ago
orbiter 14c843d364 more performance hacks
15 years ago
orbiter 39f409a7bb performance hacks
15 years ago
orbiter 906c572621 - enhanced index create menu structure
15 years ago
orbiter 64860dc1bb enhanced search event logging (to be used for further improvements)
15 years ago
orbiter 7dbc357593 patch to identify corrupted database files
15 years ago
sixcooler 17eebd4ef8 counting crawler traffic again:
15 years ago
orbiter 32f73d1aaa added copy for Info.plist for Mac application release updates (this file contains class paths and start parameters)
15 years ago
orbiter 570ca577c6 performance hacks
15 years ago
orbiter 348dece62f redesign of the SortStack and SortStore classes:
15 years ago
orbiter 5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1
15 years ago
orbiter 24502fe3de performance hacks
15 years ago
orbiter d865ef77a8 removed re-read of index in case of a bad index. This may not solve the problem but it applies a 100% CPU problem on the peer. I'm afraid bad index files must be abandoned, and cannot be fixed this way.
15 years ago
orbiter b2c9db48ea Performance enhancement
15 years ago
orbiter e8228fba09 less locking in time format computation, caching and during secondary (remote) search evaluation
15 years ago
orbiter 9c0c94683c because of a bug in search result caching count search results had not been generated as fast as possible.
15 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy:
15 years ago
orbiter 65eaf30f77 redesign of crawl profiles data structure. target will be:
15 years ago
orbiter 4f22e2df41 bugfixes for
15 years ago
orbiter 42414a6ae3 added two more tables in rss reader interface:
15 years ago
orbiter 0010cd9db1 Support for indexing of RSS feeds!
15 years ago
orbiter 0f276dd63f - MapHeap now implements Map<byte[], Map<String, String>>
15 years ago
orbiter cf07b34c2d implemented the Map interface in the ARC classes so it will be possible to instantiate ARCs as
15 years ago
orbiter c60d0282fd more abstraction for tables stored in heaps:
15 years ago
orbiter d1be64d491 removed wrong assert
15 years ago
orbiter 3197ca42ed preparations to move the HTCache into cora:
15 years ago
orbiter 844f158686 - removed dependencies in header framework:
15 years ago
orbiter 5e7081cd19 refactoring towards a unified loading mechanism for MultiProtocolURIs
15 years ago
orbiter 7aa860c505 - more logging
15 years ago
orbiter 66ac3a7d9d corrected database row iteration
15 years ago
orbiter 70dd26ec95 added the new crawl scheduling function to the crawl start menu:
15 years ago
orbiter 5a994c9796 added a scheduler based on API actions
15 years ago
orbiter 189a986ebd - modified api-call interface to record api calls with references to api-call database (carries pk)
15 years ago
orbiter 054c22e2c6 added TLDs from http://www.opennicproject.org
15 years ago
orbiter 7fdb17bb96 redirect uncaught exceptions to logging + small other changes
15 years ago
orbiter a82a93f2fc - better url double check in crawler
15 years ago
orbiter a835a22b32 fixed isLocal() property (better recognition of intranet hosts)
15 years ago
orbiter 301a59e07f moved browser access method from kelondro/util/OS to gui/framework/Browser
15 years ago
orbiter 6388a58fc7 better memory management and slightly less (in total and temporary) RAM allocation:
15 years ago
orbiter 5924a0d851 - enhanced concurrency in database index access for multicore
15 years ago
orbiter 9ab06bc333 enhancement in sorting efficiency (database root operation): less object allocation
15 years ago
mikeworks b12db14b9f Added Generics to new net.yacy.upnp.* classes to eliminate compiler warnings
15 years ago
orbiter 60caade056 removed debug output
15 years ago
orbiter dec1419bc3 ;-)
15 years ago
orbiter 22dbbcfa56 better (and corrected) recognition of intranet and internet-addresses. This corrects the isLocal property that is used by network definitions to restrict index ranges to local and global addresses. Address locations (intranet or internet) had been partly identified by the top level domain of the host address. Since intranet addresses can also be addressed using a host name that is in a country domain it is necessary to do a dns resolving for each check. The check is supported by a local dns cache so the intranet/internet check should not affect network traffic too much. To ensure that the cache works properly the cache class was upgraded to better concurrency data structures.
15 years ago
low012 dc5f0e357c *) fixed SVN properties
15 years ago
low012 01d6b952f0 *) minor changes for easier to read code, no functional changes
15 years ago
orbiter 25024d6ab2 fix for problen when accessing the metadata index. The index was not available for all peers with no RAM table copy.
15 years ago
orbiter b6fb239e74 redesign of parser interface:
15 years ago
orbiter 150cf42a1b migrated all my LGPL 3 -licensed files to the LGPL 2.1 because LGPL 3 is not compatible to the GPL 2
15 years ago
orbiter 37b8827a7a - removed the UPnP library sources from sbbi and added the jar library again. The library was included to get support for fedora releases, but after this time the fact that the sbbi cannot be part of fedora should be re-discussed. If this will still not be possible, then we may integrate the sbbi UPnP package using reflection.
15 years ago
orbiter 777195e8d1 more abstraction for access of LoaderDispatcher and cache
15 years ago
orbiter 7e2d6fac12 patch for bad values during local search join
15 years ago
orbiter 986d4f34d9 added a consistency check for new queues
15 years ago
orbiter fbf021bb50 redesign of index abstract processing - currently disabled until enough peers have fix in SVN 6928
15 years ago
orbiter 87087f12fe - scanned remote search process and enhanced some data structure and synchronizations here and there
15 years ago
orbiter de4f30bb2e UTF-8 fix
15 years ago
orbiter 3a1cebb598 bugfixes
15 years ago
orbiter 51332b787d reverted SVN 6869 as discussed with dulcedo in car after LinuxTag:
15 years ago
orbiter b03caaa57a better handling of OOM situations
15 years ago
orbiter 60e71876ad - more abstraction (HashMap -> Map)
15 years ago
orbiter a83772c71b fixes and enhancements for balancer:
15 years ago
orbiter 9cde05418f fixed url crawl list display
15 years ago
orbiter 11639aef35 - added new protocol loader for 'file'-type URLs
15 years ago
orbiter 6950d8a33d fixes to SMB crawler
15 years ago
orbiter 1defd580bc - added option to localization search to distinguish between a search for a location according to the search word only or for the relation between a web search results and locations found in the metadata fields
15 years ago
orbiter 118d589eff replaced the very very old data structure 'Records' with a simple table to fix the problem from
15 years ago
orbiter 2a8f70f0ca - fix for caching of OSM tiles. if you want that this fix applies to your peer, please delete the crawl profiles
15 years ago
orbiter 439b44be9e removed exit from computation in ReferenceContainerArray.get merge method
15 years ago
orbiter 789c6b26ce added a location search service: using the following servlet/example:
15 years ago
orbiter f23cbd2dab more bugfixes to date parser
15 years ago
orbiter cf43bdc87e This is a large bugfix and enhancement commit to support a better location detection for data
15 years ago
orbiter c45117f81f fixed dates in metadata
15 years ago
orbiter 0a5fd15703 :-(
15 years ago
orbiter ac16f582aa fix for http://forum.yacy-websuche.de/viewtopic.php?p=20017#p20017
15 years ago
orbiter fc5efcc05a enhanced and fixed OAI-PMH import
15 years ago
orbiter 455a763d7c performance hacks
15 years ago
orbiter b6cce08019 fixed a bug in rwi storage data size allocation
15 years ago
orbiter 90c3e5d6f6 - cleanup, removed unused imports
15 years ago
orbiter b18a7606a0 some performance hacks and fixed after reading dump in
15 years ago
orbiter 4cd5418963 removed finalize methods because of a hint in
15 years ago
orbiter cff8ed134f added index check to prevent blocking in synchronization
15 years ago
orbiter b95ae2518b fix for assert
15 years ago
orbiter 027b971bde fix for concurrent quicksort: catch jobs from ThreadPoolExecutor that had been rejected because of full processing queues.
15 years ago
orbiter 8c40f1cb8e self-healing for broken table files (may cause other problems, but better than nothing)
15 years ago
orbiter 7b69d79727 enhanced remove() operation: in many cases it is not necessary to return the removed object to the called.
15 years ago
orbiter 93ea0a4789 enhanced remove operation in search consequences (which are triggered when the snippet fetch proves that the word has disappeared from the page that was stored in the index)
15 years ago
orbiter 7a59012632 fix for NPE
15 years ago
orbiter 1a6c2f77b4 fix for NPE in statistic servlet
15 years ago
orbiter 64f29f990e a collection of performance hacks and code cleanup:
15 years ago
orbiter 8b8107b2a3 reduced IO-load and synchronization/blocking
15 years ago
orbiter ed07046870 flush only when > 3000 RWIs present + code cleanup
15 years ago
orbiter 3a50b5aa04 enhanced object hash computation
15 years ago
orbiter 1a8a134e0c continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775 and continued in SVN 6790
15 years ago
orbiter dde394a977 - shifted some computation out of synchronization to allow more concurrency
15 years ago
orbiter f204076d25 removed usage of temporary files: causes too much IO
15 years ago
orbiter 650be3599f added a time-out to the RWI cache to flush the cache if it has not been written for ten minutes. This additional dump criteria is necessary because some data sources repeat their vocabulary and may cause that the number of words in a RWI does not increase while the number of references in the RWI set increases. Now the RWI Buffer is flushed every 10 minutes or later if at that time already a dump is ongoing.
15 years ago
orbiter ff6cf24b80 replaced RowSetArray in ObjectIndexCache with RowSet to reduce complexity in MergeIterator. This complexity caused too much computing overhead when the RowSetArray had become very large.
15 years ago
orbiter 55d8e686ea performance hacks
15 years ago
orbiter 2f181d0027 introduced concurrency in HTCACHE storage compression
15 years ago
orbiter 2e26744f4e more concurrency when normalizing RWI entries + cleanup
15 years ago
orbiter aa083fc45c try to get a fix for OOM problem in case that there is no real problem with missing memory.
15 years ago
orbiter 70e6222978 more concurrency during search requests
15 years ago
low012 dc93cec3a8 *) Java 1.5 compatibility (see http://forum.yacy-websuche.de/viewtopic.php?f=8&t=2764)
15 years ago
orbiter 67ec58d8e7 search performance enhancement
15 years ago
hermens ef467a0303 Another workaround for the second part of http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2770
15 years ago
orbiter 25aef069a6 continuing String-hash - to - byte[]-hash redesign that was started in SVN 6775
15 years ago
orbiter 1e8e79b9ef redesign of reference hash (URL-hash) parameter hand-over:
15 years ago
orbiter 72d8e9897b removed unnecessary cache flush call in backend of BufferedRecords
15 years ago
orbiter 749ffbd642 - added another catch case for the index dump and index merge process that should cause non-blocking behavior in case that index dump and/or index merge caused any unexpected exception.
15 years ago
orbiter 312ca5d917 removed flush at end of every rwi entry since this reduces the write performance.
15 years ago
orbiter 0018163c07 moved table row/column matching method from front-end to back-end
15 years ago
orbiter 31e29a8831 - removed synchronization during index dump and index cleaning
15 years ago
orbiter bb63c5d075 using a Pattern object with precompiled regular expressions to apply must-match constraints to search results: should speed up pre-sorting of search results and should cause richer search result sets
15 years ago
orbiter 90dd197ae7 - no latency for local crawls
15 years ago
orbiter 36bd843ece for for RFC5322 comformance as suggested by Quix0r in http://forum.yacy-websuche.de/viewtopic.php?p=19585#p19585
15 years ago
orbiter 748abfcffa added patches to prevent yacy-protocol DoS settings
15 years ago
orbiter e820ed061a avoiding excessive DNS lookups to determine localhost
15 years ago
orbiter 0f8004f9da enhanced html parser to recognize a href tags inside header tags
15 years ago
orbiter 1198b9989d bugfixes, more sorttable
15 years ago
orbiter ae2f3f000f better handling of table copy abandon .. prevent memory leak
15 years ago
orbiter 0769517129 added a robots.txt monitor in the crawler monitor submenu
15 years ago
orbiter de01fe0e6d fix for bug in url parser
15 years ago
orbiter 1bbe14d23f SVN 6716 unfortunately contained parts of the unfinished SMB integration. To fix compile errors the remaining parts of the SMB implementation stub is added with this commit.
15 years ago
orbiter 884b262130 - added a new Wiki Namespace Navigator
15 years ago
orbiter 270fb38674 - fixed some bugs in Table viewer
15 years ago
orbiter 727dd9b193 - fixed a bug in robots.txt parser
15 years ago
sixcooler cd6de83905 next try for for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2703
15 years ago
sixcooler bfe4693e9a fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2703
15 years ago
orbiter 564927ce72 redesign of CrawlResult data structures because of OOM occurrences during URL deletion processes.
15 years ago
orbiter 30c8185139 fix for sid check
15 years ago
orbiter ef62d017e5 integrated session id filtering for crawler
15 years ago
orbiter d8d9984913 added framework for session id filtering (not ready yet)
15 years ago
orbiter 2bc36de336 - fix for bug in svn 6669
15 years ago
orbiter d378ca4604 better handling of concurrency in seed
15 years ago
orbiter 6538043d89 fix for http://forum.yacy-websuche.de/viewtopic.php?p=19189#p19189
15 years ago
sixcooler e071d71f19 fix for yacy-banner-network-values
15 years ago
sixcooler 787b588c33 reverted a part of svn6636:
15 years ago
lotus 11188cd7eb resource observer now uses the Java 6 method to check for free space. thus, disk observing now needs Java 6 installed.
15 years ago
sixcooler 089877f32c my first commit - hopefully fix for merge problem
15 years ago
orbiter d6391f2537 better handling of rewrite cases where the resulting rewrite blob entry is equal in size
15 years ago
orbiter ef9473d92c added another sixcooler suggestion: recycle corrupted records
15 years ago
orbiter fe78edac32 - view API calls in correct date-order
15 years ago
orbiter 308a973503 refactoring of tables data organisation
15 years ago
orbiter ada0ce9de3 refactoring of bookmarks: there is a big performance problem in the bookmarks code and furthermore the bookmarks
15 years ago
orbiter 3751ab4ae2 added sixcoolers patch and more checks/removed unnecessary code
15 years ago
orbiter d8d8562c59 fill key with zeros during normalization
15 years ago
orbiter 24060885b6 - added Tables abstraction in data.Tables.java
15 years ago
orbiter 7fdf59a77f misc NPE check
15 years ago
orbiter 4403304957 bugfix for list()
15 years ago
orbiter 0098e6e859 bugfix for heap iterator
15 years ago
orbiter db19a941cf added new image index storage classes (not integrated yet)
15 years ago
orbiter 8ce936bcdd added an api recording function: it shall be possible to record
15 years ago
orbiter e80e060ca6 - increased thread priority for server threads
15 years ago
orbiter f6731c6240 more logging etc.
15 years ago
orbiter 4f1f4863c4 fix for deadlock when initializing a SplitTable with a file of size 0, see also:
15 years ago
orbiter cc5dcf69ff missing change for last commit
15 years ago
orbiter ca1ef9a079 fix for http://forum.yacy-websuche.de/viewtopic.php?p=18584#p18584
15 years ago
orbiter 938e806182 tried to fix date problem that may have prevented that foreign peers stay in the network
15 years ago
orbiter 5df628a2a4 - added BEncoder class
15 years ago
orbiter 82f57f79e5 more PMD enhancements
15 years ago
orbiter a06f7ddb33 more PMD recommendations
15 years ago
orbiter eb79ceb3ff update to kelondro data structures
15 years ago
orbiter 18172451a0 better search computation:
15 years ago
orbiter 2113fcd7e5 - fixed usage of isEmpty() which is not available in java 1.5
15 years ago
orbiter dd459281c8 applied code changes that are recommended by PMD
15 years ago
lotus eac2daf2e8 * reenable DHT if yet enough memory is available
15 years ago
orbiter d77a8f3b3e added some modifications recommended by PMD for better performance
15 years ago
orbiter 7f20963b41 add-on to last commit
15 years ago
orbiter eeca2ded92 fix for http://forum.yacy-websuche.de/viewtopic.php?p=18500#p18500
15 years ago
lotus 32972139af added nice configuration for the resource observer
15 years ago
hermens 574f49903e Prevent blob merge from possibly losing the last container
15 years ago
orbiter 83d05e9176 added sixcoolers hack with some modifications:
15 years ago
orbiter d0b7bf9ca2 added a decoder class for Bencoding
15 years ago
low012 028657f019 *) adding more SVN properties
15 years ago
low012 82d740050f *) adding more SVN properties
15 years ago
low012 e04cb8cef0 *) adding more SVN properties
15 years ago
low012 dcb1096fb0 *) adding more SVN properties
15 years ago
low012 7d610e0063 *) minor changes
15 years ago
lotus 9bee0ac780 more logging for DHTrule
15 years ago
orbiter c14233a933 fix for a OOM in MapView that can cause unavailability of
15 years ago
orbiter 37245430c3 fix for NPE during DHT RWI selection
15 years ago
orbiter 959b38b61b fix for memory tracker
15 years ago
orbiter a37878b7d5 url parser regex performance hack
15 years ago
orbiter 362b7a929b added extensive memory protection logic to avoid out of memory errors that may be caused by the RowCollection memory allocation function
15 years ago
orbiter 8281e29963 - more configuration for profiling graph (number of events)
15 years ago
lotus 713cb26a27 update for memory observer algorithm
15 years ago
orbiter 29fde9ed49 better control of ranking order in sort stack
15 years ago
orbiter 93caa38d55 fix for bug in SortStack (did not appear to shrink according to required size) - caused bad and unsufficient search results
15 years ago
orbiter e34e63a039 preset of proper HashMap dimensions: should prevent re-hashing and increase performance
15 years ago
orbiter 4a5100789f replaced _all_ size() == 0 with isEmpty() and all size() > 0 with !isEmpty(). The isEmpty() method is much faster in some cases, especially when used to access badly balanced hashtables where an size() operation becomes a large iteration.
15 years ago
orbiter 491ba6a1ba - some refactoring in workflow
15 years ago
orbiter 23aef43786 - better synchronization in SortStack
15 years ago
orbiter 7b1f5b0430 - better media search ranking
15 years ago
orbiter 4df88a4e7a - fixes for missing or bad hashCode computation
15 years ago
orbiter dbdf2570ba added comparator and more fixes for SortStack/SortStore
15 years ago
orbiter 1dff620181 Better implementation of SortStack and SortStore and adoptions in all using classes to implement the necessary Comparable interface and hash code computation.
15 years ago
orbiter 4c6312d103 enhanced image search
15 years ago
orbiter fd0658ce7c avoid forced execution of InetAddress.getLocalHost() at startup, because that hangs at some strangely declared linux configurations. The Domains.localHostAddresses object is first instantiated with a more simple logic and enriched with more host addresses using a concurrent thread that will not block a startup process.
15 years ago
orbiter 013f337d3f - avoid unnecessary host name lookups for localhost
15 years ago
orbiter 5afd9f7a91 fix for crlf writing
15 years ago
orbiter 2d3c98b742 less computation within synchronized blocks
15 years ago
orbiter 29fe436e36 - fixed post-ranking including prefer mask
15 years ago
orbiter 5399d1e2bc refactoring (reason: get more abstraction to use the blacklist class; for integration in other servlets)
15 years ago
orbiter 534182559c removed concurrency hacks from SplitTable because it showed deadlock-like situation.
15 years ago
orbiter 54c54fb144 get a handle for grep: 'StackTrace'
15 years ago
lotus 6edc168cfe option to disable dht by memory limit:
15 years ago
orbiter 4431b9767e added about 450 replacements for printStackTrace() methods to pipe such traces into the log at DATA/LOG/
15 years ago
low012 e77c906673 *) minor changes mainly in comments
15 years ago
low012 f1740edbf8 *) added skript to change memory settings, password and port (experimental, don't blame me if it messes up your configuration)
15 years ago
orbiter b0b7a4f9a5 - added function to OAI-PMH reader that can pull all records from a server using an evaluation of the resumption token to get URL to retrieve remaining records
15 years ago
lotus 58616d99e4 patch for yacy disk usage detection on lvm host
15 years ago
orbiter a0e891c63d - some redesign in UI menu structure to make room for new 'Content Integration' main menu containing import servlets for Wikimedia Dumps, phpbb3 forum imports and OAI-PMH imports
15 years ago
orbiter 77c99e500f added more control over memory allocation
15 years ago
orbiter 3528b970d6 - refactoring
16 years ago
orbiter b79f4f062f refactoring of yacy documents and parsers: they depend now only on the kelondro classes
16 years ago
orbiter c864901087 - moved httpd.mime to defaults path
16 years ago
orbiter 6192205533 more final modifier
16 years ago
orbiter 0f6b011e1a fix for new index location and better way to use own classes by reflection
16 years ago
orbiter 7a3bbd950f :-(
16 years ago
orbiter b953f04f90 one more reflection fix
16 years ago
orbiter 77d6604856 fix for npe, see http://forum.yacy-websuche.de/viewtopic.php?p=17727#p17727
16 years ago
orbiter 2a7fe35f92 performance tuning using more final modifiers in the kelondro core
16 years ago
orbiter cb4de9ceee fixed a bug in table iterator (did not recognize elements in write buffer)
16 years ago
orbiter e7f18ba24b refactoring
16 years ago
orbiter ce8dc575ca refactoring
16 years ago
orbiter bea3b99aff moved table and util classes
16 years ago
orbiter bd876eb4b7 moved io classes
16 years ago
orbiter c0e0e1f422 moved blob classes
16 years ago
orbiter 1e4f8b56ed accumulated classes from different packages into the new rwi package
16 years ago
orbiter 194da25a2f moved kelondro index
16 years ago
orbiter 4446acc8cd moved kelondro order
16 years ago
orbiter f677d534b1 start of a really extensive refactoring which will produce a hierarchical package structure with the domain yacy.net as package root
16 years ago