Commit Graph

666 Commits (fd7c17e62429e5672f205127803451eae6e980a0)

Author SHA1 Message Date
orbiter d98418390b - introduced rankingProfile Class
19 years ago
orbiter eab1805bca refactoring: plasmaSearchProfile -> plasmaSearchTimingProfile
19 years ago
orbiter 6eef848954 re-design of post-ranking process
19 years ago
orbiter be77fe1a88 code clean-up
19 years ago
orbiter 0bc2aaeb42 added normalization to search attributes
19 years ago
theli 008bcb7fb8 *) simplifying code by moving closeTransferIndexes into final block
19 years ago
theli 50d85657b8 *) new import function for IndexImport_p.html
19 years ago
theli 214302284e *) undoing last commit because of problems with getUpdateTime
19 years ago
theli 408de3beee *) avoiding to search in the treemap two times for the same key
19 years ago
borg-0300 139ba4e0c8 Bugfix for getCachePath(URL url)
19 years ago
theli 442807cb29 *) Bugfix for last commit
19 years ago
theli 22fd1ca9aa *) minor changes
19 years ago
theli 6a99304b2b *) Redesign of db import functionality
19 years ago
orbiter 3834675084 fixed bug that caused wrong behavior of search result preparation
19 years ago
hermens 31c8476b5d plasmaWordIndexCache.getContainer:
19 years ago
orbiter 3419b3bcdd fix for bug that caused the peer-counter problem.
19 years ago
hermens 4f43816ec0 *) Fix wrong class cast in indexSize()
19 years ago
orbiter a7f0adf6fa bugfix in entity iterator
19 years ago
orbiter fa90c3ca7a - removed some usage of indexEntity
19 years ago
orbiter aea3e00864 cleanup: removed unused temporary index management in indexEntity.
19 years ago
orbiter 03c65742ba changes towards the new index storage scheme:
19 years ago
theli ab7a911bb3 *) Trying to solve pool not open problem
19 years ago
hydrox d665f3c39c *) fixed Threadnames for stackCrawl-Threads
19 years ago
theli 3d5347bc8e *) changing loglevel for some messages
19 years ago
theli 0fcd113c42 *) last bugfix part. Seems to work now for the stackCrawler
19 years ago
theli b9c9eaeb44 *) next try todo a bugfix :-((
19 years ago
theli 4b4b93c413 *) next try todo a bugfix :-(
19 years ago
theli d9fbad71b9 *) next try todo a bugfix
19 years ago
theli 6da97bd2e4 *) next bugfix for threadpool problem
19 years ago
theli bea2b9edee *) further redesign of threadpools to solve too many thread problem
19 years ago
theli 784fd50437 *) more verbose thread names
19 years ago
theli 56e4dbeb71 *) displaying current active + current idle threads in PerformanceQueues_p.html now
19 years ago
theli 859c6a88f5 *) testing various thread pool eviction settings to avoid outOfMemory - Thread creation problem
19 years ago
orbiter f2b18cede9 AND-bugfix
19 years ago
orbiter b946e28e61 some ranking enhancements
19 years ago
rramthun 6c02f889f7 Cosmetic changes.
19 years ago
theli b191f06d16 *) Adding additional logging message to locate problems with stackcrawl threads
19 years ago
theli d9bcd73d93 *) Bugfix for exception
19 years ago
theli f5abfe8d57 *) more failsafe threadpools
19 years ago
orbiter a56fefe0d3 added missing forced-flush for index cache
19 years ago
hermens 78bcb8014a *) Limit range for selection of indexes for distribution to a DHTDistance of 0.2
19 years ago
hermens 861aae678d *) cleanup cacheAge database when cleaning up the HTCache
19 years ago
theli b4e2efef10 *) first test of new iteration function
19 years ago
orbiter eabf4a0386 fix for null pointer exception during shut-down
19 years ago
orbiter 47843e69e2 auto-reset for switchboard queue stack
19 years ago
orbiter d6581c445b added content iterator for corrupted database files
19 years ago
theli ecdc1f7547 *) Bugfix for crawling URLs with query parameters
19 years ago
orbiter fc4ae899f7 added word-position to ranking (this is only a first step)
19 years ago
orbiter bb2095fe39 assortment files are now not deleted, but shifted to a backup directory.
19 years ago
orbiter 7366e39dd3 tried to fix 100% CPU bug.
19 years ago
orbiter f14d49fae9 enhancements, bugfixes and additions to word index attribute storage
19 years ago
allo 4d33020f56 Migration to WORK
19 years ago
rramthun 1e5feedf0e Fix for http://www.yacy-forum.de/viewtopic.php?p=15547#15547
19 years ago
orbiter f4ffa9aee5 - implemented more attributes to index entries
19 years ago
orbiter 90b940e90e fixed position storage problem.
19 years ago
orbiter 0371494010 tried to add word position to index
19 years ago
orbiter f1cfee7703 removed tabs from condenser
19 years ago
hermens 37791fd529 *) Close indexEntities when "found not enough peers for distribution"
19 years ago
borg-0300 c5b6154136 added CRDistOn = true/false
19 years ago
orbiter 71d5c2b2ca better control for target peer selection for RWI transfer
19 years ago
hermens ca7407b7e1 *) Don't change maxTime if zero or negative
19 years ago
orbiter 3d7c8aaeae removed confusing method
19 years ago
orbiter 4cd0c45a77 code cleanup
19 years ago
hermens 971247b78f - rotate merged indexes after merging
19 years ago
orbiter e2ff1767b5 fix for last DHT distribution bug-fix
19 years ago
orbiter 060e5a0df0 fixed problem with DHT target peer selection:
19 years ago
theli 7c22afe3de *) Bugfix for NullpointerException in deleteOldHTCache
19 years ago
orbiter b21b9df2d0 added section headlines generation to html parser
19 years ago
rramthun c4487deba9 Minor changes collected over some time.
19 years ago
allo 6822dce57b Using Orbiters function for auth
19 years ago
orbiter 2028403670 - consolidated different orderings to kelondroNaturalOrder
19 years ago
orbiter 9544c47684 added some UTF-8 handling.
19 years ago
borg-0300 9d8dca750e BUGFIX for my last commit
19 years ago
borg-0300 5449193167 bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706 (i hope)
19 years ago
borg-0300 2a23f5d419 F..., Sorry, no time, later
19 years ago
borg-0300 3a2d13786e bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706
19 years ago
borg-0300 dc0999ec9c adapted to new HTCache structure
19 years ago
orbiter 9086261476 refactoring of base64 encoding:
19 years ago
borg-0300 b24fcc8ca4 oom
19 years ago
borg-0300 7da232b5b9 HTCache Reset if necessary
19 years ago
borg-0300 4f18f24d81 small change
19 years ago
borg-0300 c652527620 YaCy removes now the old HTCACHE data
19 years ago
borg-0300 69f65210e2 ".yacy" has its own directory;
19 years ago
allo 351fffc129 DATA/WORK for user-created content
19 years ago
allo a81cc9d969 no DATA/DATA to avoid confusion.
19 years ago
borg-0300 b95c5d5781 BUGFIX for URLs how "/../" ...;
19 years ago
allo 9cce3c5709 dates Table for bookmarksdb(needed for del.icio.us api)
19 years ago
hermens 11fe95832e avoid division by zero when index transfer is extremely fast
19 years ago
allo 4ac0fd328a First Version of the Bookmarksmanager
19 years ago
theli d7b6dcbe2e *) Bugfix for MalformedURL problem if Location header is empty.
19 years ago
hermens 5b3e01bd3c avoid division by zero when importing very small indexes (<100 entries)
19 years ago
borg-0300 b7f9adc2c9 new filters added
19 years ago
theli 79667a172e *) Bugfix for additional parser problem
19 years ago
theli 8c594841a8 *) Bugfix for incorrectly indexing of URLs that were requested with Cookies in the
19 years ago
orbiter b5d02d649a fixed bug caused strange search result behaviour
19 years ago
orbiter 4500506735 fixed some bugs concerning url entry retrieval and intexControl interface
19 years ago
orbiter 83a34b838d * added Object allocation monitor on performanceMemory page
19 years ago
orbiter 4ff3d219e8 increased delay for cacheScan start and slowed down scan process
19 years ago
orbiter 3031903d50 re-design of RAM cache flush into assortment cluster
19 years ago
orbiter 0c762daf4b better startup failure handling
19 years ago
orbiter f27f9ecf15 * activated write buffer for databases.
19 years ago
orbiter c59d1b2f5e - Tests with write buffer (new class kelondroBufferedIOChunks, not yet active)
19 years ago
orbiter bb79fb5d91 - changed handling of error cases retrieving urls from database
19 years ago
theli e7d16ef831 *) Corrections in jMimeMagic MagicRule-file to detect some special rss feeds
19 years ago
theli 386d9e45d8 *) Bugfix for code cleanup
19 years ago
theli 5a1d45715d *) Bugfix for parser configuration bug
19 years ago
rramthun a1061495d4 Fixed some spelling mistakes and added some text which (should) make it easier to understand the options.
19 years ago
orbiter 0cdc58aaea fixed indexing of local domains.
19 years ago
theli e1c2d8ec5f *) Speedup "removed from queue"
19 years ago
hydrox 96930f0d2b *)added function to removed malformed URLs from urlHash.db
19 years ago
theli 8862b6ba4b *) Corrections for code cleanup 1175
19 years ago
orbiter 13fdebc50d added authentication for link deletion in search result
19 years ago
orbiter 37f88b4017 code cleanup
19 years ago
orbiter ec2b39c1ce code cleanup
19 years ago
orbiter 8f1f2daa5e implemented interactive link deletion of search results.
19 years ago
theli 6d0f7e6988 *) Adding missing file
19 years ago
theli 44fa94ac52 *) Modifications for dbImport functionality
19 years ago
orbiter dc778659fb fixed problem with time-out during result joint which caused OR behavior instead of AND beahvior
19 years ago
orbiter 3d8a5ae652 code cleanup
19 years ago
theli 64478b1f02 *) Adding possibility to delete crawler queue entries using regular expressions
19 years ago
orbiter a04930f025 code cleanup
19 years ago
low012 90b0eb144e just a typo...
19 years ago
theli 129b15f3e1 *) Correcting logging output of db importer thread
19 years ago
orbiter 420d56ce79 extended db-testing
19 years ago
orbiter ecf765ec33 temporary fix to make jrpm extension compilable with my netbeans environment
19 years ago
theli 8ed0aaae8d *) Adding content Parser for RPM Files
19 years ago
theli 818d37ce44 *) Removing getSimpleName
19 years ago
theli b35c5a48bf *) First version of urlRedirector.pl script
19 years ago
theli bdf30117c1 *) Redesign of parser configuration
19 years ago
theli d4ac3e25b1 *) Bugfix for file system link bug during detection of invalid URLs
19 years ago
orbiter adf75bc9fa better logging for invalid file path detection
19 years ago
orbiter 40621a5663 anhancements in ranking preparation and fixed problem with parser/mime recognition
19 years ago
theli c650b112ea *) Bugfix for relative URL Bug in Crawler
19 years ago
theli 4e73035aef *) Bugfix for "too many open files" during index distribution
19 years ago
orbiter f57e2d67f5 shortened network overview (less columns fit easier on page)
19 years ago
orbiter 85282b1d98 enhanced YBR recognition and search result heuristics
19 years ago
orbiter b9cc9029e3 added ybr selection for remote search
19 years ago
orbiter 0e25020f51 added first generation and usage of YBR index-files. Enhanced overall ranking of search results.
19 years ago
theli 90d6c6223b *) Adding color codes to network graphic legend
19 years ago
orbiter bfe51c7228 added generation of domain-list
19 years ago
orbiter 0ec54d9c5f enhanced CR-file handling and added first RCI-evaluation tests
19 years ago
theli c2fe3a1670 *) Updating jMimeMagic Ruleset
19 years ago
orbiter 88e3234393 fine-tuning of rci-generation
19 years ago
orbiter a12759c1bf first try to implement a rci-computation from cr-files
19 years ago
orbiter 4a8e8f269e refactoring of cr-processing; new kelondro class to handle the attribute file format
19 years ago
orbiter 24dc0e0760 implemented cr-file processing and further transmission steps
19 years ago
orbiter 9d9a87f445 limited htcache storage length
19 years ago
theli d0dfccdb77 *) Making CrawlStacker pool configurable via GUI and config file
19 years ago
theli 3631cb1f6d *) deleting empty entities during index selection
19 years ago
theli ca26aab9b1 *) More debugging output for migrateWords
19 years ago
theli 9b35ae9027 *) Correcting wrong % values on IndexTransfer_p page
19 years ago
theli e6bf9d90a5 *) Fixing Problems with MalformedURLs during Word Selection
19 years ago
theli 86a9210264 *) indexing queue slots are now configurable via config file
19 years ago
theli 3c11d7b81c *) Bugfix for minimizeUrlDB
19 years ago
orbiter 9913049009 fixed outOfMemory bug caused by loops in kelondroTree during enumeration
19 years ago
theli bbb936b9ea *) Bugfix for not human readable content of PDFs while viewing the URL Content via GUI
19 years ago
theli 445e3a620f *) Avoid rejecting of html content by the crawler when the file extension is not set properly
19 years ago
theli 444a5a9368 *) Bugfix for Entries with null url in GlobalQueue
19 years ago
borg-0300 ebac51df52 restore defaultRemoteProfile
19 years ago
borg-0300 5778428455 move cutUrlText to nxTools,
19 years ago
borg-0300 9158845c3b bugfix for snippet text null bytes
19 years ago
orbiter f763923e0a added missing files for last commit
19 years ago
orbiter 79818a320f introduced citation-rank transmission protocol and activate transport for anonymisation
19 years ago
theli 7e0647f692 *) Bugfix for userDB usage during authentication
19 years ago
orbiter 02f8013013 auto-delete of corrupted word files during word-migration
19 years ago
orbiter d2731418bf added creation of global ranking files and changed url normal form usage
19 years ago
theli 6f9f8ed8f8 *) Automatic Reset of Stack Crawler DB on startup errors
19 years ago
theli fb766413d1 *) Changes on httpc dns caching
19 years ago
orbiter bc420c62f6 fixed htcache path generation (never change a running system)
19 years ago
theli dd24f0252f *) Searchword highlighting for info page
19 years ago
borg-0300 72cde1d894 getCachePath: no logging
19 years ago
borg-0300 1fbd72f9e0 rename "index.html" to "ndx"
19 years ago
borg-0300 cd1107d85e added support for URLs with '?&'
19 years ago
borg-0300 5fb2b017cb small change
19 years ago
borg-0300 544e4ea90e small change
19 years ago
borg-0300 00ab4d8723 cleaned, small change, Properties
19 years ago
theli b8ceb1ffde *) Adding better https support for crawler
19 years ago
borg-0300 e3179a6394 added getOwnSeedFile()
19 years ago
borg-0300 a803a509ae bugfix: port handling in HTCache
19 years ago
hydrox cb69047b91 *)cleanup access static methods and fields
19 years ago
hydrox 56b9f34411 *)removed unused imports
19 years ago
orbiter 5f68b6886b introduced new url-hashes for better ranking computation
19 years ago
orbiter aadace1285 fixed network image in search performance monitor
19 years ago
orbiter bb369c98de fixed search result ordering by date
19 years ago
orbiter b058ecf0bc refactoring of image-generation; added experimental PNG encoder (not active now)
19 years ago
orbiter d42531e1b2 added auto-reset for NURL-DBs
19 years ago
allo 92c49b406b adminAuth with userDB and adminAuthenticated (fix for statuspage)
19 years ago
rramthun 27f180f24b Update of YaWoStat to 0.2.
19 years ago
orbiter d656e2b433 added a memory-profile chart generation to database performance testing
19 years ago
theli ec3af327f7 *) Bugfix for Proxy-Authentication against remote proxy
19 years ago
orbiter 5b0911d7ea added new performance menu for search sequence configuration and monitoring
19 years ago
allo ada06b0674 bugfix for Networkimage from Hydrox
19 years ago
orbiter 1aa4ba8b62 added post-search filtering of redundant urls (longer than existing cited)
19 years ago
orbiter 8d827cdb30 tried to fix problems with order of network list by last-seen (which could also improve the network picture)
19 years ago
orbiter 097009d910 experimental visualization of DHT access during global search (temporary)
19 years ago
orbiter 4dcbc26ef1 introduction of search profiles; very experimental
19 years ago
theli 6c48c3ce39 *) Bugfix for ArithmeticException during IndexTransfer
19 years ago
theli 525c8dcbd4 *) Adding Traffic Statistic for Crawler
19 years ago
theli 9a5ab62928 *) Adding yacy specific X-YACY-Index-Control header which can be used by clients
19 years ago
theli 02d9af1a70 *) Restructuring and extending of Remote Proxy Support
19 years ago