orbiter
1e4578aab6
VERY EXPERIMENTAL removal of index ram cache flushing thread.
...
The cache will fill up and flushed explicitely when it is full.
This shall remove double-access of assortments (indexing and flush)
during indexing process. Hopefully this should reduce IO.
The main idea is: the cache shall mainly be flushed by DHT transfer, and
only indexes that shall be hosted by the own peer are flushed to the
assortments. This needs further work.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1617 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
954f02d22e
*) Bugfix: Prevent wordIndex.getContainer() from returning and even manipulating
...
the containers from the ram cache. Return a new container instead.
*) Speedup flushFromMem by reducing the number of searches in the TreeMap
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1604 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fe39493145
changed default ranking parameters
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1582 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
365a3fff8e
fixings for ranking attributes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1569 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
8e55098b74
fixed detailed search
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1562 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0cb940a8e5
added detailed search.
...
ranking profiles do not work properly yet
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1551 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
c695928f7c
adopted search page to new detailed search (to be commited later)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1550 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
45323e7b76
fixed null pointer exception during search
...
see http://www.yacy-forum.de/viewtopic.php?p=16429#16429
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1547 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fb7411d7bb
re-structuring of ranking application:
...
concentration of all ranking attributes in the
plasmaSearchRankingProfile
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1541 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
d98418390b
- introduced rankingProfile Class
...
- selection of ranking and timing profiles for each search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1539 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
eab1805bca
refactoring: plasmaSearchProfile -> plasmaSearchTimingProfile
...
This was made to distiguish this profile from the
(to-be-implemented) plasmaSeachOrderProfile
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1538 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
6eef848954
re-design of post-ranking process
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1537 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
be77fe1a88
code clean-up
...
@Martin: bitte schaue mal warum die Variablenzuweisung
in plasmaCrawlNURLImporter war. So wie sie waren, waren sie überflüssig.
Das hattest du dir bestimmt nicht so gedacht.
Sollten es ggf. globale Variablen sein?
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1529 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0bc2aaeb42
added normalization to search attributes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1528 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
008bcb7fb8
*) simplifying code by moving closeTransferIndexes into final block
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1522 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
50d85657b8
*) new import function for IndexImport_p.html
...
- can be used to import the crawling queue (noticeUrlDB + stacks)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1518 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
214302284e
*) undoing last commit because of problems with getUpdateTime
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1514 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
408de3beee
*) avoiding to search in the treemap two times for the same key
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1513 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
139ba4e0c8
Bugfix for getCachePath(URL url)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1510 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
442807cb29
*) Bugfix for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1506 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
22fd1ca9aa
*) minor changes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1505 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
6a99304b2b
*) Redesign of db import functionality
...
- restructuring to allow different import tasks to be controlled via one gui
- adding possibility to import a single assortment file
- adding possibility to set the cache size that should be used
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1504 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3834675084
fixed bug that caused wrong behavior of search result preparation
...
(second search on same topic resulted in less links)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1502 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
31c8476b5d
plasmaWordIndexCache.getContainer:
...
*) Also get entries from cache
*) calculate available remaining time for backend.getContainer correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1501 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3419b3bcdd
fix for bug that caused the peer-counter problem.
...
See http://www.yacy-forum.de/viewtopic.php?p=16016#16016
The kelondroDyn now uses a generic fill character.
kelondroDyn-Tables containing peer/word/url-hashes must not use '_'
as fill character.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1498 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
4f43816ec0
*) Fix wrong class cast in indexSize()
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a7f0adf6fa
bugfix in entity iterator
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1490 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fa90c3ca7a
- removed some usage of indexEntity
...
- changed index collection process: indexes are not first flushed to indexEntity,
but now collected directly from ram cache and assortments
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1489 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
aea3e00864
cleanup: removed unused temporary index management in indexEntity.
...
This is replaced by indexContainers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1486 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
03c65742ba
changes towards the new index storage scheme:
...
- replaced usage of temporary IndexEntity by EntryContainer
- added more attributes to word index
- added exact-string search (using quotes in query)
- disabled writing into WORDS during search; EntryContainers are used instead
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1485 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
ab7a911bb3
*) Trying to solve pool not open problem
...
See: http://www.yacy-forum.de/viewtopic.php?t=1798
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1482 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
d665f3c39c
*) fixed Threadnames for stackCrawl-Threads
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1480 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
3d5347bc8e
*) changing loglevel for some messages
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1479 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
0fcd113c42
*) last bugfix part. Seems to work now for the stackCrawler
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1478 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
b9c9eaeb44
*) next try todo a bugfix :-((
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1477 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
4b4b93c413
*) next try todo a bugfix :-(
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1476 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
d9fbad71b9
*) next try todo a bugfix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1475 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
6da97bd2e4
*) next bugfix for threadpool problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1474 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
bea2b9edee
*) further redesign of threadpools to solve too many thread problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1473 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
784fd50437
*) more verbose thread names
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1471 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
56e4dbeb71
*) displaying current active + current idle threads in PerformanceQueues_p.html now
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1470 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
859c6a88f5
*) testing various thread pool eviction settings to avoid outOfMemory - Thread creation problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1467 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f2b18cede9
AND-bugfix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1461 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
b946e28e61
some ranking enhancements
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1460 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
rramthun
6c02f889f7
Cosmetic changes.
...
Corrected version numbering as described in http://www.yacy-websuche.de/wiki/index.php/De:Versionsnummern
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1453 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
b191f06d16
*) Adding additional logging message to locate problems with stackcrawl threads
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1452 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
d9bcd73d93
*) Bugfix for exception
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1448 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
f5abfe8d57
*) more failsafe threadpools
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1446 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a56fefe0d3
added missing forced-flush for index cache
...
see http://www.yacy-forum.de/viewtopic.php?p=15732#15732
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1434 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
78bcb8014a
*) Limit range for selection of indexes for distribution to a DHTDistance of 0.2
...
(For wider ranges enough suitable targets are not probable)
*) Migrate Indexes from ClassicDB back to AssortmentCluster if transfer fails
*) Remove class iterateFiles from plasmaWordIndex
(The class iterateFiles from plasmaWordIndexClassicDB is used instead)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1430 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago