orbiter
91dca2cd8d
fixed a bug in last commit: LURL entries cannot be written,
...
because a stored property was not set to false (but true)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1906 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3286b1f498
re-organisation of lurl-creation and -stacking
...
this was necessary to prevent useless write to the database
in case of blacklist appearance of the url
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1905 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0b903c5317
removed usage of kelondroNaturalOrder from plasmaCondenser to experimental
...
exclude cause of a 100% bug.
see http://www.yacy-forum.de/viewtopic.php?p=19076#19076
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1900 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
4239db0d1c
fixed new ordering for backup iterator TreeSet
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1899 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
33eba5ecb8
temporary disabling last change, does not work (cannot debug right now)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1896 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f0464042fc
fix for latest iterator-replacement-fix:
...
iterator generated TreeSet which did not resprect rotations
this has now be implemented using kelondroOrder Objects
and by adding this rotation-rules to the ording
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1895 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
ec21c585cb
try to fix path too long
...
see http://www.yacy-forum.de/viewtopic.php?p=19079
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1893 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a6a3f4b694
fix for svn 1888
...
this is a redesign of the no-iterator solution
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1892 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
8da13088e9
*)removed multiple DHT_Distribution_Threads
...
*)boosted DHT_Distribution sending chunk parallel to multiple peers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1890 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
283a7181c6
try to fix new 100% cpu bug, possibly caused by iterator method
...
see http://www.yacy-forum.de/viewtopic.php?p=18900#18900
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1888 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f588c0724f
removed cache flush in case of DHT receive
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1885 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
e94b374d56
update to cache flush method
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1884 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
bcd99fe83e
introduced a second RAM cache for DHT transfer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1880 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
360a460da8
*)URL-Cleaner: moved logging-statement to correct position
...
*)plasmaURLPattern: host is now added to the hashset in lowercase
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1879 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
02f9765013
quickfix for time problem during cache restore
...
see http://www.yacy-forum.de/viewtopic.php?p=18810#18810
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1878 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
ad119f06af
*) Don't overwrite new entries with older ones
...
see: http://www.yacy-forum.de/viewtopic.php?t=2015
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1874 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
be88687d8c
fixed some problems with new cache flush karenz
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1873 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
d3da7c9a08
*) Adding support for robots Allow directive
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1872 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
f046e1814a
*fix or last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1869 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
c55c51e2a8
*)added keywords to IndexCleaner_p.java
...
*)updated Logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1868 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
ddbeda738e
added minimum age of word in cache to performance menu
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1866 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f188611fc6
apply blacklist on rwis during dht receive
...
very experimental!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1865 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0ec28b8f8e
added DBCleaner from Hydrox
...
see http://www.yacy-forum.de/viewtopic.php?p=18093#18093
The servlet is now named IndexCleaner_p.
See http://localhost:8080/IndexCleaner_p.html
The Servlet was adopted to fit in the overall architecture
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1863 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
fb4100d47b
*) undoing last commit.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1856 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
a84cc71218
*) removing getTotalRuntime
...
- not needed anymore
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1855 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
auron_x
dce08771d1
*) Fix for wrong estimated and elapsed times when import was paused
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1850 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
b34713324a
DBImport: remove words from source index even if nothing has been added to home index
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1849 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
520b60f15b
fix for http://www.yacy-forum.de/viewtopic.php?p=18610#18610
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1841 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
bae3783d38
added a snippet marking
...
(search words are now bold in snippets)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1823 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f0a38873eb
* added yacysearch page with better view on search results
...
the old search page is obsolete and will be removed
* ConfigBasic.html is now the default page instead of index.html
as long as no password is set
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1815 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f0041d504d
remove of several results from a single domain is stopped if the result set is smaller than the wanted number of results
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1811 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
89286478e7
*) removing thread pool eviction for now. Not needed at the moment
...
See: http://www.yacy-forum.de/viewtopic.php?p=18290#18290
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1801 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
759800f543
*) Bugfix for storeHTCache problem
...
- content was not indexed if storeHTCache was off
See: http://www.yacy-forum.de/viewtopic.php?p=18269
See: http://www.yacy-forum.de/viewtopic.php?t=1882
See: http://www.yacy-forum.de/viewtopic.php?t=241
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1800 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a8548c0484
* several bugfixes regarding basic configuration
...
* extended number of search target peers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1794 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
1b9b8922d9
* fixed problems with new basic 1-2-3 configuration (now authentication required)
...
* fixed graphics problem
* fixed some other problems with default values
* 1-2-3 config now appears automatically on start-up if no password is set
* added new config menu
* moved profile to new config menu
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1792 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
auron_x
8c6f38fe70
*) added Blog to YaCy (atm not reachable through interface) -> Blog.html
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1790 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
ce5274c194
yacybot user agent
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1786 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
351bd0a678
*) dbImport: convert cacheSize to kb when creating plasma* objects
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1773 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
eaffcfefe2
* added more ranking attributes (without function; this will be added later)
...
* added ranking coefficient transmission to remote peer (without evaluation on server side, will be added later)
* changed ranking coefficients slightly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1770 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
87e90b9d8c
refinements in ram cache flush procedure and default timing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1768 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
d31a4e0b4f
some small enhancements with cache flushing parameters and data structures
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1767 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3703f76866
- fixed re-search bug: after a search with several words, a second search could not
...
find the same words as before. This was caused because indexContaines stored the url references
with a hashtable. A tree was needed to work with the index conjunction-by-numeration
- added permanent ram cache flush (again)
- removed direct flush of ram cache after a large container is added.
this happens especially during DHT transmission and therefore this fix should
speed up DHT transmission on server side.
- removed unused and out-dated methods
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1765 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
fbbbf5f411
*) remote trigger for proxy-crawl
...
- remote crawling can now be enabled for the proxy crawling profile
See: http://www.yacy-forum.de/viewtopic.php?p=17753#17753
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1758 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
dc9174c809
*) Implementing snippet fetching via ajax
...
Snippets that are not available on page load time will be fetched using ajax requests.
see: http://www.yacy-forum.de/viewtopic.php?p=16479
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1748 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
1d8ca6e082
serialized dhtChunk deletion with indexing
...
The dht selection, transmission and deletion is now completely serialized with indexing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1731 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
2336f0f013
*) allow pausing/resuming of crawlJob Threads separately
...
- pausing/resuming localCrawls
- pausing/resuming remoteTriggeredCrawls
- pausing/resuming globalCrawlTrigger
See: http://www.yacy-forum.de/viewtopic.php?t=1591
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1723 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
60dac4325e
serialized indexing with dht selection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1719 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a840755964
moved parts of index transfer logic back to switchboard
...
this is needed to merge the dht selection with the indexing thread
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1718 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
134253a603
fixed bug with cache flush
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1717 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
c2d863855d
different flush limit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1713 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
64441b1f78
ADDED: yacy.badwords list to filter the topwords
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1711 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f9063e2040
added some synchronization to avoid that several tasks can trigger a cache flush simultanously
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1708 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
2c4e4ae6a2
further refactoring of dht selection, transfer and flushing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1707 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
73dad68cf1
outsourced thelis DHT flush class into own file
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1706 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
aa4b04e3dd
reverted last change
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1705 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
4b0dae8fcf
added a possiblity to get the ranking values for an url.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1703 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
85ac7d8386
* moved DHT transfer thread to own class file, needed for further modularization
...
* changed status handling
* added forced cache flush when cache has containers with too high number of index entries
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1702 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
7df2e6e571
bugfix for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1700 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
cd41e9a0eb
moved DHT index selection to new object that holds indexes to be send away to other peer.
...
This was made to make it possible that RWI selections can be serialized with indexing.
Serialization will be implemented in another step.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1698 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
42a5f56723
*) Bugfix for broken dht thread configuration
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1695 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
f95d98142f
*) displaying amount of items in the existsIndex caches
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1679 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
e2af2a3f45
*) it's now possible to run more then one indexDistribution-Thread
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1673 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
40dd6ec4fd
*) experimental restructuring of db import function
...
- trying to reduce IO load by avoiding unnecessary db access
- trying to presort url list
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1671 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
2da18ab359
*) correcting logging output
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1667 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
8ffc6e35ad
*) correcting logging output
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1665 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
980e986b64
*) Re enabling short cycle for already removed nurl entries
...
See: http://www.yacy-forum.de/viewtopic.php?p=17147#17147
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1660 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
3b6328ad02
*) Consistent use of minCount for index transfer
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1645 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
0b60b9bf51
*) Remove entries from AssortmentCluster before reinserting the rest into the ramCache
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1640 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
8ab1d6ff4b
*) fixed NullPointerException in plasmaWordIndexEntity
...
See: http://www.yacy-forum.de/viewtopic.php?t=1921
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1638 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
a26574c894
Migration from tagName as key to wordhash(tagName) as key for bookmarkTags.db
...
(just deleting the old db, rebuildTags does the rest)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1637 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
7eb10675b3
re-organization of index management
...
this was done to be prepared for new storage algorithms
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1635 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
1e4578aab6
VERY EXPERIMENTAL removal of index ram cache flushing thread.
...
The cache will fill up and flushed explicitely when it is full.
This shall remove double-access of assortments (indexing and flush)
during indexing process. Hopefully this should reduce IO.
The main idea is: the cache shall mainly be flushed by DHT transfer, and
only indexes that shall be hosted by the own peer are flushed to the
assortments. This needs further work.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1617 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
954f02d22e
*) Bugfix: Prevent wordIndex.getContainer() from returning and even manipulating
...
the containers from the ram cache. Return a new container instead.
*) Speedup flushFromMem by reducing the number of searches in the TreeMap
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1604 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fe39493145
changed default ranking parameters
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1582 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
365a3fff8e
fixings for ranking attributes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1569 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
8e55098b74
fixed detailed search
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1562 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0cb940a8e5
added detailed search.
...
ranking profiles do not work properly yet
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1551 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
c695928f7c
adopted search page to new detailed search (to be commited later)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1550 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
45323e7b76
fixed null pointer exception during search
...
see http://www.yacy-forum.de/viewtopic.php?p=16429#16429
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1547 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fb7411d7bb
re-structuring of ranking application:
...
concentration of all ranking attributes in the
plasmaSearchRankingProfile
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1541 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
d98418390b
- introduced rankingProfile Class
...
- selection of ranking and timing profiles for each search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1539 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
eab1805bca
refactoring: plasmaSearchProfile -> plasmaSearchTimingProfile
...
This was made to distiguish this profile from the
(to-be-implemented) plasmaSeachOrderProfile
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1538 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
6eef848954
re-design of post-ranking process
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1537 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
be77fe1a88
code clean-up
...
@Martin: bitte schaue mal warum die Variablenzuweisung
in plasmaCrawlNURLImporter war. So wie sie waren, waren sie überflüssig.
Das hattest du dir bestimmt nicht so gedacht.
Sollten es ggf. globale Variablen sein?
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1529 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0bc2aaeb42
added normalization to search attributes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1528 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
008bcb7fb8
*) simplifying code by moving closeTransferIndexes into final block
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1522 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
50d85657b8
*) new import function for IndexImport_p.html
...
- can be used to import the crawling queue (noticeUrlDB + stacks)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1518 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
214302284e
*) undoing last commit because of problems with getUpdateTime
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1514 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
408de3beee
*) avoiding to search in the treemap two times for the same key
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1513 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
139ba4e0c8
Bugfix for getCachePath(URL url)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1510 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
442807cb29
*) Bugfix for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1506 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
22fd1ca9aa
*) minor changes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1505 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
6a99304b2b
*) Redesign of db import functionality
...
- restructuring to allow different import tasks to be controlled via one gui
- adding possibility to import a single assortment file
- adding possibility to set the cache size that should be used
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1504 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3834675084
fixed bug that caused wrong behavior of search result preparation
...
(second search on same topic resulted in less links)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1502 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
31c8476b5d
plasmaWordIndexCache.getContainer:
...
*) Also get entries from cache
*) calculate available remaining time for backend.getContainer correctly
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1501 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3419b3bcdd
fix for bug that caused the peer-counter problem.
...
See http://www.yacy-forum.de/viewtopic.php?p=16016#16016
The kelondroDyn now uses a generic fill character.
kelondroDyn-Tables containing peer/word/url-hashes must not use '_'
as fill character.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1498 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
4f43816ec0
*) Fix wrong class cast in indexSize()
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1495 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a7f0adf6fa
bugfix in entity iterator
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1490 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fa90c3ca7a
- removed some usage of indexEntity
...
- changed index collection process: indexes are not first flushed to indexEntity,
but now collected directly from ram cache and assortments
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1489 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
aea3e00864
cleanup: removed unused temporary index management in indexEntity.
...
This is replaced by indexContainers
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1486 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
03c65742ba
changes towards the new index storage scheme:
...
- replaced usage of temporary IndexEntity by EntryContainer
- added more attributes to word index
- added exact-string search (using quotes in query)
- disabled writing into WORDS during search; EntryContainers are used instead
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1485 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
ab7a911bb3
*) Trying to solve pool not open problem
...
See: http://www.yacy-forum.de/viewtopic.php?t=1798
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1482 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
d665f3c39c
*) fixed Threadnames for stackCrawl-Threads
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1480 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
3d5347bc8e
*) changing loglevel for some messages
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1479 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
0fcd113c42
*) last bugfix part. Seems to work now for the stackCrawler
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1478 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
b9c9eaeb44
*) next try todo a bugfix :-((
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1477 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
4b4b93c413
*) next try todo a bugfix :-(
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1476 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
d9fbad71b9
*) next try todo a bugfix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1475 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
6da97bd2e4
*) next bugfix for threadpool problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1474 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
bea2b9edee
*) further redesign of threadpools to solve too many thread problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1473 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
784fd50437
*) more verbose thread names
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1471 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
56e4dbeb71
*) displaying current active + current idle threads in PerformanceQueues_p.html now
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1470 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
859c6a88f5
*) testing various thread pool eviction settings to avoid outOfMemory - Thread creation problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1467 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f2b18cede9
AND-bugfix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1461 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
b946e28e61
some ranking enhancements
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1460 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
rramthun
6c02f889f7
Cosmetic changes.
...
Corrected version numbering as described in http://www.yacy-websuche.de/wiki/index.php/De:Versionsnummern
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1453 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
b191f06d16
*) Adding additional logging message to locate problems with stackcrawl threads
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1452 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
d9bcd73d93
*) Bugfix for exception
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1448 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
f5abfe8d57
*) more failsafe threadpools
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1446 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
a56fefe0d3
added missing forced-flush for index cache
...
see http://www.yacy-forum.de/viewtopic.php?p=15732#15732
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1434 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
78bcb8014a
*) Limit range for selection of indexes for distribution to a DHTDistance of 0.2
...
(For wider ranges enough suitable targets are not probable)
*) Migrate Indexes from ClassicDB back to AssortmentCluster if transfer fails
*) Remove class iterateFiles from plasmaWordIndex
(The class iterateFiles from plasmaWordIndexClassicDB is used instead)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1430 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
861aae678d
*) cleanup cacheAge database when cleaning up the HTCache
...
*) Log directory deletes with level Fine
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1427 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
b4e2efef10
*) first test of new iteration function
...
ATTENTION: please don't use it at the moment
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1418 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
eabf4a0386
fix for null pointer exception during shut-down
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1415 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
47843e69e2
auto-reset for switchboard queue stack
...
bugfix for http://www.yacy-forum.de/viewtopic.php?p=15684#15684
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1414 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
d6581c445b
added content iterator for corrupted database files
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1406 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
ecdc1f7547
*) Bugfix for crawling URLs with query parameters
...
See: http://www.yacy-forum.de/viewtopic.php?p=14065
*) Preparation for http://www.yacy-forum.de/viewtopic.php?t=1719
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1405 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
fc4ae899f7
added word-position to ranking (this is only a first step)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1395 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
bb2095fe39
assortment files are now not deleted, but shifted to a backup directory.
...
See also: http://www.yacy-forum.de/viewtopic.php?p=15458#15458
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1394 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
7366e39dd3
tried to fix 100% CPU bug.
...
See http://www.yacy-forum.de/viewtopic.php?p=15569#15569
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1393 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f14d49fae9
enhancements, bugfixes and additions to word index attribute storage
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1392 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
4d33020f56
Migration to WORK
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1389 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
rramthun
1e5feedf0e
Fix for http://www.yacy-forum.de/viewtopic.php?p=15547#15547
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1388 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f4ffa9aee5
- implemented more attributes to index entries
...
- implemented hand-over of new word index attributes during remote search
- implemented word-distance computation during search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1382 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
90b940e90e
fixed position storage problem.
...
Now the word position is properly stored.
No use of that now, but can be used for better ranking.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1378 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0371494010
tried to add word position to index
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1377 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f1cfee7703
removed tabs from condenser
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1376 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
37791fd529
*) Close indexEntities when "found not enough peers for distribution"
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1375 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
c5b6154136
added CRDistOn = true/false
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1372 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
71d5c2b2ca
better control for target peer selection for RWI transfer
...
see also http://www.yacy-forum.de/viewtopic.php?p=15343#15343
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1370 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
ca7407b7e1
*) Don't change maxTime if zero or negative
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1363 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3d7c8aaeae
removed confusing method
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1339 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
4cd0c45a77
code cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1337 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
971247b78f
- rotate merged indexes after merging
...
see: http://www.yacy-forum.de/viewtopic.php?t=1717
- fix -rwihashlist to correctly shutdown
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1336 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
e2ff1767b5
fix for last DHT distribution bug-fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1330 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
060e5a0df0
fixed problem with DHT target peer selection:
...
- shifted selection in front of distribution
see http://www.yacy-forum.de/viewtopic.php?p=15131#15131
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1327 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
7c22afe3de
*) Bugfix for NullpointerException in deleteOldHTCache
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1326 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
b21b9df2d0
added section headlines generation to html parser
...
can be viewed in cache control, but is not yet included to indexing
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1320 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
rramthun
c4487deba9
Minor changes collected over some time.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1319 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
6822dce57b
Using Orbiters function for auth
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1315 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
2028403670
- consolidated different orderings to kelondroNaturalOrder
...
- added another iteration method to rwihash-enumeration
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1309 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
9544c47684
added some UTF-8 handling.
...
hope this will help somehow.. for shure not THE solution to our UTF-8 problem
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1308 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
9d8dca750e
BUGFIX for my last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1306 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
5449193167
bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706 (i hope)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1304 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
2a23f5d419
F..., Sorry, no time, later
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1303 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
3a2d13786e
bugfix for http://www.yacy-forum.de/viewtopic.php?t=1706
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1302 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
dc0999ec9c
adapted to new HTCache structure
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1290 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
9086261476
refactoring of base64 encoding:
...
the kelondro database needs specific information about the order of
base64-encoded keys. Since no other package depends on base64
(only the httpd uses base64 for encryption, but does not need to encode these strings)
it is good to move base64 encoding to the new ordering classes in kelondro.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1284 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
b24fcc8ca4
oom
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1281 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
7da232b5b9
HTCache Reset if necessary
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1280 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
4f18f24d81
small change
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1278 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
c652527620
YaCy removes now the old HTCACHE data
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1277 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
69f65210e2
".yacy" has its own directory;
...
glad new year :)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1275 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
351fffc129
DATA/WORK for user-created content
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1274 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
a81cc9d969
no DATA/DATA to avoid confusion.
...
increasing version number
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1273 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
b95c5d5781
BUGFIX for URLs how "/../" ...;
...
new port handling;
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1271 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
9cce3c5709
dates Table for bookmarksdb(needed for del.icio.us api)
...
Files in DATA/DATA
Migration: move bookmarks.db from SETTINGS in DATA
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1270 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
11fe95832e
avoid division by zero when index transfer is extremely fast
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1269 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
allo
4ac0fd328a
First Version of the Bookmarksmanager
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1248 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
d7b6dcbe2e
*) Bugfix for MalformedURL problem if Location header is empty.
...
See: http://www.yacy-forum.de/viewtopic.php?p=14325#14325
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1247 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hermens
5b3e01bd3c
avoid division by zero when importing very small indexes (<100 entries)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1238 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
borg-0300
b7f9adc2c9
new filters added
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1231 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
79667a172e
*) Bugfix for additional parser problem
...
See: http://www.yacy-forum.de/viewtopic.php?p=14146#14146
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1221 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
8c594841a8
*) Bugfix for incorrectly indexing of URLs that were requested with Cookies in the
...
Request header
See: http://www.yacy-forum.de/viewtopic.php?p=14077
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1214 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
b5d02d649a
fixed bug caused strange search result behaviour
...
(results from remote peers had not been saved propery after search)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1213 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
4500506735
fixed some bugs concerning url entry retrieval and intexControl interface
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1212 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
83a34b838d
* added Object allocation monitor on performanceMemory page
...
* added some final statements
* changed shutdown sequence order
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1211 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
4ff3d219e8
increased delay for cacheScan start and slowed down scan process
...
to provide more time to other tasks
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1210 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3031903d50
re-design of RAM cache flush into assortment cluster
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1209 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0c762daf4b
better startup failure handling
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1205 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
f27f9ecf15
* activated write buffer for databases.
...
This should increase IO performance and reduce HD activity
* bugfixes for new exception-on-failure policy
* bugfixes for new IOChunks
* new Object pool for database write-buffer
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1204 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
c59d1b2f5e
- Tests with write buffer (new class kelondroBufferedIOChunks, not yet active)
...
- minor bugfixes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1203 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
bb79fb5d91
- changed handling of error cases retrieving urls from database
...
(no more NULL values are returned, instead, an IOException is thrown)
- removed ugly damagedURLS implementation from plasmaCrawlLURL.java
(this inserted a static value into the Object which is not really a good style)
- re-coded damagedURLS collection in yacy.java by catching an exception and evaluating the exception message
to do:
- the urldbcleanup feature must be re-tested
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1200 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
e7d16ef831
*) Corrections in jMimeMagic MagicRule-file to detect some special rss feeds
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1196 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
386d9e45d8
*) Bugfix for code cleanup
...
- Code must be in finally block, otherwise it does not work if an error occurs!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1193 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
5a1d45715d
*) Bugfix for parser configuration bug
...
- it was not possible to disable all parsers
See: http://www.yacy-forum.de/viewtopic.php?t=1579
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1191 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
rramthun
a1061495d4
Fixed some spelling mistakes and added some text which (should) make it easier to understand the options.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1187 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
0cdc58aaea
fixed indexing of local domains.
...
see http://www.yacy-forum.de/viewtopic.php?p=13680#13680
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1186 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
e1c2d8ec5f
*) Speedup "removed from queue"
...
See: http://www.yacy-forum.de/viewtopic.php?p=13442#12188
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1183 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
hydrox
96930f0d2b
*)added function to removed malformed URLs from urlHash.db
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1182 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
8862b6ba4b
*) Corrections for code cleanup 1175
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1179 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
13fdebc50d
added authentication for link deletion in search result
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1177 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
37f88b4017
code cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1176 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
ec2b39c1ce
code cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1175 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
8f1f2daa5e
implemented interactive link deletion of search results.
...
next steps: attach voting and restrict to administrator
to see the deletion button, move the mouse pointer to the left of a search result
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1172 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
6d0f7e6988
*) Adding missing file
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1171 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
44fa94ac52
*) Modifications for dbImport functionality
...
- dbImporter threads are now shutdown by the switchboard on server shutdown
- adding possibility to pause a importer thread via GUI
- Bugfix for abort function
See: http://www.yacy-forum.de/viewtopic.php?p=13363#13363
*) Modification of content parser configuration
- now it's possible to configure which parsers should be enabled for the proxy,
crawler, icap, etc. separately
-
*) htmlFilterContentScraper.java
- adding regular expression to normalize URLs containing /../ and /./ parts
*) httpc.java
- adding functionality to unzip gzipped content
- requested by roland: should be used later to allow gzipped seed lists
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1170 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
dc778659fb
fixed problem with time-out during result joint which caused OR behavior instead of AND beahvior
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1167 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
orbiter
3d8a5ae652
code cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1166 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago
theli
64478b1f02
*) Adding possibility to delete crawler queue entries using regular expressions
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@1160 6c8d7289-2bf4-0310-a012-ef5d649a1542
19 years ago