orbiter
8dd35f74c8
fixed redirect problem (does not work for POST)
...
see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1068&hilit=
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4687 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
8313d58ae7
- integrated the collage into the Web Visualization menu
...
- added a counter for the public and private queue on the page (testing..)
- fixed wrong public/private categorization
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4686 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
2617f4dcdb
Connections_p.html: better formatting and remove very old entries
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4684 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
82bf9ac1c8
- added Collage servlet from datengrab and modified it:
...
* all images are queued
* private/public is respected
* inserted into switchboard
* added collageQueue class that stores all the queued images
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4683 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
959f448e5f
- disabled redirects in proxy (so client sees real path)
...
- added connection stats (only connections currently in use)
- remove "old" connections (closed or idle for some time)
- synchronized shared parts of proxyHandler
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4682 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
8fe39ebd74
-fixed file transmission with POST. The only usage was in ranking transmission, therefore:
...
-fixed ranking transmission
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4681 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
82a9861779
fix for last commit
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4680 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
5d1fbb25e7
fix for bad deploy:
...
- the name of downloaded release files is adopted if the httpc delivers uncompressed tar.gz files (the .gz is removed from the file name)
- the deploy method is able to handle tar-file (not tar.gz-files)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4679 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
202a3adb3e
refactoring of HttpClient Writer processes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4678 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
8aa9fd8f24
HTTPC with only 1 retry
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4677 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
444dce7e81
more performance hacks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4676 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2c2dcd12a2
- enhanced performance of Eco-Tables: less time-consuming size() - operations
...
- will increase speed of indexing and collection.index creation
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4675 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
e356625b22
- refacotring of stream copy handling to support time-consuming operations
...
- made usage of BufferedStreams explizit to distinct different copy method in serverFileUtils (byte-by-byte and using an own buffer)
- introduced another timeout setting (java internal property)
- more restrictions to clients accessing a single host (a security setting to prevent DoS by mistake)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4674 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
f01c50cf8d
Proxy logging error (first step to resolution!?)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4673 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
c3342e1178
- removed class with only one static method
...
- removed connection method with too long time-out
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4672 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f97971b63b
fixed NPE problems doing a shutdown from command-line
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4671 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
7a35126e91
http timeouts von alten httpc wieder gesetzt
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4670 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2c1c3bb6eb
- some refactoring (sorry Daniel, hab in deinem Code rumgewütet)
...
- fixed broken downloads (flush was missing)
- different problem handling when download is corrupted
- different default values in yacy.init
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4669 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
d96e2badc7
- fixed POST in proxy
...
- prepared http connection tracking
- refactoring (mainly moving StreamTools to serverFileUtils)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4668 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
14404d31a8
- enhanced performance graph (more info)
...
- added conditions for rarely used logging lines to prevent unnecessary CPU usage for non-printed info
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4667 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
696b8ee3f5
fix for http://forum.yacy-websuche.de/viewtopic.php?p=6806#p6806
...
- removed all InputStream.available() because this does not work for files > 2GB
- iterator terminate when a IOException occurs
- added handling of non-executing index.add methods to enhance assert usage
- added index for file indexes > 2GB, to be used in new indexHeap
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4666 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
94d3d3a86f
fixed Proxy (for GET, POST still does not work!)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4665 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
081ed1d3ec
HTTPLoader: reduced stackTraces
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4664 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
8b2efb6f8c
fixed garbage in HTCACHE
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4663 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
225f9fd429
various fixes
...
- shutdown behavior (killing of client sessions)
- EcoFS reading better
- another synchronization in balancer.size()
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4662 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
6e36c156e8
added more logging to EcoFS
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4661 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
fb541f9162
HTTPC: default timeout half-hour
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4660 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
a94f6cdca4
HTTPC: allowed self-signed certs
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4659 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
ab330cfdca
Network.html: removed ; from location
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4658 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
319144f4b2
fix for outofbounds-excception in EcoFS chunk iterator
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4657 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
a9cf6cf2f4
generalization of index container-heap class.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4654 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f099061944
protection against bad dht-flush word selection
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4653 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
5e4fddc1e6
more logging for new EcoFS.ChunkIterator to find bug for
...
http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1024&hilit=&p=6806#p6806
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4652 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
117ae78001
speed enhancement for reading of eco-table indexes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4647 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
7c149a4ee8
- undo less 'binary data found'
...
- removed duplicate stackTrace
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4643 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
96cce8bed9
reduced 'Binary data found' errors
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4642 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
2aef1414f5
removed test (in yacy.init)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4641 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
5c3c1fdf41
replaced httpc with Apache Jakarta Commons HttpClient (includes some refactoring ;)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4640 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
daa04f5db9
added additional check in file handler to prevent that url attacks are hidden in url path encodings
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4637 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
783a4c9edb
strong speed enhancements for the index cache dump and restore:
...
storage and loading is 30 times faster! a cache of 100000 RWIs needed 180 seconds
to store and 100 seconds to restore; now the same cache needs only 6 seconds to store and
3 seconds to restore. The cache size has decreased now by 30% (95 MB instead of 150 MB).
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4634 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
442204a1c8
fix for concurrentModificationException
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4633 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d2f4926951
- more logging for balancer to get a hint where the problem is
...
- fix for new concurrency method in kelondroSplitTable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4631 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
20dadba426
- added a deadlock prevention function in cache flushing
...
- removed unused methods in collection index
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4630 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
764a40e37d
speed enhancements for crawler and url retrieval (affects also search speed)
...
- concurrency for LURL-fetching: this can be done using a concurrent lookup into the separated url databases. Concurrency is possible because there is no IO during lookup. The more LURL-Tables are present, the better is the speedup. More CPUs will increase speed
- because a large number of LURL-lookups are made during crawling (for double-check), the LURL-Lookup speed enhancements enhances also crawling speed
- search speed also profits from LURL-lookup enhancement
- changed some flushing parameters in word index caching which should make better use of large word index caches and should speed up indexing
- removed flush chunksize parameter, because this was only useful for IO path enhancement feature which was removed some weeks ago to prevent blocking and deadlocks during search requests
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4628 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
3ce3a4a3a1
added stub for new index container heap data structure (purpose: index folding)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4627 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2c34038912
addition/correction to last commit: usage of concurrent-classes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4626 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
b2150057d2
removed unnecessary cleanup method
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4625 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
c4c0d54b22
* added regex extended blacklistengine
...
* removed my own engines
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4618 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
368593e449
enhanced the concurrency handling of indexing process (better queue size control, better data concept, better shutdown behavior)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4617 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
be58135b3e
possible fix for deadlock in search execution
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4612 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
0241d070bc
added concurrency to indexing process:
...
- the methods {parsing, semantic analysis (condensing), structure analysis (web structure)} in the serialized indexing path had been made concurrent.
- four BlockingQueues handle concurrency and hand-over of the indexing objects, the last object in the queue is stored into a blockingQueue of maximum size 1 to serialize the process for storage (which uses IO and therefore here should not be deserialized)
- a concurrency of (CPUs + 1) is default. Single-CPU users will profil from the change because large files cannot block the indexing process any more.
- removed the secondary indexing thread, which is superfluous now. Concurrency is default for all users.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4609 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
9fb5d661f2
added my Blacklistengines
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4608 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
bca87f1e38
- refactoring of serverThreads: renaming to distinguish busy-threads and blocking-threads
...
- added blockingThreads which are threads that are not driven by pause times but by BlockingQueue lookup
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4606 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
968c775025
- preparation of parsing/indexing queue for concurrent execution
...
- remote crawl receipts are now transmitted concurrently in separate threads (makes remove crawls much faster!)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4605 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9b0e20fb06
next refactoring step in document indexing to prepare concurrency environment for document parsing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4604 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7f9f639d20
- refactoring and abstraction of index reference (urls) handling: blacklisting is part of reference filtering
...
- refactoring of word/phrase handling: word abstraction from condenser becomes part of index element handling
- removed unused code parts from condenser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4603 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d6050b9ffb
- separated the LURL data storage and Crawl result stack for process supervision.
...
this is another step to enable multiple, concurrent fulltext-indexes
- another try to make the yacy-httpc more stable
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4602 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
8d6a13bc07
refactoring of parsing-condensing-indexing process:
...
- separated parts
- removed storagePeer function
next step will be parallelization of processes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4600 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d3b06913ec
protection against seed-db failure during enumeration
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4598 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
5aa96dbc36
fix for shutdown configuration
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4596 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
93633abed8
- removed some debugging code from search process - should speed up now
...
- added some profiling code to search event - more time details in PerformanceSearch_p.html
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4594 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
fba46c51d7
fixed non-termination bug in qsort
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4593 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
541b817502
refactoring of switchboard queueing
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4591 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
fc94fbe224
another improvement to the collection sorting
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4589 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
11270d450e
better quicksort-pivot computation: 30% faster (measured with test program)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4588 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
3e44293f07
- fixed a problem with thread pools in row collection
...
- added a line-viewing feature in threaddump
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4587 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
e43051b125
- fixed Threaddump output (html-escaped ie. <init>)
...
- in EcoFS converted comments to javadoc
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4586 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
433ff855f7
- fixed another concurrency problem in collection sorting
...
- fixed a typing problem that was introduced in svn 4579 and caused the crawl monitor to fail
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4585 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
19286fa2d1
tried to fix seed2.old.db-problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4584 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f3996e63b8
tried to fix more deadlocks:
...
- changed connection modes in ftpc
- replaced sort tread pool in row collections by new one using util.concurrent. the old pool had caused blockings
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4582 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
7008a218b3
avoid ConcurrentModificationException in plasmaCrawlerQueues
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4579 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7150b463ff
changed handling of default values and database paths:
...
- the default files yacy.init and for the network definition is now moved to the path defaults
- the httpProxy.conf is renamed to yacy.conf
- the DATA/INDEX/PUBLIC is renamed to the actual network nickname, which should be freeworld or sciencenet
more menu entries
- added apfelmaennchens alternative search page to the menu
- added the new thread dump page to the server log menu point as submenu
modifications
- modified the thread dump page: sorting by thread type
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4575 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
25f5035f23
typo
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4571 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7fd094fcbe
small bug in ftpc: did cot compile in Java 1.5
...
Please set compiler to Java 1.5-compliance
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4570 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
f51bad8ae5
FTP:
...
- report connection status (to break if no connection possible)
- fixed isFolder()
- additional error output
- fixed paths with encoded symbols (ie. a%20file.txt)
- refactoring
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4567 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
820641938e
ftpc: fixed date parsing, some refactoring
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4566 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
4c584dff87
disabled soLinger to prevent that too many connections stay open (it's a TEST!)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4565 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9c989fe5f7
fixed deadlock
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4562 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
c565906050
FTP:
...
- added maxFileSize-check
- added timeout for download
- fixed dirlist (when all filenames have spaces, change to absolute links)
- enhanced isFolder()
- make sure data connection is closed, so a new can be opened
- refactoring
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4561 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
1a7870df0d
FTP: source cleanup (added finals, indention for easier diffs)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4559 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
fa1090113d
- next try to fix the networking problem:
...
set the maximum transfer size to less than MTU=1500-52: buffer size <= 1448
- some refactoring of transfer methods (naming)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4558 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d87d295c68
one more try to fix the connection problem
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4556 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
a3dadcd89b
preventing that peer which return a false search result are disconnected
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4555 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
ba622bb240
addendum to svn 4553
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4554 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
5530b8e1ca
reverted changes to yacy protocol classes: they caused the sciencenet to loose connections
...
a comparisment with the main release 0.57 had been made: this showed a stable network
This is an emergency operation to ensure availability of the sciencenet network.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4553 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
b664a53553
fix for NPE during search
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4552 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
b4ed937f1e
- modified zone navigation (does still not work correctly)
...
- added dht switch in network definition
- 0.574
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4550 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
8d0470a5c6
new method to compute search history IDs
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4549 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
65785da8f2
new method for best hash computation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4548 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9eddc1506b
- one try to fix the httpd problem
...
- fix for handling of collection index that appears when removing elements
- added another navigation method (stub, not working yet)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4543 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7cc4ff05c9
some code enhancements and bugfixes
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4542 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
6788f8f7c1
fixed error 'FTPC cannot change directory'
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4531 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7ce76c8ff8
added missing file
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4530 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
bfed9c2da6
- some refactoring in search process
...
- separated sidebars in new search interface and placed them in their own files
which can be put in into the search page like plug-ins
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4529 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
3445b1e10b
*better logging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4526 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
4b0339fec0
*fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=927
...
*remove some cast
*Properties added
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4525 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
275a226cc5
refactoring
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4524 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
apfelmaennchen
bc3d3b4c97
fixed rebuildTags() to correctly rebuild folders...
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4523 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
danielr
fbe335db73
consistent use of de.anomic.server.serverMemory to get information about memory statistics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4522 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
8c06436c4a
removing the error-db upon each time a start-up is made.
...
This is necessary because the table uses a lot of RAM and the content is never re-used after Start-Up.
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4520 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
4fdf695064
- fixed a bug in remote search that prevented that any results had been generated (!)
...
- added a great number of printStackTrace and new exceptions that shall be used to find the cause
for a bug in yacy client-server communication which causes the interruption of data transfer
which then causes the parser bug for the seed strings.
- tried to fix the communication bug on server-side (copy functions)
Be aware that the log may be full of errors and bugs - there should not be more bugs but there is more to see
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4519 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
0ddbed9451
Less memory consumption at start
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4518 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
1dce2f1079
more multithreading support:
...
- replaced some synchronized classes by classes from util.concurrent
- used a util.concurrent.SynchronousQueue to implement a persistent sorting thread in
the very basic kelondroRowCollection which supports sorting with a second thread
in case that a double-core processing CPU is used
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4517 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
6779b455d7
another fix for the punycode parser/generator (should work now!)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4516 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
1b127406d0
update to punycode encoding (still not working)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4515 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
83860507c9
- added punycode class from gnu idn library
...
- added parser for international domains in yacyURL
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4514 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
253a453413
removed possible synchronization deadlock
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4511 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
3f321ece7d
added a search history to the new search page
...
the history distinguishes between different users and identifies them by their ip
a history is only shown to the user who submitted the search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4510 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
c48e25d784
- fixed selection box for topwords
...
- fixed parser detail in condenser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4509 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
87a8747ce3
- enhanced recognition, parsing, management and double-occurrence-handling of image tags
...
- enhanced text parser (condenser): found and eliminated bad code parts; increase of speed
- added handling of image preview using the image cache from HTCACHE
- some other minor changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4507 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
low012
652086159a
*) Replaced System.err.println() by logging function. Left System.err.println()s as comments to be able to quickly revert changes since gzip is an application with it's own main method and Orbiter maybe wants to keep it this way.
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4505 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
677ee2ea04
added remove operation to collection index (re-activation)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4503 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d477483373
stronger criteria to use RAM copy to use table copy
...
(should use less RAM)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4502 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
a7abee3578
- fixed some data types in new search stack
...
- added image domain presentation to image preview
- added new search page to menu
- added automatic re-search when an old search profile is requested and a crawl is ongoing,
to fetch newly crawled entries
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4501 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
81687b6bd5
added missing hachCode computation for previous feature
...
this solves also the missing image double-check fetaure!
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4500 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
bedd8dfbe2
- added image sorting by image size. This is the default now.
...
This is performed using a 3-stage sorting process:
- sort by relevance, then do snippet-fetch
- sort snippets by relevance then do image link extraction
- sort image links by image size; unknown sizes are handled like small sizes
- only the exact amount of images as requested are shown
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4499 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
727feb4358
- fixed some bugs in ranking computation
...
- introduced generalized method to organize ranked results (2 new classes)
- added a post-ranking after snippet-fetch (before: only listed) using the new ranking data structures
- fixed some missing data fields in RWI ranking attributes and correct hand-over between data structures
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4498 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f4c73d8c68
- fixed highslide usage
...
- some enhancement to index management, better types
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4497 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2327451653
- changed order of database initialisation (index first)
...
- removed mainly unused init-time for databases (was only used for tree tables, which are not used any more)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4496 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
3441ec3928
- some small changes to highslide integration to get it working... (does not work yet)
...
- performance enhancement for url list parser
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4495 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
6c3cd2b4f2
- added new way to watch images from the image search:
...
they appear as separate, floating window above the search results,
not in a new window
- added highslide javascript library for feature mentioned above
- removed dir servlet. This thing was not used as it was supposed to be (as an example applet)
and was a major problem for intranet-indexing when files are hosted on the same peer.
- added yacy-httpd-internal directory listing. Because YaCy is a search engine,
directory listings are similar to search result listings. Intranet indexing from the same peer
will get nice index pages for document collections.
- removed unused test applet
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4494 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
61a81820e3
- refactoring of search tracker
...
- added link to search history to repeat the search
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4493 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
9ecc17baef
fixed double Blog entrys
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4492 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
36b898ca7a
- tested successfully z-presentation of yacy seed encoding
...
- added alternative switch that takes shortest representation as yacy seed string encoding
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4491 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
066c88140f
quickfix for OOM, see http://forum.yacy-websuche.de/viewtopic.php?f=5&t=875&hilit=&p=5686#p5686
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4488 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
4079c38ce0
- probably slightly better default ranking
...
- added experimental right column to new search page (no function, only container)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4487 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
8fd5e52f04
added basket icons and experimental gif animation class
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4485 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
94e256e13b
* removed single Blogview, now links direct to BlogComments.html
...
* some other small changes
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4483 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
ff5969901c
modified dir servlet to cooperate with intranet indexing from the own HTDOCS repository:
...
- removed md5 file generation (spoils the won repository)
- removed comments in file share (was never used)
- moved dir list comparator to other place (maybe solves problem, lets see)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4481 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
00f5f917de
- more refactoring to blog
...
- fixed moderate comment bug. see http://forum.yacy-websuche.de/viewtopic.php?f=9&t=860
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4478 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f890b039ee
experiments wit openstreetmaps
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4477 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7f445f34a6
bitte die Java 5 - typischen Warnings einschalten!
...
(unboxed-Fehler wies auf Programmfehler hin und Typangabe fehlte)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4476 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
c1b9a03304
* some refactoring to Blog
...
* changed default sort order to reverse (newest first)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4475 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
766a04bc06
fixed sort problem in Blog. see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=639
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4474 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
bfe171e693
Small change (generics)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4473 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
2589290ded
better ping
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4472 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
dae9053b21
BUGFIX
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4464 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
77ba446332
seedDB helpers update/cleanup
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4461 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
dd215e7f6b
NPE fix
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4460 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
bd63999801
- faster search: using different data structures that avoid multiplr calculations
...
- no more table copy for error-eco table
- optional table copy for lurl-entries
- more abstractions (less single constant strings)
- better logging (using host names instead of ips)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4459 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
8358652fa9
some small changes to blog
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4457 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
159aaf8889
re-introduced global search limitation when index receive is switched off
...
this was necessary because othervise robinson peers did also global searches, which cannot be a wanted effect
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4456 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
a9c4e9c309
Small change (ping)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4453 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
borg-0300
9ab6ad8b73
more seedDB helpers
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4452 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
6a85764e1a
Second bugfix for numberbug in Blog.
...
This update fix automatic existing blogentrys.
A backup is not needed but almost a good idea ;)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4451 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
efd5807a7c
- some renaming of variables to support DC
...
- initial 120mb RAM for fresh peers
- release 0.57
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4445 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
lulabad
40a0591942
Fixed numberbug in Blog, see http://forum.yacy-websuche.de/viewtopic.php?f=6&t=639 . This wont fix existing Blogentrys (comes later).
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4443 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
141db7ba48
there is less RAM needed for eco table (its just a security-plus for RAM check)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4442 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
249d61759a
fix for false RAM table activation in EcoTables
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4441 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
ff6b69b37e
fix for NPE in access tracker
...
fix for NPE in word index
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4439 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
3c7b94c119
- fix for online caution delay settings, see
...
http://forum.yacy-websuche.de/viewtopic.php?f=6&t=738&p=4723#p4723
- removed remote search limitation for non-dht-peers according to discussion in
http://forum.yacy-websuche.de/viewtopic.php?f=15&t=793&hilit=&p=5277#p5277
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4438 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f35a3794e0
auto-healing (deletion) of bad peer addresses during start-up
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4437 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
42c1e11f2b
added another link double-check
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4434 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
a5d388bfff
fix for HTCache organisation that may have caused unlimited grow of the cache
...
appeared only for tree-caches
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4433 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
96c5e6acc7
added a double-check for search results
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4432 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
a1e9e6e2e6
fix for search result page navigation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4431 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7404256997
- no more search time-out!
...
- fixed a bug with last commit
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4430 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
cd3e0d6f03
tried to fix another eco bug
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4429 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
08a12e9bb5
- removed dashed line from default skin (looks much better!)
...
- better timing when displaying results
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4428 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
89169d54fd
fixed search result preparation
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4427 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
acf771d5e1
- fixed bug with too much RAM in crawler queue
...
- fixed dir bug
- better calculation of TF for join
- better waiting-on-result logic
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4424 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
a8a5df4a51
- more dublin core naming of page metadata
...
- better presentation of result counters in search results
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4420 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
fa3b8f0ae1
fixed bug in remote search
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4419 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
7d875290b2
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4417 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9d693ee635
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4415 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
0f5c4abaca
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4414 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
974fea7933
added term-frequency ranking
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4413 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
1a296af6ff
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4412 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
4a80902081
- added ViewProfile as rdf in foaf syntax
...
- added link to rdf and vCard version on html page
- can be seen on http://localhost:8080/ViewProfile.html?hash=localhash
- more generics
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4411 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
da8c850a25
disabled IO path optimization (seems to block other methods)
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4405 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
hermens
d177ceb3b3
Fix for growing responseHeader[12].db when using proxyCacheLayout = hash
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4404 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
apfelmaennchen
b1fae9b5af
fixed import Netscape Bookmarks
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4401 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2485681002
added termination control for RotateIterator
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4399 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
e2e7f065e9
minor fixes, some generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4398 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
15397298dc
- refactoring of indexControlRWIs: moved statics to own class; better Dublin Core naming
...
- fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=759&hilit=&p=4866#p4866
- some bugfixes in EcoTable according remove method
- switched more tables to Eco: crawl Profiles, htcache, seeddb, newsdb
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4397 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
apfelmaennchen
f3a9e9c542
added getFolderList() to bookmarksDB
...
added cleanTagsString() to bookmarksDB
added getFoldersString() to Bookmark
modified getTagsString() to exclude folderTags
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4383 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
db25425893
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4382 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9e7cd4fdbb
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4380 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
4e70dff8cf
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4379 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
6dc679785f
- fixed bad sort behavior of kelondroRowSet, in this case: no sort at all!
...
see http://forum.yacy-websuche.de/viewtopic.php?p=4841#p4841
- some memory calculation enhancements in kelondroFlex and a little bit more logging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4378 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
0b4205eb5a
- fix double-deletion in eco tables
...
- changed behaviour of sort moment (not during a get)
- added some asserts in snippet cache for debugging
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4375 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
4ce6fab428
added special handling for doubles in eco tables after initialization
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4370 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
002a109c4d
patch for http://forum.yacy-websuche.de/viewtopic.php?p=4597#p4597
...
(urls that have no protocol but start with www will be treated as http://www ...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4369 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
634430c48a
- more logging
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4368 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d372a78aef
some fixes to bring back lulabads peer..
...
see also: http://forum.yacy-websuche.de/viewtopic.php?p=4772#p4772
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4366 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
low012
f4799c2334
*) removed since I decided to turn this into a project of it's own using Perl to gather n-gram data which YaCy will be able to use
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4365 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
4ffbcd54a4
fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=754
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4358 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
apfelmaennchen
e81bced2bd
reorganized the code and adjusted getTagIterator() to suit folders
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4357 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
85dc62c16f
refactoring: more dublin core - compliant naming
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4354 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
efd0b8371a
- added parsing of Dublin Core - compliant metadata (see RFC 5013 and ISO 15836) to html parser
...
- refactoring of plasmaParserDocument to use Dublin Core - compatible property names
- redesign of url handling in parser and condenser (less String-to-yacyURL conversion)
- more generics
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4352 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
low012
cfd4fecd12
*) blanks in paths for restart and update script are replaced by backslash+blank now (see http://forum.yacy-websuche.de/viewtopic.php?t=745 )
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4351 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f945ee21d2
some security additions, keep maximum byte[] size to 2^27
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4350 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
2f3b2f3481
- extended dbtest for comparisment tests
...
- added initial space option for eco tables
- used initial space value in initialization of collectionIndex, this should avoid OOM failures" /Volumes/Magneto/dev/workspace/trunk/source/dbtest.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/kelondro/kelondroCollectionIndex.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/kelondro/kelondroDyn.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/kelondro/kelondroEcoTable.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/kelondro/kelondroRow.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/kelondro/kelondroSplitTable.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/plasma/plasmaCrawlBalancer.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/plasma/plasmaCrawlStacker.java /Volumes/Magneto/dev/workspace/trunk/source/de/anomic/plasma/plasmaCrawlZURL.java
- added index consistency check (checks for double-occurrences of primary keys in file)
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4349 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9eb746863d
interface enhancements for eco records memory statistics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4348 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
9abc927645
to fix inconsistencies in collection index, a double reference reporting mechanism has been implemented
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4347 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
58a1f518f8
fixed some problems with eco tables
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4346 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
d4d07802ac
better RAM protection using eco tables
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4345 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
f4e9ff6ce9
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4343 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
cbefc651ac
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4342 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago
orbiter
45339c3db5
more generics
...
git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@4341 6c8d7289-2bf4-0310-a012-ef5d649a1542
17 years ago