yacy_search_server

Commit Graph

Author	SHA1	Message	Date
orbiter	d755a8026d	- better OOM protection - better memory allocation for FlexTable indexes - splitting between static index and dynamic index (only the dynamic part must grow) - to enable a merge-iteration of new splittet index, a huge number of classes needed to be adopted for new iterator classes - added new iterator classes that support cloneable iterators - adopted all iterator classes to implement cloneable itarators git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3453 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	2be405e1e1	- fix for last two commits git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3452 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	de1b4a1731	- don't publish news if empty or equal page is submitted in wiki git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3451 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	dcc13abd59	- fixed small bug at home page, button "peer's console" - fixed <fieldset><dl> for safari on many pages - added Blog-link to Network page git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3450 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	6596167277	*) bugfix for wrong RSS feed pubDate formats git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3449 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	0d178d00a5	*) adding RSS feed for peer messages git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3448 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	4f2e6ef47b	- WatchCrawler_p shows max. 80 characters of URLs now (maybe dynamically adjustable based on browser width?) - typo in BlacklistCleaner git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3445 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	70cd391ea1	fix for dl/fieldset problem in Safari git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3444 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	5741701b59	moved crawl start up, personal web pages down in main menu git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3443 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	b627c77df6	- workaround for safari bug with definition lists inside fieldsets in ConfigBasic - alternative can be seen in PerformanceMemory, where a dl is simulated with a table layout git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3442 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	03c5906ae7	- minor bugfixes for url-fetcher & http://www.yacy-forum.de/viewtopic.php?t=3646 - PerformanceMemory_p.html is valid XHTML again git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3440 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	8e9bee12fc	*) adding guid to yacysearch.rss git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3435 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	1cba31de43	redesigned ram organization for database caches - each cache can now allocate as much memory as is available - no more fixed limits - replaced old performance memory monitor by new one - added supervision methods as static functions into the classes that provide cache functionality - steering of ram allocation is done with two simple limits that are ram availability-relative git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3434 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	e934c5b09b	*) wrong blog rss feed titel git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3433 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
theli	ceed0364e2	) Blog RSS: Image added ) RSS Feed for YaCy Bookmarks added git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3432 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
low012	ce360ef43e	) no more HTML in plasmaCrawlProfile.java anymore ) <br> will not be displayed in items in Auto Filter Content on WatchCrawler_p.html anymore *) removed unnecessary replaceHTML() git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3425 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	93e1ad2bca	- fix for last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3424 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	88245e44d8	- improved version of robots.txt (delete your old htroot/robots.txt before updating): - robots.txt is a servlet now - no need to rewrite the whole file each time a section is added or removed - user-defined disallows, added manually, won't be overwritten anymore - new config-setting: httpd.robots.txt, holding names of the disallowed sections git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3423 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	9623bf7bbe	- removed call of java 1.5 method - added config servlet for local robots.txt - removed YPStats_p as it is of no use anymore - supertemplates use XHTML now - quick-fix for http://www.yacy-forum.de/viewtopic.php?p=32296#32296 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3422 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
daburna	f4c13b422c	*updated translation git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3421 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	1fe505f0b0	- adapted User_p to general web-interface style (and removed status-only page on changes) - beautified WikiHelp.html + typos - IP hasn't been set correctly in Blog.xml git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3418 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	92b6bc0ad2	- fixed wrongly applied replacement of "<" and ">" in Blog and simplified the code a bit - added check, whether active blacklist engine is supported by blacklist cleaner git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3417 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	a1d68fe092	- use .class rather than Class.forName for classes in class-path - added Bost's patch for Diff.findDiagonale() from: http://www.yacy-forum.de//files/patch_685.txt - fixed minor bugs in Blog git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3416 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f4cfd19835	second Generation of collection R/W head path optimization: - permanent cache flush is switched off. The optimized cache flush works better if it is a large number of collections that is flushed together - the flush size can be configured instead the flush divisor. There is only one size for all flushes - collection records that shall be removed during collection transition (jump from one collection file to another) are now not really removed but only marked in RAM. add-operations to the collection use these marked collection spaces - index bulk write operations are now separated for each file of a kelondroFlex git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3414 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	e92e8b2ae3	*) added RSS-Feed for blog git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3413 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	a107961099	) fixed blog-comment-deletion without admin-rights is no longer possible ) fixed no empty blog-comments anymore git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3412 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
(no author)	cf47075855	CSS corrects git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3410 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	116fc016d0	*) fix for Blogcomment-Preview git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3408 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	54fef3574f	*) missing files for last commit git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3406 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	cb89c74d52	) added blog-comments ) removed debug-output when deleting news git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3405 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	6fbe31425a	- some code-cleanup (no more syntax-warnings here) - added deletion from loadedURLs of URLs to be blacklisted in IndexControl_p git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3404 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	31ad42535a	- added buttons to add complete domain or single URL to blacklist to IndexControl_p git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3400 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	e0decf4653	- added support for changing invalid entries in blacklist cleaner git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3397 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	c58ef48e1c	- increased size of subject text-field git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3396 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
auron_x	9cbf94222f	*) added seedurl to network.xml as requested by lulabad git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3394 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	26f5757b40	- added support for multiple paths per domain to default-blacklist warning: an interface-change had been neccessary: - remove(String, String) has been renamed to removeAll(String, String), because it removes all path-entries for the specified host - remove(String, String, String) has been added to delete only a path-entry - geBlacklistType(String) has been renamed to getBlacklistType(String) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3391 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	3d6ab19f7e	- remove double entries in blacklist as well git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3390 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	bf7a69197d	- fix for possible NPE in queues_p - WatchCrawler_p: - display crawler traffic - pause/resume local- and global crawler git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3389 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
allo	9702d3abba	further supertemplate test git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3388 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	0c7b8cf632	- added first version of new wiki-parser - added blacklist support to manual URLFetcher stack fill - fix for NPE: http://www.yacy-forum.de/viewtopic.php?t=3559 git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3385 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	f7803a6ce4	enhanced crawl balancer - new domains now get a chance to get crawled early - less IO operations - new balancing method - better dump order at shutdown time - bugfixes regarding not found url hashes (no more superfluous cache kill) - domain access time is now shared over all balancer stacks - viewing the stack does no more disturbish the balancing algorithm that much - intelligent selection of best next domain using domain access times - extra double-check (to double-check the double-check) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3384 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	39b0658839	Redesign of Webinterface menu structure git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3381 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
orbiter	dc0c06e43d	PLEASE MAKE A BACK-UP OF YOUR COMPLETE DATA DIRECTORY BEFORE USING THIS redesign for better IO performance enhanced database seek-time by avoiding write operations at distant positions of a database file. until now, a USEDC counter was written at the head-section of a kelondroRecords database file (which is the basic data structure of all kelondro database files) to store the actual number of records that are contained in the database. Now, this value is computed from the database file size. This is either done only once at start-time, or continuously when run in asserts enabled. The counter is then updated only in RAM, and written at close of the file. If the close fails, the correct number can be computed from the file size, and if this is not equal to the stored number it is a strong evidence that YaCY was not shut down properly. To preserve consistency, the complete storage-routine had to be re-written. Another change enhances read of nodes in some cases, where the data-tail can be read together with the data-head. This saves another IO lookup during each DB node fetch. Includes also many small bugfixes. IF ANYTHING GOES WRONG, ALL YOUR DATA IS LOST: PLEASE MAKE A BACK-UP git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3375 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
hydrox	5af76fccd7	*) peer-search on Network.html now is case-insensitive git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3374 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	c016fcb10f	- added streaming-support to CrawlURLFetchStack_p servlet - bug for NPE in list.java - use more constants git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3373 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	65af9d3215	- continue shifting even in the case the stacked URL could not be found git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3372 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	d114a0136e	- crawl profile: don't add null-values - added some settings and statistics for url-fetcher 'server'-mode - added own stack for fetchable URLs - added possibility to fill stack via shift from peer's queues, via POST (addurls=$count and url$num=$url) or via file-upload - added "htroot" to classpath of linux start-script git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3370 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	a46dc43f45	- added lock symbol for restart- and stutdown-buttons on Status-page (see http://www.yacy-forum.de/viewtopic.php?p=31444#31444 ) git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3369 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	b2a9d32f29	why do I always forget some lines? sorry... git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3368 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago
karlchenofhell	e6ddf135bb	- enabled fetching new crawls via /yacy/list.html?list=queueUrls for testing purposes - sent URLs are taken off the limit-stack (of the global crawl trigger) (may be moved somewhere else in future versions) - added option to set the requested chunk-size git-svn-id: https://svn.berlios.de/svnroot/repos/yacy/trunk@3367 6c8d7289-2bf4-0310-a012-ef5d649a1542	18 years ago

1 2 3 4 5 ...

1344 Commits (d755a8026d00f7c18ea6be7be4b8889fe8d27189)