Commit Graph

2177 Commits (1d31ebbeec74b7b928ea1c49157448ecf857a074)

Author SHA1 Message Date
orbiter 0a0c3edeb6 fixed a bug in index transfer
18 years ago
orbiter 8e7215475b - extended ViewFile to use is as debugging-tool: you can now use the
18 years ago
orbiter 30888e7a2f implementation of search constraints
18 years ago
orbiter 49a83f99d9 - fix for wrong DHT ordering in DHT selection
18 years ago
orbiter e55ef0df28 - automatic migration of old RWI entries to new format during remote search
18 years ago
orbiter f4b547dc13 limited index transfer to peer with version 0.486
18 years ago
orbiter 10a4ab5195 disabled some (more) write caches
18 years ago
orbiter 09bcc10344 bugfix for some problems of last change with assortments
18 years ago
orbiter e3d75f42bd final version of collection entry type definition
18 years ago
theli 58d79a9ac1 *) more error logging
18 years ago
theli 73552db482 *) bugfix adding missing lib names to copy and zip commands
18 years ago
theli 5c0669429e *) soap: adding function to query the peer list
18 years ago
orbiter c9364246cc introduced new RWI-Object.
18 years ago
hydrox f442af956c *) first version of build-in logalizer
18 years ago
theli 9ea9a9f157 *) bugfix for bookmark service: get didn't work when using date=null
18 years ago
orbiter b94ee53958 some enhancements to url retrieval
18 years ago
theli 203f2bde9a *) adding function to query the pause/resume state of the crawling queues
18 years ago
orbiter d34f10c63d some tests with reverse dns lookup
18 years ago
(no author) 24ac4e8860 Bugfix to "-UNRESOLVED_PATTERN- bei Hostname-Änderung" (http://www.yacy-forum.de/viewtopic.php?t=3093)
18 years ago
orbiter e628d34e16 patches for bad data
18 years ago
orbiter 497428c8ec refactoring
18 years ago
(no author) a51417d86b Bugfix: language of ConfigLanguage_p.html was not changed properly when a different language was choosen here
18 years ago
theli 4147c9f490 *) better error logging on soap engine initialization
18 years ago
orbiter 76fceb9997 refactoring
18 years ago
theli f77d624b94 *) bugfix for persistent connection support on transfer-encoded requests
18 years ago
orbiter eeda881553 bugfix for last commit
18 years ago
orbiter bb7d4b5d5e refactoring to prepare new RWI entry object
18 years ago
orbiter 6412c926bc new url entry interface
18 years ago
orbiter bdc9216366 - more asserts
18 years ago
low012 b18273bbb2 *) YaCy will extract URLs from <embed> tags now.
18 years ago
orbiter 1751a799ac - deactivated all write buffers
18 years ago
orbiter ba967c4875 - bugfixes and debug code
18 years ago
orbiter eaad91d84f fixed wrong RAM calculation
18 years ago
orbiter ee4715a21c - more asserts
18 years ago
orbiter 114a76a86e - added flag to urlhash that shows that domain is a local domain
18 years ago
orbiter eafb5ecd22 - better usage of memory resources for kelondroFlexSplit
18 years ago
orbiter b2d51be33c bugfix for latest changes to entry generalization
18 years ago
(no author) e59ff8b657 Bugfix: language of ConfigBasic.html was not changed properly when a different language was choosen here. Note: there's a similair bug on ConfigLanguage_p.html
18 years ago
theli 892b9f2fc4 *) additional soap function to query peer status
18 years ago
theli bd3710a974 *) new xml template to view peer profile as xml
18 years ago
theli d1afe1ce6b *) adding xml template to get the message list as xml
18 years ago
orbiter d454ca44ee update of cache logging
18 years ago
hermens 8385557672 Small fix for the Cache Monitor when using proxyCacheLayout=hash
18 years ago
theli f37e2041e8 *) adding soap function to import yacy bookmarks from xml or html (transfered via soap attachments)
18 years ago
orbiter f1ed55a5fc bugfix for last commit
18 years ago
orbiter 8fdefd5c68 generalization of payload definition of index storage
18 years ago
theli 29a1f132ec *) some strings replaced by constants
18 years ago
theli 4a3ec63e34 *) new soap service to manage yacy bookmarks
18 years ago
(no author) 9b3fd2b9e5 *) removing doctype definition to avoid problems with xml parser
18 years ago
(no author) c64d5018b4 *) Bugfix. Problem in XML Parser
18 years ago
theli 5e57e0814d *) new soap function to display log
18 years ago
theli ad248d61ca *) more verbose exception
18 years ago
hydrox 7e8669b15c *) added possibility to "recycle" a DHTChunk that failed to transfer.
18 years ago
low012 4feaa91890 *) Added additional MIME-Type.
18 years ago
low012 89af433879 *) Deleted parts of WebCat that were not needed for parsing SWFs.
18 years ago
orbiter 46a712e195 - more asserts
18 years ago
low012 8c9bc7e341 *) extracting urls works now
18 years ago
orbiter fc2936d500 bugfix for internal index entry generation
18 years ago
low012 493391e42d *) new flash parser, still experimental
18 years ago
orbiter 215c4e65f1 code cleanup
18 years ago
orbiter bd4f43cd66 - fixed a null pointer exception bug
18 years ago
auron_x 194d42b6a7 *) changed PPM-calculation to be more accurate
18 years ago
orbiter fe8afaf426 switched off usage of write cache for imprortant databases
18 years ago
orbiter 985fd807cc bugfixing in collection methods
18 years ago
theli c7bea4addb *) soap api
18 years ago
theli ee4d4e8567 *) Soap-handler: bugfix. wrong content-length was send when using content-encoding
18 years ago
orbiter d3431433b0 more anonymization in logging
18 years ago
orbiter e6044e5198 bugfix for
18 years ago
theli 4d19d94348 *) bugfix for nullpointerexception
18 years ago
theli 532c23b5c7 *) soap handler
18 years ago
(no author) 5141fa5942 combinedVersionString2PrettyString(..) renamd to combined2prettyVersion(..), new parameter "computerName" added to indentify the source of problems
18 years ago
orbiter 78b7f6f7fd bugfix for index remove bug,
18 years ago
(no author) 0e79f2fd7e name of the file to tranlate apears ahead its translation
18 years ago
orbiter ebd2d629d8 added missing file for last commit
18 years ago
orbiter 147d88cf23 re-design of database caching
18 years ago
orbiter 4e363108e1 - removed bad debug code that caused a large and unnecessary delay during global search
18 years ago
orbiter f21ede312e bugfixes for internals of database organization
18 years ago
orbiter eb4bfb0e9d fixed problem with cache.profile()
18 years ago
orbiter 2a9d868f6d - removed object cache from kelondroTree
18 years ago
theli 7299dc30e3 *) new soap service to manage the yacy file-share
18 years ago
theli 777e39cea0 *) new template to display the dir-listing in xml format.
18 years ago
theli 9e8942a064 *) adding method to implement blacklist from file
18 years ago
theli 4d1f933ea1 *) avoid reading of content body into memory
18 years ago
theli 88cfdecd38 *) Bugfix: calling close must not close the wrapped input stream, otherwise
18 years ago
theli d38ef0493d *) be more tolerant against missing ports in url
18 years ago
theli cfe54fedc7 *) Bugfix for resolveBackpath problem with tailing /..
18 years ago
orbiter dc056fabf3 small bugfix
18 years ago
orbiter 278d8c3c7e - more asserts
18 years ago
allo 5a6488256d catch the "username too short" exception
18 years ago
orbiter 2d3f1a53fd handling of Missing byte-order mark exception
18 years ago
theli ac13fa763a *) bugfix for blacklist remove (blacklist was not informed about remove)
18 years ago
karlchenofhell c5a5a9eb1c - patch for NullPointerException by Fuchs: see http://www.yacy-forum.de/viewtopic.php?p=27033#27033
18 years ago
allo 8a5c2d0a19 fix for supertemplates, too.
18 years ago
allo c35793fb46 fix for last commit
18 years ago
theli 3e0516446b *) new soap function to get the current queue status
18 years ago
allo a831c83025 create servletProperties, with the servlet specific funktions from serverObjects
18 years ago
orbiter 1825540020 another fix for url-db migration
18 years ago
orbiter 83a0efc65a better assert statements and fixes
18 years ago
karlchenofhell d13b381f83 - added mint-green skin
18 years ago
orbiter 2025e885d6 a fix for problems with remove situations in kelondroFlexSplitTable
18 years ago
theli b12da510f3 *) adding optional libraries for needed for soap attachments
18 years ago
orbiter 11843bba7f fix for Malformed URL Exception in url migration
18 years ago
theli 9eecc9a888 *) libs added to classpath
18 years ago
theli a1acc9c389 *) new function to configure distributed crawling
18 years ago
theli 0996e550e7 *) deploy soap peer admin service
18 years ago
orbiter 3ffc5b8793 fixed problem with serverCharBuffer.append(char)
18 years ago
orbiter 8b56887676 removed unused code
18 years ago
orbiter 06854988da - full integration of new LURL database in INDEX
18 years ago
(no author) 02c66c04f2 *) Missing file from last commit
18 years ago
octoate e4a3574b77 StringBuffer now resets every time the parser is called
18 years ago
theli ef912811f1 *) adding new soap service for peer administration
18 years ago
karlchenofhell ce237aefad - assortment-sizes table from PerformanceQueues_p.html is not shown if not used
18 years ago
theli 68204ff729 *) Suppressing for bad client requests.
18 years ago
theli c1dff41f99 *) adding possibility to deploy custom SOAP services
18 years ago
theli df49724f28 *) better error handling for seed upload - test download - problems
18 years ago
theli a5b9b514c1 *) retry crawling without content-encoding if the content-encoding header was not correct
18 years ago
theli 52466067d8 *) Bugfix for ArrayIndexOutOfBoundsExceptions which occure because SimpleDateFormat is not thread-safe
18 years ago
theli b357a13e9a *) adding synchronization block because SimpleDateFormat is not thread-safe
18 years ago
theli 92f774edd1 *) Better charset encoding detection
18 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
18 years ago
octoate cc24dde5e0 First version of a MS Excel parser based on Apache POI
18 years ago
karlchenofhell 4c63129136 - stupid mistake...
18 years ago
karlchenofhell b14a500b88 - removed debug output from PerformanceMemory_p
18 years ago
karlchenofhell ebf0da2a45 - now the fix http://www.yacy-forum.de/viewtopic.php?t=2974 works
18 years ago
theli 09337c9751 *) Bugfix wrong chars in soap search result document
18 years ago
theli 3d152bfe43 *) Logging message added
18 years ago
karlchenofhell b5e40e2fa2 - fix for http://www.yacy-forum.de/viewtopic.php?t=2974 (no cache-sizes for new db)
18 years ago
theli 96f45e9b15 *) Bugfix wrong chars in soap search result document
18 years ago
theli da2ac6fa23 *) adding new ant target to allow generation of client stub classes for yacy soap api
18 years ago
theli a9cc6df21b *) adding wsdl files to generate client stub classes with ant
18 years ago
orbiter 77a59a115d refactoring of indexing methods
18 years ago
orbiter 14490f0a83 added missing flush statement
18 years ago
orbiter 688cbfb776 - bugfixing for flextable bug
18 years ago
allo a29b4d4fb5 extended Supertemplates for Headerincludes.
18 years ago
theli a7e11ada50 *) suppressing stacktrace for "server has closed connection"
18 years ago
allo 72cc082ebe created password generator for scripts.
18 years ago
theli 5b114249ce *) Bugfix for ViewLog problem with multiline logging messages
18 years ago
theli de5e233766 *) Bugfix for GuiHandler sorting problem
18 years ago
theli fd94aa4bef *) Bugfix for IndexOutOfBound in GuiHandler
18 years ago
orbiter 29a1318ef9 bugfixes for wrong database access that do not consider deleted entries
18 years ago
theli cbb1e710b9 *) removing old class
18 years ago
orbiter c6d46f7ebd null pointer bugfix
18 years ago
theli decb09df6d *) Trying to be more tolerant against wrong charset names
18 years ago
theli e9afe39cbb *) Trying to be more tolerant against wrong charset names
18 years ago
theli 7526c831a8 *) Suppressing stracktrace
18 years ago
orbiter 50f2578c55 - some bugfixing and code cleanup
18 years ago
orbiter bdf4c7c51e added missing files for last commit
18 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
18 years ago
orbiter 130cc76927 loop detection and termination in deletedHandles method
18 years ago
octoate 1c4076da8a First version of the MS Powerpoint parser based on Apache POI
18 years ago
theli 5b75d64d7d *) bugfix for last commit
18 years ago
theli 71ed104bc7 *) adding additional rpm mimetype (used by packman)
18 years ago
borg-0300 76d959122b new constants, finals, Stringbuffer, cleanup
18 years ago
rramthun 581dd2ec72 *)Proper arrow-function on Network.html, but ordering is still broken. Perhaps someone could fix that?
18 years ago
orbiter 6396f5971e bugfixes and migration attempt toward new kelondroFlex db
18 years ago
hermens 48f81acc0e reverse SVN 2744, it is not needed
18 years ago
hermens 1da9aece12 Repair DNS prefetch during cacheScan
18 years ago
orbiter 918b59dc5e - bugfix for snippet profile (no delete button)
18 years ago
orbiter 2bb529cedb added peer tags for peers in robinson mode
18 years ago
orbiter afbb547f3d extended options for abstracts generation in remote search interface
18 years ago
theli 22649408ad *) Better errorhandling for charset encoding problem during content parsing
18 years ago
theli a9c7e3f061 *) Bugfix for NoSuchElementException
18 years ago
orbiter f25f61d9d3 documentation of compile problem. See
18 years ago
orbiter c8f3a7d363 added snippet-url re-indexing
18 years ago
low012 2cfd4633ac *) even better handling of searchwords in snippets, words can consist of letters and numbers now
18 years ago
orbiter b062847797 fix for
18 years ago
orbiter e17fea7015 files in htcache are now stored in different hash/tree subdirectories
18 years ago
orbiter 661f005214 fix for seed upload build script
18 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details)
18 years ago
orbiter ddf8f220f6 fix for build fail
18 years ago
orbiter 25ae3d3161 generalized definition of hexhash
18 years ago
orbiter 86047f439d removed very bad bug that prevented production of any remote search result
18 years ago
orbiter f0d747c723 removed deprecated method
18 years ago
orbiter 5ff77612ac bugfix for old WORDS storage method
18 years ago
orbiter 0f10bdde22 more generic cache methods
18 years ago
orbiter 72482b1426 fixed scraper
18 years ago
hermens 6557112d8f small fix for plasmaURLPool.getURL() needed for new alternative htcache layout
18 years ago
hermens 440c6ee657 Implement alternative htcache layout
18 years ago
allo 226f2c5b2c first version, of the Serverlet Debugger
18 years ago
orbiter adf1f74ab2 bugfix for java 1.5 compile problem with serverCharBuffer.append(char)
18 years ago
orbiter fd61209797 lines inside tags without punctuation are extended by a single dot.
18 years ago
allo 1d0c0edda3 first version of posts/get from the del.icio.us api
18 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
18 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
18 years ago
orbiter 1dfab1abe3 more control for seed receive
18 years ago
theli 1c0e65f55f *) Bugfix for problems with charset detection
18 years ago
orbiter db294687ea enhanced logging
18 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
18 years ago
theli ce7ee74316 *) better errorhandling in filehandler (try catch block now starts before argument parsing)
18 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
18 years ago
theli 1586d57187 *) odtParser: better handling of large files
18 years ago
theli f17ce28b6d *) plasmaHTCache:
18 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM
18 years ago
orbiter bcf2b800b4 applied UTF-8 encoding parameter to yacy-internal protocol communication
18 years ago
orbiter c40fca08a2 fixed bad handling of string separation
18 years ago
orbiter 5a40ea7866 refactoring of wget string list generation
18 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
18 years ago
orbiter d4c239e4be - fixed problem in collection index with deletion of single url references
18 years ago
orbiter 00746ca232 identified and fixed search performance problem caused by
18 years ago
orbiter b033a80750 better control of failure in node seek of kelondroTree
18 years ago
orbiter 310f1c41cd added option to see ranking scores in surftipps
18 years ago
theli a2e3095044 *) Bugfix. Add missing plasmaParserDocument.close() calls
18 years ago
theli cd5f349666 *) Better handling of large files during parsing
18 years ago
theli 8b2ceddb91 *) Displaying servere and warning logging messages in different colors on ViewLog_p.html
18 years ago
low012 f8ac694e51 *) fixed a bug where searchword in snippets were not displayed bold in front of a punctuation mark (see http://www.yacy-forum.de/viewtopic.php?p=25998)
18 years ago
orbiter df1629b05a - code cleanup
18 years ago
theli c665f6cddb *) handling of quotes in charset string
18 years ago
theli b73efd5565 *) missing changes needed because of last commit
18 years ago
theli 140ddba93f *) adding soap functions to pause and resume the crawler
18 years ago
orbiter 2463e5624a 'quick' release 0.47
18 years ago
theli 49fbb688df *) SOAP: old urlInfo renamed to urlInfoByHash, new urlInfo Function added.
18 years ago
theli 8f143d516b *) make snippet fetcher accessible via soap api
18 years ago
theli 97615af406 *) Restructuring of YaCy SOAP services
18 years ago
theli 241b881560 *) Redesign of YaCy SOAP handler
18 years ago
theli 009a33170b *) Content-Location header added
18 years ago
theli 1aa07a52cd *) Bugfix for UnsupportedEncodingException if the media type contains multiple parameters
18 years ago
theli 625c2ce6b1 *) bugfix for snippet fetching problem if content but not http header is available in cache
18 years ago
theli 813a8a8179 *) migration of mimeTypeParser to jmimemagic 0.1
18 years ago
hermens 3f5a4153a0 Make Peers more receptible to transferred indexes
18 years ago
theli 57415b6889 *) Bugfix for surftipps UTF-8 problem
18 years ago
allo b0a4fcce8c fix from theli
18 years ago
theli b6c7b91582 *) Parser now throws an ParserException instead of returning null on parsing errors (e.g. needed by snippet fetcher)
18 years ago
theli 64b2ef5aae *) Trying to bugfix shutdown problem
18 years ago
orbiter e03427871e enhanced surftipps:
18 years ago
theli 1dc12d6659 *) Bugfix for shutdown problem caused by cacheScan thread
18 years ago
borg-0300 42173462f5 rename cutUrlText to shortenURLString;
18 years ago
borg-0300 af1d89e381 check url == null added;
18 years ago
theli cc667b0aa5 *) htmlFilterContentScraper.java: adding support for link tag
18 years ago
theli 26dfbb7499 *) Bugfix for UTF-8: url names are now stored properly in stackcrawl, crawler, indexing queue and should be displayed correct on the gui
18 years ago
theli cf6acff2c2 *) Bugfix. htmlFilterInputStream document analysis did not work properly for documents smaller than the
18 years ago
borg-0300 f18304ddd3 unused/not needed imports removes;
18 years ago
orbiter ec031eb993 first version of surftipps
18 years ago
borg-0300 b174fbd0ca "import ...*" removed;
18 years ago
orbiter 807756150e patch for strange bug reported by email
18 years ago
theli 5c6251bced *) some improvements for extended html document charset support
18 years ago
theli 33f0f703c0 *) reinserting type cast again
18 years ago
orbiter 8c11a543dc fixed line ending coding
18 years ago
theli b690597275 *) adding casts to avoid compatibility problems between java 1.4 and java 1.5 writer class usage
18 years ago
theli 5afb0cbce8 *) setting default charset (for unkown documents) to iso-8859-1
18 years ago
orbiter f453c14b5d removed unreacheable catch blocks and unused imports
18 years ago
theli ad7f600f25 *) Bugfix. re-enabling inheritance of serverCharBuffer from writer class
18 years ago
theli 97d2a08ef1 *) restructuring needed to support parsing of documents using various charsets
18 years ago
theli fc594e8eda *) adding httpContentLengthInputStream.java class to allow reading of http response bodies
18 years ago
low012 cd636eb00e *) Fix for the fix...
18 years ago
low012 f9a5b55a9e *) Fixed bug described in http://www.yacy-forum.de/viewtopic.php?p=25448#25448
18 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
18 years ago
low012 8a30c5343d *) Fixed bug where exclamation marks could get lost between [=...=] and <pre>...</pre>
18 years ago
low012 d8f4b17e31 *) Hopefully fixed bug described in http://www.yacy-forum.de/viewtopic.php?t=2825.
18 years ago
theli 0e84a969d6 *) Bugfix for serverCharBuffer read from file operation
18 years ago
theli 90ef19d778 *) first version of a serverCharBuffer
18 years ago