Commit Graph

1834 Commits (e9afe39cbb26230f47fc6768cde309fc3863240b)

Author SHA1 Message Date
theli e9afe39cbb *) Trying to be more tolerant against wrong charset names
18 years ago
theli 7526c831a8 *) Suppressing stracktrace
18 years ago
orbiter 50f2578c55 - some bugfixing and code cleanup
18 years ago
orbiter bdf4c7c51e added missing files for last commit
18 years ago
orbiter a5dd0d41af - refactoring of plasmaCrawlLURL.Entry to prepare new Entry format
18 years ago
orbiter 130cc76927 loop detection and termination in deletedHandles method
18 years ago
octoate 1c4076da8a First version of the MS Powerpoint parser based on Apache POI
18 years ago
theli 5b75d64d7d *) bugfix for last commit
18 years ago
theli 71ed104bc7 *) adding additional rpm mimetype (used by packman)
18 years ago
borg-0300 76d959122b new constants, finals, Stringbuffer, cleanup
18 years ago
rramthun 581dd2ec72 *)Proper arrow-function on Network.html, but ordering is still broken. Perhaps someone could fix that?
18 years ago
orbiter 6396f5971e bugfixes and migration attempt toward new kelondroFlex db
18 years ago
hermens 48f81acc0e reverse SVN 2744, it is not needed
18 years ago
hermens 1da9aece12 Repair DNS prefetch during cacheScan
18 years ago
orbiter 918b59dc5e - bugfix for snippet profile (no delete button)
18 years ago
orbiter 2bb529cedb added peer tags for peers in robinson mode
18 years ago
orbiter afbb547f3d extended options for abstracts generation in remote search interface
18 years ago
theli 22649408ad *) Better errorhandling for charset encoding problem during content parsing
18 years ago
theli a9c7e3f061 *) Bugfix for NoSuchElementException
18 years ago
orbiter f25f61d9d3 documentation of compile problem. See
18 years ago
orbiter c8f3a7d363 added snippet-url re-indexing
18 years ago
low012 2cfd4633ac *) even better handling of searchwords in snippets, words can consist of letters and numbers now
18 years ago
orbiter b062847797 fix for
18 years ago
orbiter e17fea7015 files in htcache are now stored in different hash/tree subdirectories
18 years ago
orbiter 661f005214 fix for seed upload build script
18 years ago
low012 2d3b7251a4 *) better handling of searchwords in snippets (see http://www.yacy-forum.de/viewtopic.php?t=2891 for details)
18 years ago
orbiter ddf8f220f6 fix for build fail
18 years ago
orbiter 25ae3d3161 generalized definition of hexhash
18 years ago
orbiter 86047f439d removed very bad bug that prevented production of any remote search result
18 years ago
orbiter f0d747c723 removed deprecated method
18 years ago
orbiter 5ff77612ac bugfix for old WORDS storage method
18 years ago
orbiter 0f10bdde22 more generic cache methods
18 years ago
orbiter 72482b1426 fixed scraper
18 years ago
hermens 6557112d8f small fix for plasmaURLPool.getURL() needed for new alternative htcache layout
18 years ago
hermens 440c6ee657 Implement alternative htcache layout
18 years ago
allo 226f2c5b2c first version, of the Serverlet Debugger
18 years ago
orbiter adf1f74ab2 bugfix for java 1.5 compile problem with serverCharBuffer.append(char)
18 years ago
orbiter fd61209797 lines inside tags without punctuation are extended by a single dot.
18 years ago
allo 1d0c0edda3 first version of posts/get from the del.icio.us api
18 years ago
orbiter 1969522dc1 removed lowercase of snippets (and other things):
18 years ago
orbiter 43614f1b36 bugfix in collection index. the index for collections was not created correctly
18 years ago
orbiter 1dfab1abe3 more control for seed receive
18 years ago
theli 1c0e65f55f *) Bugfix for problems with charset detection
18 years ago
orbiter db294687ea enhanced logging
18 years ago
theli a9a0f51303 *) suppressing InterruptedException errormessage
18 years ago
theli ce7ee74316 *) better errorhandling in filehandler (try catch block now starts before argument parsing)
18 years ago
theli 1d4fb680ce *) CrawlWorker.java: only keep content in memory if size is equal or less than 5MB
18 years ago
theli 1586d57187 *) odtParser: better handling of large files
18 years ago
theli f17ce28b6d *) plasmaHTCache:
18 years ago
orbiter 630a955674 read snippets from cache in case they are not provided in RAM
18 years ago