Commit Graph

3450 Commits (453f3aaa942e17704d549b345c85d11be7590ba0)

Author SHA1 Message Date
orbiter d399444e49 added debug information to class loader
16 years ago
orbiter 5080fc33bf fix for http://forum.yacy-websuche.de/viewtopic.php?p=12247#p12247
16 years ago
orbiter 335d6ce8fc fix for class loading problem
16 years ago
low012 78778df464 *) this should adjust the Dev/Main detection of the updater to the new version numbers (0.7x is Dev, if x != 0)
16 years ago
orbiter b423d0a036 moved all servlets from htroot/xml to htroot/api
16 years ago
orbiter 91af105373 last changes before release
16 years ago
f1ori 7eade3f181 * fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1728
16 years ago
orbiter d1bace5e4d enhanced cleanup function
16 years ago
orbiter cb76d9e0e4 more synchronized in BLOBHeap (will not fix problem with Runtime-Error as reported in forum)
16 years ago
orbiter ff41da613e removed exception printout during load of snippets
16 years ago
orbiter 814a28775f removed thread dump writing in case of invocation target exception in httpd (looked bad, not serious)
16 years ago
orbiter bed38a5f8c fix for uncaught exception in RSSReader
16 years ago
orbiter 05c235de32 fix for npe
16 years ago
low012 7608944081 *) bugfix for REMOTE_HOST environment variable in CGI code (shows hostname of client instead of hostname of YaCy peer now)
16 years ago
orbiter a6b29cf72c reverted change of search event processing in SVN 5460. The new code did not work properly,
16 years ago
orbiter 9ef77d57f5 added an access control to the search interface using white/blacklists:
16 years ago
orbiter efe801173c better dht-in cache flush. see also:
16 years ago
orbiter 941ab78d9b better termination for blocking threads
16 years ago
apfelmaennchen 3dc208fad0 bugfix: bookmarks can now handle folder names like /news and /newspaper without getting confused...
16 years ago
orbiter e948df68ac longer timeout for queues during shutdown
16 years ago
orbiter 2b32248079 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1516&p=10545#p10545
16 years ago
low012 c1330f5743 *) added environment variable DOCUMENT_ROOT
16 years ago
low012 f26b8fcb1b *) comment mode is 'moderated' instead of 'activated' by default now (to avoid spam being visible)
16 years ago
orbiter b2a8c653ee small fixes
16 years ago
orbiter f675d47f86 better protection against database failures
16 years ago
orbiter 4f45605f04 small update for timing in search result processing
16 years ago
orbiter 9d119c6b61 migration of auto-update rules to new release strategy:
16 years ago
orbiter 4d5b401f00 try to fix some performance problems with the internal index management:
16 years ago
borg-0300 a0605325bb fixed a NullPointer Exception
16 years ago
orbiter b2b7edae18 fixed interactive search
16 years ago
lotus 2be119f0df adjusted big peer to 28M links
16 years ago
orbiter c6880ce28b removed the permanent cache flush and replaced it with a periodic cache flush
16 years ago
orbiter ef7fe537c5 fixed a cache-bug in cachedFileRA
16 years ago
orbiter 6c7e83909b - refactoring of data access methods to be prepared for new cell data structure
16 years ago
lotus c8451614f3 fix for overflow
16 years ago
orbiter c4c4c223b9 fixed a problem with attribute flags on RWI entries that prevented proper selection of index-of constraint
16 years ago
orbiter 6072831235 no cr transmission for robinson peers
16 years ago
low012 afe98bc11c *) added changes as proposed by Halborinda in http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1674
16 years ago
orbiter 07fc115e90 removed active profiling in kelondroRowSet
16 years ago
orbiter be4c458951 refactoring (implemented Iterable in kelondroRowCollection)
16 years ago
low012 bb5c2cd12e *) ISINDEX parameters will not be put on commandline anymore to prevent possible security hazards (better safe than sorry). Parmeters will have to be read from QUERY_STRING in ISINDEX case too which does not seem to be uncommon behaviour for web servers: http://vms.pdv-systeme.de/users/martinv/cgi_basics/cgi_basics.html#Datenuebergabe
16 years ago
orbiter b6bba18c37 replaced the storing procedure for the index ram cache with a method that generates BLOBHeap-compatible dumps
16 years ago
low012 db1cfae3e7 *) cleaning up after myself
16 years ago
low012 f547f9a78c *) added CGI capabilities (run Perl scripts and other software via HTTP GET and POST)
16 years ago
f1ori bdc380cd84 * add lastModified to templateCache
16 years ago
f1ori 025094675f * remove empty directory
16 years ago
f1ori c5691180cb * skip style-tags in HTML-files
16 years ago
orbiter 3567c58b18 added another filed information for BLOBHeap dumps: the gaps
16 years ago
orbiter abdd4aa414 added a index dump for blob heaps:
16 years ago
orbiter 8c3205b62e fix for OOB Exception
16 years ago
orbiter 78c568331e added test channel to /xml/feed.rss
16 years ago
orbiter e004da48d3 - added fast fingerprint computation for files (any). Will be used in new index dump method
16 years ago
f1ori 2d2ce24011 * remove all encoding-stuff from proxy
16 years ago
f1ori 73c8a0839c * abort download, when proxy connection is closed
16 years ago
orbiter bb935fdbb0 less organization overhead for DNS caching and prefetching
16 years ago
f1ori 4907697cfa * make fileuploads through proxy bigger than 65500 bytes possible
16 years ago
orbiter fc8189f3fb better self-healing of corrupted databases
16 years ago
f1ori 963da8c3f9 * updated tm-extractors to new version 1.0
16 years ago
orbiter e34ac22fbd - added new monitoring servlet at
16 years ago
lotus 449e697436 fix for null-seed in seedfile
16 years ago
orbiter d376d81fc4 replaced busy thread control of crawl stacker by blocking threads
16 years ago
orbiter f29b48d9ff patch for IndexOutOfBoundsException
16 years ago
f1ori 0881190b19 * Robots.txt: don't interpret Crawl-Delays for other robots
16 years ago
orbiter 243e73f53b removed unnecessary usage of kelondroBLOBTree
16 years ago
orbiter 8cb7170b75 - set status of kelondroTree, kelondroBLOBTree and kelondroFlexTable to deprecated
16 years ago
orbiter 7535fd7447 - refactoring of CrawlEntry and CrawlStacker
16 years ago
lotus 18513e2ee2 npe fix: http://forum.yacy-websuche.de/viewtopic.php?t=1646
16 years ago
orbiter 2802138787 - refactoring of CrawlStacker (to prepare it for new multi-Threading to remove DNS lookup bottleneck)
16 years ago
orbiter db6b3bf5a3 speed enhancement for integrated http server:
16 years ago
orbiter 7cd08bd5fb fix for NPE in BLOBCompressor
16 years ago
orbiter 5b94498643 fine-tuning of cache usage from SVN 5386 and a bug fix for overflow in available() method
16 years ago
orbiter 1779c3c507 - added a read cache to the RAFile interface to RandomAccessFile
16 years ago
orbiter e1acdb952c fix for problem with userDB and bookmarksDB which was caused by changes in kelondroRA in SVN 5376
16 years ago
orbiter 4a2dac659e more speed hacks:
16 years ago
orbiter 47292e696a more performance hacks
16 years ago
orbiter 759cef23dd fix for bug in kelondroAbstractRA.readFully
16 years ago
orbiter d39d420b39 performance hacks
16 years ago
orbiter 0b4808ba3d added new interactive search feature:
16 years ago
orbiter 74a3d86114 fixed a error response that might present classified information
16 years ago
orbiter c6525ab75f fix for NPE in seed handling
16 years ago
lotus 1951d30a62 addendum to last commit
16 years ago
lotus 325ba7bfb8 only query words with length > 2
16 years ago
low012 e423fa9846 *) added method to only get file names in directory listing which match a filter
16 years ago
orbiter 513179f404 changed interface to colletctionIndex and adopted all implementing classes:
16 years ago
orbiter 9d64693cfb reverting again the changes to new concurrent chunkIterator
16 years ago
orbiter 45ad1c3dd5 - re-activated concurrent iterator for EcoFiles
16 years ago
orbiter 2e2120046f speed enhancement for BLOBHeap opening process
16 years ago
orbiter fa26a8f25a fix for deadlock-like behavior in balancer
16 years ago
orbiter 1918a0173e added more exception handling during crawling
16 years ago
orbiter 10f5ec1040 reverted last commit (more testing needed)
16 years ago
f1ori 5af8923f37 * distribute forgotten jar-file in parser
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter 0ca4bc7b79 - added reader and visualization for mediawiki-export files:
16 years ago
danielr 2e63f03ca5 copy&paste vergessen :/
16 years ago
danielr cd8082b4e3 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1111#p11166
16 years ago
lotus 4f996a7651 fix for logparser pattern
16 years ago
f1ori d18c18971e * dirlisting in UTF-8 encoding
16 years ago
orbiter 867d0f2f56 removed some unnecessary pause delays
16 years ago
f1ori d49ffcd818 * files distributed by yacy are utf-8, files from repository use the system default charset
16 years ago
orbiter 8c96bc2ac1 do not use proxy caching rules for crawling
16 years ago
orbiter dba7ef5144 extended crawling constraints:
16 years ago
orbiter 96174b2b56 more debugging / better result status logging for parser/caching errors
16 years ago
f1ori 90e78b2cf6 * improve encoding detection of http service
16 years ago
orbiter ef66438662 - more space in error db to store larger error messages
16 years ago
orbiter 674ad2d55b different handling of error cases that occur during loading files with http or ftp:
16 years ago
danielr 538359a0ff simple fix to get DHT working again (maybe something more has to be done ;)
16 years ago
f1ori 7e1fe05e3c * added utf8-encoding to many getBytes-calls
16 years ago
lotus fad044fb54 update to snippet marker:
16 years ago
lotus 16723d0fa6 ask another peer if crawljob loading fails
16 years ago
orbiter 1b18d4bcf3 enhancement to crawling and remote crawling:
16 years ago
orbiter 3f746be5d4 - consolidation and refactoring of many DHT target - computing methods
16 years ago
orbiter d014b2728a Design-check, Extension and Refactoring of DHT target position computation:
16 years ago
orbiter dd27ce7216 added control logic to ECO tables that deletes ram copies of the tables if they get too large
16 years ago
orbiter 38e6ba5d00 forgot to re-rename commonsPath
16 years ago
orbiter 22989d0d8a added property index.storeCommons to switch commons storage on or off
16 years ago
f1ori 4b4ce75396 * http-server: submit charset from html metatags
16 years ago
f1ori 69e695bd4b * detect charset for directory index
16 years ago
f1ori 340ecd919d * include non ascii characters in visible characters
16 years ago
lotus 5cf0cbb47e javadoc
16 years ago
lotus 8d07607d1d update to resource observer:
16 years ago
f1ori d0543a7c39 * fix the debug ant-target
16 years ago
low012 baae3d91b1 *) fixed warning when compiling listManager
16 years ago
danielr a4fb76e93c undo r5300 (not fixed as seen after longer run)
16 years ago
low012 a99a629ed4 *) quick fix to prevent comments for blog entries which don't exist (http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1554)
16 years ago
low012 00e27e5050 *) fixed bug which made it possible to write files outside of the DATA/LIST directory when creating a new blacklist
16 years ago
danielr 0f9c0bd0d5 fix for ConcurrentModificationException at de.anomic.index.indexContainerHeap$heapCacheIterator.next(indexContainerHeap.java:324)
16 years ago
danielr 103ad2a437 some javadoc
16 years ago
orbiter b098522977 some very small advances to index utf-8 (not working yet), inserted also debugging code
16 years ago
orbiter 2f49666908 integrated the character decoding into the parser, removed old code
16 years ago
orbiter 49293c1358 fix for deadlock in new encoder :-(
16 years ago
orbiter 0edec2b760 FULL redesign of algorithms in htmlTools to encode/decode strings from/to unicode and html.
16 years ago
orbiter 958ec20cd0 removed specialized umlaute-handling in html parser. This has to be replaced by something that is able to transfer all possible html encodings into utf-8. Please see SVN 5293 for test cases.
16 years ago
f1ori 2e53cbc66a should compile now
16 years ago
f1ori f3bf2e379e should compile again
16 years ago
f1ori dd8441f102 fix bug: data from plasmaParser is allready converted to UTF-8
16 years ago
orbiter 6941bf42b1 performance hacks
16 years ago
orbiter 9b0c4b1063 redesign of parts of the new BLOB buffer
16 years ago
orbiter 1778fb420d - added some performance tweaks to the new BLOB buffer
16 years ago
orbiter 9663e61449 added another class to handle BLOB writings to the new HTCACHE data storage:
16 years ago
orbiter 382226da94 fix for bug introduced in SVN 5281: parameters were switched
16 years ago
danielr f2fd043797 refactoring (moved duplicate code into methods)
16 years ago
danielr c612046e5e r5278 java 1.5 compatible
16 years ago
f1ori af71ec93bf ops, forgot to import something
16 years ago
f1ori 9e65e9141c * always use UTF-8 for encoding hashes
16 years ago
orbiter 826ca79735 refactoring and new architecture to store the files of the web cache:
16 years ago
danielr f095137238 - respecting httpdMaxBusySessions (refusing new connections if limit is hit)
16 years ago
orbiter 8ba33f104e fix for npe
16 years ago
orbiter 998861acfd - some refactoring in BLOBHeap to enable more gap processing functions
16 years ago
lotus 9d50bfd0b3 fix for npe: http://forum.yacy-websuche.de/viewtopic.php?p=10562
16 years ago
orbiter 766cad6e93 enhancement in memory management of BLOB Heap files / merging of deleted entries
16 years ago
orbiter 7860d5d632 fix for bug in seed list management (cause was bad class overloading, only visual effects!)
16 years ago
orbiter ffed5fc415 fixed problem with lost peers in database
16 years ago
orbiter 6fb865fbdc - fix of bug in iterator in kelondroBLOBHeap which caused bug in crawl profile listing
16 years ago
orbiter 2d65887723 - fix for bug in new profile handling
16 years ago
orbiter ff68f394dd fix for problem with balancer and lost crawl profiles:
16 years ago
lotus fb8d9850ea fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1462
16 years ago
lotus 0d1a2f6183 fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=1461
16 years ago
orbiter 9ac16f565b - fixed several bugs in database management functions
16 years ago
orbiter 820a03f9d6 - removed some warnings
16 years ago
lotus fe2792e9ce use accept-language header instead of user agent for language detection
16 years ago
orbiter c8bdd965ec - larger update time for status page
16 years ago
lotus dda771db9d - search result layout
16 years ago
orbiter ce4715e305 removed indexing of anchor links and tagging such words as part of urls (that was wrong)
16 years ago
orbiter ce57de6cb3 - fixed re-setting of DHT Send/Receive settings
16 years ago
lotus 31c31e54e4 new tray icon image for different icon sizes (e.g. linux)
16 years ago
f1ori 9589dfe080 * removed trayicon popupmenu title
16 years ago
lotus 5a637f004d localized tray
16 years ago
lotus 9d4f0325e1 - removed shutdown from search page (we have it in tray now!)
16 years ago
lotus 214277dad6 - revert r5202
16 years ago
f1ori 7afa084207 * add nativ java trayicon, using reflections
16 years ago
apfelmaennchen b97ff24b43 bookmarksDB / xbel.xml:
16 years ago
orbiter 6e7d113eac fix for wrong index initialization after network switch
16 years ago
lotus 0a0cc3bf67 added missing classes to build target "run"
16 years ago
orbiter 7b35d54c6c fixed some problems with network switching (was not completely 'clean')
16 years ago
orbiter f0b42e5a98 fixed NPE
16 years ago
orbiter 8e0de7f180 update to language statistic evaluation:
16 years ago
orbiter 1198eeecc7 added language selection to search query:
16 years ago
orbiter 00c1535f84 added ranking and evaluation of language type in a search
16 years ago
lotus a81cb78211 finally some putHTML on htroot/xml/
16 years ago
orbiter bfcf9b7aa3 - added language detection using metadata from documents: html and odt documents provide this information
16 years ago
apfelmaennchen 5e8bd0f29c small fixes to getpageinfo_p.xml and htmlFilterContentScraper.java with respect to keyword extraction
16 years ago
apfelmaennchen 5b2a57bfd0 - /xml/util/getpageinfo_p.xml added <desc> and <lang> tags
16 years ago
orbiter e1f67262f7 - added and removed some debugging output
16 years ago
orbiter ce2a7ed116 integrated language detection classes into condenser environment
16 years ago
orbiter 2b13705839 fixed a mistake in indexing queue processing: documents had been parsed before it was checked if they should be indexed or not. parsing was not necessary for this check, so the check was moved in the queue in front of the document parsing
16 years ago
orbiter 21dbb39afa switched two balancer cases
16 years ago
orbiter 1bbf362cef update to the crawl balancer: better organization and better crawl delay prediction
16 years ago
orbiter ddcf285499 - fixed a bug in performance setting (did not work with german translation)
16 years ago
orbiter 0cd0fee546 fixed bug with wrong proxy result enqueueing. See:
16 years ago
orbiter 670244849d fix for http://forum.yacy-websuche.de/viewtopic.php?p=9835#p9835
16 years ago
lotus fd9233244e configurable free disk space via disk.free
16 years ago
orbiter 25a62cdc3f small fixes
16 years ago
lotus 73f233bb11 * set resource observer to 1000MB
16 years ago
orbiter 5fbccfd75e fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1366&p=9348#p9348
16 years ago
orbiter a28faabfd2 fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1351&p=9242#p9242
16 years ago
apfelmaennchen 7b63c66a08 - bugfix in bookmarksDB.Tag.hasPublicItems()
16 years ago
orbiter 1fb1665e71 increased dht interval to avoid peer selection failure
16 years ago
orbiter 1eb813bd43 shifted index deletion-on-exit rule to the class where the errors are produced
16 years ago
f1ori ba76995d2c * fix for http://forum.yacy-websuche.de/viewtopic.php?f=6&t=1415
16 years ago
f1ori bea6c13139 * with r5137 robotParser didn't work at all -> fix
16 years ago