Commit Graph

4619 Commits (85c65475fa4c03dad5d6561c38d9a2b882540d1e)

Author SHA1 Message Date
orbiter aacf572a26 - enhancements for search speed
15 years ago
sixcooler 61c82f3105 gzip-compresson @ transferRWI & transferURL back again
15 years ago
orbiter 2c549ae341 fixed a number of small bugs:
15 years ago
orbiter f6eebb6f99 replaced auto-dom filter with easy-to-understand Site Link-List crawler option
15 years ago
orbiter 3057a0b939 - intranet scanner now produces urls with host names, not ips if possible
15 years ago
orbiter c60aed4435 no caching in browser of dynamic web pages sent by YaCy http
15 years ago
orbiter e63896f2a8 added an intranet scanner and a servlet which shows all intranet addresses and an option to start a site-crawl for all these addresses at once.
15 years ago
orbiter e54cb7fb0c more bugfixes (also for latest commit)
15 years ago
orbiter be6b48311c misc bugfixes
15 years ago
orbiter d2fd93135c - moved yacybot user agent string definition to MultiProtocolURI since there are basic access mechanisms where the bot string is needed
15 years ago
low012 afa708d552 *) added <s>...</s> tag to WikiCode -> works just as the HTML equivalent
15 years ago
orbiter a83186ac7d fix for bug in cytrails
15 years ago
orbiter 48c0d508ac fixes for crawling of smb links (file length not always available)
15 years ago
orbiter 0bc6284e27 - added bugfix for access tracker in case of concurrency conflicts
15 years ago
f1ori e670e1ef8e add charset auto-detection for htmlParser
15 years ago
f1ori ddcd5ae78c fix for http://forum.yacy-websuche.de/viewtopic.php?f=5&t=2989
15 years ago
f1ori 8fe1102452 fix http://forum.yacy-websuche.de/viewtopic.php?p=20889#p18426
15 years ago
orbiter 10a9cb1971 simplified snippet computation process and separated the algorithm into two classes
15 years ago
lotus 4450c240b7 npe fix http://forum.yacy-websuche.de/viewtopic.php?f=6&t=2982
15 years ago
orbiter 84a023cbc8 fixed several search bugs
15 years ago
orbiter 09c208a3ab patch for corrupted database files (just work on and forget key)
15 years ago
orbiter 97ee278931 enhanced search speed:
15 years ago
orbiter ee3820c9cc more logging for strange "java.lang.NoClassDefFoundError: de/anomic/http/server/RequestHeader" error
15 years ago
f1ori b392ca5024 * add option to show YaCy version, usage:
15 years ago
orbiter ac73072924 added a demonstration class: integrate the YaCy search results in own applications
15 years ago
orbiter 377f001e0d sorting of crawl profile names in crawl profile editor, see
15 years ago
orbiter 3552476fbe terminated migration from apache httpclient-3.1 to 4.1:
15 years ago
orbiter 8da4eb5de6 addition to patch in SVN 7111
15 years ago
orbiter a2f9974745 some redesign in the access tracker to realize sixcoolers question about "smartes way for deleting the first Object":
15 years ago
sixcooler 03f0414025 some minor correction of my last commit
15 years ago
sixcooler 42fa0eadb1 fix endless loop:
15 years ago
low012 5a9ea0308f *) further simplification of wiki code parser (less redundancy in code, less magic numbers), still not done with it...
15 years ago
orbiter 37baa8bae3 - fixes for concurrency exceptions and failed database integrity verification
15 years ago
orbiter 29fe401f93 - some layout and text enhancement for site crawl start
15 years ago
orbiter 461a2a6ec7 enhanced remote crawling:
15 years ago
orbiter 670ba4d52b - removed the remote crawl option from the network configuration submenu and
15 years ago
orbiter 89c2d8b81e better initial hash computation
15 years ago
orbiter 34e2f7f487 enhanced snippet fetch strategy: concurrent snippet fetch even for offline-snippet searches. This improves speed since it is now possible to fetch snippets offline and parsing of source files from the htcache can be enhanced using concurrency. This improves local and remote search.
15 years ago
orbiter 0cf006865e refactoring and enhanced concurrency
15 years ago
orbiter 83ac07874f - corrected return value of put() methods (not used anywhere, so it did not harm before)
15 years ago
orbiter 5702419194 fixed a bug in HTTPClient: keep-alive must be set to false, otherwise servers hold connections 2 seconds open until response.
15 years ago
orbiter 5870b13f3a - code cleanup / added debug line for further investigation in HTTPDemon.parseMultipart
15 years ago
orbiter ac1c08924e more performance hacks
15 years ago
orbiter 14c843d364 more performance hacks
15 years ago
orbiter 39f409a7bb performance hacks
15 years ago
orbiter 7ebef56add - redesign of a part of the remote search client to make it possible to have a test environment for remote search performance tests
15 years ago
orbiter 3c0e07ba72 removed all delays in shutdown process
15 years ago
orbiter 906c572621 - enhanced index create menu structure
15 years ago
orbiter 64860dc1bb enhanced search event logging (to be used for further improvements)
15 years ago
orbiter 7dbc357593 patch to identify corrupted database files
15 years ago
sixcooler 17eebd4ef8 counting crawler traffic again:
15 years ago
lotus d2a3d08c44 avoid div. by zero
15 years ago
orbiter 2c7edea35e - better shutdown behavior for the GUI (waits until data is written if GUI is killed)
15 years ago
orbiter 34a25856a5 - added navigation to next/prev search page using arrow keys (left/right)
15 years ago
orbiter 32f73d1aaa added copy for Info.plist for Mac application release updates (this file contains class paths and start parameters)
15 years ago
orbiter 4c21d8dc9d - changed default values for online caution (the pausing may not be necessary any more)
15 years ago
orbiter 570ca577c6 performance hacks
15 years ago
orbiter 348dece62f redesign of the SortStack and SortStore classes:
15 years ago
hermens 03eb021568 Fix for byte[] Objects as keys
15 years ago
orbiter 114bdd8ba7 fixed old sitemap importer which was not able to parse urls containing post elements
15 years ago
orbiter c0b08ac59b slighlty changed way of pdf parser integration
15 years ago
orbiter 6d83c7cb62 removed unnecessary Override statements (produces errors in strict validation)
15 years ago
lotus 6a09f1f7e5 fix dedicated upnp testing
15 years ago
orbiter 5fe828fa06 - replaced pdfbox and fontbox version 1.1.0 with 1.2.1
15 years ago
orbiter c757a4aa9f - corrected lifetime computation for search events
15 years ago
orbiter 24502fe3de performance hacks
15 years ago
orbiter ffaa9a1c51 avoiding double-loading of the same resource from the web in case that a seond attempt to load the resource is started while the first attempt is still loading the content from the web. This will delay the second attempt to the time when the first attempt has finished with the possible result that the second attempt reads only from the web cache, not from the web.
15 years ago
orbiter fb828f3767 - performance enhancements in search response time using faster query ID computation and an ID cache
15 years ago
orbiter d865ef77a8 removed re-read of index in case of a bad index. This may not solve the problem but it applies a 100% CPU problem on the peer. I'm afraid bad index files must be abandoned, and cannot be fixed this way.
15 years ago
orbiter b2c9db48ea Performance enhancement
15 years ago
orbiter ae07e11bc5 enhanced image search result display: concurrent loading of images before they are displayed
15 years ago
orbiter 22047ffad5 enhanced computation speed of many replaceAll string operations
15 years ago
orbiter e8228fba09 less locking in time format computation, caching and during secondary (remote) search evaluation
15 years ago
orbiter 9c0c94683c because of a bug in search result caching count search results had not been generated as fast as possible.
15 years ago
orbiter fa2eb9676e removed unused class
15 years ago
low012 5f391fcfa9 *) cleaned up in wikiCode parser (more to be done)
15 years ago
orbiter b3f0d06444 fixed a problem with restarts in YaCy mac applications: the DATA directory path was not submitted when doing a restart. This solves the problem by:
15 years ago
orbiter d4e4967e19 cleaned up code in yacyRelease (there will be work to do there)
15 years ago
sixcooler ca0a03e9ea ... migrating to HttpComponents-Client-4.x ...
15 years ago
orbiter 1da5241c2d do not block server session if maximum number of sessions is reached, just try to clean up once
15 years ago
orbiter 3988a95fb5 added ability in rss reader to parse atom feeds
15 years ago
orbiter 5de70c3d7c changed way of storage for search requests:
15 years ago
orbiter 9d080f387e change in handling of the all-visible home path for storage in YaCy:
15 years ago
orbiter 65eaf30f77 redesign of crawl profiles data structure. target will be:
15 years ago
f1ori 938676265f fix shutdown command, close HttpClient connection pool
15 years ago
f1ori 55da979291 disable revision detection for git
15 years ago
orbiter 104318d58a - added nice colors to feed indexing state messages
15 years ago
orbiter 4f22e2df41 bugfixes for
15 years ago
orbiter 42414a6ae3 added two more tables in rss reader interface:
15 years ago
orbiter 0010cd9db1 Support for indexing of RSS feeds!
15 years ago
orbiter 0f276dd63f - MapHeap now implements Map<byte[], Map<String, String>>
15 years ago
orbiter cf07b34c2d implemented the Map interface in the ARC classes so it will be possible to instantiate ARCs as
15 years ago
orbiter c60d0282fd more abstraction for tables stored in heaps:
15 years ago
orbiter d1be64d491 removed wrong assert
15 years ago
orbiter 3197ca42ed preparations to move the HTCache into cora:
15 years ago
orbiter 844f158686 - removed dependencies in header framework:
15 years ago
orbiter 80ba543d4c svn fix for uppercase problem
15 years ago
orbiter 5e7081cd19 refactoring towards a unified loading mechanism for MultiProtocolURIs
15 years ago
orbiter caece04f26 removed System.err and System.out usage from FTPClient; changed logging to log4j (preferred in yacy.cora)
15 years ago
orbiter 90531f78ff refactoring of the cora package to get subpackages for http and ftp (smb to come)
15 years ago