Commit Graph

167 Commits (f7aaeb3fad85719d6f9f73aa709b244a17b6b3f7)

Author SHA1 Message Date
apfelmaennchen 34e5422675 adjusted code for bookmarksDB.getFolderList()
17 years ago
apfelmaennchen 6f9f821481 added XBEL Export for YaCy Bookmarks. Tags are strored as
17 years ago
orbiter f7c5ccedc7 more generics
17 years ago
fuchsi 1cb6e431a6 Replace the ISO8601 aka W3C datetime parser by one that supports every representation allowed by this standard, see http://www.w3.org/TR/NOTE-datetime
17 years ago
fuchsi 3c30c2da75 more cleanup and API consistency changes, more to come...
17 years ago
orbiter 89b9b2b02a redesigned remote crawl process:
17 years ago
orbiter 55c87b3b12 changed behavior of crawl stacker
17 years ago
orbiter a31b9097a4 preparations for mass remote crawls:
17 years ago
fuchsi 0e1738899f * Complete number localization and provide a more reasonable interface to serverObjects:
17 years ago
orbiter 01e0669264 re-designed some parts of DHT position calculation (effect is the same as before)
17 years ago
fuchsi 5b0c1449e1 various fixes and cleanups for blacklist handling:
17 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation:
17 years ago
orbiter 4779f314fe first version of next-generation search interface:
17 years ago
orbiter d9472b6a3a * fixed problem with watch crawler
17 years ago
orbiter e332b844b2 - enhanced remote search: during waiting time for remote crawls
17 years ago
orbiter b3c830271c fix in xml header
17 years ago
orbiter 947fc46904 refactoring of search process:
18 years ago
orbiter 62347b50f4 added security layer for ViewImage:
18 years ago
orbiter 9ca46a8c69 indexing of local (intranet) urls enabled
18 years ago
orbiter 511dcbb172 fixed encoding bug made in SVN 3993
18 years ago
orbiter 40b0547611 - documentaton changes (removed old forum links)
18 years ago
orbiter a4e8ad95ab enhancements to news and switchboard queue processing
18 years ago
orbiter 36a37f758b fix for oom exception during release download
18 years ago
karlchenofhell 71ca9aa6d4 - fix for changed blacklist types
18 years ago
theli 339153d40e *) favicons that are specified in the document content via html link-tags
18 years ago
theli 051a65f7af *) Snippet fetching:
18 years ago
allo 5fc00871a9 getpageinfo/sitemap bugfix
18 years ago
allo e7da3d2340 fixed sitemap url in getpageinfo
18 years ago
(no author) 92351c4dcb *) SOAP: bookmarks list now indicates if a bookmark is private (requested by KoH)
18 years ago
orbiter a585b4d41b added web structure image
18 years ago
orbiter 33ad0c8246 added a web structure computation and logging:
18 years ago
karlchenofhell 601fc7d1c5 - added source to J7Zip-modifed.jar and it's license (changelog is still to come)
18 years ago
theli 7d9259e44d *) Bugfix for umlaut problem
18 years ago
theli 0b5fc3c28c *) moving date functions to serverDate class
18 years ago
theli 6f46245a51 *) Bookmarks: Ajax icon is displayed while loading title
18 years ago
orbiter 6e7340ef52 added exclusion search
18 years ago
theli 91c2a042a7 *) bugfix for wrong proxy traffic accounting
18 years ago
orbiter 861f41e67e redesigned NURL-handling:
18 years ago
orbiter 9f929b5438 better snippet handling in case of snippet load fail
18 years ago
karlchenofhell bf7a69197d - fix for possible NPE in queues_p
18 years ago
orbiter 306c50ac40 QPM (queries per minute) statistic stub
18 years ago
allo 29aa7031d3 workaround for the snippets
18 years ago
allo 8803f813c5 partly fixed snippets
18 years ago
allo 0c81bd39d4 XSS-safe put as default.
18 years ago
rramthun 00ca6ecf58 -made snippet-timeout for text and media configurable
18 years ago
karlchenofhell 41bc31d2c2 - ConfigAdvanced_p => XHTML (no invalid IDs)
18 years ago
orbiter 1d2d1854b9 added size of rwi and urls to WatchCrawler
18 years ago
orbiter 0a050bc043 enhanced ranking
18 years ago
orbiter 61798f0ae6 added option to distinguish between text crawl and media crawl
18 years ago
orbiter febe6b114a design update of crawler monitor
18 years ago
orbiter e4570bffaf -implemented a specialized snippet-fetch for media content
18 years ago
orbiter 1377c53aa3 extraction of media links from search results
18 years ago
orbiter fb9e0f0284 preparations for media snippets
18 years ago
orbiter 937ccd4e76 fix for snippet-generation
18 years ago
orbiter 9a85f5abc3 cleanup
18 years ago
orbiter 109ed0a0bb - cleaned up code; removed methods to write the old data structures
18 years ago
orbiter ceb9e3aa17 - enhanced parser: collection of audio, video, image and application links
18 years ago
orbiter b5a29e9651 - fix for snippets that are too short
18 years ago
orbiter 30888e7a2f implementation of search constraints
18 years ago
orbiter d34f10c63d some tests with reverse dns lookup
18 years ago
orbiter 497428c8ec refactoring
18 years ago
allo a75f895884 memory and traffic informations
18 years ago
allo 2ba56f70a8 XML-safe put.
18 years ago
allo a17c43779f removed wrong part of template
18 years ago
allo 27f9e0b1c6 xml interface for blacklists
18 years ago
allo 74f09a0510 some more xml-backend files.
18 years ago
allo e25172853a fixed license notice
18 years ago
allo 1d0c0edda3 first version of posts/get from the del.icio.us api
18 years ago
orbiter 5a40ea7866 refactoring of wget string list generation
18 years ago
orbiter dbc2e039bb added time-out option parameter to call hierarchy
18 years ago
orbiter b59d4576af increased version number to emphasise that the snippet fix
18 years ago
orbiter d4c239e4be - fixed problem in collection index with deletion of single url references
18 years ago
orbiter df1629b05a - code cleanup
18 years ago
orbiter 3aac5b26da - added automatic tag generation when a web page from the search results is added
18 years ago
orbiter 5015e780c2 - simplified watchCrawler code
18 years ago
orbiter c89d8142bb replaced old 'kCache' by a full-controlled cache
18 years ago
theli 92e986bb91 *) adding missing return prop (requested by allo)
18 years ago
allo f0529fe53e update for ftp urls
18 years ago
theli 413e6b9855 *) direct access to responseheaders of sbQueue.Entry removed to make it more http independent
18 years ago
theli eb9b138986 *) next step of restructuring for new crawlers
18 years ago
theli 1395aae742 *) starting restructuring which is needed to add crawlers for additional protocols
18 years ago
orbiter 7df572756a fist step+attempt so solve the snippet marking problem.
18 years ago
orbiter 3879a0ecd0 replaced java.net.URL usage by use of new class de.anomic.net.URL
19 years ago
allo 933a9e02ab fix for broken build
19 years ago
allo 360056b30c fix ajax bug (no valid xml)
19 years ago
orbiter 90d569d70f refactoring of index management:
19 years ago
allo 44d72f06c4 more Caching
19 years ago
allo 1a13c8b78e right wordCachesize after orbiters commit.
19 years ago
allo 6b056610e3 updated watchcrawler for the recent changes
19 years ago
orbiter bcd99fe83e introduced a second RAM cache for DHT transfer
19 years ago
orbiter bae3783d38 added a snippet marking
19 years ago
allo fb5d8fdc59 removed encoding attribute
19 years ago
allo f1b91b1266 xml with right encoding
19 years ago
orbiter 3703f76866 - fixed re-search bug: after a search with several words, a second search could not
19 years ago
theli dc9174c809 *) Implementing snippet fetching via ajax
19 years ago
allo 7e7a72b108 display wordcaches number on WatchCrawler.html
19 years ago
allo 3fd1641893 queuesizes in queues_p.xml
19 years ago
allo 62664d7252 AJAX Check for robots.txt before crawling.
19 years ago
allo 26d7e8dd0d more escapes
19 years ago
allo 127396436f more queues in the xml backend
19 years ago