Commit Graph

200 Commits (77b1e921a954247a49d92f8bfc6f78b5056f4c9b)

Author SHA1 Message Date
orbiter bead0006da replaced tmp file extensions by prt
16 years ago
orbiter 89aeb318d3 enhanced the wikimedia dump import process
16 years ago
orbiter 5fb77116c6 added a submenu to index administration to import a wikimedia dump (i.e. a dump from wikipedia) into the YaCy index: see
16 years ago
orbiter c097531e3d added a catch Exception to all thread to check if any of them silently dies without any other notification
16 years ago
orbiter 9c6ac43f66 fixes for wiki parser
16 years ago
orbiter d079d6dfdb small changes in surrogate reader, wiki code and portal test
16 years ago
orbiter 2e3186189b fix for mediawikiIndex surrogate producer + added concurrency
16 years ago
orbiter 1b9e532c87 some concurrency for wikipedia dump reader
16 years ago
orbiter 16baa7ad24 To translate a mediawiki dump into the YaCy surrogate format do the following:
16 years ago
orbiter 0b2c98edc9 some more work on the wikipedia-dump exporter (not finished yet)
16 years ago
f1ori d93a2a6552 * ignore whitespaces so you can copy&paste signatures better
16 years ago
orbiter fbcbcc5bdb export of yacy document objects as dublin core record in xml
16 years ago
f1ori 44daec7936 * introduce signatures to autoupdate
16 years ago
orbiter 8a24350036 - fix for join method with new generalized RWI data structure (caused by latest commit)
16 years ago
orbiter d4d87d90c4 - extended experimental wikipedia dump parser
16 years ago
orbiter c08f9b36a4 refactoring of wiki parser.
16 years ago
orbiter 9da69d6b68 - better selection of files to be merged
16 years ago
orbiter d39a5b42ca more care about open file handles. Now files also close on windows and can be deleted afterwards.
16 years ago
orbiter 96eaecda3e - added migration class to go from index collections to the index cell data structure.
16 years ago
f1ori c545fcb9fa * add class to handle keys and signatures
16 years ago
lotus 39a177649b * added upnp listener for devices that do not respond to discovery but advertise themselves
16 years ago
orbiter c12bb8a6d0 - refactoring of the http client
16 years ago
orbiter 62505bb3cb more bugfixes as recommendet by findbugs
16 years ago
lotus 4aad461100 added UPnP support
16 years ago
lotus e8ae2599fd * some refactoring/moves to consoleInterface
16 years ago
f1ori 76cdc59789 * added some convertions to and from UTF-8
16 years ago
orbiter 94110df85a moved logging partially to kelondro
16 years ago
orbiter 024da2916b refactoring of logging
16 years ago
orbiter 83ce65707a (almost) completed partition of classes in kelondro
16 years ago
orbiter 7ee494fde5 more refactoring of kelondro:
16 years ago
orbiter bf93767ec6 refactoring of kelondro database classes
16 years ago
orbiter fc27bf8c4c refactoring of kelondro classes:
16 years ago
orbiter e004da48d3 - added fast fingerprint computation for files (any). Will be used in new index dump method
16 years ago
orbiter 47292e696a more performance hacks
16 years ago
orbiter b0f2003792 fast database initialization and fast start.up of yacy:
16 years ago
orbiter 0ca4bc7b79 - added reader and visualization for mediawiki-export files:
16 years ago
orbiter 6941bf42b1 performance hacks
16 years ago
danielr f095137238 - respecting httpdMaxBusySessions (refusing new connections if limit is hit)
16 years ago
orbiter 8ba33f104e fix for npe
16 years ago
lotus 9d50bfd0b3 fix for npe: http://forum.yacy-websuche.de/viewtopic.php?p=10562
16 years ago
lotus fe2792e9ce use accept-language header instead of user agent for language detection
16 years ago
orbiter 00c1535f84 added ranking and evaluation of language type in a search
16 years ago
orbiter bfcf9b7aa3 - added language detection using metadata from documents: html and odt documents provide this information
16 years ago
orbiter 536e77e8b7 modifications towards a single database operation to read/write http header and cached file at once:
16 years ago
orbiter bdae051d9a - extended new performance graph (better timing)
16 years ago
danielr 621b473b18 * removed some warnings of findbugs (http://findbugs.sf.net)
17 years ago
danielr 17b7845eb5 * refactoring
17 years ago
danielr 3bb870bfcd added final where possible
17 years ago
orbiter 50ef5c406f - refactoring of robots parser (removed opaque Objects[] result vector)
17 years ago
orbiter c3d461d191 - removed superfluous copyright statement
17 years ago
lotus 5488543b8f disabled disk usage logpoints
17 years ago
orbiter 7052f2f61f - added copyright header of ResourceObserver
17 years ago
orbiter 1400cdc91e - refactoring of resourceObserver (moved it to crawler)
17 years ago
f1ori b6301a54fa * added class ListDirs to provoid generic listing of directories in systemdirectories and jar-files
17 years ago
orbiter e81be7d4f2 added many missing user-agent declarations for yacy http client connections.
17 years ago
orbiter 69aac0d74c modified the diskUsage class regarding the following two aspects:
17 years ago
danielr f7f9ceb967 diskUsage: replaced blocking sleep with semaphore
17 years ago
danielr 63eadfdf84 fixed unlimited FileSizeLimit
17 years ago
det 609aaf0df3 rework of the windows part
17 years ago
det 1a4f26ba30 exclude HTDOCS from recursiv scan
17 years ago
det 6c07e894d9 add needed sleep
17 years ago
danielr 6b7e873962 resourceObserver refactoring and some synchronisation for console output
17 years ago
danielr 68c38c2d34 - WatchCrawler shows status without JavaScript
17 years ago
det c0dfe49743 also exclude collection.0028.commons and RANKING at startup check
17 years ago
det 11656741f1 exclude LOCALE and RELEASE at startup check
17 years ago
det 0727bb1e63 rework of console message handling; add of debugging output
17 years ago
orbiter f5ef7f222e - fixed a bug in parser (directory paths had not been recognized)
17 years ago
lotus ed24eab737 small fix for windows in resource observer
17 years ago
det 6afeb535cd another bugfix for the windows drive check
17 years ago
det b416af7568 bugfix for the windows drive check
17 years ago
danielr 7feae906aa - organize imports
17 years ago
det f597185026 Initial import of the resource observer framework
17 years ago
orbiter 03438ee977 added missing implementation of network-path reference
17 years ago
lotus 4a48717017 * automatic update for windows
17 years ago
danielr d4bce6affd refactoring (initialized static fields, removed empty if/else, serialized some fields in serializable classes)
17 years ago
danielr 7a35126e91 http timeouts von alten httpc wieder gesetzt
17 years ago
danielr d96e2badc7 - fixed POST in proxy
17 years ago
danielr 5c3c1fdf41 replaced httpc with Apache Jakarta Commons HttpClient (includes some refactoring ;)
17 years ago
orbiter 3e44293f07 - fixed a problem with thread pools in row collection
17 years ago
orbiter 7cc4ff05c9 some code enhancements and bugfixes
17 years ago
orbiter 4fdf695064 - fixed a bug in remote search that prevented that any results had been generated (!)
17 years ago
orbiter 83860507c9 - added punycode class from gnu idn library
17 years ago
low012 652086159a *) Replaced System.err.println() by logging function. Left System.err.println()s as comments to be able to quickly revert changes since gzip is an application with it's own main method and Orbiter maybe wants to keep it this way.
17 years ago
orbiter 36b898ca7a - tested successfully z-presentation of yacy seed encoding
17 years ago
orbiter 4e70dff8cf more generics
17 years ago
borg-0300 3cab85158c update for last commit
17 years ago
borg-0300 53367d941a more information (BASE64)
17 years ago
orbiter 4dc438f7e7 moved to Java 1.5:
17 years ago
fuchsi f717beecb1 - Changed yFormatter handling to be more flexible and produce more readable code for server pages. There are serverObject.putNum() methods to allow adding of number type values in a formatted form, and put() methods for number types that add them without formatting. This reduces the need to transform them into Strings in server pages and removes the HTML encoding step which is unecessary for numbers.
17 years ago
fuchsi 3352474dd8 Remove grouping separator in Network.xml (yacystats will woork without it) and format a few more numbers.
17 years ago
fuchsi 06e6a1ff62 Add a generalized Formatter class yFormatter inspired by http://forum.yacy-websuche.de/viewtopic.php?f=5&t=437
17 years ago
orbiter daf0f74361 joined anomic.net.URL, plasmaURL and url hash computation:
17 years ago
orbiter c48c9735cb bugfix for property table loader (did not exclude comment lines)
18 years ago
low012 d95a06e2e4 *) cosmetics: fixed a typo
18 years ago
orbiter 36a37f758b fix for oom exception during release download
18 years ago
orbiter ad1e4aa88e added selection of audio, video, image and application resources
18 years ago
orbiter fb7902aa68 fix for http://www.yacy-forum.de/viewtopic.php?p=26142#26142
18 years ago
orbiter 76fceb9997 refactoring
18 years ago
orbiter b79e06615d - added new LURL.Entry class for next database migration
18 years ago
orbiter bcf2b800b4 applied UTF-8 encoding parameter to yacy-internal protocol communication
18 years ago