Commit Graph

5728 Commits (ad62609ec7e62b0f39880dd2beb0ffb167940ae9)

Author SHA1 Message Date
cominch 3c255c025b Show tags in search results (if activated in ConfigPortal_p.html)
13 years ago
Michael Peter Christen 16d8f33795 added objectlink generation to vocabulary generation and editor
13 years ago
cominch f49d92d8da Cleanup of interaction class and helper routines
13 years ago
cominch 56b0115054 Triplestore: modify routines to access per user store
13 years ago
cominch a95127c9af Triplestore: initalize per-user triplestores
13 years ago
Michael Peter Christen d45718251e refactoring (Localization -> Location)
13 years ago
Michael Peter Christen b8b3c87ba7 - renamed localization to location (that was confusing)
13 years ago
Michael Peter Christen e89747bb67 - added automated generation of vocabularies from url stubs
13 years ago
Michael Peter Christen 79464189a4 The 'Locale' vocabulary, which is generated by geo data, has now the
13 years ago
Michael Peter Christen eca38c53e7 added a vocabulary editor
13 years ago
Michael Peter Christen 61bb52d55c - using http://purl.org/dc/terms/references to refer from an
13 years ago
Michael Peter Christen 2bbb6c52cf added option to clean the triplestore when deleting the index
13 years ago
Michael Peter Christen 50c576599b allow multiple parser options instead of printing an error
13 years ago
Michael Peter Christen c02d742e53 proper namespaces in triplestore dump
13 years ago
Michael Peter Christen 8b53771db2 changed behavior of navigation processing:
13 years ago
Michael Peter Christen 5fc6524ca8 - moved triple store to net.yacy.cora.lod (should be generalized there
13 years ago
cominch 8d2e6355f8 augmented browsing: remove non-existing external snippet file
13 years ago
cominch c90f174799 preparation and generalization of augmented browsing methods
13 years ago
Michael Peter Christen bef823c247 close the reader if finished
13 years ago
Michael Peter Christen 4ee6fb1de9 added missing blacklist dht cache storage (maybe due to mistakes in
13 years ago
Roland 'Quix0r' Haeder e4d36fa5eb Fix to make all values lower-case (this should make all existing blacklists compatible with the new enum)
13 years ago
Roland 'Quix0r' Haeder edaa09b9b1 Rewrote all String blacklist types to enum 'BlacklistType', closes bug
13 years ago
Roland 'Quix0r' Haeder af5a597e47 Scroogle is not comming back, remove dead code
13 years ago
cominch 7a4dab6d1d - removed unused variables
13 years ago
Michael Peter Christen ca93835713 removed usage of deprecated methods
13 years ago
Michael Peter Christen 23e38bd918 do not load the "_triplestore.rdf" files which are of special use in
13 years ago
Michael Peter Christen 90c6fc4b63 load all - but not the persistent local.rdf - triples from
13 years ago
cominch bbfc53b663 bugfix
13 years ago
cominch 65c5826d93 bugfix
13 years ago
cominch aa0295917c augmentation
13 years ago
cominch ed2ea0f08e augmented browsing modification
13 years ago
cominch 6b32f7c1f6 re-enable augmented proxy
13 years ago
cominch 3b08edec2e bugfix
13 years ago
cominch 5f8ba7f4f2 small changes
13 years ago
cominch 300b235ce8 Updated Demo Servlet
13 years ago
cominch 90512640bf Added config switches for custom parser
13 years ago
cominch b5a8fb5fd8 Catch malformed URL when submitted in encoded style
13 years ago
cominch df47f31235 interaction: add special table interaction
13 years ago
cominch e14f2881ae interaction: add special table interaction
13 years ago
cominch d7326079a8 interaction: add global variable store
13 years ago
cominch 4e4e7a99f8 interaction: add global variable store
13 years ago
cominch 8e80894812 create virtual web folder /currentyacypeer/ which always points to local
13 years ago
cominch bde07ed7a8 Add tagging overlay element
13 years ago
cominch b0bc0b4572 Add new demonstration module for client-side key-value store (backend:
13 years ago
cominch c9dc6cda02 Demonstration: include value from interaction in search results
13 years ago
cominch ae8adb0e58 Small changes
13 years ago
cominch bcbd8eee33 Add several parsers, for RDFa and rdf files.
13 years ago
cominch 9ef5a80f4e add interaction for triples and selector for augmented browsing
13 years ago
cominch 282c1620d6 Allow TripleStore to be persistent after reboot
13 years ago
cominch 5d20cd324a Add Triplestore and RDF query interface
13 years ago
cominch bc9a618e0a augmented browsing: ignore js and css, integrate more user interaction
13 years ago
cominch b21048892b augmentedParser add features and integrate external html parser to
13 years ago
cominch 9cbfc1a1c0 augmentedProxy, which forwards every proxy request to a
13 years ago
Michael Peter Christen 3b992e6b00 using utf8 String compression in Webstructure database
13 years ago
Michael Peter Christen 26301a538d bugfix in Domains - dns-lookup
13 years ago
Michael Peter Christen cde20911bb saved a bit more ram using UTF8 String compression for OpenGeoDB and
13 years ago
Michael Peter Christen 225ee42879 made the GeoLocation into an interface with the current
13 years ago
Michael Peter Christen 2280a7b276 - changed initialization order to prefer allocation of memory for table
13 years ago
Michael Peter Christen 0746308bc2 only the metadata tables shall be able to use the tail cache
13 years ago
Michael Peter Christen 7ec9bef0c3 fix for OOM
13 years ago
Michael Peter Christen 41c02cb10e - less restrictions for usage of Table RAM copy
13 years ago
Michael Peter Christen b8f56a9803 npe bugfix
13 years ago
Michael Peter Christen dd14b19c26 lazy initialization of block rank table ... only normal web search uses
13 years ago
Michael Peter Christen ba10caf89a lazy initialization of database tables
13 years ago
Michael Peter Christen 701b9a28a0 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen ab7107b34b fixed RWIProcess queue limits: now discovering hidden results for mass
13 years ago
Michael Peter Christen 10c9c17d51 fixed handlemap spread factor and null iterator handling
13 years ago
Michael Peter Christen b0095c8d3c flush the compressor cache when a cleanup is done
13 years ago
Michael Peter Christen a61f44f9e4 lazy initialization of block rank table.
13 years ago
Michael Peter Christen 96e9d77270 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 00f2df1120 a variety of possible memory leak fixes
13 years ago
Michael Peter Christen 3dd8376825 added automatic cleaning of cache if metadata and file database size is
13 years ago
Michael Peter Christen d0ec8018f5 fixes for bad long computation
13 years ago
Michael Peter Christen 6bb07afcc3 accept also files with other file prefix; used to read 'foreign' cache
13 years ago
Michael Peter Christen 96c8119b50 added GeoLocation / GeoPoint classes which uses less memory than
13 years ago
Michael Peter Christen 461a0ce052 removed warnings
13 years ago
Michael Peter Christen 62ae9bbfda allow more POIs, get more at once
13 years ago
Michael Peter Christen 407fdf6968 more bug fixes and performance hacks for search process
13 years ago
Michael Peter Christen a1fe65b115 performance hacks
13 years ago
Michael Peter Christen 2fe207f813 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 5aee19daa4 added show from cache in search results (not yet finished)
13 years ago
Michael Peter Christen 5e562dcdb7 adopted vocabulary usage within anotation/naviagtion feature of search
13 years ago
Michael Peter Christen 514700291a moved Vocabulary to cora package (added in git
13 years ago
Michael Peter Christen 0284a4d88f more fixes for double precision of coordinates
13 years ago
Michael Peter Christen 964406ad17 added concurrency enhancement to xml parser
13 years ago
Michael Peter Christen 240045cf7c fix for bad distance computation
13 years ago
Michael Peter Christen e0d8643226 - performance hacks
13 years ago
Michael Peter Christen 7a329465b3 using pre-compile pattern in blacklist; should enhance search speed
13 years ago
Michael Peter Christen 6e83b02b83 - bugfix for surrogate file reader
13 years ago
Michael Peter Christen 9b4c699526 ehanced location search:
13 years ago
Michael Peter Christen 834dc6b263 store more data from interface access
13 years ago
Michael Peter Christen 1f48d1528b performance hacks
13 years ago
Michael Peter Christen c70aaccdc9 better location to generate a guid for rss messages
13 years ago
Michael Peter Christen 10da7335ea performance hack: use a hash cache for all hashes that are computed by a
13 years ago
Michael Peter Christen f8a0cf6d7c RSSMessages do not need a concurrent hash map -> removed overhead
13 years ago
Michael Peter Christen 07ca7e4dd1 enhanced RSS parsing by ensuring that it is parsed with a buffered input
13 years ago
Michael Peter Christen 7c1feefb28 introduced a default 10 second time-out in rwi normalization time
13 years ago
Michael Peter Christen 8d997d55b6 better logging
13 years ago
Michael Peter Christen 65d37e6a20 only ASCII needed in seed bitflags
13 years ago
Michael Peter Christen 0f82fb3628 using double instead float for a better release ordering
13 years ago
Michael Peter Christen 43c2c6e588 better logging
13 years ago
sixcooler 56087c1f23 bump to httpclient- httpcore-, httpmime- 4.2
13 years ago
Michael Peter Christen 20e0cc0822 fix for bad location evaluation
13 years ago
Michael Peter Christen 71c3163f3d - fixes to node identification
13 years ago
Michael Peter Christen 4d3cc02168 replaced old bzip2 library against better documented commons-compress
13 years ago
Michael Peter Christen ad222be7f8 added node state icon in network list
13 years ago
Michael Peter Christen eff7667554 fix for http://bugs.yacy.net/view.php?id=188
13 years ago
Michael Peter Christen 3c2bec681f added a root node flag: identifies peers with short ping time
13 years ago
Michael Peter Christen c846e9ca14 redesign of the crawler monitor page: show crawled pages instead of
13 years ago
Michael Peter Christen 8b974905ee changed log-in text for all servlets with authentication:
13 years ago
Michael Peter Christen 16b21f7a5b Added more steering in Crawler_p.html interface
13 years ago
Michael Peter Christen acc19e190d hack against 100% cpu during crawl delete
13 years ago
Michael Peter Christen c15fcde1c8 add-on to latest commit
13 years ago
Michael Peter Christen cf47d94888 performance hack to parse numbers inside of substrings without actually
13 years ago
Michael Peter Christen 7e0ddbd275 added a "fromCache" flag in Response object to omit one cache.has()
13 years ago
Michael Peter Christen 81737dcb18 removed stack trace from swf parser since we cant do anything there
13 years ago
Michael Peter Christen 7bf421b9dd - fixed image search page navigation
13 years ago
Michael Peter Christen 125d47b3c1 added more interruptions in DidYouMean because that was the cause for
13 years ago
Michael Peter Christen c6a09eab0b synchronization needed
13 years ago
Michael Peter Christen fb94b47b1a changed queue sizes to have less memory occupied during indexing
13 years ago
Michael Peter Christen 76157dc2c3 bugfix for http://bugs.yacy.net/view.php?id=173
13 years ago
reger 6696cb1313 bugfix: lookup of peernames no result for active peer in page IndexControlRWIs_p.html -> Transfer RWI to other Peer
13 years ago
Michael Peter Christen c6558cba08 more classification bugs
13 years ago
Michael Peter Christen 082831b9d6 search contentdom was checked in wrong way - fixed
13 years ago
reger ee553d971e correct typo in scripts_txt comment
13 years ago
Michael Peter Christen f294f2e295 bugfix to http://bugs.yacy.net/view.php?id=181
13 years ago
Michael Peter Christen acf8d521a2 fix for http://bugs.yacy.net/view.php?id=126
13 years ago
Michael Peter Christen bb88878b4d the last commit was incomplete..
13 years ago
Michael Peter Christen d320a31ae1 bugfix for http://bugs.yacy.net/view.php?id=186
13 years ago
Michael Peter Christen fa735f4f04 Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 3e1bc9477f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 6f8a2fef1f small speed enhancement using a column factory
13 years ago
Roland 'Quix0r' Haeder d10627d591 More sync in close() methods
13 years ago
Roland 'Quix0r' Haeder b3ae2aa41f With or without 'final'? At least please try it in other methods
13 years ago
Roland 'Quix0r' Haeder fbb946f913 Made a method static (Eclipse suggested it), removed unused import, pk=null check does now output a warning in logfile
13 years ago
Michael Peter Christen 52d307c735 prevent that the snippet fectch process removes catchall entries
13 years ago
Michael Peter Christen 7eece0256f moved yacy.logging to defaults according to request in
13 years ago
Michael Peter Christen 5b3acc12cd Pattern.quote() replaces \\Q and \\E according to publication in
13 years ago
Michael Peter Christen 89142d1e8d removed (not all) warnings
13 years ago
Michael Peter Christen 5deebd02ea added serialization
13 years ago
reger b2175ea4ef Add possibility to set custom Solr field names for the YaCy default Solr attributes.
13 years ago
Michael Peter Christen 15db703808 added missing serialization to remove all warnings
13 years ago
Michael Peter Christen 1795a7325b made HandleSet serializable
13 years ago
Michael Peter Christen e7e381d110 added configuration to switch off redirection following in crawler
13 years ago
Michael Peter Christen 2717c1b749 fixed bug in solr interface
13 years ago
Michael Peter Christen 70505107ca enhanced crawler/balancer: better remaining waiting-time guessing
13 years ago
Michael Peter Christen f150bc218b fixed bug in solr error document
13 years ago
Michael Peter Christen cb54c1737b solrj connector bugfix
13 years ago
Roland 'Quix0r' Haeder a093ccf5eb Now used synchronization in all close() methods to make sure all objects
13 years ago
Michael Peter Christen 49cab2b85f Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen 0d58fea210 made multiple connector default
13 years ago
Michael Peter Christen 7740c02c56 - enhanced the solr connector
13 years ago
Michael Peter Christen 0cf3d36eae more tolerance in case of corrupted file
13 years ago
Michael Peter Christen acc6db28ff added missing classes for solr interface
13 years ago
Michael Peter Christen adeb33bb36 better abstraction for solr objects
13 years ago
Michael Peter Christen 8864141872 more abstraction in solr connection classes
13 years ago
Michael Peter Christen c00efc2717 made the solr connection more generic
13 years ago
Michael Peter Christen ea2bd43b28 patch for broken configurations
13 years ago
Michael Peter Christen e5ca7f22b1 enhancement in circle drawing
13 years ago
Michael Peter Christen 34f4225d7e less 'wellformed' calls without asserts
13 years ago
Marc Nause a691023d04 *) better formatting for network QPM
13 years ago
Michael Peter Christen 77f8e9fb9b Merge branch 'master' of ssh://git@gitorious.org/yacy/rc1.git
13 years ago
Michael Peter Christen ba6aaabc51 refactoring + parser bugfixes
13 years ago
Michael Peter Christen 2a0434efa4 Merge commit 'c1f6b4fb5226d3d2f8b2bec9e361f6b3476e03ff'
13 years ago
Michael Peter Christen 942896fe46 removed methods not supported by new solrj connector for httpclient 4
13 years ago
Michael Peter Christen 22e1f68c0b solrj user authentication patch
13 years ago
Michael Peter Christen 09484955dc added new entry class for embed tags
13 years ago
Michael Peter Christen 62f2554a01 - fixed build problems (deprecated methods using httpclient 3.1)
13 years ago
Michael Peter Christen a6d60fc21f concurrency enhancement in ConfigurationSet
13 years ago
Michael Peter Christen 453010bd68 - solved problems with backpath normalization
13 years ago
Michael Peter Christen 5f5ed33ed8 patch for media search (audio, video apps)
13 years ago
Michael Peter Christen 7860c1df80 fix needed for new solrj library
13 years ago
Michael Peter Christen 0e13022147 - enhanced solr field documentation
13 years ago
Michael Peter Christen 19efbf1b0f - apply directDocByURL to NOLOAD Queue
13 years ago
Michael Peter Christen 659178942f - Redesigned crawler and parser to accept embedded links from the NOLOAD
13 years ago
Michael Peter Christen a3badd3205 changed search process for images: no more media snippet load process,
13 years ago
Michael Peter Christen f5efdb21fd refactoring
13 years ago
reger c1f6b4fb52 lookupByIP: prevent comparing of port parameter if called with port -1 (=unknown)
13 years ago
Michael Peter Christen f8cd57c92f new indexing strategy: ALL links that appear anywhere are indexed, not
13 years ago
Michael Peter Christen 14f67f217c refactoring of ContentDomain: now subclass of Classification
13 years ago
Michael Peter Christen 8a08c96a82 removed dependency from logging
13 years ago
Michael Peter Christen a1a5b015d8 refactoring: moved document Classification to cora package
13 years ago
Michael Peter Christen a5d7da68a0 refactoring: removed dependency from switchboard in Balancer/CrawlQueues
13 years ago
Michael Peter Christen 33d1062c79 refactoring: the cache belongs to the crawler
13 years ago
Michael Peter Christen 4d5da75814 fix for parser problem if a <a>-tag is 'within' html tags with unclosed
13 years ago
Michael Peter Christen 91a86f0b06 fixed to network graph testing
13 years ago
Michael Peter Christen 7b5b9baee0 added citation rank to ranking profile
13 years ago
Michael Peter Christen 046f3a7e8d check if httpc has decompressed the release file and rename the file
13 years ago
Michael Christen 02e4dedff2 fix to url citation collection
13 years ago
Michael Christen e32055aa15 added stub classes for
13 years ago
Michael Christen ac5d124ee0 experimental implementation of a citation ranking as post-ranking
13 years ago
Michael Christen 8fc86fe397 added storage of full anchor link structure:
13 years ago
Michael Christen 22f05c83ff fixed default must-match filter for full domain crawls - the old filter
13 years ago
Lotus 0b3f39136e allow custom ppm lower than minimum button on /Crawler_p.html
13 years ago
Michael Peter Christen 532c7cf827 added physics experiment to the graph plotter. not active by default
13 years ago
Michael Peter Christen aba9b1bfa0 better names for elements of a linked graph
13 years ago
Michael Peter Christen 0cc0290978 bugfix for a must-not-match pattern check. This bug did not make the
13 years ago
Michael Peter Christen 2fc8ecee36 ConcurrentLinkedQueue has a VERY long return time on the .size() method.
13 years ago
Michael Peter Christen 8aba045ba1 if a new pop-up page is set in config portal, then this page applies
13 years ago
Michael Peter Christen 8c06925984 animation of the web structure picture
13 years ago