Improving Scalability in Pub-Sub Knowledge-Based Networking by Semantic Clustering John Keeney, Dominic Jones, Dominik Roblek, David Lewis, Declan O’Sullivan Knowledge & Data Engineering Group (KDEG) - Trinity College Dublin, Ireland { John.Keeney | jonesdh | roblekd | Dave.Lewis | Declan.OSullivan }@cs.tcd.ie http://kdeg.cs.tcd.ie/technologies/kbn

The three main pub/sub systems – type-based, content-based and topic-based networks allow for normalised subscriptions and publications using a combination of push/pull message delivery. Knowledge-based Networks (KBN) extends contentbased networks by allowing subscriptions to be matched not only on the contents of messages, but also on some semantics of the message contents [1,2,3]. This creates a fuller, richer and more meaningful system whereby publishers and subscribers can be matched using more a expressive subscription mechanism. Here we discuss how a Knowledge-Based Network implementation was extended to support node clustering based on subscription semantics, thereby improving performance and scalability. In extending the Knowledge-Based Network to incorporate semantic-based clustering, this research aims to provide a network environment in which routing nodes, publishers and subscribers are clustered based on their semantic footprint and interests. The benefits of this are twofold: Firstly, this reduces the processing time involved in making routing decisions based on messages content. Its takes fewer hops to get from source to destination, as these are already closely linked based on the likelihood of there being a match between the two. Secondly, this allows for natural grouping of likeminded publishers and subscribers as seen in traditional web forums / newsgroups. The cluster-based approach to pub/sub networks turns the normal userbased search paradigm full circle as network data is passed from node to node towards those who are most likely to be interested in the data as opposed to those users searching out that same data. An implementation of a KBN, based on the Siena CBN, has been implemented, and enables the efficient routing of distributed heterogeneous knowledge to, and only to, nodes that have expressed a specific interest in that knowledge. This KBN implementation currently operates on distributed PlanetLab Nodes. Initially clusters are statically designed and operated. In this sense nodes are assigned to clusters without the possibility of changing clusters once they have joined; later users will be able to join and leave clusters independently. Clusters will then be seen as organic structures in which users join and leave as their own personal interests change, grow, reform and are refined. Preliminary evaluations show the importance of semantic clustering for efficient performance and network scalability. These evaluations demonstrate how even inflexible and static clustering can have a substantial positive effect. Ongoing research is focusing on how clustering can be performed dynamically as the semantics of the data in the network changes. Such clustering may also form the basis for a viable means for forming KBN sub-domains, thereby sharing the load of development and supporting incremental deployment.

Current work is also focusing on integrating policy-based cluster management for the KBN [1,3] to support much more sophisticated cluster schemes, where sub-cluster intercommunicate over a super-peer network. This will support overlapping clusters and hierarchies of clusters under separate administrative control. Policy-driven clustering enables the size of the super-peer network and the size and granularity of peer clusters to reflect different application domain needs. For example, the clustering policy may be specified in terms of accuracy and latency as well as the semantic spread of the query-able knowledge-base, or in terms of queries across a peer population and subscription and notification load across that population. This use of clustering policies supports innovation in clustering strategies by allowing peers to introduce new policy elements and the supporting super-peer matching capabilities. The scalability and flexibility of the KBN under high load of heterogeneity will be evaluated using a combination of small-scale and large-scale simulations and deployments in order to test the message overhead involved in clustering and the effectiveness of semantic load sharing. To facilitate these investigations a tool has recently been completed to allow the parameterised generation of arbitrary ontologies with defined characteristics, suitable for a diverse set of application scenarios. Ultimately we aim to design and validate differing clustering policies, and tune a range of semantic distance calculations to make semantic clustering more effective. We will also assess the impact of policies on the coexistence of different reasoning capabilities in KBN nodes. In addition, the effect of semantic interoperability in node matching functions and in inter-cluster communications is being assessed [2]. This requires evaluation of different schemes for injecting newly discovered semantic interoperability mappings into the ontological corpus held by any given cluster, as well as how these mapping are shared between clusters [2]. We expect that any practical system will need to adapt its clustering to reflect the constantly changing profile of semantics being send and subscribed to via the KBN. This also raises interesting possibilities of the KBN operating as a knowledge discovery tool. One of the main questions that surround the use of ontologies deep in the network at the routing layer remains the evaluation of the resulting performance overhead. Previous small scale studies in this area [1,3] shows a definite performance penalty, but this may be acceptable when offset against the increased flexibility and expressiveness of the KBN subscription mechanism. This research will continue evaluating how the performance of off-the-shelf ontology tools will affect the scalability of the KBN at larger scales. This material is based upon works supported by the Science Foundation Ireland under Grant No 05/RFP/CMS014. [1] Lewis, D., Keeney, J., O'Sullivan, D., Guo, S., "Towards a Managed Extensible Control Plane for Knowledge-Based Networking”, Distributed Systems: Operations and Management Large Scale Management, (DSOM 2006), at Manweek 2006, Dublin, Ireland, October 2006. [2] Guo, S., Keeney, J., O'Sullivan, D., Lewis, D., “Adaptive Semantic Interoperability Strategies for Knowledge Based Networking”, Workshop On Scalable Semantic Web Knowledge Based Systems at OTM 2007, Vilamoura, Algarve, Portugal, November 2007 [3] Keeney, J., Lewis, D., O'Sullivan, D., "Ontological Semantics for Distributing Contextual Knowledge in Highly Distributed Autonomic Systems", Journal of Network and System Management (JNSM). Volume 15, Number 1, March 2007.

Improving Scalability in Pub-Sub Knowledge-Based Networking by ...

Networking by Semantic Clustering. John Keeney, Dominic ... Knowledge-Based Network implementation was extended to support node clustering based on ...

32KB Sizes 2 Downloads 170 Views

Recommend Documents

Improving OLTP scalability using speculative lock ...
Locks managed globally. • Fine-grained parallelism. – Each lock has its own .... Best solutions may be indirect. – Sidestep hard problems. – Look to distributed ...

Improving Networking Between High School and ...
The concern over the relationship between high school and college forensic programs is not new. In 1989, in a paper for a ... Often high school directors critique collegiate programs for an "elitist" atti- tude, arguing the valuable interaction provi

Improving Implementable Meet-in-the-Middle Attacks by ...
May 22, 1996 - the run-time of meet-in-the-middle attacks is thus of wide-ranging interest. .... sought, which we call the golden collision. ..... [1] H.R. Amirazizi and M.E. Hellman, “Time-Memory-Processor Trade-Offs”, IEEE ... Discrete Logarith

Improving Energy Performance in Canada
Sustainable Development Technology Canada –. NextGen ..... through education and outreach, as well as through .... energy science and technology by conducting ...... 171 026. Oct. 11. 175 552. Nov. 11. 167 188. Dec. 11. 166 106. Jan. 12.

Improving Automatic Detection of Defects in Castings by ... - IEEE Xplore
internal defects in the casting. Index Terms—Castings, defects, image processing, wavelet transform, X-ray inspection. I. INTRODUCTION. NONDESTRUCTIVE ...

Improving Energy Performance in Canada
and Canadian businesses money by decreasing their energy bills ... oee.nrcan.gc.ca/publications/statistics/trends11/pdf/trends.pdf. economy in ...... 2015–2016.

pdf-1399\fieldbus-and-networking-in-process-automation-by-sunit ...
pdf-1399\fieldbus-and-networking-in-process-automation-by-sunit-kumar-sen.pdf. pdf-1399\fieldbus-and-networking-in-process-automation-by-sunit-kumar-sen.

WAP, Scalability and Availability in a J2EE environment
Dec 4, 2000 - Support for JMS on part of J2EE compliant application servers has now been raised ... respect to ascertaining the best deployment scenario. ..... A JMS Provider is the entity which implements JMS for a messaging product.

WAP, Scalability and Availability in a J2EE environment
Dec 4, 2000 - application within a domain, needs thorough investigation and testing ...... machine hosting the application server would cause all the services.

Metaserver Locality and Scalability in a Distributed NFS
access the file system through the PVFS library or using an OS-specific kernel module. The latter ... M.L., Navaux, P.O.A., Song, S.W., eds.: Proceedings of the ...

Scalability Improvements in the NASA Goddard ...
2Earth System Science Interdisciplinary Center .... HP/Compaq (HALEM), an IBM-SP Power4, and an SGI ... pure MPI, pure OpenMP, or MPI-OpenMP hybrid.

McBoost: Boosting Scalability in Malware Collection ...
cutable) executables that the users of the monitored network are downloading ... licious code. Figure 1: Overview of McBoost Classification System. ... lation of Windows XP Home with the addition of common ...... In Annual Computer Security.

Effects of Population Size on Selection and Scalability in Evolutionary ...
scalability of a conventional multi-objective evolutionary algorithm ap- ... scale up poorly to high dimensional objective spaces [2], particularly dominance-.

Improving Dependability by Revisiting Operating System ... - Choices
Figure 1. Microkernel OS structure also exists in other microkernels like L4 [17], Chorus [18], .... filesystem service and a network service that use SSRs.

Improving Personalized Trip Recommendation by ...
Oct 28, 2016 - Orienteering problem; location-based social network; trip recom- mendation. 1. INTRODUCTION ... top k most relevant POIs and list the results, as in POI rec- ommendation using .... In [10], personalized travel sequences in different se

Data Communications and Networking By Behrouz A.Forouzan.pdf ...
Page 3 of 1,171. Data Communications and Networking By Behrouz A.Forouzan.pdf. Data Communications and Networking By Behrouz A.Forouzan.pdf. Open.

Improving biofuel production in phototrophic ...
Biofuels derived from algal energy carriers, including lipids, starch and hydrogen, offer a promising, renewable alternative to ..... An alternative to 2-DE is to sequentially cut out sec- ..... to be the most available and low-cost water resources.

IMPROVING EFFICIENCY IN MATCHING MARKETS ...
Wako, Alex Westkamp, Yosuke Yasuda, and seminar participants at Arizona State, Boston ..... attention in the context of practical course allocation at business schools as studied by ..... qh, that is, the regional cap can be smaller than the.

(Networking Technology) By Jeff Riddel
Jan 10, 2012 - technology to make your downloading PacketCable Implementation (Networking Technology) By Jeff .... building successful careers. Category: ...

Data Communications and Networking By Behrouz A.Forouzan.pdf ...
Page 2 of 2. Data Communications and Networking By Behrouz A.Forouzan.pdf. Data Communications and Networking By Behrouz A.Forouzan.pdf. Open.Missing: