Machine Learning of Harmonic Relationships which Maximise Source Detection and Discrimination Thomas A. Lampert and Simon E. M. O’Keefe Department of Computer Science, University of York, York, U.K. {tomal, sok}@cs.york.ac.uk

1

Extended Abstract

Typically, acoustic data received via passive sonar systems in underwater environments is transformed into the frequency domain using the Short-Time Fourier Transform. This allows for the construction of a spectrogram image in which time and frequency are the axes and intensity represents the power at a particular time and frequency. It follows from this that if a (stationary or non-stationary) periodic component is present during some consecutive time frames a track or line will be present within the spectrogram. The problem of automatic detection of these tracks drew increasing attention in the literature during the mid 1980s and research expanded during the 1990s and early 21st century. It is an ongoing area of research with contributions from a variety of backgrounds ranging from statistical modelling and image processing to expert systems. It forms a critical stage in the detection and classification of sources in passive sonar systems and the analysis of vibration data. Applications are wide ranging and include identifying and tracking marine mammals via their calls, identifying ships, torpedoes or submarines via the noise radiated by their mechanics, meteor detection and speech formant tracking. Recently track detection algorithms have been proposed which aim to boost detection rates in low signal-to-noise ratio spectrograms by integrating information from locations in the image determined by harmonic relationships in the signal [1]. These relationships, the relative spacing between tonal harmonics and the fundamental frequency, are characteristic of the particular mechanical components within a source such as the propulsion and auxiliary machinery (engine, motors, reduction gears, generators and pumps etc.) [2]. Algorithms of this sort can be tailored to detect a particular source even in the case that harmonic relationships are not defined as integer multiples but as some arbitrary linear relationship. Currently these harmonic relationships are manually determined, either through observation, or through analysis of a source’s mechanical structure. In remote sensing applications it may not be possible to have a priori knowledge regarding a source’s mechanical components. Additionally, different operating conditions may excite or inhibit the mechanisms which produce particular harmonics and therefore the components which are observed. This complicates the manual identification of a source’s characteristic harmonics. Machine learning techniques can be applied to this problem, to determine

2

Thomas A. Lampert and Simon E. M. O’Keefe

automatically the linear relationships of harmonic components which identify the source within varying conditions. One drawback of supervised machine learning is the requirement for manually labelled ground truth data. If this is not available there are two approaches to overcoming this problem: utilising unsupervised learning techniques, which removes the requirement for ground truth data; or employing supervised learning techniques but using noisy, automatically generated, ground truth data. This noisy ground truth data can be generated using a detection mechanism which has a high true positive as well as a high false positive detection rate (which is a common trade-off when performing detection within noisy data). If a suitable supervised machine learning technique is applied, and enough training data is available, the relationships between true frequency components, which are common between multiple observations, will be reliably discovered. An additional complication in the automatic discrimination of sources based upon harmonic components is that subsets of these components belonging to distinct sources may overlap. The degree to which these overlap will directly influence a system’s ability to distinguish between the sources which share common subsets. Multi-objective optimisation can be employed to minimise these effects by determining the optimal combination of components which uniquely identifies each source with respect to all other sources. Thus, optimising the system’s ability to discriminate between sources. This type of optimisation problem is ideal for supervised machine learning techniques which are able to optimise complex hypotheses. Evolutionary computing methods such as genetic algorithms are one such technique [3]. These stochastic search algorithms search a large space of hypotheses, progressively refining multiple competing hypotheses until an optimal solution is found according to a predefined fitness function. As these algorithms perform searches in large spaces the optimisation can take time. However, once the system has been designed, the optimisation is a fully automatic process which is performed off-line and only needs to be repeated when a new set of sources are to be included. In conclusion, as far as we are aware, machine learning techniques have not been applied to the area of automatic detection and discrimination within acoustic data in underwater environments. This extended abstract has outlined two areas in which their application could improve existing systems. Namely, the automatic identification of reliable time-invariant features for remote sources and the optimisation of these features for source discrimination and detection. Issues concerning the application of these methods have also been outlined and methods to resolve them have been proposed.

References 1. Lampert, T.A., O’Keefe, S.E.M.: Active contour detection of linear patterns in spectrogram images. In: Proc. of ICPR’08. (December 2008) 1–4 2. Urick, R.: Principles of Underwater Sound. 3rd edition edn. McGraw-Hill, New York (1983) 3. Mitchell, T.M.: Machine Learning. McGraw-Hill, New York (October 1997)

Machine Learning of Harmonic Relationships which ...

Department of Computer Science, University of York, York, U.K.. {tomal, sok}@cs.york.ac. ... The degree to which these overlap will directly in- fluence a system's ...

40KB Sizes 1 Downloads 128 Views

Recommend Documents

Learning Relationships between Multiple Modalities and Words
that can learn multiple categorizations and words related to any of four modalities (action, object, position, and color). This paper focuses on a cross-situational learning using the co-occurrence of sentences and situations. We conducted a learning

Learning and the Value of Trade Relationships
Nov 6, 2017 - recovery is slower in Germany than in the United Kingdom. .... lationships,” Working Paper 14-08, U.S. Census Center for Economic ...... See Schmidt-Eisenlohr (2013) for a model of payment choices with positive interest rates.

Learning and the Value of Trade Relationships
Most trade (that we can track) is in long-term relationships. Table: U.S. Arm's-Length Imports, 2011 ..... again next period. The posterior probability that a supplier is patient after buying from them for k periods is: θk = ̂θ ... (1 − θk ) λ

Applied Machine Learning - GitHub
In Azure ML Studio, on the Notebooks tab, open the TimeSeries notebook you uploaded ... 9. Save and run the experiment, and visualize the output of the Select ...

Learning Relationships between Multiple Modalities and Words
*This work was partially supported by JST, CREST. 1Akira Taniguchi and Tadahiro Taniguchi are with Ritsumeikan Univer- sity, 1-1-1 Noji Higashi, Kusatsu, Shiga 525-8577, Japan {a.taniguchi, taniguchi} @em.ci.ritsumei.ac.jp. 2Angelo Cangelosi is with

Machine learning - Royal Society
a vast number of examples, which machine learning .... for businesses about, for example, the value of machine ...... phone apps, but also used to automatically.

Applied Machine Learning - GitHub
Then in the Upload a new notebook dialog box, browse to select the notebook .... 9. On the browser tab containing the dashboard page for your Azure ML web ...

Machine learning - Royal Society
used on social media; voice recognition systems .... 10. MACHINE LEARNING: THE POWER AND PROMISE OF COMPUTERS THAT LEARN BY EXAMPLE ..... which show you websites or advertisements based on your web browsing habits'.

Applied Machine Learning - GitHub
course. Exploring Spatial Data. In this exercise, you will explore the Meuse ... folder where you extracted the lab files on your local computer. ... When you have completed all of the coding tasks in the notebook, save your changes and then.

Gaussian Margin Machines - Proceedings of Machine Learning ...
separable samples, we can relax the inequality constraints by introducing a slack variable ξi for each point xi and aug- menting the objective function with a ...

MACHINE LEARNING BASED MODELING OF ...
function rij = 0 for all j, the basis function is the intercept term. The matrix r completely defines the structure of the polynomial model with all its basis functions.

Overview of Machine Learning and H2O.ai - GitHub
Gradient Boosting Machine: Highly tunable tree-boosting ensembles. •. Deep neural networks: Multi-layer feed-forward neural networks for standard data mining tasks. •. Convolutional neural networks: Sophisticated architectures for pattern recogni

Exchangeable Variable Models - Proceedings of Machine Learning ...
Illustration of low tree-width models exploiting in- dependence (a)-(c) and .... to the mixing weights wt; then draw three consecutive balls from the chosen urn ..... value to 1 if the original feature value was greater than 50, and to 0 otherwise.

Machine Learning In Chemoinformatics - International Journal of ...
Support vector machine is one of the emerging m/c learning tool which is used in QSAR study ... A more recent use of SVM is in ranking of chemical structure [4].

Deep Boosting - Proceedings of Machine Learning Research
We give new data-dependent learning bounds for convex ensembles. These guarantees are expressed in terms of the Rademacher complexities of the sub-families. Hk and the mixture weight assigned to each Hk, in ad- dition to the familiar margin terms and

Machine Learning of User Profiles: Representational Issues
tools for finding information of interest to users becomes increasingly ... Work on the application of machine learning techniques for constructing .... improved retrieval performance on TIPSTER queries, and to further ... testing procedure.

Gaussian Margin Machines - Proceedings of Machine Learning ...
we maintain a distribution over alternative weight vectors, rather than committing to ..... We implemented in matlab a Hildreth-like algorithm (Cen- sor and Zenios ...

Batch Normalization - Proceedings of Machine Learning Research
2010) ReLU(x) = max(x, 0), careful initialization (Ben- gio & Glorot, 2010; Saxe et al., 2013), and small learning rates. If, however, we could ensure that the distribution of nonlinearity inputs remains more stable as the network trains, then the op

Learning and the Value of Trade Relationships
Nov 6, 2017 - 1In related work, Antr`as and Foley (2015) reveal that learning also plays a key role for the dynamics of .... international real business cycle model. Several papers ... operations are classified as wholesale or retail are also dropped

Essence of Machine Learning (and Deep Learning) - GitHub
... Expectation-Maximisation (EM), Variational Inference (VI), sampling-based inference methods. 4. Model selection. Keywords: cross-validation. 24. Modelling ...

Deep Boosting - Proceedings of Machine Learning Research
ysis, with performance guarantees in terms of the margins ... In many successful applications of AdaBoost, H is reduced .... Our proof technique exploits standard tools used to de- ..... {0,..., 9}, fold i was used for testing, fold i +1(mod 10).

Boundary Element Formulation of Harmonic ... - Semantic Scholar
On a deeper level, BEM makes possible the comparison of trans- finite harmonic ... Solving a Dirichlet problem could seem a high price to pay, but the quality of the .... Euclidean space, and not just to some large yet bounded domain. This task ...