“Lossless Value Directed Compression of Complex User Goal States for Statistical Spoken Dialogue Systems” Paul A. Crook and Oliver Lemon, Heriot-Watt University, Edinburgh

1

Outline • The problem: • Real user goals vs. simplified dialogue state representation in POMDP SDSs

• The idea: • Automatic state space compression • Value–Directed Compression (VDC)

• Experiment and results: • Lossless VDC (using Krylov iteration) 2

The Problem • A spoken dialogue system (SDS) must: 1. Determine the user’s goal (e.g. plan suitable meeting times or find an Indian restaurant ) 2. Do this under uncertainty (e.g. from ASR) 3. Compute the optimal next system action (e.g. offer a restaurant, ask for clarification).

• POMDP systems address these problems e.g. Young et al. [2010 CSL]



….. but they use impoverished user goal representations to ensure tractability of planning. 3

Typical POMDP b

b(s) = P(s|b)

s

s

s’

a o

o’

A POMDP is defined as a tuple 4

Current POMDP Systems • Use simplified state spaces and/or hand-crafted state space compressions • For example the state s is typically factored into some dialogue history h and some user goal g 5

Current POMDP Systems • The next design decision is what level of complexity should ‘g’ represent

• Typically the set of user goals G (g ∈ G) is either • the set of domain objects, e.g. the set of restaurants { Tail-end, Pizza Express, Maison Bleue, … }

• or features of the domain objects that are assumed to be independent, e.g. food { fish, pizza, french … }, price { budget, mid-range, expensive }, location { city centre, ... } 6

Current POMDP Systems • Even when a independence assumptions aren’t used the state space is often summarised and a compressed state space is used by the policy learner/executer.

7

“Real User Goals” • Independence assumptions dramatically reduce the size of the POMDP belief space: • e.g. from over 300K real-valued space to a 4-valued space

• But `real user goals’ can be sets of targets, with complex combinations of attributes [Crook and Lemon 2010 SIGDIAL]

8

“Real User Goals” “A cheap Thai nearby or an expensive Italian downtown” “Chinese as long as it’s not too cheap”

• The former would be a problem for a system where the user goal states assume users are only interested in one domain object (one restaurant) at a time. • The latter is a problem for systems where food type, price range, quality are modelled as independent. 9

“Real User Goals” • Use of real, complex, user goals should lead to more flexible and natural SDS.

• However….

10

“Real User Goals”

Simple example:

sets of objects with two attributes attribute u with values u1, u2, u3, and attribute v with values v1

Generates 8 possible user goal states.

In general: leading to a very large state spaces!

11

Research Question • Can we build POMDP systems with more realistic representations of user goals WHILE maintaining tractability ? • And can we automatically compute the compressed state spaces — thus reducing design effort ?

• Initial method: Value Directed Compression using Krylov iteration for lossless compression [Poupart 2005 PhD thesis] 12

VDC using Krylov Iteration • This is an off-line compression algorithm. • Data driven: • The POMDP to compress has to be fully specified including transition and observation probabilities.

13

Algorithm • Construct a vector for each action which contains the associated rewards for that action in each state. • Retain those vectors that are linearly independent — these are the initial basis vectors • Generate new vectors by applying observation and transition matrices

• Test and retain new linearly independent vectors • Repeat until no new linearly independent vectors found or number of vectors = number of states. 14

Algorithm • Basis vectors = compressed state space • Value of being in a state is computed via linear sum of basis functions

• No loss in precision • Policy can then be learnt and executed in the reduced state space.

15

Example Dialogue Task • Search task over objects with 3 attributes: • One attribute with 3 different values, the other two attributes can each take 2 values.

• • • •

Generates 4,096 user goal sets (states) System has 23 dialogue actions There are 49 possible observations Reward function: • +10 if presented with goal • -10 if presented with non-goal • -1 per step 16

Transitions & Observation Probabilities • Assume a user goal doesn’t change during a dialogue, i.e. transition probabilities = identity • Thus degree of compression obtained is indirectly related to the observation probabilities • Artificial but realistic set of observation frequencies e.g. system confirms an attribute value in users goal set user response: 0.16 yes, 0.16 yes & provide an additional goal attribute, 0.13 no & provide alternative attribute goal value, 0.11 provide an additional goal attribute, ⋮ 0.01 no & provide object from goal set. 17

Results • 4096 state problem gets compressed to 630 states. • a compression of approximately 6.5 times

• The same task using two values per attribute (=256 states) failed to compress. • Speculate: larger tasks may exhibit greater compression? 18

Summary • A lossless 6 fold reduction in the state space of a small but fairly realistic POMDP SDS task • The first time that automatic compression has been demonstrated for complex user goals • Should result in more natural statistical SDS, without loss in tractability.

19

Future Work • Apply this work to a real system. • To that end – coming soon... a statistical SDS for obtaining restaurant recommendations in Edinburgh

http://sites.google.com/site/abcpomdp

Thank you for listening! 20

“Lossless Value Directed Compression of Complex User Goal States ...

Real user goals vs. simplified dialogue state ... price { budget, mid-range, expensive }, location { city centre, ... } Current POMDP Systems. 6 ... Data driven:.

851KB Sizes 0 Downloads 82 Views

Recommend Documents

Lossless Value Directed Compression of Complex ... - Semantic Scholar
School of Mathematical and Computer Sciences (MACS). Heriot-Watt University, Edinburgh, UK. {p.a.crook, o.lemon} @hw.ac.uk .... 1In the case of a system that considers N-best lists of ASR output. 2Whether each piece of information is filled, ...

Lossless Value Directed Compression of Complex ... - Semantic Scholar
(especially with regard to specialising it for the compression of such limited-domain query-dialogue SDS tasks); investigating alternative methods of generating ...

Universal lossless data compression algorithms
2.7 Families of universal algorithms for lossless data compression . . 20 .... A full-length movie of high quality could occupy a vast part of a hard disk.

Lossless Compression – Slepian Wolf
May 5, 2009 - decoder has access to K, it can decode Y and get X=Y+K. • In general case, partition the space into cosets associated with the syndromes of the principal underlying channel. (repetition code here). • Encoding – Compute Syndrome co

EBOOK Lossless Compression Handbook - Khalid ...
Aug 15, 2002 - *Invaluable resource for engineers dealing with image processing, signal processing, multimedia systems, wireless technology and more.

Universal lossless data compression algorithms
4.1.3 Analysis of the output sequence of the Burrows–Wheeler transform . .... main disadvantages of the PPM algorithms are slow running and large memory.

level-embedded lossless image compression
information loss, in several areas–such as medical, satellite, and legal imaging– lossless ..... tation of picture and audio information - progressive bi-level.

Anchors-based lossless compression of progressive triangle meshes
PDL Laboratory, National University of Defense Technology, China. [email protected] ..... Caltech Multi-Res Modeling Group. References. [1] P. Alliez ...

Factorization-based Lossless Compression of ... - Research at Google
A side effect of our approach is increasing the number of terms in the index, which ..... of Docs in space Θ. Figure 1 is an illustration of such a factor- ization ..... 50%. 60%. 8 iterations 35 iterations. C o m p re ssio n. R a tio. Factorization

Factorization-based Lossless Compression of Inverted ...
the term-document matrix, resulting in a more compact inverted in- ... H.3.1 [Information Storage And Retrieval]: Indexing methods .... an approximate solution.

LEARNING OF GOAL-DIRECTED IMITATION 1 ...
King's College London. Cecilia Heyes ... Over the past decade, considerable evidence has accumulated to support the suggestion that .... computer keyboard; immediately beyond the keyboard was a manipulandum; and 30 cm beyond the ...

A Lossless Color Image Compression Architecture ...
Abstract—In this paper, a high performance lossless color image compression and decompression architecture to reduce both memory requirement and ...

Gray-level-embedded lossless image compression
for practical imaging systems. Although most ... tion for the corresponding file size or rate. However ... other values generalize this notion to a partition- ing into ...

Optimal synchronization of directed complex networks
Jun 23, 2016 - L is the Laplacian matrix defined for directed networks with entries defined ... values ri populate the diagonal matrix R ј diagрr1,…,rNЮ.

Modeling goal-directed spatial navigation in the rat ... - Boston University
data from the hippocampal formation ... encoding in ECIII and CA3, which enabled us to simulate data on theta phase precession of place ... Previous analytical.

110801 Early Goal-Directed Therapy in the Treatment
Nov 8, 2001 - ated mortality rates have remained constant or have decreased only ...... also be ordered for a fee through the Internet (http://www.nejm.org).

Modeling goal-directed spatial navigation in the rat ...
encoding in ECIII and CA3, which enabled us to simulate data on theta phase precession of place ... different animal species during exploration and orientation.

110801 Early Goal-Directed Therapy in the Treatment
Nov 8, 2001 - 30 ml per kilogram of body weight over a 30-minute period) or .... www.nejm.org at GALTER HEALTH SCIENCES LIBRARY on April 22, 2007 .

110801 Early Goal-Directed Therapy in the Treatment
Nov 8, 2001 - partment of Emergency Medicine, Henry Ford Hospital, 2799 West Grand .... saturation was less than 70 percent, dobutamine administration was ...... Rangel-Frausto MS, Pittet D, Costigan M, Hwang T, Davis CS, Wenzel. RP.

110801 Early Goal-Directed Therapy in the Treatment
Nov 8, 2001 - Physiology and Chronic Health Evaluation (APACHE II) .... CVP central venous pressure, MAP mean arterial pressure, ScvO2 central.

Hitwe meets its goal of optimal user engagement ... Services
eliminated discrepancies between our internal BI and attribution tool, so we're more confident in our marketing spend.” Hitwe meets its goal of optimal user engagement with Firebase. - Ann Lavrisheva. PPC Manager, Hitwe. Goals. • Understand user

Dynamics of affective states during complex learning
contexts include a multitude of computer environments, such as preparation .... inevitably accompanied by failure, so the learner experiences a host of affective ...