Learning Context Conditions for BDI Plan Selection 1

Dhirendra Singh

1

1

Sebastian Sardina

Lin Padgham

2

Stéphane Airiau

1School

of Computer Science & Information Technology, RMIT University, Australia 2Institute for Logic, Language and Computation, University of Amsterdam, The Netherlands

Summary

Experimentation

Learning Task

We address the plan selection problem in Belief, Desire, Intentions (BDI) Agent Systems.

G

Context conditions of plans determine applicability in given situations, and must be specified upfront. However, new environments often require learning changes to selection conditions. Easing this constraint would allow conditions to be refined once deployed, improving adaptability. Our learning framework augments plan’s context conditions with decision trees, allowing plan applicability to be learnt from experience. Using a probabilistic plan selection function, the agent balances exploration and exploitation of plans, while learning online.

We study the impact of goal-plan structures on learning performance. We use synthetic hierarchies that model some features of real BDI programs.

1

...

P1

? ...

Pi

GA

Pn

2

5

PA

PB ? ×

GA1

GB1

4



×

GA2

3

×

How to record training set: We compare two approaches, a conservative one (BUL) that only records failures when all plan choices are considered well-informed, and an aggressive one (ACL) that records all outcomes.

GB

GB2

6



×

7



×

PB2 √

0 PB2

×

1. The imposed BDI hierarchy implies that high level plans may fail not because they were poor choices for the situation but due to poor choices further below. 2. Learning is performed online while acting in the environment, so care must be taken in how much confidence to put in each decision tree on an ongoing basis.

BDI Architecture A plan is a rule e : ψ ← δ; program δ is a strategy for goal e when condition ψ holds. The burden for the programmer is to perfectly design the logical formula ψ.

BDI Learning Framework Each plan’s logical formula context condition is augmented with a decision tree. A probabilistic plan selection function balances exploitation of ongoing decision tree learning and further exploration of the state space.

events Pending Events

Record outcomes for chosen plans to train decision trees

Beliefs BDI engine

Plan library

Probablistically select plans based on ongoing learning

dynamic static Intention Stacks actions

Acting and learning are interleaved. Ongoing learning impacts the choice of future actions that impact subsequent learning and whether a good solution is eventually found.

How to use decision trees: A confidence measure is applied to the decision tree prediction to calculate plan selection weights. Confidence is related to the coverage of paths below a plan. Success 1.0 0.8 0.6 0.4 0.2 0.0

T1

500

Success 1.0 0.8 0.6 0.4 0.2 0.0 500 Success 1.0 0.8 0.6 0.4 0.2 0.0 1000 Success

T2

1500

Iterations 2500

T3

2500

1.00 0.75 0.50 0.25 0.00

Iterations 4000

T 2, 20%

500 Plans perform primitive actions or post subgoals to be handled in a hierarchical manner.

1000

Iterations 1500

1500

Iterations 2500

Results comparing ACL+coverage (crosses) and BUL (circles) for various goal-plan hierarchies.

D. Singh, S. Sardina, L. Padgham, S. Airiau, Learning context conditions for BDI plan selection. In Proceedings of Autonomous Agents and Multi-Agent Systems (AAMAS), Toronto, Canada, 2010.

Dhirendra Singh Sebastian Sardina Lin Padgham ...

School of Computer Science & Information Technology, RMIT University, ... tion of plans, while learning online. ... level plans may fail not because they were.

144KB Sizes 3 Downloads 199 Views

Recommend Documents

Dhirendra Singh Sebastian Sardina Lin Padgham Geoff ...
CSIRO Energy Technology, Sydney, Australia. Summary. This paper extends our earlier work integrating learning to improve plan selection in the popular. Belief ...

Sebastian Sardina Lavindra de Silva Lin Padgham
RMIT University [email protected] ... User provides (procedural) domain knowledge. – Some similarities with ... N is the agent name. 2. Π is a plan library ...

Nitin Yadav, John Thangarajah, and Sebastian Sardina ...
Coverage g/p : 2. 3. 4 p/g : 2. 3. 4. FastDownward McMAS NuSMV. Percentage of instances completed in 10 minutes. Time comparison (2-2-8). 0.05. 0.50. 2.00.

Nitin Yadav and Sebastian Sardina. RMIT University ...
BDI agents outside the coalition: not augmented. 2. M |= 〈〈A〉〉ω,ϱϕ can be checked in exponential time on the number of agents |A| and goals maxa∈A(|ϱ[a]|).

Nitin Yadav and Sebastian Sardina RMIT University ...
Value of controller: Measures degree of target's expected realizability. Reward gained on ... Optimal policy for MS,T ≡ Optimal controller for T in S. • Existence of ...

Lin-English.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Lin-English.pdf.

lin-bs.pdf
Keywords: CAPTCHAs Recognition; Handwriting recognition; Shape. context. Page 3 of 51. lin-bs.pdf. lin-bs.pdf. Open. Extract. Open with. Sign In. Main menu.

sebastian kneipp pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. sebastian ...

LIN CONG
2013. Stanford Asian American Award. 2013. The Gerald J. Lieberman Fellowship. 2012–2013. Peter F. DeVos Fellowship. 2012–2013. Dimitrijevic Fellowship. 2012–2013. Zephyr Prize for Best Paper in Corporate Finance, The 25th AFBC. 2012. Prize Win

Manmohan Singh - Visva Bharati
Topic: India's Export Trends and Prospects for Self-. Sustained Growth. [Published ... Japan's leading business daily. 1996 Honorary. Professor, ... Sept 1982 – Jan 1985: Governor, Reserve Bank of India. April 1980 – Sept 1982: Member-Secretary,

Mantej Singh Dhanjal - GitHub
07/2014 - 10/2014. Accenture. Associate ... Drove UI testing on Bluefly's Mobile app and website. Logged ... 10 tips on how Bluefly can use Social Media for lead.

Belle and sebastian fold
Rough guide pdf.57689107719 - Download Belleand sebastian fold.Theadorable ... Newyork undercover is_safe:1.Win 10 ... It's my life.Network datarecovery.

Dearly Beloved - Sebastian Wolff.pdf
Kingdom Hearts. Yoko Shimomura. Arrangement by .... Dearly Beloved - Sebastian Wolff.pdf. Dearly Beloved - Sebastian Wolff.pdf. Open. Extract. Open with.

Sheela Sebastian Vs R Jawaharaj.pdf
IPC is not satisfied in view of what has been stated under. 3. Page 3 of 19. Main menu. Displaying Sheela Sebastian Vs R Jawaharaj.pdf. Page 1 of 19.

Johann Sebastian Bach.pdf
Nov 25, 2013 - Suo padre Johann Ambrosius era. violinista di corte ad Arnstadt. Fedele alla tradizione, il giovane Johann Sebastian iniziò gli studi musicali in ...

Descargar algebra sebastian lazo pdf
Page 3 of 23. Descargar algebra sebastian lazo pdf. Descargar algebra sebastian lazo pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Descargar ...

noah-by-sebastian-fitzek.pdf
developer for numerous media companies in Europe. He lives in Berlin and is currently working in. the programme management of a major capital radio station.

INVITATION RENCONTRE LIN CHANVRE BIO.pdf
yesterday at Philadelphia Interna- tional Airport, Reagan denied he. remembered anything concerning a. scheme to divert funds from the. Wharton ..... Whoops! There was a problem loading this page. Retrying... INVITATION RENCONTRE LIN CHANVRE BIO.pdf.

Maharaj Singh Order.pdf
For Respondent(s) Mr. Ashok Desai,Sr.Adv. Ms. Rukhmini Bobde,Adv. Mr. Abhiram Naik,Adv. Ms. Mohuna Thakur,Adv. M/S. Parekh & Co., AOR. UPON hearing the counsel the Court made the following. O R D E R. Delay condoned. The special leave petition is dis

Ritesh Singh Rajpoot.pdf
~Google Analytics ~Google Webmaster ~ Google Merchant Centre. ~ Gupshup messenger bot builder ~Many Chat fb messenger bot builder ~ MOZ. ~Wordpress ~Shopify ~Joomla. ~Blogger ~HTML ~Proshow Video Maker. Ritesh Singh Rajpoot. Phone:09630784804. E-Mail

Amardeep Singh Saini.pdf
under the University of Mumbai. By. Mr. Amardeep Singh Saini. Class & Roll No: MMS-A-52. Specialization: Finance. Batch: 2013-15. Under the Guidance of.

Bikram-Singh-Judgment.pdf
Page 1 of 60. ITA No.55/2017 Page 1 of 25. $~. * IN THE HIGH COURT OF DELHI AT NEW DELH. + ITA 55/2017. Reserved on: 03rd August, 2017. Date of decision: 25th August, 2017. PRINCIPAL COMMISSIONER OF. INCOME TAX – 7 ..... Appellant. Through: Mr. Ruc