Hierarchical Planning in BDI Agent Programming Languages Sebastian Sardina

Lavindra de Silva

Lin Padgham

RMIT University [email protected]

RMIT University [email protected]

RMIT University [email protected]

M OTIVATION & O BJECTIVES 1. What is the problem? No lookahead deliberation in available BDI Agent Programming languages and systems: (a) only reactive planning; (b) implemented systems with no formal semantics. 2. What is the aim? Accommodate planning capabilities into BDI Agent Programming language (e.g., A GENT S PEAK) with: (a) formal semantics; (b) direct implementation. 3. What is our solution/approach? Extend the BDI agent language C AN with a new language construct Plan(P ): on-demand planning on P ! 4. What is the outcome? C AN P LAN = C AN + HTN planning 5. What do we leave out? Plan monitoring & re-planning, goal reasoning, and much more...

BDI-P ROGRAMMING

PROPERTIES OF C AN P LAN

T HE C AN L ANGUAGE [WPHT02] • A C AN agent is defined as A = hN , Π, B, A, Γi, where: 1. N is the agent name. 2. Π is a plan library containing plan rules e : ψ ← P e is the triggering event and ψ is the context condition; P is the plan-body from the following language: P ::= act | ?φ | +b | −b | !e | P1; P2 | P1kP2 | Goal(φs, P1, φf ). 3. B is the belief base: current agent’s knowledge. 4. A is the sequence of executed actions. 5. Γ is the intention base: partially instantiated plan-bodies. • Special features in C AN: – Includes failure-handling when a sub-goal fails: =⇒ retries alternative plans for the sub-goal. – Construct Goal(φs, P, φf ): declarative+procedural goal =⇒ achieve goal φs by using plan P ; failing if φf .

Beliefs

Plan Library

A very popular agent-oriented programming methodology. • Key features of BDI agent-oriented systems: Beliefs: information about the world (set of facts). Events: goals/desires to resolve; internal or external. Plan library: recipes for handling goals-events: e : ψ ← P . Intentions: partially instantiated programs; commitment. • A Personal Scheduler Manager example: Events: reqM eeting(p, t) (external); !f reeSlot(t) (internal). Plan library: plans for handling a meeting request with different priorities depending on the person p. 2 Intentions: a partial plan to make slot W ed(3pm) free; a plan to inform John that a meeting is not possible.

BDI Execution Engine 1. Absorb external events 2. Pick an intention 3. Perform step Plan(P) Event

Successful Decomposition

4. Failure recovery • Properties: – Goals-to-do instead of goals-to-be: goT oLocation(Y Y Z) – Decomposition of high-level tasks. – User provides (procedural) domain knowledge. – Some similarities with “programming”. – Subsumes first-principle “STRIP” planning. – Well understood semantics and implementations. • Components of HTN: – State: set of atoms + CWA. – Task: a common goal; primitive or compound. – Task Network: set of tasks T + order/state constraints φ. – Method: a task network to solve high-level task e. HTN solution: a plan σ that stands for a full decomposition of a top-level task. Operational and model-theoretic semantics were given in [Erol, Henlder & Nau 94]:

=⇒ sol(d, B, D): set of all plans that solve d.

BDI

AND

HTN: S IMILARITIES

BDI S YSTEMS belief base plan library event action plan-body/program plan rule plan rule context test ?φ in plan-body sequence in plan-body parallelism in plan-body

HTN S YSTEMS state method library high-level task primitive task network task method method precondition state constraints ordering constraint ≺ no ordering constraint

external events sensors

action commands actuators

Enviroment

F ORMALIZATION

1. Two basic transitions: hB, A, P i −→ hB ′, A′, P ′i 2. Agent level transition: hN , Λ, Π, B, A, Γi =⇒ hN , Λ, Π, B ′, A′, Γ′i bdi

P ∈ Γ hB, A, P i −→ hB ′, A′, P ′i Astep ′ ′ ′ hN , Λ, Π, B, A, Γi =⇒ hN , Λ, Π, B , A , (Γ \ {P }) ∪ {P }i

!e −→ Lψ1 : P1, . . . , ψn : PnM −→ Pi ⊲ Lψ1 : P1, . . . , ψi−1 : Pi−1, ψi+1 : Pi+1, . . . , ψn : PnM −→ Lψ1 : P1, . . . , ψi−1 : Pi−1, ψi+1 : Pi+1, . . . , ψn : PnM −→ . . . ∆ = {ψiθ : Piθ | e′ : ψi ← Pi ∈ Π ∧ θ = mgu(e, e′)} Event hB, A, !ei −→ hB, A, L∆Mi

bdi

plan





hB, A, P1i −→ 6

hB, A, P1 ⊲ P2i−→hB, A, P2i ′



plan∗

hB, A, P i −→ hB , A , P i hB , A , P i −→ hB f , Af , nili bdi

⊲f

bdi









FOR

D EC . G OALS (A) (B) √ √ √ √ √ √ √ √

(C) (D) (E) √ √ √ √ √ √ √ √

(A) P is used towards the eventual satisfaction of goal φs. (B) Allow partial executions of P . (C) Commitment on the goal φs. (D) Goal dropping mechanism. (E) P may be solved partially.

hB, A, Plan(P )i −→ hB , A , Plan(P ′)i

2. Using planning for learning new plans When no relevant plan is available =⇒classical planning. Add new plan to the BDI plan library Π 3. Planning in the context of other intentions. 4. Planning up to some level of abstraction Leave details to the BDI execution engine. 5. Use S HOP 2 for implementing planning in J ACK.

R ELATED W ORK 1. Other formal BDI-style systems (with no planning): A GENT S PEAK, 3APL, D M ARS.

ψi : Pi ∈ ∆ B |= ψiθ hB, A, L∆Mi −→ hB, A, Piθ ⊲ L∆ \ PiMi Sel hB, A, nil ⊲ P2i−→hB, A, nili

But: • No concurrency k and no goal-programs Goal(φs, P, φf ). • Converts JS HOP’s total-order plans into partial-order plans. • Planner returns the relevant methods and bindings; the BDI execution engine then follows step-by-step the decomposition suggested by the planner.

1. Planning upon failure (like P ROPICE -P LAN & C PEF): =⇒ P ⊲ Plan(P ′).

bdi/plan

⊲t

I MPLEMENTATION OF C AN P LAN

F UTURE W ORK

C AN P LAN

OF

1. If Plan(P ) can perform a step, then Plan(P ) never fails in an environment-free agent execution. 2. All executions of Plan(P ) corresponds to HTN solutions of P w.r.t. the agent’s plan-library Π. 3. P simulates Plan(P ) in every C AN P LAN agent with no concurrency k and no goal-programs Goal. =⇒ Plan(Plan(P1)kP2) = Plan(P1kP2) 6= Plan(P1)kP2.

A LTERNATIVES Plan(P ; ?φs) Plan(Goal(φs, P, φf )) Goal(φs, Plan(P ), φf ) Goal(φs, Plan(P ; ?φs), φf ) Goal(φs, Plan(Goal(φs, P, φf )), φf )

Select

HTN P LANNING

• Technical results:

P LANNING

Action

HTN Planner

Retry

1. Plan is built-in within the BDI execution cycle. 2. Plan’s semantics is defined “in terms” of the BDI cycle. 3. But failure-handling is left to the BDI cycle only.

Theorem 2 is the basis for an implementation: =⇒ J ACK+ JS HOP [Lavindra & Padgham 05]

C AN P LAN = C AN + Plan Intentions

• Some remarks about Plan:

Plan

Most of the semantic rules are shared between the BDI cycle and the planning sub-module!

2. Systems/architectures mixing planning with BDI execution: • Planners that interleave execution: I PEM; S AGE. • Planning always: R ETSINA. • Planning on failure: P ROPICE -P LAN; C PEF. 3. I NDI G OLOG [De Giacomo & Levesque 99]: • Execution+planning in the situation calculus. • Inspired semantics of Plan(P ) from search construct Σ(δ). • Not a typical programming language in the BDI tradition • G OLOG and HTN planning [Gabaldon 02].

Sebastian Sardina Lavindra de Silva Lin Padgham

RMIT University [email protected].au ... User provides (procedural) domain knowledge. – Some similarities with ... N is the agent name. 2. Π is a plan library ...

93KB Sizes 1 Downloads 154 Views

Recommend Documents

Dhirendra Singh Sebastian Sardina Lin Padgham ...
School of Computer Science & Information Technology, RMIT University, ... tion of plans, while learning online. ... level plans may fail not because they were.

Dhirendra Singh Sebastian Sardina Lin Padgham Geoff ...
CSIRO Energy Technology, Sydney, Australia. Summary. This paper extends our earlier work integrating learning to improve plan selection in the popular. Belief ...

Nitin Yadav, John Thangarajah, and Sebastian Sardina ...
Coverage g/p : 2. 3. 4 p/g : 2. 3. 4. FastDownward McMAS NuSMV. Percentage of instances completed in 10 minutes. Time comparison (2-2-8). 0.05. 0.50. 2.00.

Nitin Yadav and Sebastian Sardina. RMIT University ...
BDI agents outside the coalition: not augmented. 2. M |= 〈〈A〉〉ω,ϱϕ can be checked in exponential time on the number of agents |A| and goals maxa∈A(|ϱ[a]|).

Nitin Yadav and Sebastian Sardina RMIT University ...
Value of controller: Measures degree of target's expected realizability. Reward gained on ... Optimal policy for MS,T ≡ Optimal controller for T in S. • Existence of ...

PR De Silva Sadosky prize.pdf
Jun 3, 2015 - analysis, to advance the careers of the prize recipients, ... depth, and enormous technical skills are. evident ... PR De Silva Sadosky prize.pdf.

OTERO SILVA-La muerte de Honorio.pdf
Podrás verlo cuando aparezca, si. te inclinas un poco. Page 3 of 60. OTERO SILVA-La muerte de Honorio.pdf. OTERO SILVA-La muerte de Honorio.pdf. Open.

Lin-English.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Lin-English.pdf.

lin-bs.pdf
Keywords: CAPTCHAs Recognition; Handwriting recognition; Shape. context. Page 3 of 51. lin-bs.pdf. lin-bs.pdf. Open. Extract. Open with. Sign In. Main menu.

sebastian kneipp pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. sebastian ...

LIN CONG
2013. Stanford Asian American Award. 2013. The Gerald J. Lieberman Fellowship. 2012–2013. Peter F. DeVos Fellowship. 2012–2013. Dimitrijevic Fellowship. 2012–2013. Zephyr Prize for Best Paper in Corporate Finance, The 25th AFBC. 2012. Prize Win

Belle and sebastian fold
Rough guide pdf.57689107719 - Download Belleand sebastian fold.Theadorable ... Newyork undercover is_safe:1.Win 10 ... It's my life.Network datarecovery.

Dearly Beloved - Sebastian Wolff.pdf
Kingdom Hearts. Yoko Shimomura. Arrangement by .... Dearly Beloved - Sebastian Wolff.pdf. Dearly Beloved - Sebastian Wolff.pdf. Open. Extract. Open with.

Sheela Sebastian Vs R Jawaharaj.pdf
IPC is not satisfied in view of what has been stated under. 3. Page 3 of 19. Main menu. Displaying Sheela Sebastian Vs R Jawaharaj.pdf. Page 1 of 19.

Silva intuition system.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Silva intuition ...

Johann Sebastian Bach.pdf
Nov 25, 2013 - Suo padre Johann Ambrosius era. violinista di corte ad Arnstadt. Fedele alla tradizione, il giovane Johann Sebastian iniziò gli studi musicali in ...

Descargar algebra sebastian lazo pdf
Page 3 of 23. Descargar algebra sebastian lazo pdf. Descargar algebra sebastian lazo pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Descargar ...

noah-by-sebastian-fitzek.pdf
developer for numerous media companies in Europe. He lives in Berlin and is currently working in. the programme management of a major capital radio station.

INVITATION RENCONTRE LIN CHANVRE BIO.pdf
yesterday at Philadelphia Interna- tional Airport, Reagan denied he. remembered anything concerning a. scheme to divert funds from the. Wharton ..... Whoops! There was a problem loading this page. Retrying... INVITATION RENCONTRE LIN CHANVRE BIO.pdf.

PC-PL-14 - Debra de Silva [Compatibility Mode].pdf
more supported. 50% of families. say they feel. supported. • Ask at clinic /. homes visits. • Quick survey. Survey. Please answer these questions to help us.

JANAINA DA SILVA MATIAS.pdf
Sign in. Loading… Whoops! There was a problem loading more pages. Retrying... Whoops! There was a problem previewing this document. Retrying.

ONLI OUP chapter Lin clean
and C'est livre est de Charles 'This book is of Charles (literal translation)' and in the ..... DP2), which has been raised onto the specifier position above the VP to ...