COMPLEXITY OF BEZOUT’S THEOREM VI: GEODESICS IN THE CONDITION (NUMBER) METRIC MICHAEL SHUB

Abstract. We introduce a new complexity measure of a path of (problems, solutions) pairs in terms of the length of the path in the condition metric which we define in the article. The measure gives an upper bound for the number of Newton steps sufficient to approximate the path discretely starting from one end and thus produce an approximate zero for the endpoint. This motivates the study of short paths or geodesics in the condition metric.

1. Introduction In a series of papers we have studied the complexity of solving systems of homogeneous polynomial equations by applying Newton’s method to a homotopy of (system,solution) pairs [7], [8],[9], [10], [11]. The latest word in this direction is [1],[2]. A key ingredient is an estimate of the number of Newton steps by the maximum condition number along the path multiplied by the length of the path of solutions. The main result of this paper is to show that the maximum condition number times the length may be replaced by the integral of the condition number times the length of the tangent vector to the path, Theorem 3. The result suggests using the condition number to define a Riemannian metric on the solution variety and to study the geodesics of this metric. Finding a geodesic is in itself not easy. So we do not immediately obtain a practical algorithm. Rather the study may help to understand in some systematic fashion the geometry of homotopy algorithms, especially those that attempt to avoid ill-conditioned problems. We note that recentering algorithms in linear programming theory may be seen as adaptively avoiding ill-conditioning. [6] compares the central path of linear programming theory to geodesics in an appropriate metric. In [3] we begin studying distances in the condition metric. 2. Definitions and Theorems We begin by recalling the context. For every positive integer l ∈ N, let Hl ⊆ C[X0 , . . . , Xn ] be the vector space of all Q homogeneous polynomials of degree l. For n (d) := (d1 , . . . , dn ) ∈ Nn , let H(d) := i=1 Hdi be the set of all systems f := (f1 , . . . , fn ) of homogeneous polynomials of respective degrees deg(fi ) = di , 1 ≤ i ≤ n. So f : Cn+1 → Cn . We denote by D := max{di : 1 ≤ i ≤ n} the maximum of the degrees. Date: February 1,2007. 2000 Mathematics Subject Classification. Primary 65H10, 65H20. Key words and phrases. Approximate zero,homotopy method, condition metric. This work was partly supported by an NSERC Discovery Grant. 1

2

MICHAEL SHUB

The solution variety Vˆ ⊂ H(d) ×(Cn+1 \{0}) is the set of points {(f, x)|f (x) = 0}. Since the equations are homogeneous, for all λ1 , λ2 ∈ C \ {0}, λ1 f (λ2 x) = 0 if and only if f (x) = 0. So Vˆ defines a variety V ⊂ P(H(d) ) × P(Cn+1 ) where P(H(d) ) and P(Cn+1 ) are the projective spaces corresponding to H(d) and Cn+1 respectively. Vˆ and V are smooth. We speak interchangeably of a path (ft , ζt ) in Vˆ and its projection (ft , ζt ) in V. Most quantities we define are defined on Vˆ but are constant on equivalence classes so are defined on V. For (g, x) ∈ P(H(d) ) × P(Cn+1 ) let 1

di −1 2 µnorm (g, x) = ||g||||Dg(x)|−1 )|| Nx ∆(di ||x||

where ||g|| is the unitarily invariant norm defined by the unitarily invariant Hermitian structure on H(d) considered in [7] and sometimes called the BombieriWeyl or Kostlan Hermitian structure, ||x|| is the standard norm in Cn+1 , Nx is the Hermitian complement of x, and ∆(ai ) for ai ∈ C, i = 1 . . . n is the n × n diagonal matrix with ith diagonal entry ai . If Dg(x)|−1 Nx does not exist we take µnorm = ∞. µnorm , also called µproj in some of our papers, is a normalized version of the condition number which we usually denote by µ. We have also used various notions of distance in projective space. In this paper we use only the Riemannian distance inherited from the Hermitian structure on the vector space, i.e. the angle. We denote this distance by d(., .). The space and Hermitian structure we are considering will be clear from the context. In previous papers we have paid careful attention to the constants. In this paper we are more cavalier. We begin with an analysis of how the normalized condition number varies. Proposition 1. Given  > 0 there is a constant C > 0 such that if gP(H(d) ) and ζ, ηP(Cn+1 ) with d(ζ, η) <

C D3/2 µnorm (g, ζ)

Then µnorm (g, η) ≤ (1 + )µnorm (g, ζ). Proof. Use proposition 2.3 of [10]. In the notation of that proposition note that u ≤ D3/2 µnorm (g, η) · d(η, ζ) < C. r0 ∼ d(η, ζ), the η(f, x) in the definitions of K D−1 ∼ (1 + d(ζ, η))D−1 < ec .  is ≤ 1 and kηk kζk Proposition 2. Given  > 0 there is a constant C > 0 such that if f, gP(H(d) ) and ζP(Cn+1 ) and d(f, g) <

C , D1/2 µnorm (f, ζ)

then µnorm (g, ζ) ≤ (1 + )µnorm (f, ζ)

COMPLEXITY OF BEZOUT’S THEOREM VI

3

Proof. By Proposition 5b) of [7] SectionI-3. µnorm (g, ζ) ≤ ≤

µnorm (f, ζ)(1 + d(f, g)) 1 − D1/2 d(f, g)µnorm (f, ζ) µnorm (f, ζ) 1 +

C D 1/2 µnorm (f,ζ)



1−C

Recall that µnorm (f, ζ) ≥ 1.



Theorem 1. Given  > 0 there is a constant C > 0 such that if f, gP(H(d )) and ζ, ηP(Cn+1 ) and C

d(f, g) <

D1/2 µnorm (f, ζ) C

d(ζ, η) <

D3/2 µnorm (f, ζ)

,

then 1 µnorm (g, η) ≤ µnorm (f, ζ) ≤ (1 + )µnorm (g, η). 1+ Proof. Apply Propositions 1 and 2 to prove the left hand inequality. Then given the left hand inequality apply Propositions 1 and 2 again to prove the right hand inequality applying the theorems with g, η in place of f, ζ. Adjust C and  as necessary.  The next proposition is useful for our Main Theorem 3. Proposition 3. Given  > 0 there is a C > 0 with the following property: Let (ft , ζt ) be a C 1 path in V for t0 ≤ t ≤ t1 . Define S0 = t0 and Si to be the first value of t ≤ t1 such that ZSi

(kf˙t k + kζ˙t k)dt =

Si−1

C D3/2 µnorm (fSi−1 , ζSi−1 )

or t1 .

Then Sk = t1 for (1 + ) 3/2 k ≤ max(1, D C

Zt1

µnorm (ft , ζt )(kf˙t k + kζ˙t k)dt)

t0 k

and µnorm (ft1 , ζt1 ) ≤ (1 + ) µnorm (ft0 , ζt0 ).

4

MICHAEL SHUB

Proof. ZSi

ZSi

1 µnorm (ft , ζt )(kf˙t k + kζ˙t k)dt ≥ (1 + )

Si−1

µnorm (fSi−1 , ζSi−1 ) (kf˙t k + kζ˙t k)dt

Si−1



1 1+ C D3/2

if Si < t1 .

Consequently ZSk

µnorm (ft , ζt ) (kf˙t k + kζ˙t k)dt

S0



(k − 1)C and (1 + )D3/2

C ≤ (k − 1) (1 + )D3/2

Zt1

µnorm (ft1 ζt ) (kf˙t k + kζ˙t k)dt,

t0

so k − 1 ≤

(1 + )D3/2 C

Zt1

µnorm (ft1 ζt ) (kf˙t k + kζ˙t k)dt.

t0

 Since we are working in the metric d we require an approximate zero theorem in this metric. First we prove a lemma. Recall the following quadratic polynomial ψ(u) = 1 − 4u + 2u2 and the definition of the projective Newton iteration Nf (x) = x − (Df (x)|x⊥ )−1 f (x). Here x⊥ is the Hermitian complement to x. Nf : P(Cn+1 ) → P(Cn+1 ) except that it fails to be defined where (Df (x)|x⊥ )−1 does not exist. If f (ζ) = 0 and d(Nfk (x), ζ) ≤ 1 d(x, ζ) for all positive integers k, then x is called an approximate zero of f with 22k −1 associated zero ζ. √

Lemma 1. Let u < 3−4 7 . Let f P (H(d) ) and ζP (Cn+1 ) with f (ζ) = 0. If u d(x, ζ) ≤ D3/2 µnorm , then (f,ζ) d(Nf (x), ζ) ≤

4u d(x, ζ). ψ(2u)

Proof. In the range of angles under consideration Tan d(x, ζ) ≤ 2d(x, ζ). So apply Lemma 1 of P263 of [5] to conclude that d(Nf (x), ζ) ≤ Tan d(Nf (x), ζ) ≤ ≤

4u d(x, ζ). ψ(2u)

2u Tan d(x, ζ) ψ(2u)

COMPLEXITY OF BEZOUT’S THEOREM VI

5

 Now let u0 solve the equation 4u0 1 = ψ(2u0 ) 2

√ 16 − 224 or u0 = ∼ 0.06458. 16

Theorem 2. (Approximate zero Theorem) Let f P (H(d) ), ζP (Cn+1 ) with f (ζ) = 0. If d(x, ζ) < d(Nfk (x), ζ)



1 22k −1

u0 D 3/2 µnorm (f,ζ)

then

d(x, ζ).

Proof. By induction. Suppose d(Nfk (x), ζ) ≤ ≤ Let u1 =

2k −1 4u0 u0 . ψ(2u0 )

4u0 2k −1 d(x, ζ) ψ(2u0 ) 2k −1 4u0 u0 ψ(2u0 ) D3/2 µnorm (f, ζ).

Note u1 ≤ u0 and so ψ(2u1 ) ≥ ψ(2u0 ). By the lemma 4u1 d(Nfk (x), ζ) ψ(2u1 ) 4u1 ≤ · d(Nfk (x), ζ) ψ(2u0 ) 2k −1 4u0 u0 4u0 2k −1 ψ(2u0 ) · d(x, ζ) ≤4 ψ(2u0 ) ψ(2u0 ) 4u0 2k+1 −1 = d(x, ζ). ψ(2u0 )

d(Nfk+1 (x), ζ) ≤

 Let (ft , ζt ) be a (piecewise) C 1 path in V, d k dt (ft , ζt )k the length of its tangent vector.

d dt (ft , ζt )

its tangent vector and

Theorem 3. (Main Theorem) There is a constant C1 > 0, such that: if (ft , ζt ) t0 ≤ t ≤ t1 is a C 1 path in V , then Z

t1

d (ft , ζt )kdt dt to steps of projective Newton method are sufficient to continue an approximate zero x0 of ft0 with associated zero ζ0 to an approximate zero x1 of ft1 with associated zero ζ1 . C1 D3/2

µnorm (ft , ζt )k

6

MICHAEL SHUB

Proof. Choose C < µ0 and  small enough such that Theorem 1 and Theorem 2 √ 4u 1 C apply, u = 2C(1 + ) < 3−4 7 and ψ(2u) < 2(1+) . Hence, if d(f, g) < D1/2 µnorm , (f,ζ) C , f (ζ) = 0 ,g(η) = 0 D 3/2 µnorm (f,ζ) C and d(x, ζ) ≤ D3/2 µnorm (f,ζ) , C then d(Ng (x), η) ≤ D3/2 µnorm . So Ng (x) (g,η)

d(ζ, η) ≤

is an approximate zero of g with associated zero η. Now apply proposition 1 to produce S0 , . . . , Sk and x0 such that d(x0 , ζt0 ) <

C . D3/2 µnorm (fto , ζt0 )

Then xi = NfSi (xi − 1) is approximate zero of fSi with associated zero ζSi and d(xi , ζSi ) <

C D3/2 µnorm (fSi , ζSi )

. 

Corollary 1. There is a constant C2 > 0, such that: if (ft , ζt ) t0 ≤ t ≤ t1 is a C 1 path in V , then

C2 D

3/2

Z

t1

µ2norm (ft , ζt )kf˙t kdt

to

steps of projective Newton method are sufficient to continue an approximate zero x0 of ft0 with associated zero ζ0 to an approximate zero x1 of ft1 with associated zero ζ1 . Proof. kζ˙t k ≤ µnorm (ft , ζt )kf˙t k.



By comparison the results of the papers [7] and [10] give for the Theorem 3 C3 D

3/2

Z

t1

k

supt (µnorm (ft , ζt )) to

d (ζt )kdt dt

steps of projective Newton’s method and for the corollary C3 D3/2 supt (µ2norm (ft , ζt ))

Z

t1

kf˙t kdt

to

steps of projective Newton’s method for some constant C3 . Theorem 3 suggests that if we wish to continue a solution ζP (Cn+1 ) of f P (H(d) ) to a solution ηP (Cn+1 ) of gP (H(d) ) an efficient way might be to follow a geodesic joining (f, ζ) to (g, η) in the metric given by the locally Lipschitz Riemann˙ 2 = µnorm (f, ζ)2 (kf˙k2 + kζk ˙ 2 ). We call this structure the ian structure k(f˙, ζ)k k condition (number) Riemannnian structure and the induced metric (see below) the condition (number)metric and quickly drop the ”number” from the names.

COMPLEXITY OF BEZOUT’S THEOREM VI

7

P0 P0 1 Let ⊂ V = {(f, ζ)V | µnorm (f, ζ) = ∞} and W = V − . Note that µ2norm (f, ζ) is not differentiable everywhere on W .2 In any case, k kk defines a metric dk on W by dk (x, y) = inf Length(γ) over piecewise differentiable paths γ in W joining x to y. Theorem 4. W is a locally compact and complete in the metric dk . Lemma 2. There is a constant C4 > 0 such that (ft , ζt ) in a C 1 path in W , t0 ≤ 3/2 t ≤ t1 of length L in the k kk metric, then µnorm (ft1 , ζt1 ) ≤ C4D L µnorm (ft0 , ζt0 ). Proof of lemma From proposition 1 it follows that µnorm (ft1 , ζt1 ) ≤ (1 + )(

1+ 3/2 C )D



For an appropriate , C. Let C4 = (1 + ) Proof of Theorem Fix (f0 , ζ0 ) for example f0i = and ζ0 = (1, 0, . . . , 0). Then µnorm (f0 , ζ0 ) = n1/2 .



2L

µnorm (ft0 , ζt0 )

2( 1+ C )

1/2

di X X di −1 i n1/2 i 0

D 3/2 dk (f,ζ),(f0 ,ζ0 ) C4

= 1, . . . , n



Hence, µnorm (f, ζ) ≤ n1/2 . P0 So any Cauchy Sequence in W stays a bounded distance away from , hence in a compact region of W where it converges in the usual metric but also in the metric induced by dk . Local compactness is obvious. The condition metric dk makes W a path metric space in the sense of Gromov [4]. Theorem then allows the application of the Hopf-Rinow Theorem from [4] to conclude that any two points of W may be joined by a minimizing geodesic. I thank one of the referees for pointing this reference out to me. References [1] Beltran, C.and L.M.Pardo: Smale’s 17th Problem: A Probabilistic Positive Solution. Foundations of Computational Mathematics (2007), Online First, DOI 10.1007/s10208-005-0211-0 [2] Beltran, C.and L.M.Pardo: Smale’s 17th Problem: Average Polynomial Time to compute affine and projective solutions. preprint [3] Beltran, C.and M.Shub: Complexity of Bezout’s Theorem VII: Distance estimates in the condition metric. to appear Foundations of Computational Mathematics [4] Gromov,M.: Metric Structures for Riemannian and Non-Riemannian Spaces, Birkh¨ auser, 1998. [5] Blum,L.,F.Cucker,M.Shub and S.Smale: Complexity and Real Computation, Springer, 1998. [6] Nesterov,Yu.E. and M.J. Todd: On the Riemannian geometry defined by self-concordant barriers and interior-point methods, Foundations of Computational Mathematics, 2,(2002), 333–361. [7] Shub,M and S. Smale: Complexity of Bezout’s Theorem I: Geometrical Aspects, Journal of AMS 6, (1993), 459–501. 1It follows from the condition number theorem see [5] and theorem 1 that µ norm (f, ζ) is P comparable to the reciprocal of the distance of (f, ζ) to 0 in the metric d. ln(µnorm P (f, ζ)) might be an interesting function to study. It has the same asymptotics with respect to 0 as a Green’s function. 2Instead of µ norm one might consider a smooth approximation to make the Riemannian geometry easier.

8

MICHAEL SHUB

[8] Shub,M and S. Smale: Complexity of Bezout’s Theorem II: Volumes and Probabilities in Eyssette,F. and A. Galligo, eds.: Computational Algebraic Geometry,Progress in Mathematics, 109, Birkhauser, 1993, 267–285. [9] Shub,M and S. Smale: Complexity of Bezout’s Theorem III: Condition Number and Packing, Journal of Complexity 9, (1993), 4-14. [10] Shub,M and S. Smale: Complexity of Bezout’s Theorem IV: Probability of Success; Extensions, SINUM 33, (1996), 128–148. [11] Shub,M and S. Smale: Complexity of Bezout’s Theorem V: Polynomial Time , Theoretical Computer Science 133,(1994), 141–164. Department of Mathematics, University of Toronto, 40 St. George Street, Toronto, Ontario, M5S 2E4, Canada E-mail address: [email protected] URL: http://www.math.toronto.edu/shub

COMPLEXITY OF BEZOUT'S THEOREM VI

gests using the condition number to define a Riemannian metric on the solution variety and to study the geodesics of this metric. Finding a geodesic is in itself ...

181KB Sizes 0 Downloads 227 Views

Recommend Documents

COMPLEXITY OF BEZOUT'S THEOREM VII
References. [BFM96] Bernard Beauzamy, Jean-Louis Frot, and Christian Millour, Massively parallel com- putations on many-variable polynomials, Ann. Math.

COMPLEXITY OF BEZOUT'S THEOREM VII
For any integer k ≥ 1 be denote by Ik the identity square matrix of size k. Lemma 1. Let k ≥ 1 and U ∈ Uk be a unitary matrix. Then, there exists a smooth path Ut ⊆ Uk, 0 ≤ t ≤ 1, such that U0 = Ik, U1 = U and. Length(Ut) ≤ π. √ k, w

COMPLEXITY OF BEZOUT'S THEOREM VII
all systems f = (f1,...,fn) of homogeneous polynomials of respective degrees deg(fi) = di, 1 ≤ i ≤ n. .... The Theorem and corollaries above are a consequence of the two following tech- ...... Online First DOI 10.1007/s10208-005-0211-0. [BP06b].

Coase theorem, complexity and transaction costs
Next, let T denote the end of the negotiation game and at ∈ A the ..... Second, consider any period, call it + 1, in which player j obtains his maximum continu-.

Theorem of math.pdf
ሻ − ૚ࡼሺ. Page 3 of 11. Theorem of math.pdf. Theorem of math.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Theorem of math.pdf. Page 1 of 11.

proof of bayes theorem pdf
Loading… Page 1. Whoops! There was a problem loading more pages. proof of bayes theorem pdf. proof of bayes theorem pdf. Open. Extract. Open with.

The Fundamental Theorem of Calculus
Problem: Using the information above found in example 5, sketch a graph of ( ). g x . The solutions for example 5: SOLUTIONS: We use the modified diagram above. (a) To calculate. 0. 2. (0). ( ) g. f t dt. -. = ∫. , we note that A1 = A2 and that thi

Complexity Anonymous recover from complexity addiction - GitHub
Sep 13, 2014 - Refcounted smart pointers are about managing the owned object's lifetime. Copy/assign ... Else if you do want to manipulate lifetime, great, do it as on previous slide. 2. Express ..... Cheap to move (e.g., vector, string) or Moderate

An extension of Carnot's theorem
Jan 28, 2005 - d2 a + zx y d2 b + xy z d2 c) ≥ 2dadbdc(x + y + z). (c) 2(dbdc + dcda + dadb) ≤ R( dbdc da. + dcda db. + dadb dc. ). B. C. A. O da db dc. Figure 6: With circumcenter. Proof. (a) Using Theorem 2 with I ≡ O(sin 2A,sin 2B,sin 2C) an

cyberpatriot vi -
Visit www.uscyberpatriot.org for more information. CYBERPATRIOT VI. PRE-REGISTRATION NOW OPEN. “Have you got what it takes?” Air Force Association ...

VI-SPDAT.pdf
If YES to question 14 or NO to questions 15 or 16, score 1. YES NO REFUSED Prescreen. Score. 14. Is there anybody that thinks you owe them money?. 15. Do you have any money coming in on a regular basis, like a job or government. benefit or even worki

Bank of India PO MT-VI & SP-VI Reserve List [email protected] ...
Click here for PO/MT-VI Reserve list. 2. Click here for SP-VI Reserve list. 3. Date: 12.04.2018 General Manager (HR). www.GovNokri.in. Page 1 of 1. Bank of India PO MT-VI & SP-VI Reserve List [email protected]. Bank of India PO MT-VI & SP-VI Reser

Bank of India IBPS PO MT-VI & SP-VI Results [email protected] ...
Click the link given below for the List of candidates provisionally. selected and allocated by IBPS. 1. Click here for PO/MT-VII list. 2. Click here for SP-VII list. 3. Date: 12.04.2018 General Manager (HR). www.GovNokri.in. Page 1 of 1. Bank of Indi

Comparison Theorem
part), l'Hôpital's rule, and some other tools and the geometric intuition can be illustrated .... Ui(t) := 1. √fi(t0). Ji(t). (3.9) for i = 1, 2. Therefore, by the Jacobi equations and the ... Then define a map ϕ from the set of vector fields alo

Maximum Power Transfer Theorem
from the power source to the load is when the resistance ... Function generator on the Velleman oscilloscope that is used in a ... Th . http://www.vellemanusa.com ...

Curse-of-Complexity Attenuation Using Semidefinite Programming ...
Curse-of-Complexity Attenuation Using Semidefinite Programming in. Solution ... namic programming equation reduces to a Hamilton-Jacobi- ..... the kth step. The algorithm keeps only the L(k) quadratics,. ̂ak i with the highest values according to th

Quantum query complexity of state conversion
Aug 23, 2011 - Theoretical Computer Science, 288(1):21–43, 2002. ... and Trends in Theoretical Computer Science, 3(4):263–399, 2007. ... B. S. Tsirelson.

On the Complexity of Explicit Modal Logics
Specification (CS) for the logic L. Namely, for the logics LP(K), LP(D), LP(T ) .... We describe the algorithm in details for the case of LP(S4) = LP and then point out the .... Of course, if Γ ∩ ∆ = ∅ or ⊥ ∈ Γ then the counter-model in q

The Dynamics of Policy Complexity
policymakers are ideologically extreme, and when legislative frictions impede policy- making. Complexity begets complexity: simple policies remain simple, whereas com- plex policies grow more complex. Patience is not a virtue: farsighted policymakers

ON INITIAL SEGMENT COMPLEXITY AND DEGREES OF ...
is 1-random, then X and Y have no upper bound in the K-degrees (hence, no ... stitute for Mathematical Sciences, National University of Singapore, during the ..... following result relates KZ to unrelativized prefix-free complexity when Z ∈ 2ω is

the complexity of exchange
population of software objects is instantiated and each agent is given certain internal states ... macrostructure that is at least very difficult to analyse analytically.

ON INITIAL SEGMENT COMPLEXITY AND DEGREES OF ...
2000 Mathematics Subject Classification. 68Q30 ... stitute for Mathematical Sciences, National University of Singapore, during the Computational. Aspects of ...... In Mathematical foundations of computer science, 2001 (Mariánské Lázn˘e),.