35

Correspondence Analysis and Data Coding

M 2 (NJ (I)) =

X

fi kfJi − fJ k2fJ =

i∈I

X

fi ρ2 (i)

(2.3)

i∈I

In the latter term, ρ is the Euclidean distance from the cloud center, and fi is the mass of element i. The mass is the marginal distribution of the input data table. Let us take a step back: the given contingency P table data is denoted kIJ = {kIJ (i, j) = k(i, j); i ∈ I,P j ∈ J}. We have k(i) = j∈J k(i, j). Analogously k(j) is defined, and k = i∈I,j∈J k(i, j). Next, fIJ = {fij = k(i, j)/k; i ∈ I, j ∈ J} ⊂ RI×J , similarly fI is defined as {fi = k(i)/k; i ∈ I, j ∈ J} ⊂ RI , and fJ analogously. Next back to the first right hand side term in equation 2.3: the conditional distribution of fJ knowing i ∈ I, also termed the jth profile with coordinates indexed by the elements of I, is fJi = {fji = fij /fi = (kij /k)/(ki /k); fi 6= 0; j ∈ J} and likewise for fIj . The cloud of points consists of the couple: profile coordinate and mass. We have NJ (I) = {(fJi , fi ); i ∈ I} ⊂ RJ , and again similarly for NI (J). From equation 2.3, it can be shown that X M 2 (NJ (I)) = M 2 (NI (J)) = kfIJ − fI fJ k2fI fJ = (fij − fi fj )2 /fi fj i∈I,j∈J

(2.4) The term kfIJ − fI fJ k2fI fJ is the χ2 metric between the probability distribution fIJ and the product of marginal distributions fI fJ , with as center of the metric the product fI fJ . In correspondence analysis, the choice of χ2 metric of center fJ is linked to the principle of distributional equivalence, explained as follows. Consider two elements j1 and j2 of J with identical profiles: i.e., fIj1 = fIj2 . Consider now that elements (or columns) j1 and j2 are replaced with a new element js such that the new coordinates are aggregated profiles, fijs = fij1 + fij2 , and the new masses are similarly aggregated: fijs = fij1 + fij2 . Then there is no effect on the distribution of distances between elements of I. The distance between elements of J, other than j1 and j2 , is naturally not modified. This description has followed closely [47] (chapter 2). The principle of distributional equivalence leads to representational selfsimilarity: aggregation of rows or columns, as defined above, leads to the same analysis. Therefore it is very appropriate to analyze a contingency table with fine granularity, and seek in the analysis to merge rows or columns, through aggregation.

2.2.3

Notation for Factors

Correspondence analysis produces an ordered sequence of pairs, called factors, (Fα , Gα ) associated with real numbers called eigenvalues 0 ≤ λα ≤ 1. The

2.2.3 Notation for Factors - multiresolutions.com

The principle of distributional equivalence leads to representational self- similarity: aggregation of rows or columns, as defined above, leads to the same analysis. Therefore it is very appropriate to analyze a contingency table with fine granularity, and seek in the analysis to merge rows or columns, through aggregation.

35KB Sizes 0 Downloads 207 Views

Recommend Documents

Asymptotic Notation - CS50 CDN
Like searching through the phone book. • Identify ... as you go. If array[i + 1] < array[i], swap them! ... Grab the smallest and swap it with whatever is at the front of ...

Jit epartmetit of (notation
Apr 29, 2016 - Public Elementary and Secondary Schools Heads. All Others .... FL'. • co 0 F". •. 000P0,0. • oo. 0 0 0 17. 5. O co ••4 aq cc g. 0 co Et ,ct. • co. 4.

Using Scientific Notation
If the decimal is shifted to the right, the exponent is negative. Example (Final Answer): The exponent is positive, and the final answer is 3.750 x 103. Converting Scientific Notation to Integers. Step 1: Write the decimal number. Example: 3.750 x 10

Credit.Card.Visa.Hack.Ucam.Cl.Tr.560.[223.kB_www.netz.ru].pdf ...
Credit.Card.Visa.Hack.Ucam.Cl.Tr.560.[223.kB_www.netz.ru].pdf. Credit.Card.Visa.Hack.Ucam.Cl.Tr.560.[223.kB_www.netz.ru].pdf. Open. Extract. Open with.

1 Notation
Aug 29, 2013 - by the definition of eigenvalues and eigenvectors, ˆFPC = (NT)−1zz′ ˆFPCV−1. Thus, letting. H = Q(T−1F′ ˆFPC)V−1, ...... was to be shown. □. Lemma PC4. Under the conditions of Lemma PC1,. 1. T. N. C i=1 η′i dPC = 1.

Asymptotic Notation - CS50 CDN
break – tell the program to 'pause' at a certain point (either a function or a line number) step – 'step' to the next executed statement next – moves to the next ...

the z notation - Terry Marris
and c. EXERCISE 1.1. Identify five sets drawn from your surroundings and ...... pp 50. WORDSWORTH J.B. 1992 Software Development with Z Addison-Wesley ...

Set notation Lesson.pdf
Page 3 of 9. Set notation Lesson.pdf. Set notation Lesson.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Set notation Lesson.pdf. Page 1 of 9.

critical success factors for offshore
(n=122). %. Organisations track record of successful projects. 53. 43. Efficient Project Manage- ment. 47. 39. Efficient Contract Manage- ment. 45. 37. SPI Certification. 41. 34. Knowledge of the Clients. Language and Culture. 39. 32. Timely Delivery

Testing for Common GARCH Factors
Jun 6, 2011 - 4G W T ¯φT (θ0) +. oP (1). vT. 2. + oP (1) ..... “Testing For Common Features,” Journal of Business and Economic. Statistics, 11(4), 369-395.

nepartinent of (notation
Mar 17, 2016 - 7, S. 2016. (School Year (SY) 2015-2016 End of School Year Rites) ... vocational institutions (TVIs) and higher education institutions ... 2. Awarding of honors to learners from Grades 1 to 12 may be conducted during.

the z notation - Terry Marris
5 the set of departments in a small, manufacturing business. .... There is just one inbuilt type that is part of the Z Notation; it is Z, the set of all integers. .... NORCLIFFE A. & SLATER G. 1991 Mathematics of Software Construction Ellis Horwood.