AAAI-08 Chicago

Trace Ratio Criterion for Feature Selection Feiping Nie1, Shiming Xiang1, Yangqing Jia1, Changshui Zhang1, Shuicheng Yan2 1Department

of Automation, Tsinghua University, China

2National

University of Singapore, Singapore

Outline

¾ Feature

Selection ¾ Our Method ¾ Experiments ¾ Conclusion

2

Feature Selection vs. Subspace Learning ¾ Feature

selection is often faster than the corresponding subspace learning algorithm ¾ The result of the selection is physically explainable ¾ We only need to process a small subset of features for further data processing. 3

Feature Selection ¾ Select

a subset of m features from the d original feature set. We denote a selection option by . ¾ It can be viewed as a special subspace learning:

¾ where

the corresponding matrix W is constrained to be a 0-1 “selection” matrix. 4

Selection Matrix ¾ We

define where each column-vector comes from the set

,

¾ Also, 5

An Example ¾ If

we are going to choose two features from the original 3dimensional data, a possible option is:

6

Trace Ratio Criterion for Feature Selection ¾A

general graph-based framework is to maximize a trace-ratio criterion:

¾B

is to reflect the between-class or global affinity relationship of the data, E is to reflect the within-class relationship or the local affinity relationship.

7

Examples ¾ Supervised:

Fisher Score [Bishop 1995], using the within-class and between-class scatter matrices ¾ Unsupervised: Laplacian Score [He, Cai, & Niyogi, 2005], using graph Laplacian and its degree matrix ¾ Semi-supervised: Can readily extended based on this framework. 8

Scores ¾ Subset

Score:

¾ Feature

Score:

¾ The

goal of feature selection is to find the largest subset score 9

Previous Methods ¾ Without

loss of generality, suppose

¾ Then

the first m vectors are selected to form the matrix W. ¾ However, this can actually be viewed as a greedy algorithm that essentially maximizes but not the subset-score

10

Main Goal ¾ We

aim to maximize the trace ratio criterion for feature selection, and finds the global optimum solution

¾ It

appears that we need to search the solution space containing options. 11

The Trace Difference function ¾ Suppose

we have the global optimum solution, then

12

The Trace Difference function ¾ Define

the trace difference function

¾ It

can be verified that f is monotonic decreasing ¾ The trace ratio problem will be equivalent to solving the equation 13

How to calculate f and the corresponding W?

¾ For

a given , we can calculate the trace difference score

¾ and

select the m vectors with the largest score to form W, and calculate the corresponding function value. 14

An iterative Algorithm

15

An Example

16

Datasets

17

UCI Results

18

Face Datasets

19

Conclusion ¾A

general feature selection framework ¾ An algorithm to find the global optimum solution for the subsetscore ¾ Experiments show the superiority of the subset-score.

20

Trace Ratio Criterion for Feature Selection Feiping Nie, Shiming Xiang, Yangqing Jia, Changshui Zhang, Shuicheng Yan

THANK YOU!

21

This page is intentionally left blank

22

PowerPoint Presentation - Instance-level Multiple ...

denote a selection option by . ➢It can be viewed ... features from the original 3- dimensional data, a possible option is: 6 ... graph Laplacian and its degree matrix.

2MB Sizes 2 Downloads 173 Views

Recommend Documents

PowerPoint Presentation on BIMSTEC.pdf
GDP growth in BIMSTEC (approx 6%) much higher than world's (2.5% in. 2016). FDI inflows was ... Page 3 of 12. PowerPoint Presentation on BIMSTEC.pdf.

PowerPoint プレゼンテーション - PowerPoint Presentation
KYUSHU UNIVERSITY / FUKUOKA. Aim: To examine whether advanced learners of Japanese process. SRCs faster than ORCs in Japanese, as predicted by SDH. Participants: 21 advanced Chinese-speaking learners of Japanese (CLJ) from Hiroshima University. 19 of

DDOT Projects Update PowerPoint Presentation for Ward 1 - District ...
Oct 27, 2014 - 6:30 p.m. Call to Order and Welcome – Don Edwards, Facilitator. 6:35 p.m. Opening Remarks ..... through the Mayor Call. Center at 311.

DDOT Projects Update PowerPoint Presentation for Ward 1 - District ...
Oct 27, 2014 - Dedicated Bus Lanes Georgia Avenue. • Safety Improvements 15th Street. • Resurfacing 7th Street. • Reconstrucbon of U Street, Phase II.

Multiple Intracellular Routes in the Cross-Presentation ...
Copyright © 2005 by The American Association of Immunologists, Inc. 0022-1767/05/$ ... ognized with Db by CD8 T cells that express the F5 TCR. The 65-P1 ...... Booth, J. W., M. K. Kim, A. Jankowski, A. D. Schreiber, and S. Grinstein. 2002.

Presentation
A fast, cheap and simple analytical method. .... limited data from Jordan ... data. • Some of those: Mishor Yamin,. Revivim – Mashabim, Sde-. Boker, Shivta ...

powerpoint template -
Four SICA project meeting. Imperial College, London, 24 May 2013. Florin Grigorescu MD, PhD ... Consortium. Taragona. Cantazaro. Rome. Bologna. Bucharest.

מצגת של PowerPoint - Editorial Express
Overconfidence. Introduction. Example. Results. Variants. Evolution. Model. People report 80 .... Principal wants agents with the most accurate private signals to.

PowerPoint bemutató - Tárki
Dec 15, 2012 - Data archives. Scientific analytic papers. Survey instruments. Administrative data collection. Other forms of data collection. Outline: the structure of the .... Form of access (web access, file transfer, remote statistical analysis ..

PowerPoint bemutató - Tárki
Dec 15, 2012 - (Tarki Social Research Institute, Budapest) with contributions by .... Project websites ... content, frequency, etc. so that all will understand the.

מצגת של PowerPoint
information → a few overconfident agents survive. ▫. Second-best outcome; compensates another bias (e.g., excess risk aversion): Wang (91), Blume & Easly ...

Presentación de PowerPoint
Job Description: GRADIANT, Galician Research and Development Center in Advanced Telecommunications, leader in the generation and transfer of ICT knowledge in Galicia, needs to incorporate a ... Work with a qualified team with flexible schedules. •

PowerPoint Sunusu -
Flexible Production. • Specialization of labor. • Keynesian Institutions are failed. • Labor unions are getting out of picture. • Social security system declined.

016 PowerPoint Slides.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. 016 PowerPoint ...

Presentation Title Presentation Sub-Title
April 2010, Prahran, Melbourne. • Direct impacts ... Victoria. Currently infrastructure and facilities are designed based on past climate, not future climate. ... Sensitivity of Materials to Climate Change Impacts. Material. CO. 2. Cyclones. & Stor

Presentation Title Presentation Sub-Title
Climate change impacts – impact upon cycling conditions and infrastructure. Infrastructure and climate change risks for Vic. Primary impacts – impact upon ...

PowerPoint bemutató - Tárki
Dec 15, 2012 - (Tarki Social Research Institute, Budapest) with contributions ... Social monitoring and social reporting. Scientific community .... Project websites ...

PowerPoint Handout
During the PPT activity, take notes on Romanticism from the screen so that during our class ... Romantic Period, take notes on the pieces of art and poetry.

Presentazione di PowerPoint Services
What we discussed so far connected. Lives. Internet of me. Speed ... A moment we reflexively turn to a device to act on a need we have that moment – to learn, discover, find or buy ... Smartphone. Mobile search for info on purchasing during free ti