Tutorial of the STRUCTURE software Dr. Sung-Chur Sim Tomato Genetics and Breeding program The Ohio State Univ., OARDC

STRUCTURE software A model-based clustering method (Pritchard et al. 2000) • Free software (http://pritch.bsd.uchicago.edu/software/structure2_1.html) • Bayesian approach (MCMC: Markov Chain Monte Carlo) • Detects the underlying genetic population among a set of individuals genotyped at multiple markers • Computes the proportion of the genome of an individual originating from each inferred population (quantitative clustering method)

Input data A matrix where the data for individuals are in rows, the loci are in column • n consecutive rows have the data for each individual of nploid species • Integer should be used for coding genotype • Missing data should be indicated by a number which doesn’t occur elsewhere in the data (e.g. -1) • The data file should be a text file (.txt) not an excel file (.xls) for running STRUCTURE

Information of user-defined populations (market class)

2 consecutive rows for alleles

Missing data

Running STRUCTURE from a graphical interface, Front End

The Front End organizes data analysis into “project”

Importing input data into a project

Importing input data into a project (cont.)

Importing input data into a project (cont.)

Importing input data into a project (cont.)

Importing input data into a project (cont.)

Importing input data into a project (cont.)

Configuring a parameter set

Configuring a parameter set (cont.)

Length of Burnin Period: how long to run the simulation before collecting data to minimize the effect of the starting configuration Number of MCMC Reps after Burnin: how long to run the simulation after burnin to get accurate parameter estimates

Configuring a parameter set (cont.)

Configuring a parameter set (cont.)

Configuring a parameter set (cont.)

Configuring a parameter set (cont.)

Running STRUCTURE: a single run

Running STRUCTURE: a single run (cont.)

Running STRUCTURE: a batch run

Running STRUCTURE: a batch run (cont.)

Ln P(D): Estimated probability of Ks

Inference of true K (number of populations) The log likelihood for each K, Ln P(D) = L(K) Two approaches to determine the best K 1. Use of L(K): When K is approaching a true value, L(K) plateaus (or continues increasing slightly) and has high variance between runs (Rosenberg et al. 2001). Nonparametric test (Wilcoxon test) 2. Use of an ad hoc quantity (∆K): Calculated based on the second order rate of change of the likelihood (∆K) (Evanno et al. 2005). The ∆K shows a clear peak at the true value of K. ∆K = m([L’’K])/s[L(K)]

Evanno et al. 2005. Molecular Ecology 14: 2611-2620

SAS code for the nonparametric method

Inference of best K using the delta K method

The best K = 8

L(K) = an average of 20 values of Ln P(D) L’(K) = L(K)n – L(K)n-1 L’’(K) = L’(K)n – L’(K)n-1 Delta K = [L’’(K)]/Stdev

Q-matrix

An example of steps to identify the best K Format the marker data Run STRUCTURE w/10K for burnin and 50K for MCMC reps 20 times at each of K=1 to 10 Infer true K (5~7) Run STRUCTURE w/500K for burnin and 750K for MCMC reps 20 times at each of K=3 to 8

Identify the best K based on L(K) and ∆K

We may not always be able to know the TRUE value of K, but we should aim for the smallest value of K that captures the major structure in the data Pritchard et al. (2000)

Enjoy running STRUCTURE

Tutorial of the STRUCTURE software by Dr. Sung-Chur Sim.pdf ...

... problem loading more pages. Tutorial of the STRUCTURE software by Dr. Sung-Chur Sim.pdf. Tutorial of the STRUCTURE software by Dr. Sung-Chur Sim.pdf.

2MB Sizes 2 Downloads 225 Views

Recommend Documents

spss software tutorial pdf
Download now. Click here if your download doesn't start automatically. Page 1 of 1. spss software tutorial pdf. spss software tutorial pdf. Open. Extract. Open with.

procast software tutorial pdf
There was a problem previewing this document. Retrying... Download. Connect more apps. ... procast software tutorial pdf. procast software tutorial pdf. Open.

A Linux Software RAID Tutorial - IBM
Mar 10, 2006 - risk by protecting data stored on hard disks and improving disk ... any one of the disks will render the RAID unusable and data will have been ...

system software tutorial pdf
Sign in. Loading… Whoops! There was a problem loading more pages. Whoops! There was a problem previewing this document. Retrying... Download. Connect ...

sas software tutorial pdf
Sign in. Loading… Whoops! There was a problem loading more pages. Whoops! There was a problem previewing this document. Retrying... Download. Connect ...

A Linux Software RAID Tutorial - IBM
Mar 10, 2006 - Creating a Software RAID on an Existing Linux System. ...... Other company, product, and service names may be trademarks or service marks of ...

[Read] Ebook Creo Simulate 3.0 Tutorial: Structure and ...
modeling. This textbook is written for first-time FEA users in general and Creo. Simulate users in particular. After a brief introduction to finite element modeling, the tutorial introduces the major concepts behind the use of. Creo Simulate to perfo

Mastering the Art of Self Hypnosis by Dr. Kenneth Grossman.pdf ...
Mastering the Art of Self Hypnosis by Dr. Kenneth Grossman.pdf. Mastering the Art of Self Hypnosis by Dr. Kenneth Grossman.pdf. Open. Extract. Open with.

DAS CABINET DES DR. CALIGARI. (THE CABINET OF DR ...
Page 2 of 2. DAS CABINET DES DR. CALIGARI. (THE CABINET OF DR. CALIGARI) (1920.pdf. DAS CABINET DES DR. CALIGARI. (THE CABINET OF DR.

ORGANIZATIONAL STRUCTURE OF THE CMU JOURNAL OF ...
ORGANIZATIONAL STRUCTURE OF THE CMU JOURNAL OF SCIENCE.pdf. ORGANIZATIONAL STRUCTURE OF THE CMU JOURNAL OF SCIENCE.pdf.

DR. OZ'S - The Dr. Oz Show
DR. OZ'S. INSTALL SMOKE ALARMS IN EVERY SLEEPING ROOM,. OUTSIDE EACH SLEEPING AREA AND ON EVERY LEVEL OF. THE HOME. MOUNT FIRE ...

DR. OZ'S - The Dr. Oz Show
DR. OZ'S. INSTALL SMOKE ALARMS IN EVERY SLEEPING ROOM, ... MAKE SURE YOU AND YOUR KIDS KNOW HOW TO OPEN ALL WINDOWS. SET UP A ...

Morphology and structure of silkworm egg Tutorial 1.pdf
There was a problem previewing this document. Retrying... Download. Connect more apps... Try one of the apps below to open or edit this item. Morphology and ...