Structure Harvester

Structure Harvester & STRUCTURESelector in Population Genetics

Structure Harvester & STRUCTURESelector

Powerful tools for interpreting population structure and genetic clustering

Introduction

Understanding population structure is a cornerstone of genetics, plant breeding, and evolutionary biology. The STRUCTURE software provides a Bayesian clustering method to assign individuals to populations based on multilocus genotype data. However, interpreting results and identifying the optimal number of clusters (K) can be challenging. Tools like Structure Harvester and STRUCTURESelector have become essential for analyzing and visualizing STRUCTURE output.

What is Structure Harvester?

Structure Harvester is a web-based and standalone program designed to visualize STRUCTURE results. It implements the popular Evanno method for determining the most likely number of genetic clusters (K). The tool generates clear plots of likelihood values and ΔK, helping researchers identify meaningful population partitions.

How It Works

Structure Harvester processes the results_summary.txt files produced by STRUCTURE. It extracts log-likelihood scores (LnP(D)) and applies the Evanno ΔK method (Evanno et al., 2005). This allows researchers to pinpoint the value of K that best explains population structure.

Step Description
Upload Results STRUCTURE output files are provided to Structure Harvester.
Visualization Generates line graphs of likelihood values for each K.
ΔK Calculation Applies Evanno’s method to detect the optimal K value.
Output Provides plots and tables summarizing genetic clustering.

Using STRUCTURESelector

STRUCTURESelector is an advanced online platform that integrates multiple statistical approaches beyond the Evanno method. It supports Puechmaille’s methods, MedMedK, MedMeanK, and others, providing more robust inference when ΔK alone is insufficient.

STRUCTURESelector accepts results directly from STRUCTURE or Structure Harvester, producing interactive plots, summary statistics, and downloadable reports.

Applications in Plant Breeding

In plant breeding and crop genetics, understanding population structure is vital for:

  • Identifying distinct genetic groups for association mapping
  • Detecting admixture and hybridization
  • Improving the accuracy of GWAS (Genome-Wide Association Studies)
  • Designing effective breeding strategies based on genetic diversity

By combining STRUCTURE, Structure Harvester, and STRUCTURESelector, breeders gain deeper insights into the genetic architecture of germplasm collections.

Conclusion

Structure Harvester and STRUCTURESelector extend the functionality of STRUCTURE by providing clear, reliable methods to interpret genetic clustering. These tools are indispensable for modern plant breeding, helping researchers make informed decisions about germplasm selection, population analysis, and breeding program design.

References

  • Evanno, G., Regnaut, S., & Goudet, J. (2005). Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology, 14(8), 2611–2620. https://doi.org/10.1111/j.1365-294X.2005.02553.x
  • Earl, D. A., & vonHoldt, B. M. (2012). STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conservation Genetics Resources, 4(2), 359–361. https://doi.org/10.1007/s12686-011-9548-7
  • Li, Y. L., & Liu, J. X. (2018). StructureSelector: A web-based software to select and visualize the optimal number of clusters using multiple methods. Molecular Ecology Resources, 18(1), 176–177. https://doi.org/10.1111/1755-0998.12719