I will provide a complete population genetics analysis from vcf data
About this Gig
I will provide a complete population genetics analysis from VCF data, using robust and reproducible methods to characterize genetic diversity, neutrality patterns, and population structure.
Standard analysis includes:
- Basic VCF statistics and data checks
- Observed and expected heterozygosity (Ho/He)
- Inbreeding coefficient
- Nucleotide diversity (π; genome-wide)
- Tajimas D (fixed, non-overlapping windows)
Add-on 1: Population-level
Enables population-level analyses using a client-provided sample-to-population mapping file. Includes population-stratified metrics, PCA (LD-pruned), ADMIXTURE (K=210, 3 replicates per K, cross-validation), and pairwise F_ST (genome-wide).
Add-on 2: Sliding Windows
Extends the analysis with genome-wide sliding-window metrics to detect local patterns and outlier windows. Includes π in sliding windows (with step size) and Tajimas D in fixed windows, with plots and tabulated results.
Bonus: If both add-ons are selected, pairwise F_ST is also computed in sliding windows.
Notes: The VCF must contain sample genotypes. Population-level analyses require a mapping file. Dataset size (variants, samples, populations) is limited by the selected package. Outliers refe
