Skip to main content
A self tuning sliding window method for detecting phenotype linked regional poly-methylation architecture in sparse wildlife methylomes

A self tuning sliding window method for detecting phenotype linked regional poly-methylation architecture in sparse wildlife methylomes

This is a Preprint and has not been peer reviewed. This is version 1 of this Preprint.

Add a Comment

You must log in to post a comment.


Comments

There are no comments or no comments have been made public for this article.

Downloads

Download Preprint

Authors

Thomas Stocker 

Abstract

Despite featuring extreme physiological adaptations integration of wildlife species into the modern ‘omics’ frameworks are limited due to the sparsity in the data. To address the sparsity limitation a self-tuning sliding-window framework was developed for the identification of the regional poly-CpG methylation architecture associated with phenotypic traits. Under the framework the iteratively expanded window aligns against each chromosome for two constraints: (i) the window must increase its penalised R2 statistic derived from univariate (ADVI-EWAS) and multivariate (Horseshoe) CpG-phenotype models and (ii) both models but agree upon the direction of the methylome-phenotype association. Any windows not passing the two conditions were discarded if they lacked sufficient CpG density, otherwise were retained but ceased to grow. Application of the framework to the blood methylome of 69 California Sea Lions (Zalophus californianus) detected three ‘trusted’ windows across the entire genome. Each of the windows were uniquely mapped to a single gene, these genes were GCSH, DNMT1, LMX1A. All three genes are mechanistically linked to the phenotypic trait and are consistent with the extreme phenotype of marine mammals. The results were independent of the age of the individuals tested. The results were not predictive of phenotype when applying epigenome wide association studies (EWAS); however, application of the poly-CpG architecture provided a phenotypically linked signal that was biologically interpretable. The sliding window method is computationally efficient, isolates short, directionally stable genome regions and outperforms single-site analyses. The generalisable statistical framework provided within the study could extract phenotype-linked molecular architecture from sparse wildlife methylomes.

DOI

https://doi.org/10.32942/X2QX1T

Subjects

Bioinformatics, Cellular and Molecular Physiology, Comparative and Evolutionary Physiology, Evolution, Genetics and Genomics, Marine Biology, Other Ecology and Evolutionary Biology

Keywords

Physiology, Sliding-window, Methylation, Phenomics

Dates

Published: 2026-06-26 15:50

Last Updated: 2026-06-26 15:50

License

CC BY Attribution 4.0 International

Additional Metadata

Conflict of interest statement:
None

Data and Code Availability Statement:
All relevant data/ code can be sourced from the GitHub repository: https://github.com/tsto3616/poly-CpG

Language:
English

Metrics

Views: 6

Downloads: 0