2025 Hyper Recent •CC0 1.0 Universal

This work is dedicated to the public domain. No rights reserved.

Access Preprint From Server
May 21st, 2025
Version: 3
UCD School of Agriculture and Food Science, University College Dublin, Belfield, Ireland; UCD Conway Institute of Biomolecular and Biomedical Research, Universi
genomics
biorxiv

Inferring domestic goat demographic history through ancient genome imputation

Erven, J. A. M.Open in Google Scholar•Etourneau, A.Open in Google Scholar•Mashkour, M.Open in Google Scholar•Neupane, M.Open in Google Scholar•Bardou, P.Open in Google Scholar•Stella, A.Open in Google Scholar•Talenti, A.Open in Google Scholar•Masiga, C. W.Open in Google Scholar•Van Tassel, C. P.Open in Google Scholar•Clark, E. L.Open in Google Scholaret al.

Goats were among the earliest managed animals, making them a natural model to explore the genetic consequences of domestication. However, a challenge in ancient genomic analysis is the relatively low genome coverage for most samples, limiting analysis to pseudohaploid genotypes. Genotype imputation offers potential to alleviate this limitation by improving information content and accuracy in low coverage genomes. To test this we used published high coverage (>8x) goat palaeogenomes, imputing downsampled genomes using the VarGoats dataset (1,372 individuals) as a reference panel. Measuring concordance between imputed and high coverage genotypes, we find high concordance after filtering for common (>5%), high confidence variants, with 0.5x genomes reaching >0.97 concordance. There is a trade-off between coverage, genotype probability (GP) thresholds, and genotype recovery, where higher coverage and more lenient GP thresholds result in higher recovery, and a reduction in heterozygous false positive rates with stricter thresholds. We then imputed 36 goat palaeogenomes with [≥]0.5x coverage to examine runs-of-homozygosity (ROH) and identity-by-descent (IBD) patterns. Using a novel approach combining ROH profiles across tools, we find that among Neolithic goats, ROH increases with distance from the Zagros Mountains, suggesting a large effect of the initial dispersal of managed herds. Inbreeding levels decrease across Southwest Asia in more recent periods. IBD mirrored this pattern, with less relatedness in the early herding site of Ganj Dareh compared to higher relatedness in goats from later in the dispersal process. These findings provide insights into the genetic consequences of early goat management on demography, and confirm the utility of imputation in leveraging low coverage palaeogenomes.

Similar Papers

biorxiv
Fri May 23 2025
A catalog of ancient proxies for modern genetic variants
The ability to observe the genomes of past human populations using ancient DNA provides an extraordinary perspective on many fundamental questions in human genetics, including understanding the evolutionary history of variants that underlie human disease and other phenotypes. However, ancient DNA is often damaged and degraded, yielding low-coverage of most nucleotides. Further, many publicly avail...
Brand, C. M.
•
Capra, J. A.
biorxiv
Fri May 23 2025
Designing DNA With Tunable Regulatory Activity Using Score-Entropy Discrete Diffusion
Designing regulatory DNA sequences with precise, cell-type-specific activity is critical for applications in medicine and biotechnology, but remains challenging due to the vast combinatorial space and complex regulatory grammar governing gene expression. Recent deep generative models---including genomic language models and diffusion-based approaches---offer new tools for sequence design, yet lack ...
Sarkar, A.
•
Kang, Y.
•
Somia, N.
•
Mantilla, P.
...•
Koo, P.
biorxiv
Fri May 23 2025
The CLAMP GA-binding transcription factor regulates heat stress-induced transcriptional repression by associating with 3D chromatin loops
To survive exposure to heat stress (HS), organisms activate stress response genes and repress constitutive gene expression, thereby preventing the accumulation of potentially toxic RNA and protein products. Although many studies have elucidated the mechanisms that drive HS-induced activation of stress response genes across species, little is known about the mechanisms that repress constitutively e...
Aguilera, J.
•
Duan, J.
•
Cortez, K.
•
Lee, R.
...•
Larschan, E.
biorxiv
Thu May 22 2025
Functional genomic analysis of non-canonical DNA regulatory elements of the aryl hydrocarbon receptor
The aryl hydrocarbon receptor (AHR) is a ligand-dependent transcription factor that is activated by environmental toxicants, like halogenated and polycyclic aromatic hydrocarbons, and then binds to DNA and regulates gene expression. AHR is involved in various physiological processes, including liver and immune system function, cell cycle regulation, oncogenesis, and metabolism. In the canonical pa...
Shahriar, S.
•
Patel, T. D.
•
Nakka, M.
•
Grimm, S. L.
...•
Gorelick, D. A.
biorxiv
Thu May 22 2025
A complete reference genome assembly and annotation of the Black Redstart (Phoenicurus ochruros)
The Black Redstart (Phoenicurus ochruros) is one of the most widely distributed species, occupying diverse habitats and exhibiting remarkable altitudinal migration, making it suitable model for studying altitudinal migration and high-altitude adaptation. In this study, we present the first reference genome of Phoenicurus ochruros, generated using PacBio HiFi long-read sequencing. The nuclear genom...
Ghimire, P.
•
Wang, N.
•
Lamichhaney, S.
biorxiv
Thu May 22 2025
Draft genome and transcriptomic sequence data of three invasive insect species
Cydalima perspectalis (the box tree moth), Leptoglossus occidentalis (the western conifer seed bug), and Tecia solanivora (the Guatemalan tuber moth) are three economically harmful invasive insect species. This study presents their genomic and transcriptomic sequences, generated through whole-genome sequencing, RNA-seq transcriptomic data, and Hi-C sequencing. The resulting genome assemblies exhib...
Lombaert, E.
•
Klopp, C.
•
Blin, A.
•
Annonay, G.
...•
Deleury, E.
biorxiv
Thu May 22 2025
Phenotypic tolerance for rDNA copy number variation within the natural range of C. elegans
The genes for ribosomal RNA (rRNA) are encoded by ribosomal DNA (rDNA), whose structure is notable for being present in arrays of tens to thousands of tandemly repeated copies in eukaryotic genomes. The exact number of rDNA copies per genome is highly variable within a species, with differences between individuals measuring in potentially hundreds of copies and megabases of DNA. The extent to whic...
Hall, A. N.
•
Morton, E.
•
Walters, R.
•
Cuperus, J. T.
•
Queitsch, C.
biorxiv
Wed May 21 2025
Fusion transcription factor dosage controls cell state in rhabdomyosarcoma
In the fusion-positive subset of rhabdomyosarcoma, the PAX3::FOXO1 oncoprotein is the most common fusion driver. We previously established a human myoblast system for inducible expression of PAX3::FOXO1. In the current study, we modulate PAX3::FOXO1 protein expression to understand the epigenetic and phenotypic functions at different PAX3::FOXO1 levels. Proliferative and oncogenic outcomes depend ...
Hoffman, R. A.
•
Wang, M.
•
Sunkel, B. D.
•
Nguyen, T. H.
...•
Stanton, B. Z.
biorxiv
Wed May 21 2025
The coordination between CTCF, cohesin and TFs impacts nucleosome repositioning and chromatin insulation to define state specific 3D chromatin folding
CTCF-mediated chromatin folding plays a key role in gene regulation, however the mechanisms controlling chromatin organization across cell states are not fully elucidated. Comprehensive analyses reveal that CTCF binding stability and cohesin overlap in mice and humans, are regulated by species specific differences in CTCF binding site (CBS) accessibility and enrichment of motifs corresponding to e...
Do, C.
•
Jiang, G.
•
Cova, G.
•
Zappile, P.
...•
Skok, J. A.