A likelihood ratio approach for identifying three-quarter siblings in genetic databases


Por: Galvan-Femenia, I, Barcelo-Vidal, C, Sumoy, L, Moreno, V, de Cid, R and Graffelman, J

Publicada: 1 mar 2021 Ahead of Print: 1 ene 2021
Resumen:
The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an important aspect of standard quality control procedures. Relatedness research is usually based on an allele sharing analysis of identity by state (IBS) or identity by descent (IBD) alleles. Existing IBS/IBD methods mainly aim to identify first-degree relationships (parent-offspring or full siblings) and second degree (half-siblings, avuncular, or grandparent-grandchild) pairs. Little attention has been paid to the detection of in-between first and second-degree relationships such as three-quarter siblings (3/4S) who share fewer alleles than first-degree relationships but more alleles than second-degree relationships. With the progressively increasing sample sizes used in genetic research, it becomes more likely that such relationships are present in the database under study. In this paper, we extend existing likelihood ratio (LR) methodology to accurately infer the existence of 3/4S, distinguishing them from full siblings and second-degree relatives. We use bootstrap confidence intervals to express uncertainty in the LRs. Our proposal accounts for linkage disequilibrium (LD) by using marker pruning, and we validate our methodology with a pedigree-based simulation study accounting for both LD and recombination. An empirical genome-wide array data set from the GCAT Genomes for Life cohort project is used to illustrate the method.

Filiaciones:
:
 Univ Girona, Dept Comp Sci, Appl Math & Stat, Girona, Spain

 Inst Hlth Sci Res Germans Trias & Pujol IGTP, Genomes Life GCAT Lab, Can Ruti Campus, Barcelona, Spain

Barcelo-Vidal, C:
 Univ Girona, Dept Comp Sci, Appl Math & Stat, Girona, Spain

:
 Inst Hlth Sci Res Germans Trias & Pujol IGTP, High Content Genom & Bioinformat Unit, Can Ruti Campus, Barcelona, Spain

Moreno, V:
 Catalan Inst Oncol ICO, Oncol Data Analyt Program, Badalona, Spain

 Bellvitge Biomed Res Inst IDIBELL, Oncobell Program, Barcelona, Spain

 Consortium Biomed Res Epidemiol & Publ Hlth CIBER, Madrid, Spain

 Univ Barcelona, Dept Clin Sci, Barcelona, Spain

:
 Inst Hlth Sci Res Germans Trias & Pujol IGTP, Genomes Life GCAT Lab, Can Ruti Campus, Barcelona, Spain

Graffelman, J:
 Univ Politecn Cataluna, Dept Stat & Operat Res, Barcelona, Spain

 Univ Washington, Dept Biostat, Seattle, WA 98195 USA
ISSN: 13652540





Heredity
Editorial
Nature Publishing Group, CAMPUS, 4 CRINAN ST, LONDON, N1 9XW, ENGLAND, Reino Unido
Tipo de documento: Article
Volumen: 126 Número: 3
Páginas: 537-547
WOS Id: 000607998300001
ID de PubMed: 33452467
imagen Green Published, hybrid

MÉTRICAS