Mutual Independence of Loci in databases of Multi-locus Genotypes: Application in Human Identification and Population Genetics

dc.contributor.advisorPlanz, John V.
dc.contributor.committeeMemberWoerner, August E.
dc.creatorSong, Bing
dc.date.accessioned2022-02-02T23:05:51Z
dc.date.available2022-02-02T23:05:51Z
dc.date.issued2020-12
dc.description.abstractMulti-locus genotype data are widely used in population genetics and disease studies. In evaluating the utility of multi-locus data, the independence of markers is commonly considered in many genomic assessments. Generally, pairwise non-random associations are tested for linkage disequilibrium; however, the dependence of one panel might be triplet, quartet, or other. Therefore, a compatible and user-friendly software is necessary for testing and assessing the global linkage disequilibrium among mixed genetic data. This study describes a software package for testing the mutual independence of mixed genetic datasets. The new R package "mixIndependR" calculates basic genetic parameters like allele frequency, genotype frequency, heterozygosity, and Hardy-Weinberg equilibrium by mutual independence from population data, regardless of the type of markers, such as simple nucleotide polymorphisms, short tandem repeats, insertions and deletions, and any other genetic markers. A novel method of assessing the dependence of mixed genetic panels is developed in this study and functionally analyzed in the software package. By comparing the observed distribution of two common summary statistics (the number of heterozygous loci [K] and the number of share alleles [X]) with their expected distributions under the assumption of mutual independence, the overall independence is tested. The package "mixIndependR" is compatible to all categories of genetic markers and detects the overall non-random associations. Compared to pairwise disequilibrium, the approach described herein tends to have higher power, especially when the numbers of markers are large. With this package, more multi-functional genetic panels can be developed, like mixed panels with different kinds of markers. In population genetics, the package "mixIndependR" makes it possible to discover more about admixture, natural selection, genetic drift, and population demographics, as a more powerful method of detecting LD. Moreover, this new approach can optimize variants selection in disease studies and contribute to panel combination for treatments in multimorbidity. Application of this approach in real data is expected in the future, and this might bring a leap in the field of genetic technology. The R package "mixIndependR", is available on the Comprehensive R Archive Network (CRAN) at: https://cran.r-project.org/web/packages/mixIndependR/index.html
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/20.500.12503/30797
dc.language.isoen
dc.subjectmutual independence
dc.subjectlinkage disequilibrium
dc.subjectR Package
dc.subjectnon-random association
dc.subjectstatistical genetics
dc.subject.meshGenetic Loci / genetics
dc.subject.meshGenotype
dc.subject.meshGenetics, Population
dc.titleMutual Independence of Loci in databases of Multi-locus Genotypes: Application in Human Identification and Population Genetics
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentGraduate School of Biomedical Sciences
thesis.degree.disciplineMicrobiology and Immunology
thesis.degree.grantorUniversity of North Texas Health Science Center at Fort Worth
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2020_12_gsbs_Song_Bing_dissertation.pdf
Size:
26.26 MB
Format:
Adobe Portable Document Format
Description: