About 99.5 percent of a person's DNA is the same as any unrelated person's DNA. Differences in the sequence of DNA among individuals are called genetic variation. Genomic variation explains some of the differences among people, such as eye color and blood group, as well as whether a person has a higher or lower risk for getting particular diseases. Variants in single genes cause some traits and diseases, such as the ABO blood group and cystic fibrosis. Complex interactions among multiple genes and the environment cause common diseases such as diabetes, heart disease, many cancers, stroke, Alzheimer's disease, Parkinson's disease, depression, arthritis, and asthma.
Currently, about 99 percent of variants with a frequency of 1 percent or higher have been found, with a false detection rate of 5 percent. About 96 percent of the genome can be studied with high confidence. More than 80 million variant sites in the genome have been found so far, including single nucleotide polymorphisms (SNPs), insertions and deletions (indels), and other structural variants; these variants have been put onto haplotypes (phased), showing which variants at nearby sites are on the same chromosome. The program also seeks to relate there genetic variants to functional variation and phenotype.
The Genomic Variation Program supports large-scale studies of human genetic variation as part of projects such as the International HapMap Project and the 1000 Genomes Project. The program supports many studies developing data analysis methods for how to relate variation to traits, diseases, and responses to drugs and environmental factors, such as association and admixture approaches, and how to use patterns of variation to infer demographic history, selection, and other population genetic processes. The program also works with other programs to support experimental and analysis development studies and databases relating variation to differences in gene function and regulation and to clinical effects.