ABI Bioinformatics Guide 2024
  • INTRODUCTION
    • How to use the guide
  • MOLECULAR BIOLOGY
    • The Cell
      • Cells and Their Organelles
      • Cell Specialisation
      • Quiz 1
    • Biological Molecules
      • Carbohydrates
      • Lipids
      • Nucleic Acids (DNA and RNA)
      • Quiz 2
      • Proteins
      • Catalysis of Biological Reactions
      • Quiz 3
    • Information Flow in the Cell
      • DNA Replication
      • Gene Expression: Transcription
      • Gene Expression: RNA Processing
      • Quiz 4
      • Chromatin and Chromosomes
      • Regulation of Gene Expression
      • Quiz 5
      • The Genetic Code
      • Gene Expression: Translation
    • Cell Cycle and Cell Division
      • Quiz 6
    • Mutations and Variations
      • Point mutations
      • Genotype-Phenotype Interactions
      • Quiz 7
  • PROGRAMMING
    • Python for Genomics
    • R programming (optional)
  • STATISTICS: THEORY
    • Introduction to Probability
      • Conditional Probability
      • Independent Events
    • Random Variables
      • Independent, Dependent and Controlled Variables
    • Data distribution PMF, PDF, CDF
    • Mean, Variance of a Random Variable
    • Some Common Distributions
    • Exploratory Statistics: Mean, Median, Quantiles, Variance/SD
    • Data Visualization
    • Confidence Intervals
    • Comparison tests, p-value, z-score
    • Multiple test correction: Bonferroni, FDR
    • Regression & Correlation
    • Dimentionality Reduction
      • PCA (Principal Component Analysis)
      • t-SNE (t-Distributed Stochastic Neighbor Embedding)
      • UMAP (Uniform Manifold Approximation and Projection)
    • QUIZ
  • STATISTICS & PROGRAMMING
  • BIOINFORMATICS ALGORITHMS
    • Introduction
    • DNA strings and sequencing file formats
    • Read alignment: exact matching
    • Indexing before alignment
    • Read alignment: approximate matching
    • Global and local alignment
  • NGS DATA ANALYSIS & FUNCTIONAL GENOMICS
    • Experimental Techniques
      • Polymerase Chain Reaction
      • Sanger (first generation) Sequencing Technologies
      • Next (second) Generation Sequencing technologies
      • The third generation of sequencing technologies
    • The Linux Command-line
      • Connecting to the Server
      • The Linux Command-Line For Beginners
      • The Bash Terminal
    • File formats, alignment, and genomic features
      • FASTA & FASTQ file formats
      • Basic Unix Commands for Genomics
      • Sequences and Genomic Features Part 1
      • Sequences and Genomic Features Part 2: SAMtools
      • Sequences and Genomic Features Part 3: BEDtools
    • Genetic variations & variant calling
      • Genomic Variations
      • Alignment and variant detection: Practical
      • Integrative Genomics Viewer
      • Variant Calling with GATK
    • RNA Sequencing & Gene expression
      • Gene expression and how we measure it
      • Gene expression quantification and normalization
      • Explorative analysis of gene expression
      • Differential expression analysis with DESeq2
      • Functional enrichment analysis
    • Single-cell Sequencing and Data Analysis
      • scRNA-seq Data Analysis Workflow
      • scRNA-seq Data Visualization Methods
  • FINAL REMARKS
Powered by GitBook
On this page

Was this helpful?

  1. MOLECULAR BIOLOGY
  2. Mutations and Variations

Genotype-Phenotype Interactions

PreviousPoint mutationsNextQuiz 7

Last updated 10 months ago

Was this helpful?

Mutations are the primary source of genetic variation within a species, creating variants of nucleotide sequences in a gene called alleles. While many alleles do not significantly affect gene function, some result in distinct phenotypic traits.

A classic example of allele variation is observed in pea plants, particularly in genes determining flower colour. In these plants, a gene encoding a protein which controls the production of a purple pigment called anthocyanin has two alleles. One allele codes for a functional protein, leading to purple flowers, while the other allele is faulty, resulting in no pigment production and thus white flowers.

Most multicellular organisms are diploid, meaning they possess two sets of chromosomes in their cells. Consequently, genes are typically represented by two alleles in a cell. If both alleles are identical, the organism is termed homozygous for that gene. Conversely, if the alleles differ, the organism is termed heterozygous.

In a heterozygous organism, if only one of the alleles affects the observable characteristics it is called dominant, while the second allele is called recessive. For instance, in pea plants with heterozygous alleles for flower colour, where one allele codes for a functional protein and the other for a non-functional protein, the purple flower trait is dominant.

Examples of phenotypes discussed above demonstrate discontinuous variation, which means that every individual can be assigned to a particular group according to its phenotype. However, plenty of traits, such as human body mass or skin tone are distributed continuously, in other words, an individual can have any characteristic value within a certain range. Such characteristics are commonly polygenic, which means, they are controlled by multiple genes, and/or are influenced by the environment.

Skin tone is polygenic, meaning it's controlled by multiple genes. At least 150 genes are known to contribute to skin colour, and even a simplified model considering just 3 genes, each with 2 alleles, can illustrate polygenic inheritance and its effect on phenotype. In this model, an individual who is homozygous recessive for all three genes would have a very light skin tone, while a homozygous dominant individual would have very dark skin. The combination of different alleles of these genes produces the wide spectrum of skin tones observed in human populations.

Interactions between alleles can often be more nuanced than a binary dominant-recessive relationship. For example, human blood groups are determined by a gene with three alleles IIIIAI^{A}IA, IIIIBI^{B}IB, and iii. This gene governs the presence of a specific carbohydrate on the surface of red blood cells, with each allele resulting in a different structure of this carbohydrate. Alleles IAI^{A}IAand IBI^{B}IB lead to the production of carbohydrate structures A and B, respectively, while allele iii results in the absence of the carbohydrate. Both IAI^{A}IAand IBI^{B}IB are dominant over allele iii, meaning individuals with genotypes IAiI^{A}iIAi and IBiI^{B}iIBi will have blood A and B, respectively. Only individuals with the genotype iiiiii will have no carbohydrate on their red blood cells and will have the blood group O. However, individuals with the genotype IAIBI^{A}I^{B}IAIB will exhibit blood group AB, indicating the presence of both carbohydrate structures. This type of allele interaction, where both alleles contribute to the phenotype in a heterozygous individual, is termed codominance.

Alleles are variants of a nucleotide sequence in a gene
Blood groups Image source: OpenStax College - Anatomy & Physiology, Connexions Web site. http://cnx.org/content/col11496/1.6/, Jun 19, 2013., CC BY 3.0, https://commons.wikimedia.org/w/index.php?curid=30148183 with changes
A simplified model of polygenic inheritance of skin colour