In complex [eukaryotic] organisms like humans, cats and cacti, genes are not single unbroken sequences of nucleotides from start codon to stop codon. Instead, complex organisms are broken up into sections which are translated called exons, with gaps of untranslated sequence between them called introns (Figure 1).
Introns and exons allow flexibility in whether particular parts of the genes sequence are included in the final protein sequence (Figure 2). This means that one gene can encode for many different proteins. Being able to flexibly make slightly different proteins has advantages for the organism, as the proteins produced can be varied according to what the cell or organism needs at a given moment, but it can do so within a smaller genomic space than an organism that had different genes for each version of the protein.
The regions of the gene sequence where an exon becomes an intron are called splice sites. The regions where a splice site ends an exon and begins an intron are splice site donors, while those that end an intron but begin an exon are splice site acceptors. The full mechanism of how these sites work, and how variable transcription happens is not fully understood, nevertheless there are a number of splice site sequence patterns which have been identified (Figure 3), and a number of algorithms which have been developed to predict splice sites and changes in them.
Changes to splice sites can cause catastrophic damage to a gene. A famous example of this is haemophilia in European royalty in the 19th and early 20th centuries. Thanks to the discovery of the bodies of the Romanov family (Figure 4), the cause of this disease was determined to be a mutation that changed a splice site in the gene for Factor IX, which caused haemophilia B, as the Factor IX gene was no longer functional, and so males who inherited a copy of this X chromosome gene from their mothers had this clotting disorder.
It should be noted that bacteria (which are not eukaryotes) do not have introns or exons, rather their entire genes are translated whole. This has important implications, as bacteria are common expression systems for proteins, which often come from eukaryotic organisms. This means that the genes for these proteins have to be modified in order to remove the introns in order for them to be expressed properly in bacterial cells, this can be achieved by identifying the protein or mRNA sequence and producing the DNA sequence based on it (Figure 5).
On April 24th 1915, authorities in the Ottoman Empire arrested over 200 Armenian notables in the city of Constantinople, who were later executed or expelled from the capital. This decapitation strike targeted Armenian community leaders and is marked annually as the anniversary of the beginning of the Armenian genocide, during which around 1 million Armenians were murdered.