From Carolina Biologicals; Developed by A. Malcolm Campbell, Ph.D. and Todd Eckdahl, Ph.D.
For information about bacterial promoters please read: Pierce: Genetics: A conceptual approach, 6th ed. bacterial promoters are section 13.3, pg 381-386
The timing of gene expression is the mechanism through which an organism's genetic information controls cellular metabolism and many other critical life processes. Inappropriate gene expression can lead to diseases such as cancer and diabetes. Gene expression involves transcription of DNA into RNA and the translation of RNA to create proteins. Promoters are DNA sequences that initiate transcription and are among the most important and most studied elements controlling gene expression. Information about how promoters function has applications in basic science research and biological engineering.
In this lab, you will clone promoters and indirectly test their ability to initiate transcription of a gene that encodes a blue protein, AmilCP Blue. You will be supplied two different pairs of complementary strands of single-stranded DNA (ssDNA) molecules, and also design your own. From these ssDNA molecules, you will build double-stranded DNA (dsDNA) promoters that you will clone into a plasmid called pClone Blue, using a cloning method called Golden Gate assembly (GGA). After cloning the promoters, you will transform E. coli with GGA reactions containing the promoter mixed with the plasmid and will then plate the transformed bacteria to produce ampicillin-resistant colonies. Once the bacterial colonies express the colored protein, you will measure the level of these proteins by using the free software program lmageJ.
One of the fundamental concepts in biology is the transfer of information from the genome to the cell. This transfer is accomplished through the transcription of DNA to RNA, and the subsequent translation of the RNA into protein.
Appropriate regulation of gene expression is critical for the function of all organisms. Transcription of RNA plays a key role in ultimately controlling what protein is made in what cell and when. The change in the set of proteins in a cell serves many functions. In bacteria, many genes are transcribed only under conditions in which their protein products are needed. In multicellular organisms, each cell contains the same genes, but not all of the genes are expressed, so different cells contain different proteins. Control of gene expression allows the different cells to carry out their different functions in their specific environments. The growth of different tissues during development also requires that different cells express different proteins.
The main modulator of gene transcription is called the promoter. This laboratory focuses on the role of the promoter in regulating gene expression. In this lab, you will work with slightly different promoter sequences to measure their effects on the expression of reporter proteins encoded on plasmids. A reporter protein is used as an indicator of transcription from a specific promoter. Reporter proteins have a characteristic, such as color, that makes it easy to measure how much protein is present in the cell.
Figure 1 shows the elements required for expression of a gene in bacteria. The numbering of the nucleotides composing the coding region of a gene starts at+ 1, which corresponds to the first nucleotide transcribed by RNA polymerase and the first nucleotide in the mRNA produced by transcription. Nucleotides upstream of+ 1 are numbered consecutively with negative numbers, with -1 immediately adjacent to+ 1. Bacterial promoters are always described using negative numbers because they are located upstream of the+ 1 nucleotide.
Promoters in eukaryotes, bacteria, and viruses contain two important binding sites for RNA polymerase. The first binding site attracts RNA polymerase to the DNA, while the second one facilitates conversion of the DNA from double stranded to single stranded so that the DNA can be transcribed. In E. coli and other bacteria, a common promoter structure is one in which RNA polymerase binds first to the nucleotides centered approximately on base -35 and then to the nucleotide centered approximately on -10 (Figure 1). Once the RNA polymerase has bound to the promoter, transcription proceeds from the +1 nucleotide through the rest of the gene. In bacterial mRNA, shortly after the +1 nucleotide is the ribosomal binding site (RBS), which often begins at around nucleotides +6 to +8. The RBS is where the ribosome binds to the mRNA to initiate translation. In bacteria, translation commences before transcription is completed, meaning that mRNAs that are not fully produced can have ribosomes bound to them that are in the process of translating the encoded protein.
Figure 1. Diagram of the DNA required for gene expression in bacteria. RNA polymerase binds to the promoter (blue curved arrow) and initiates transcription at nucleotide number +1 upstream of the ribosomal binding site (RBS). Transcription proceeds through the coding DNA in the direction of the arrows.
Bacterial promoter sequences are not identical to one another, but they do have consensus sequences. The consensus sequence represents the nucleotides that most frequently occur at a given location in a DNA sequence for a given species. For example, the -10 consensus sequence in E. coli promoter is TATAAT. The consensus sequence for the -35 region is TTGACA. Not every promoter has this sequence; individual promoter sequences usually vary by one or more bases. The degree of variation from the consensus is the major determinant of how well the RNA polymerase will bind to the promoter. The affinity of RNA polymerase determines the baseline transcriptional strength of a promoter.
The baseline expression of a promoter can be affected by other factors. Promoters in genomes are longer than the 41 bases shown in Figure 1. The additional bases make it possible to control the timing of expression and the amount of mRNA made. Typically, a bacterial promoter is active unless transcription is prevented by the binding of a protein called a repressor to the promoter sequence. Alternatively, some promoters require that an additional protein called an activator bind to the promoter before transcription can begin.
In this experiment, you will use a simplified promoter that has -10 and a -35 elements but does not contain any additional regulatory elements. During GGA, you will remove the promoter that drives the transcription of green fluorescent protein (GFP) from the plasmid pClone Blue. In its place, you will clone three different promoters, one functional, one nonfunctional and one you have designed yourself. The functional promoter (P5) has the sequence shown in Figure 2 and is a constitutive promoter, meaning it is always active. The nonfunctional promoter (P5-33A) has a single base variation from the wild-type P5 that greatly reduces the binding of RNA polymerase and therefor the rate at which the reporter gene is transcribed. You and your partner will engineer a novel promoter based on P5 to test the function of the various elements of the promoter.
To generate these three promoters, you will be provided with pairs of single-stranded (ss) short polymers of DNA (also called oligonucleotides) that you will assemble into double-stranded (ds) DNA. Notice that the dsDNA has a short single-stranded segment of four bases on each end that are called "sticky ends"; these will be described in more detail later.
Figure 2. Diagram showing the sequence of the oligonucleotides used to produce the PS and P5-33A promoters and how they anneal to form promoters that can be cloned into pClone Blue or pClone Red using Golden Gate assembly, the cloning method that you will use in this lab.
To avoid many of the common difficulties of cloning, this laboratory uses the Golden Gate assembly (GGA) cloning method adapted for use with the pClone Blue plasmid (Weber, 2011; Werner, 2012).
Figure 3. Restriction enzyme recognition (red font) and cut sequences. A) Type II enzyme EcoRI restriction site GAATTC is cut in the middle by the enzyme and produces sticky ends (bold font). B) Type IIS enzyme Bsal recognition site is GGTCTC, but the enzyme cuts to the right as shown. The "N"s may be any of the four DNA bases. The sticky ends are four bases (bold font), the sequence of which vary depending on the nucleotides located adjacent to the recognition sequence. C) The Bsal restriction enzyme can cut to the left of the recognition site as indicated.
The GGA protocol relies on the type IIS restriction enzyme Bsal (Figure 3). The use of this enzyme allows you to cut the plasmid DNA and ligate the desired insert in the same reaction. Unlike many of the commonly used type II restriction enzymes that cut within their recognition sequences (e.g., EcoRI, see Figure 3A), Bsal cuts outside its recognition sequence. Regardless of whether the 5' GGTCTC 3' recognition site is on the top or bottom strand, the Bsal cuts 3' to the sequence. To keep a clear idea of this in mind, it is useful to label each Bsal site with an arrow indicating whether the cut is to the left or to the right of the binding site. The Bsal staggered cut produces a 5' overhang four bases long whose sequence is independent of the enzyme's recognition sequence (Figure 3). Because the sequence of this overhanging sequence is independent of the enzyme's recognition site, the DNA fragments to be connected by GGA can be engineered to any desired sequence. Thus, the fragments can be engineered to have overhanging ends that are complementary, which allows them to be ligated by the enzyme DNA ligase, and to have no Bsal recognition sites present in the final ligation product. The importance of the lack of Bsal recognition sites will become clear as the GGA reaction is described further.
In this lab, you will use the plasmid pClone Blue, which contains two outward-facing Bsal sites flanking the promoter region. When the plasmid is cut with the Bsal enzyme, the DNA between the two Bsal cut sites, along with the Bsal recognition sites, is excised to make room for the new promoter (Figure 4).
Figure 4. Diagram of the pClone plasmids. Sticky ends used for cloning are shown inside the white boxes. These sticky ends are generated when either pClone Red or pClone Blue is cut with Bsal. The location of the recognition sites for Bsal are indicated by black arrows and explained in the GGA section. pClone Blue contains a left-facing GFP gene including a functional promoter that initiates transcription to the left. GFP is produced unless a new promoter initiating transcription to the right is cloned in to initiate transcription of the blue chromoprotein AmilCP blue.
To carry out the GGA reaction, the new promoter insert and the pClone Blue plasmid are mixed with Bsal and DNA ligase simultaneously in a single tube under conditions that allow both enzymes to function. The reaction is placed into a thermal cycler and the temperature is cycled between 37°C (the optimum temperature for digestion by Bsal) and 16°( (the optimum temperature for ligation). During digestion by BsaI, the original promoter is cut out of the pClone Blue plasmid, leaving overhanging base pairs (sticky ends) that are complementary to the overhanging base pairs of the new promoter insert (see Figure 3). When the temperature is shifted to 16° ligation by DNA ligase occurs. During ligation, the new promoter with its complementary overhanging nucleotides is ligated into the plasmid.
The original promoter may also religate into the plasmid, but because reinsertion of this original DNA fragment regenerates the Bsal sites, the recently religated original promoter will be cut out again during the next digestion step. Plasmid with the new promoter ligated into it cannot be digested, because the Bsal recognition sites are no longer present.
The GGA reaction is put through multiple cycles, reducing the number of plasmid molecules containing the original promoter and increasing the number of plasmid molecules with the new promoter. The GGA reaction terminates with a 15-minute incubation at 37°C to cleave as many as possible of the remaining Bsal sites in plasmids still containing the original promoter.
The pClone Blue plasmid has been engineered specifically for GGA. As mentioned above, each plasmid contains two opposing Bsal sites so that the DNA between the two Bsal sites can be excised by Bsal digestion to make room for the new promoter (Figure 4). In addition, these plasmids allow you to distinguish between colonies containing the following:
• plasmid with the original promoter (they are still able to produce GFP using the original promoter).
• plasmid with the new promoter (they can use this promoter to produce AmilCP Blue).
• plasmids without a functional promoter (they are yellowish white).
In its original form, pClone Blue produces GFP. The GFP gene faces "backward," meaning that the promoter is not shown in the conventional right-facing direction.
Note: A gene cannot really be "backward" since DNA can move within the cell. Furthermore, because a gene can be encoded by either strand of dsDNA, backward applies only to the way we look at genes on paper. The left-facing functional promoter driving GFP expression is flanked by two Bsal sites, so the original promoter is removed during GGA. The introduction of the new P5 or P5-33A promoter changes the direction of transcription. Thus, pClone Blue no longer expresses GFP but instead expresses a blue chromoprotein called AmilCP blue.
The gene that codes for AmilCP blue is from the coral Acropora millepora. The protein reflects blue light and does not fluoresce. Green fluorescent protein (GFP) was isolated from the jellyfish Aequorea victoria. GFP can sometimes be seen under white light but is more easily seen under UV light.
The GFP expressed from the default promoter is also visible under white light. However, in order to distinguish between the green colonies containing the original promoter and the yellowish white colonies without a functional promoter, it is best to take photos of the plates while they are on a UV transilluminator.
In this lab, you will design and build your own promoter. You can develop and test your own hypotheses regarding changing bases within known promoter elements such as the -10 and -35 sites within the P5 promoter used in this study. You can introduce any SNPs, deletions, or insertions and test your hypotheses about the functional impact of mutating the promoter. Keep in mind that mutations may have no effect, or may increase or decrease transcription.
You will first determine the sequence of the promoters that you want to clone. Then generate top and bottom strands of your promoter and add the two sticky ends, as shown in Figure 2 above. Make sure the top strand has the four-base sticky ends of CGAC on its 5' end as shown. The bottom strand should have the CCGC sticky end on its 5′ end. Keep track of the 5′ to 3′ orientation of oligonucleotides to avoid a common mistake that will produce aberrant cloning results. Be sure that the BsaI recognition sequence of 5′GGTCTC 3′ does not occur in either of the two oligonucleotides. Keep the oligonucleotides limited to 60 or 70 bases in length, so they can be produced inexpensively and quickly by a range of suppliers. DNA oligonucleotides are always ordered in a 5′ to 3′ direction.
Once you have designed your two oligonucleotides show your instructor and discuss your hypothesis regarding promoter function with your instructor. Then add your oligonucleotides to the class ordering sheet in the 5' to 3' orientation for BOTH oligos.