Jmol Tutorial

1. Downloading Jmol:

Jmol is an open source molecular visualization program that runs on a Java platform. You will need Java on your computer to run Jmol on your desktop. Java can be downloaded for free from https://java.com/en/download/.

For this workshop we will be using Jmol Version 14.15.3. This file is a single .jar file, and does not need to be "installed" on your computer. Once the .jar file has been downloaded, the program will be ready to use. Click on the .jar file to start Jmol on your computer.

For convenience, you may want to create a shortcut to the Jmol .jar file on your desktop (CREST Jmol Resources, n.d.).

2. Jmol Windows:

Once launched, Jmol will display one window that looks like this:

To open the console window, click file, then click console.

You should have 2 windows open now. They should look like this:

3. Opening your Insulin .pdb File

We will be using the 2hiu pdb file. To open this file click file, then click get PDB.

An input window should appear. In the window enter the file name, 2hiu, and then click okay.

Once you have clicked okay, you should see the 2hiu file in your display window. It will look like this:

To get the entire image on your screen, use the wheel on your mouse and scroll up to zoom out. In the future, if you want to zoom in, scroll down on your mouse. To zoom in/out without a mouse, hold the shift-key, then click and drag up to zoom in or click and drag to zoom out.

4. Deciding how to display your molecule:

One of the most powerful features of Jmol is the ability to completely customize how your structure is displayed. Every protein or molecular structure can be visualized in an endless variety of ways, each highlighting different features of the structure. When designing your model you may want to consider the following:

  • Amino acids involved in binding a substrate or ligand
  • Protein co-factors, ATP binding sites, DNA binding sites
  • Does the protein function in a dimeric (or multimeric) state? If so, do you want to display the dimer> And if you model the dimer, do you want to show different things in the two monomers?
  • Are there structural features you want to emphasize in the model?
    • Beta Sheets
    • Alpha helices
    • A loop that undergoes a major conformational change when activated
    • Disulfide bonds
    • Individual chains in a multichain protein
    • An unusual gorge or pocket or domain that is important to protein function
    • An amino acid that, when mutated, impacts protein function
    • Any ligands associated with the protein in your pdb file? (You will need to know how to call these up in Jmol; the pdb webpage will include a list of ligands and their abbreviations.) (CREST Jmol Resources, n.d.)

See protein data bank page on where to find this information on the 2hiu protein databank page.

5. Backbone Display Formats:

Your current view of the the insulin protein does not show you much about the structure of the protein. By using some simple commands, you can change the way the molecule is displayed and highlight different structures within the protein.

In order to clearly see the secondary structures within the protein you can use the backbone format. The backbone format displays the position of the alpha carbon in each amino acid by a bend in the backbone. All other atoms within the amino acid are not displayed.

To turn all other formats off and turn only backbone on:

backbone only

There are other display formats and commands that you can use when modeling in Jmol. To explore those formats and associated commands, check the Jmol commands page here.

6. Adding thickness to Display Formats:

Many of the display formats can be further altered by changing the thickness. To do this, simply add a number after any display format when typing the command into the console. The number added after the display format must include a decimal.

To display the backbone format with 1.5 angstrom (a unit of length equal to one hundred-millionth of a centimeter, 10^-10) thickness:

backbone 1.5

7. Selecting Chains and Coloring Selected Chains:

Many protein structures have more than one polypeptide chain. When a protein has more than one polypeptide chain, the protein is considered to have quarternary structure. These chains are labeled with single letter identifiers in the .pdb file and can be selected by entering a colon (:) followed by the letter identifier. Only the atoms in chain you select will be affected by future commands.

The protein insulin has two chains. Chain A and Chain B.

To select for chain A:

select :a

To color chain A blue:

color blue


To select for chain B:

select :b

To color chain B white:

color white


You may decide to highlight secondary structure instead of the two polypeptide chains that make up the insulin protein. You would use the color structure command to color the model by secondary structures. Helices will be colored magenta, beta sheets will be colored yellow, and loop regions will be colored white.

Since you currently have your B chain selected you will have to select all again to color the entire protein by structure.

To select all:

select all

To color by structure:

color structure

There are other select commands and color commands that you can use when modeling in Jmol. To explore those commands, check the Jmol commands page here.

8. Adding Bonds:

Jmol has the ability to render different bonds within the protein like Hydrogen Bonds and Disulfide Bonds between Cysteine amino acids. If you wanted to 3D print your protein, you could also add additional structural supports called Struts.

One important feature of our insulin protein is the presence of Disulfide bonds. In order to show all of the disulfide bonds in insulin, you will need to select all.

To select all:

select all

To show your disulfide bonds at 1.0 angstrom thickness:

ssbonds 1.0


Disulfide bonds occur between two cysteine amino acids. Your disulfide bonds currently look like they are floating, because the backbone format only shows the position of the alpha carbon.

To set your disulfide bonds in the backbone:

set ssbonds backbone

To color the disulfide bond yellow:

color ssbonds yellow

To see explore to add how to add and format hydrogen bonds, check the Jmol commands page here.

9. Adding sidechains:

We are currently using the backbone format. This format only displays the position of the alpha carbon in each amino acid by a bend in the backbone. All of the other atoms within the amino acid are not displayed, including the sidechain (also known as an R-Group).

Sometimes you may wish to display an amino acid's sidechain with the rest of your protein structure in backbone format, because it is crucially important to telling the story of your protein. For example, instead of highlighting the disulfide bonds, you may wish to show the cysteine sidechains that form the disulfide bonds in your insulin protein (CREST Jmol Resources, n.d.).

To select the particular sidechain, you can use the amino acid sidechain three letter abbreviation and the number of the amino acid in the protein. You need to be careful here because there can be more than one amino acid with a given number if there are more than one polypeptide chains in a structure file (CREST Jmol Resources, n.d.).

Remember this map of the insulin protein from the Insulin: mRNA to Protein activity:

This map shows that Cys7 of the B-chain forms a disulfide bond with Cys7 of the A-chain. Cys19 of the B-chain forms a disulfide bond with Cys20 of the A-chain. A third disulfide bond forms between Cys6 and Cys11, both from the A-chain.

To add your sidechains you are going to do a series of commands in Jmol. You will use the select, spacefill, and wireframe commands.

To select for Cys7 to show a clean sidechain (note that this command selects both the Cys7 of the A-chain and Cys7 of the B-chain):

Select cys7 and sidechain

The use of and in this command is a boolean operator. This command selects for atoms in cys7 and the sidechain (R-group) of the amino acids. Remember an 'and' boolean operator has to fulfill both conditions.

To add the spacefill format with a 1.25 Angstrom thickness:

spacefill 1.25

To add the wireframe format with a 1.0 Angstrom thickness:

wireframe 1.0

To color the carbon's gray, the hydrogen's white, oxygen's red, nitrogen's blue, and sulfur's yellow:

color cpk

You can select for multiple sidechains at the same time.

To select for cys19 and cys20, spacefill to 1.25 angstrom thickness, wireframe to 1.0 angstrom thickness, and color cpk:

select cys19, cys20 and sidechain
spacefill 1.25
wireframe 1.0
color cpk

If you play with the viewer on Jmol, you may notice that we have not yet added sidechains to one of the disulfide bonds. What is the argument to not showcase the disulfide bond between cys 6 from the A chain and cys11 from the A chain?

If you choose to not highlight this bond and did not want to show the bond at all, you can turn the disulfide bonds off.

To turn disulfide bonds off:

select all
ssbonds off

10. Coloring the N-terminus and the C-terminus:

You may want to highlight where the N-terminus and C-terminus on each sidechain is. Remember due to the way that amino acids link together, all proteins show the same pattern in connection. Look at the image below and see if you can find the pattern. This means that you polypeptide chain has directionality.


Using the protein data bank sequencing page for 2hiu (https://www.rcsb.org/pdb/explore/remediatedSequence.do?structureId=2HIU), you can find the N-terminus and C-terminus for each polypeptide.

The sequence for chain A looks like this:

The N-terminus is found on the first amino acid in the chain. In this chain, the first amino acid is G or Gly. The protein databank only uses one letter abbreviations, you can find a great amino sidechain chart here.

To select for Gly:

select gly1

To color blue:

color blue

To select for the C-terminus:

select asn21

To color red:

color red

The sequence for chain B looks like this:

To select for the N-terminus on chain B:

select phe1

To color blue:

color blue

To select for the C-terminus on chain B:

select thr30

To color red:

color red

You could simplify this code and select for both of your N-termini at the same time:

select phe1, gly1

To color blue:

color blue

The same applies for your c-termini:

select thr30 or asn21
color red

11. Other commands you may find useful:

Some molecular structures include additional small molecules that are not proteins. These molecules are given a three character alpha-numeric identifier that can be used to select them. The file 2hiu.pdb does not contain any small and unique molecules. Some of the .pdb files you will be looking at later today may contain small and unique molecules that you want to select for and highlight in your model (CREST Jmol Resources, n.d.).

To determine the three character identifier, you can click on the Jmol display window or review the structure summary page for the .pdb file on the Protein Databank (www.pdb.org) website (CREST Jmol Resources, n.d.). When looking at the structure summary page of the .pdb file, you can scroll down to see if it contains any small molecules. Let's use hemoglobin (1a3n) as an example. The structure summary will look like this:


To select for the small molecule HEM:

select hem

You may also want to restrict or only show listed residues or small molecules.

To only show the hem group listed above:

restrict hem


References:

CREST Jmol Resources. (n.d.). Retrieved from http://cbm.msoe.edu/crest/crestJmolResources.php