Protein Investigator
- The Protein Investigator Screen
- Entering and Editing Protein Sequences
- Folding Your Proteins
- Using the History List
- Printing Your Structures
- Assumptions, approximations, etc.
(1) The Protein Investigator Screen
The Protein Invesitgator Screen is shown below with key parts numbered.

- The Amino Acid Table. This is a reference for the 20 amino acids
found in proteins. Both the one-letter and three-letter codes for
each amino acid are shown.
- The shades of the amino acids indicate
their relative hydrophobicities: the shades blend from
white (hydrophilic) through gray (intermediate)
to black (hydrophobic).
- Amino acids with positively-charged side chains have
a + in their symbol.
- Amino acids with negatively-charged side chains have
a - in their symbol.
- Amino acids whose side chains can make hydrogen bonds
have a * in their symbol.
- Upper Folding Window: Amino acid Sequence Box. Enter the amino acid
sequence you want to fold here. See under Entering
for instructions on how to enter and edit sequences. See under
Folding for how to fold sequences.
- Upper Folding Window: Folded Protein Pane. The structure of the folded
protein will appear here.
- Lower Folding Window. You can enter and fold another protein in
this window. You can then compare the two protein sequences and
structures.
- History List. Each time you fold a protein, it is saved in the
History List. Sequences in the history list can be loaded into either
folding window for further analysis.
(2) Entering and Editing Protein Sequences
You can enter a protein sequence by clicking in the Amino Acid Sequence Box and
typing the single letter code for the desired amino acids. The program
will automatically fill in the three-letter code for easier reading.
You can edit the amino acid sequence as you would any line of text: you can
click on a place to insert an amino acid (type the single-letter code); you
can select amino acids and delete them with the delete or backspace key
(the program will automatically delete complete amino acids).
Note that once you change the amino acid sequence, the border of the Folded
Protein Pane changes to pink - this indicates that the amino acid sequence
in the Sequence Box no longer matches that of the Folded Protein. The "Fold"
button is then activated so you can fold your new protein.
(3) Folding Your Proteins
To fold your protein, click on the "Fold" button in the appropriate Folding
Window. In a few seconds, depending on the length of your protein and the
speed of your computer, the folded protein will appear in the Folded Protein
pane. The border of the Folded Protein Pane will then turn gray to indicate
that your protein is folded and the "Fold" button will be deactivated until
you change the Amino acid sequence. The backbone of your protein will be shown
in magenta.
By selecting either "Disulfide Bonds On" or
"Disulfide Bonds Off", you can fold the protein under conditions
where disulfide bonds can form (oxidizing - typically outside of a cell) or not
(reducing - typically inside a cell), respectively. When enabled, disulfide bonds
can form between pairs of cysteines; they are stronger than ionic bonds and are shown
by yellow lines.
(4) Using the History List
The History List shows all the proteins you have folded during this session.
All proteins are lost once you quit the program.
You can move a folded protein from the History List to one of the
Folding Windows by double-clicking on the desired protein in the History List; its
border will change to green indicating that it has been selected. Then
choose either the "Send to Upper Panel" or "Send to Lower Panel" items
from the menu that pops up to load that protein into
the upper or lower folding window, respectively. You can then compare the
two proteins in the different folding windows as well as edit and re-fold
either sequence. You can also add a note to the item in the history list
that will appear when you leave the cursor over the item for a few seconds.
If you leave the cursor over a protein in the History List, a small window
will pop up that shows that protein's amino acid sequence and/or any notes
you have added.
You can save the proteins in your History List to a file by choosing
"Save As..." from the "File" menu. Once you have designated a file
to save into, choosing "Save" from the "History List" menu will automatically
save to that file. History List files must end with the extension
.histlist; the program will add it if you don't.
You can load in a History List that you have saved in a .histlist file
by choosing "Load" from the "History List" menu and selecting the desired
.histlist file. The current History List will be replaced by the one
in the file.
You can delete an protein from the History List by clicking on the protein
in the History List that you want to delete and choosing "Delete Selected"
from the "History List" menu. This action cannot be un-done.
You can clear the History List by selecting "Clear" from the "History
List" menu. This action cannot be un-done.
You can save all the proteins in the History List as a web page by selecting
"Save as Web Page..." from the "History List" menu. You will be
asked to supply the name of a folder in which all the relevant files
will be stored. You can view the web page by double-clicking the file
named "index.html" in the directory you have named. The web page
cannot be loaded into Protein Investigator; only a .histlist file can
do this.
(5) Printing Your Structures
You can print the amino acid sequences and folded structures of the
proteins in both Folding Windows by choosing "Print" from the "File"
menu.
(6) Assumptions, approximations, etc.
This is a highly-simplified model of protein folding. It is not
intended to predict the correct structures of any proteins; it is designed
to illustrate the major principles involved in that process. Our model of
protein folding includes the following factors:
- The hydrophobic effect: Proteins are folded so as to minimize
the number of hydrophobic amino acids exposed to the water surrounding
the protein. Different amino acids have different hydrophobicities that
span a wide range of values. This is the weakest force controlling the
shape of the proteins.
- Hydrogen Bonds: Proteins are folded so as to maximize contacts
between amino acids that can form hydrogen bonds. Amino acids are
either able to make hydrogen bonds or not; there are no intermediate
levels of hydrogen bonding ability. Hydrogen bonds have an intermediate
strength between the other two interactions.
- Ionic Bonds: Proteins are folded so as to maximize contacts
between side chains with unlike charges and to minimize contacts
between side chains with like charges. Amino acids are (+), (-),
or neutral; no intermediate levels of charge are allowed. Ionic
bonds are the strongest of the interactions.
Even though these provide some important insights into protein folding,
you should always keep in mind that this is an approximation. The most
important "gotcha's" to be aware of are:
- This program folds proteins in 2-dimensions only; real proteins
fold in 3-dimensions.
- This program treats all amino acids as equal-sized circles; real
amino acids have different sizes and shapes.
- This program folds the protein strictly based on the interactions
between the side chains; in real proteins, side chains can interact
with the backbone and proteins are often folded with the help
of chaperones.
- This program does not model secondary or quaternary structure;
real proteins often have these.
- This program assumes that all side chains with hydrogen bonding
capability can bond with each other; in real proteins this is
not always possible.