C687 Midterm Questions and Answers
A subset of the following questions will appear on the mid-term exam.
No questions will appear on the exam that are not included in this list.
The mid-term exam is scheduled for 1:30-3pm, Wednesday, April 7, in room A400.
Questions are in bold.
Acceptable answers are in plain text. Note that more than one acceptable answer may exist for some questions.
Visualization
- Q: What is "visualization"?
A: changing the view to improve the information content.
- Q: Identify all types of Visualization Models in the list below:
- Graphical Interpretation
- Simplification
- Symbolic Reduction
- Analgization
- Mathethmatical Description
- All of the above
- none of the above
A: 2, 4, and 5
- Q: Identify all types of Visualization Models in the list below:
- Fundamental Cartoon
- Comprehensive Perspective
- Didactical Illustration
- Dynamic Animation
- All of the above
- none of the above
A: 3
- Q: Identify all types of Visualization Models in the list below:
- Mathethmatical Description
- Mathethmatical Generalization
- Mathethmatical Correlation
- Mathethmatical Simplification
- All of the above
- none of the above
A: 1
- Q: True or False: The molecular modeling specialist must judge whether proper techniques have been used
("is the result accurate?").
A: True
- Q: True or False: The molecular modeling specialist must judge whether proper controls and control experiments have been used ("is the result reliable or sgnificant?").
A: True
- Q: True or False: The molecular modeling result depends as much on the quality of the modeler as the quality of the modeling technique.
A: True
- Q: True or False: The molecular modeling specialist must judge whether sufficient parameter space been searched if not all possibilities have been calculated.
A: True
- Q: True or False: The molecular modeling specialist must judge whether the basis set has been sufficiently large to cover all appropriate possibilities.
A: True
- Q: A stand-alone computer must have:
- CPU
- memory
- disk
- graphics processor
- monitor
- keyboard
- All of the above
- none of the above
A: 7
- Q: True or False: A "local computer" is the computer with which you physically interact during your computing session.
A: True
- Q: True or False: A "remote computer" is the computer with which you interact through a network (e.g., via 'telnet') from a local computer.
A: True
- Q: Why use UNIX workstations?
- UNIX workstations are easier to program and support.
- UNIX workstations have historically been used for scientific computing.
- The specific parts of a UNIX workstation are still faster than parts of a PC or Mac workstation.
- The interaction of parts of a UNIX workstation is still faster with UNIX workstations than with PCs and Macs.
- All of the above
- none of the above
A: 1, 2, and 4
- Q: In practice, a "networked computer" may consist of:
- monitor and keyboard of a local workstation
- monitor and keyboard of a remote workstation
- CPU, memory, graphics processor of a local workstation
- CPU, memory, graphics processor of a remote workstation
- disk in local workstation
- disk in remote workstation
- All of the above
- none of the above
A: 1, 3, 4, 5, and 6
Database Searching, Sequence Alignment, Homology Modeling,
and Evaluation of the Quality of a Molecular Model
- Q: True or False: Sequence alignments are evolutionary hypotheses.
A: True
- Q: True or False: "sequence homology" is a binary property. Two sequences ARE homologous or ARE NOT homologous.
A: True.
- Q: True or False: "sequence homology" is a quantitative measure. The percentage of homology between two sequences is an appropriate description.
A: False.
- Q: True or False: "sequence identity" is a quantitative measure. The percentage of sequence identity between two sequences is an appropriate description.
A: True.
- Q: True or False: "sequence identity" is a binary property. Two sequences ARE identical or ARE NOT identical.
A: False
- Q: If two sequences are greater than 100 residues, and they have >25% sequence identity, are they related?
A: Probably YES.
- Q: If two sequences are greater than 100 residues, and they have >15% and <25% sequence identity, are they related?
A: Teh amount of sequence identity is too ambiguous to answer this question.
- Q: If two sequences are greater than 100 residues, and they have <15% sequence identity, are they related?
A: Probably NO.
- Q: If two sequences are less than 100 residues, and they have >25% sequence identity, are they related?
A: Due to the relatively short sequence lengths, this question can't be answered with confidence.
- Q: If two sequences are less than 100 residues, and they have >15% and <25% sequence identity, are they related?
A: Due to the relatively short sequence lengths, this question can't be answered with confidence.
- Q: If two sequences are less than 100 residues, and they have <15% sequence identity, are they related?
A: Due to the relatively short sequence lengths, this question can't be answered with confidence.
- Q: Identify all sequence alignment scoring matrices in the following list (select all that apply):
- codon substitution scoring matrices
- electrostatic scoring matrices
- hydrophobicity scoring matrices
- loops & gaps scoring matrices
- hydrogen bond scoring matrices
- all of the above
- none of the above
A: 1 and 3
- Q: Identify all sequence alignment scoring matrices in the following list (select all that apply):
- biosynthetic frequency matrices
- sequence homology matrix
- mutation frequency scoring matrices
- sequence identity matrix
- residue volume matrices
- all of the above
- none of the above
A: 3 and 4
- Q: Which sequence alignment scoring matrices include secondary structure information?
A: None! This is a trick question.
- Q: What is a gap in a sequence alignment?
A: A segment in a sequence that does not have residues that correspond to existing residues in the other aligned sequence or sequences.
- Q: What is a loop in a sequence alignment?
A: A segment in a sequence for which there are no corresponding residues in the other aligned sequence or sequences.
- Q: True or False: sequence alignment methods often have difficulty with regularly repeating segments
(e.g. repeating sequences in helices) or pseudo-symmetric dimers.
A: True.
- Q: True or False: gaps and loops usually exist within structurally conserved regions.
A: False.
- Q: Sequence alignments can be used to perform the following (select all that apply):
- secondary structure prediction
- designing PCR primers
- development of molecular interaction scales
- inferring functional importance from sequence conservation
- testing evolutionary hypotheses (i.e., phylogenetic analyses)
- all of the above
- none of the above
A: 2, 4, 5
- Q: True or False: Specific types of side chain rotamer angles are found within specific types of structural motifs.
A: True.
- Q: True or False: about 80% of identical residues have the same rotameric state for all side chains.
A: True
- Q: True or False: about 70% to 75% of mutated residues have the same rotameric state for all side chains.
A: True
- Q: Identify options available for evaluating the quality of a protein model (select all that apply):
- evaluation of solvent-accessible hydrogen bond donors
- evaluation of mainchain torsion angle distributions
- evaluation of peptide bond angle distributions
- evaluation of solvent-accessible polar groups
- evaluation of chirality of alpha-carbon atoms
- all of the above
- none of the above
A: 2, 3, 5
- Q: Identify options available for evaluating the quality of a protein model (select all that apply):
- evaluation of bond length distributions
- evaluation of the model's free energy thermodynamic cycle ("delta-delta-G")
- evaluation of bond angle distributions
- evaluation of planarity of aromatic ring systems
- evaluation of electrostatic charge distributions
- all of the above
- none of the above
A: 1, 3, 4
- Q: Identify options available for evaluating the quality of a protein model (select all that apply):
- evaluation of planarity of sp2-hybridized end groups
- evaluation of van der Waals overlaps ("bump check")
- evaluation of hydrogen bond qualities
- evaluation of the distribution of polar amino acids on surface & interior
- evaluation of backbone atom types
- all of the above
- none of the above
A: 1, 2, and 4
- Q: Identify options available for evaluating the quality of a protein model (select all that apply):
- evaluation of energies of force field cross terms
- evaluation of the distribution of non-polar amino acids on surface & interior
- evaluation of buried hydrogen bond donors
- evaluation of solvent-accessible hydrogen bond acceptors
- evaluation of 3D RMSD superpositions of model structures and template structures
- all of the above
- none of the above
A: 2, 3, and 5
- Q: Identify options available for evaluating the quality of a protein model (select all that apply):
- evaluation of the geometry of secondary structural elements
- evaluation of non-bond energies
- evaluation of buried polar groups
- evaluation of the location of secondary structural elements
- evaluation of the model's radius of gyration
- all of the above
- none of the above
A: 1, 3, and 4
- Q: Identify options available for evaluating the quality of a protein model (select all that apply):
- evaluation of the model's surface area
- evaluation of buried hydrogen bond acceptors
- evaluation of the model's center of mass
- evaluation of chirality of serine side chains
- evaluation of chirality of threonine side chains
- all of the above
- none of the above
A: 1, 2, and 5
- Q: What is a Ramachandran plot?
A: a graph of a protein's mainchain torsion angle distribution (i.e., phi angle vs psi angle for each residue).
- Q: True or False: Incorrectly folded protein models almost always have larger surface areas than correctly folded protein models.
A: True.
- Q: How many unsatisfied hydrogen bond donors or hydrogen bond acceptors are acceptable in an ideal high-quality protein model?
A: Ideally, there should be ZERO unsatisfied hydrogen bond donors or hydrogen bond acceptors.
- Q: True or False: The total energy of two or more homology models can be used to determine which homology model is correct.
A: False.
- Q: Identify databanks and databases that contain 3D structures (select all that apply):
- the Swis-Prot database
- the Protein Data Bank
- the TIGR Database
- the EC Enzyme Database
- the Cambridge Structure Database
- all of the above
- none of the above
A: 2 and 5
- Q: Identify databanks and databases that contain 3D structures (select all that apply):
- the NDB (Nucleic Acid Databank)
- the PIR (Protein Information Resource)
- the GenBank Database (National Center for Biotechnology Information)
- the PROSITE database
- all of the above
- none of the above
A: 1
- Q: What is the difference between a DATABANK and a DATABASE?
A: A DATABASE stores information in a parallel, relational, or "interconnected" arrangement, allowing for rapid search and retrieval by a
multidimensional search engine using a query language.
A DATABANK stores information in a serial, or "linear" arrangement, requiring an exhaustive search and retrieval by a one-dimensional search engine, usually using a "string search".
- Q: Identify information that you can use to search for information (select all that apply):
- search by taxonomy
- search by taxon
- search by literature reference
- search by keyword
- search by sequence similarity of local alignments
- search by 3D coordinates
- search by function
- all of the above
- none of the above
A: 8
Energy Minimization and Molecular Dynamics
- Q: What are force field "cross terms"?
A: Cross terms are combinations of the four general bond terms.
- Q: When must you include cross terms?
A: Cross terms are required for high-precision minimizations, such as minimizations required for vibrational analyses.
- Q: Name the "covalent bond terms" in most classical newtonian force fields.
A: bond term, angle term, dihedral angle term, and planar term.
- Q: Name the "nonbond terms" in most classical newtonian force fields.
A: van der Waals term and electrostatic term.
- Q: When would you choose to use a Morse bond term?
A: A Morse bond term is required for high-precision minimizations, such as minimizations required for vibrational analyses,
or for dynamics that are far from a minimum.
- Q: Name the "universal force field" appropriate for all molecules.
A: There is no universal force field! This is a trick question.
- Q: A force field parameter (e.g., "spring constant" for stretching a bond) is a:
- Microscopic Descriptor (e.g., a description of the exact energy needed to stretch the bond)
- Practical Descriptor (e.g., a description of bond stretch energy that interacts with other terms to produce realistic models).
A: a Practical Descriptor
- Q: Name the steps necessary to "plug" a new parameter into a force field.
A: Force field parameters should NEVER just be plugged into a force field without careful testing.
This is a trick question!
- Q: A diagonal force field has:
A: no cross terms
- Q: A matrix force field has:
A: cross terms
- Q: A Class II force field has:
A: parameters derived from quantum mechanics
- Q: A "Fast" force field has:
A: only dihedral torsion angle terms and non-bond terms
- Q: What terms do some (less accurate) force fields hold fixed in order to speed up dynamics calculations?
A: bond, angle, and planar (improper torsion) terms.
- Q: Which force field or force fields available in InsightII are parameterized only for standard amino acids?
A: The CFF91 force field.
- Q: Which force field or force fields available in InsightII are parameterized only for standard amino acids and nucleotides?
A: The AMBER and AMBER95 force fields.
- Q: Which force field or force fields available in InsightII are parameterized for many types of atoms?
A: The CVFF and ESFF force fields.
- Q: What force field terms are often scaled for vicinal atom pairs?
A: Non-bond force field terms.
- Q: What are "Periodic Boundary Conditions"?
A: A solute surrounded by an isolated cube of solvent is replicated periodically in three dimensions to represent
an infinitely large bulk or crystalline environment.
- Q: How can you simulate a system with infinite size?
A: By using periodic boundary conditions.
- Q: What is a "Minimim Image Model"?
A: A Minimum Image Model within Periodic Boundary Conditions allows each real molecular model in the central unit cell to interact with
only one image of each real molecular model in the central cell or nearest adjacent cells.
- Q: What is an "Explicit Image Model"?
A: An Explicit Image Model within Periodic Boundary Conditions allows each real molecular model in the central unit cell to interact with
all images of all molecular models in the central unit cell or the nearest adjacent cells.
- Q: What dielectric constant is generally used to simulate the dielectric medium inside a protein structure?
A: 4
- Q: What is the dielectric constant of a vacuum?
A: 1
- Q: What is the dielectric constant at the surface of a protein?
A: about 10
- Q: If solvent molecules are explicitly included in the system, what dielectric constant should be used?
A: 1
- Q: The dielectric constant is approximately uniform for:
- only small molecules
- only large molecules
- both small and large molecules
A: small molecules
- Q: What force field terms are most responsible for the majority of calculation times for large molecules?
A: the non-bond force field terms (i.e., van der Waals and electrostatic interaction terms).
- Q: What is a non-bond energy cutoff value?
A: Non-bond energies are only calculated for atom pairs that are
separated by distances smaller than the non-bond energy curoff value.
- Q: What nonbond cutoff value should be used for proteins?
A: at least 15 angstroms
- Q: What nonbond cutoff value should be used for nucleic acids?
A: at least 20 angstroms
- Q: What nonbond cutoff value should be used for aliphatic polymers?
A: at least 10 angstroms
- Q: What nonbond cutoff value should be used for carbohydrates?
A: at least 20 angstroms
- Q: What is the Cell Multipole Method?
A: The electrostatic potential felt by atom i may be divided
into a near-field potential from surrounding atoms, and a far-field potential due to the rest
of the atoms. The Cell Multipole method represents the far-field potentials as the general
potential originating at the center of cells far from atom i.
- Q: What is the Ewald Summation Method?
A: A Fourier Transform method to evaluate the electrostatic potentials of periodic systems.
- Q: Identify one or more types of constraints that can be applied to make calculations faster:
- Torsion constraints
- Fixed Atom constraints
- Tethered Atom constraints
- Distance Constraints
- Template Force constraints
- All of the above
- none of the above
A: 2
- Q: When minimizing the energy of a molecular model, what derivative value should you select to ensure that your molecule has reached a minimum energy?
A: There is no exact answer---this is a trick question! Only a "molecular modeling experiment" can determine if you have reached
a sufficient energy minimum during minimization calculations for your purposes.
- Q: When should you use the Steepest Decents algorithm?
- When you are far from the minimum.
- When you are near "saddle points" on the energy surface.
- When you have nonstandard amino acid or nucleic acid residues
- When you minimize with cross terms
- All of the Above.
- None of the Above.
A: 1 and 2
- Q: Identify limitations that may prevent you from using a matrix-based minimization method, such as the Newton-Raphson or VAO9A algorithms.
Select one or more correct answers.
- Minimizing a model with a Morse Term
- Minimizing a model that is far from the energy minimum.
- Minimizing a model with the Amber95 force field parameters
- Minimizing a model with more than 200 atoms
- Minimizing a model with fixed atoms
- All of the above
- None of the Above
A: 2 & 4
- Q: What is the basic difference between molecular dynamics and simulated annealing?
A: Simulated annealing calculations run extra dynamics steps at a cooler temperatures before the final minimization.
- Q: What is the time step for most molecular dynamics calculations?
A: 1 to 3 femtoseconds
- Q: What is a cluster graph?
A: A graph showing the RMSD difference between each pair of conformations of an ensemble of conformers.
- Q: Identify limitations that may prevent you from using the Conjugate Gradients minimization method.
Select one or more correct answers.
- Minimizing a model that is far from the energy minimum
- Minimizing a model with a Morse Term
- Minimizing a model with a potential energy surface with many saddle points
- Minimizing a model with the Amber95 force field parameters
- Minimizing a model with more than 200 atoms
- All of the above
- None of the Above
A: 1 & 3
- Q: how do you determine if a Discover or Discover_3 minimization calculation has reached the derivative threshold that you requested?
A: you must examine the contents of the file that ends with ".inp".
- Q: The units of the Discover or Discover_3 miniization derivative is:
- kcal mol-1
- kcal mol-1 angstrom-1
- kcal angstrom-1
- None of the Above
A: 2
- Q: Discover and Discover_3 miniizations are reported as "successful" if:
- The minimum derivative is less than the derivative threshold
- The maximum derivative is less than the derivative threshold
- The average derivative is less than the derivative threshold
- The total number of iterations is reached.
- All of the above
- None of the Above
A: 2
- Q: To retain the "core conformation" of a structure during minimization:
(select all that apply)
- minimize the structure with fixed heavy atoms
- minimize the structure with fixed backbone
- minimize the structure with fixed structually conserved regions
- all of the above
- none of the above
A: 4
- Q: Quantum Mechanical Calculations are required for:
(select all that apply)
- photon absorption
- bond formation
- protein folding
- acid/base reactions
- all of the above
- none of the above
A: 1, 2, and 4
- Q: What is the "Active Analog Approach"?
A: conformers of active analogs are systematically generated via torsion angle rotations; only
conformers that match the geometry of an active analog that has few degrees of freedom (i.e., is relatively rigid
or conformationally restricted) are selected for further analysis.
- Q: What is the Rigid Rotor Technique?
A: conformers are systematically generated via torsion angle rotations; no energy minimization
is performed after each torsion angle rotation.
- Q: What is the Dihedral Driver Technique?
A: conformers are systematically generated via torsion angle rotations; energy minimization
IS performed after each torsion angle rotation.
- Q: What is the difference between the Rigid Rotor Technique and the Dihedral Driver Technique?
A: energy minimization is performed after each torsion angle rotation in the Dihedral Driver Technique;
no energy minimization is performed in the Rigid Rotor Technique.
- Q: How do you prove that a Monte Carlo search has sampled sufficiently long enough?
A: Start from different starting points; if each run generates the same result, the calculation has converged to the same ensemble.
- Q: What conformational search method is best for ring systems?
A: The Monte Carlo method.
- Q: Name the conformational search method that uses the following protocol:
- move atomic coordinates in a random direction
- minimize the structure
- if the conformation's absolute energy is below a pre-set threshold, save the new conformation
- if the conformation's absolute energy is above a pre-set threshold, and a random number is selected that is greater than a "temperature factor", save the new conformation
- if the conformation's absolute energy is above a pre-set threshold, and a random number is selected that is greater than a "temperature factor", return to the original conformation
- return to the first step and repeat many times.
- reduce the "temperature factor" and repeat steps 1-4. Continue to reduce the energy threshold until the energy is near a minimum.
A: the Monte Carlo method.
- Q: Name the conformational search method that uses the following protocol:
- move atomic coordinates in a random direction
- if the conformation's absolute energy is below a pre-set threshold, increase the average atomic displacements during randomization
- if the conformation's absolute energy is above a pre-set threshold, reduce the average atomic displacements during randomization
- return to the first step and repeat many times.
A: the Molecular Dynamics method.
- Q: Name the conformational search method that uses the following protocol:
- move atomic coordinates in a random direction
- if the conformation's absolute energy is below a pre-set threshold, increase the average atomic displacements during randomization
- if the conformation's absolute energy is above a pre-set threshold, reduce the average atomic displacements during randomization
- return to the first step and repeat many times.
- reduce the pre-set energy threshold and repeat steps 1-3. Continue to reduce the energy threshold until the energy is near a minimum.
A: the Simulated Annealing method.
- Q: What is the RATTLE method?
A: bond lengths, or bond lengths and bond angles, are held relatively fixed.
- Q: What is the advantage of the RATTLE method?
A: the molecular dynamics time step can be increased from 1 fsec to 3 fsec.
- Q: What is the practical advantage of holding angles fixed via the RATTLE method?
A: Almost none! (Only holding bonds fixed substantially improves computation speed.)
- Q: What is the minimum time required to equilibrate most systems during molecular dynamics?
A: at least 10 picoseconds.
- Q: Which technique is least able to ensure complete sampling of conformational space?
- Systematic Search Methods
- Monte Carlo Methods
- Molecular Dynamics Methods
- All methods are equal
- The answer depends on the system being studied
- None of the Above
A: 3
- Q: True or False: energies of systems with different numbers of bonds can be directly compared.
A: False
- Q: True or False: energies of systems with different numbers of atoms can be directly compared.
A: False
- Q: True or False: energies of systems with the same total charge can always be directly compared.
A: False
- Q: True or False: energies of systems with the same bonds and atoms can be directly compared.
A: True
- Q: What are the requirements for a successful Free Energy Peturbation method?
Assume that
A + Protein <---> A:Protein
^ ^
| |
B + Protein <---> B:Protein
(select all that apply)
- Changes between A and B must be small
- A and B must bind to the protein in the same general location and orientation
- solvent must be excluded
- The free energy change must be large to overcome inaccuracies in this method
- Only energy-minimized cnformers can be used in this method.
- All of the above
- none of the above
A: 1, 2, and 5
- Q: True or False: (dG1 - dG2) = (dG3 - dG4)
Assume that
dG1
A + Protein ---> A:Protein
^ ^
| dG4 | dG3
dG2
B + Protein ---> B:Protein
dG1 = A:Protein - (A + Protein)
dG2 = B:Protein - (B + Protein)
dG3 = A:Protein - B:Protein
dG4 = (A + Protein) - (B + Protein)
A: True
- Q: True or False: (dG1 - dG2) = (dG3 - dG4)
Assume that
dG1
A + Protein ---> A:Protein
^ ^
| dG3 | dG4
dG2
B + Protein ---> B:Protein
dG1 = A:Protein - (A + Protein)
dG2 = B:Protein - (B + Protein)
dG3 = (A + Protein) - (B + Protein)
dG4 = A:Protein - B:Protein
A: False
- Q: True or False: (dG1 - dG2) = (dG3 - dG4)
Assume that
dG2
A + Protein ---> A:Protein
^ ^
| dG4 | dG3
dG1
B + Protein ---> B:Protein
dG1 = B:Protein - (B + Protein)
dG2 = A:Protein - (A + Protein)
dG3 = A:Protein - B:Protein
dG4 = (A + Protein) - (B + Protein)
A: False
- Q: True or False: (dG1 - dG2) = (dG3 - dG4)
Assume that
dG2
A + Protein ---> A:Protein
^ ^
| dG3 | dG4
dG1
B + Protein ---> B:Protein
dG1 = B:Protein - (B + Protein)
dG2 = A:Protein - (A + Protein)
dG3 = (A + Protein) - (B + Protein)
dG4 = A:Protein - B:Protein
A: True
Docking & Electrostatics
- Q: True or False: Electrostatics can be used to measure relative binding constants and relative association rates.
A: True.
- Q: True or False: Electrostatics can be used to measure absolute binding constants and relative association rates.
A: False.
- Q: True or False: Electrostatics can be used to measure relative binding constants and absolute association rates.
A: False.
- Q: True or False: Electrostatics can be used to measure absolute binding constants and absolute association rates.
A: False.
- Q: What is "polarity"?
A: The density of charged groups and dipolar groups
- Q: What is "polarization"?
A: The potential for reorganizing charges, orienting dipoles, and inducing dipoles
- Q: A folding protein has (select all that apply):
- high polarity, high polarizability
- high polarity, low polarizability
- low polarity, high polarizability
- low polarity, low polarizability
A: 1
- Q: A folded protein has (select all that apply):
- high polarity, high polarizability
- high polarity, low polarizability
- low polarity, high polarizability
- low polarity, low polarizability
A: 2
- Q: What is "the measure of non-polar surface area of a conformer"?
A: lipophilicity
- Q: What is "the measure of the partition coefficient of a molecule between water and an organic solvent"?
A: hydrophobicity
- Q: What is "lipophilicity"?
A: "the measure of non-polar surface area of a conformer"
- Q: What is "hydrophobicity"?
A: "the measure of the partition coefficient of a molecule between water and an organic solvent"
- Q: What solvent is often cited as having the same dielectric constant and
(hydrogen bond/molecular weight) ratio that is similar to the "core" of a protein?
A: octanol
- Q: True or False: lipophilicity is used as a qualitative measure of free energy of solvation.
A: False
- Q: True or False: lipophilicity is used as a quantitative measure of free energy of solvation.
A: False
- Q: True or False: hydrophobicity is used as a qualitative measure of free energy of solvation.
A: True
- Q: True or False: hydrophobicity is used as a quantitative measure of free energy of solvation.
A: False
- Q: Electrostatics can be used to measure:
- changes in acid/base reactions
- changes in pKa values
- changes in electron transport
- changes in redox midpoints
- changes in stabilization energies
- All of the above
- none of the above
A: 2, 4, and 5
- Q: True or False: the polarization of large protein dipole reorientations can be represented by a dielectric model.
A: False
- Q: True or False: the polarization of large protein dipole reorientations can be represented by explicit charge distributions.
A: True
- Q: True or False: the polarization of small protein dipole reorientations can be represented by a dielectric model.
A: True
- Q: True or False: the polarization of small protein dipole reorientations can be represented by explicit charge distributions.
A: True
- Q: Dipole reorientations within a protein approximate a dielectric medium with a dielectric constant of:
A: 2.5 to 4
- Q: Charge rearrangements within a protein approximate a dielectric medium with a dielectric constant of:
A: greater than 30
- Q: electronic polarizabilities within a protein approximate a dielectric medium with a dielectric constant of:
A: 2 to 2.5
- Q: What effects within a protein cause the protein to approximate a dielectric medium with a dielectric constant greater than 30?
- Dipole reorientations
- Charge rearrangements
- Electronic polarizabilities
- All of the above
- none of the above
A: 2
- Q: What effects within a protein cause the protein to approximate a dielectric medium with a dielectric constant of 2 to 2.5?
- Dipole reorientations
- Charge rearrangements
- Electronic polarizabilities
- All of the above
- none of the above
A: 3
- Q: What effects within a protein cause the protein to approximate a dielectric medium with a dielectric constant of 2.5 to 4?
- Dipole reorientations
- Charge rearrangements
- Electronic polarizabilities
- All of the above
- none of the above
A: 1
- Q: What is the name of the electrostatic field created by an atom as it responds to electrostatic fields from other atoms?
A: The "reaction field".
- Q: What is "QSAR"?
A: Quantitative Structure-Activity Relationships
- Q: True or False: 2D QSAR methods usually focus only on the initial interaction between a ligand and receptor; all subsequent
processes leading to biological function are generally not taken into account.
A: True
- Q: 2D QSAR techniques include:
- chemical information systems
- reaction databases
- pharmacophore generation based on covalent structure
- all of the above
- none of the above
A: 4
- Q: 3D QSAR techniques include:
- de novo ligand design
- analyses of principle geometric components
- pharmacophore generation based on conformation
- prediction of properties based on types of functional groups
- all of the above
- none of the above
A: 1, 2, and 3
- Q: True or False: The binding conformation must exist among the low-energy conformers generated during an exhaustive conformational search of all conformational space.
A: False.
- Q: True or False: The binding conformation is likely to exist among the low-energy conformers generated during an exhaustive conformational search of all conformational space.
A: True.
- Q: What are "bioisosteric ligands"?
A: ligands that have all necessary functional groups that interact with the binding site in the same way.
- Q: What is a "pharmacophore"?
A: a ligand with all necessary functional groups in the correct conformation for binding.
(The literal Greek translation is "medicine carrier")
- Q: What is the definition of "CoMFA"?
A: "Comparitive Molecular Field Analysis"
- Q: Describe CoMFA (Comparitive Molecular Field Analysis).
A: conformers are superimposed based on Molecular Interaction Fields to find common biologically-active properties.
- Q: conformers can be superimposed based on (select all that apply):
- atom-by-atom pairwise RMSD fit
- fit of Molecular Interaction Fields and associated properties
- fit of molecular surfaces
- all of the above
- none of the above
A: 4
- Q: What are the most important problems with CoMFA?
- depends on initial conformation of rigid ligands
- depends on molecular alignments
- depends on parameters describing Molecular Interaction Fields
- depends on scoring schemes
- doesn't explicitly account for hydrogen bonds
- All of the above
- None of the Above
A: 1 and 4
- Q: What are the most important problems with docking methods?
- requires information about the binding site before the docking can proceed
- approximate methods are too inaccurate to generate meaningful results
- scoring schemes are not reliable
- must consider water molecules in the active site
- docked receptors are rigid
- All of the above
- None of the Above
A: 3 and 5
- Q: Name the automated ligand design method that covers all possibilities
A: There is NO ligand design method that covers all possibilities---this is a trick question!
- Q: Which automated ligand design method is better for ligands with conformational and structural variations?
- the atom-by-atom ligand design method
- the fragment-addition ligand design method
- the pharmacore-pattern ligand design method
- all methods are equal
A: the atom-by-atom ligand design method
- Q: Which automated ligand design method is more likely to generate ligands that are chemically or synthetically plausible?
- the atom-by-atom ligand design method
- the fragment-addition ligand design method
- the pharmacore-pattern ligand design method
- all methods are equal
A: the fragment-addition ligand design method
- Q: Which automated ligand design method is faster?
- the atom-by-atom ligand design method
- the fragment-addition ligand design method
- the pharmacore-pattern ligand design method
- all methods are equal
A: the fragment-addition ligand design method
- Q: Which automated ligand design method is best for ring structures?
- the atom-by-atom ligand design method
- the fragment-addition ligand design method
- the pharmacore-pattern ligand design method
- all methods are equal
A: the pharmacore-pattern ligand design method
- Q: Which automated ligand design method is best if you have no receptor structure available?
- the atom-by-atom ligand design method
- the fragment-addition ligand design method
- the pharmacore-pattern ligand design method
- all methods are equal
A: the pharmacore-pattern ligand design method
- Q: Match the "Interaction" with the "Free Energy" according to Bohm's Ligand Design Scoring Function
| Interaction | Free Energy |
| 1: loss of translational & rotational motion of ligand | A: -4.7 kJ/mol |
| 2: loss of freedom for each rotatable bond of ligand | B: -8.3 kJ/mol |
| 3: generation of ideal hydrogen bond | C: +5.4 kJ/mol |
| 4: generation of ideal ionic interaction | D: -0.17 kJ/ang2 |
| 5: lipophilic contact | E: +1.4 kJ/mol |
Answer: 1=C, 2=E, 3=A, 4=B, 5=D
- Q: According to the LUDI Ligand Design Empirical Scoring Function, what is the distance range (from hydrogen to hydrogen bond acceptor) for an ideal hydrogen bond?
A: 1.7 to 1.9 angstroms
- Q: According to the LUDI Ligand Design Empirical Scoring Function, what is the distance range (from hydrogen to hydrogen bond acceptor) for any hydrogen bond?
A: 1.3 to 2.5 angstroms
- Q: According to the LUDI Ligand Design Empirical Scoring Function, what is the angle range (from hydrogen bond donor, to hydrogen, to hydrogen bond acceptor) for an ideal hydrogen bond?
A: -150 to +150 degrees
- Q: According to the LUDI Ligand Design Empirical Scoring Function, what is the angle range (from hydrogen bond donor, to hydrogen, to hydrogen bond acceptor) for any hydrogen bond?
A: -100 to +100 degrees
- Q: According to the LUDI Ligand Design Empirical Scoring Function, how many square angstroms of lipophilic contact equal 1 ideal hydrogen bond (in terms of interaction energies)?
A: 60 angstroms2 equals 1 ideal hydrogen bond.
- Q: What types of interaction energies are incorporated into automated ligand design methods? Select all that apply.
- electrostatic interactions
- hydrogen bond interactions
- ionic interactions
- hydrophobic interactions
- all of the above
- none of the above
A: 5
- Q: What is the "Link Score" of a fragment-based ligand design method?
A: A score that measures how well the new fragment can attach to the existing fragment by a covalent bond, assuming that the new
fragment is oriented for optimized binding, and the existing fragment doesn't change it's initial orientation.
UNIX Commands
UNIX commands are shown in bold text. Note that more than one acceptable answer may exist for some questions.
- Q: How can you find all files in your home directory & all sub-directories that end with .psv?
A: find $HOME -name "*psv" -print
- Q: How can you list all lines of zinc_finger.out that include the text "Writing"?
A: more zinc_finger.out | grep Writing
- Q: How can you list ALL files in a directory?
A: ls -a
- Q: How can you rename a file named junk.data to the name published.data?
A: mv junk.data published.data
- Q: How can you learn more about the ls command in a UNIX shell?
A: man ls
- Q: How can you learn more about commands related to the keyword priority?
A: man -k priority
- Q: How can you delete all files that end with .ps in your current working directory?
A: rm *.ps
- Q: How can you determine your present working directory?
A: pwd
- Q: How can you list your currently running processes?
A: ps -ef | grep $LOGNAME
- Q: How can you change directories to your home directory?
A: cd or cd $HOME or cd path_to_your_home_directory
- Q: How can you list the date & time that file master.data was last modified?
A: ls -l master.data
- Q: How can you set the permissions for your file named master.data so that only you
can read, write, and execute it?
A: chmod 700 master.data
- Q: How can you list the most active processes on your workstation?
A: top
- Q: How can you list all users currently logged on to your workstation?
A: who
- Q: How can you list the contents of file master.data screen page-by-screen page?
A: more master.data
- Q: How can you list the first lines of the file master.data?
A: head master.data or head -10 master.data
- Q: How can you list the first 5 lines of the file master.data?
A: head -5 master.data
- Q: How can you list the last lines of the file master.data?
A: tail master.data or tail -10 master.data
- Q: How can you list the last 5 lines of the file master.data?
A: tail -5 master.data
- Q: How can you list the differences between original.data and new.data?
A: diff original.data new.data
- Q: How can you list the size of your home directory, including all sub-directories?
A: du -sk $HOME
- Q: How can you list the size of all files in your present working directory?
A: du -sk *
- Q: How can you list the size of all files in your directory, sorted by size (smallest listed first)?
A: du -sk * | sort -n
- Q: How can you list the size of all files in your directory, sorted by size (largest listed first)?
A: du -sk * | sort -rn
- Q: How can you list the size of the 10 largest files in your directory, sorted by size (largest listed first)?
A: du -sk * | sort -rn | head -10
- Q: How can you create a new directory named calc.results?
A: mkdir calc.results
- Q: How can you remove a directory named calc.results and it's contents?
A: rm -r calc.results
- Q: How can you transfer files between worktations that don't share disks across the network?
A: use ftp
- Q: How do you list the contents of the directory calc.results without changing directories?
A: ls calc.results
- Q: How do you list the existence of a directory calc.results in your current working directory?
A: ls -d calc.results
- Q: How do you list the process number of your process named fdiscover?
A: ps -ef | grep $LOGNAME | grep fdiscover
- Q: How do you immediately kill process 3278 (assumng that you own this process)?
A: kill -9 3278
- Q: How do you suspend process 3278 (assumng that you own this process)?
A: kill -23 3278
- Q: How do you restart process 3278 (assumng that you own this process)?
A: kill -25 3278
- Q: How do you list the files of the parent directory of your current working directory?
A: ls ../
- Q: How can you list the previous commands that you typed?
A: history
- Q: How can you repeat your last command?
A: !!
- Q: How can you repeat the 153rd command that you've typed?
A: !153
- Q: How can you repeat the last command that started with mo that you typed?
A: !mo
- Q: How can you execute a command named run.calc in the background?
A: run.calc &
- Q: How can you print an expanded list of all processes?
A: ps -ef
- Q: How can you interrupt the current process running in the foreground?
A: <control-c>
- Q: How can you suspend the current process running in the foreground?
A: <control-z>
- Q: How can you display background and suspended processes?
A: jobs
- Q: How can you remove suspended process #1?
A: kill %1
- Q: How can ou display the current, most computer-intensive commands?
A: top
- Q: How can you display the operating system statitistics?
A: osview
- Q: How can you graphically display the operating system statitistics?
A: gr_osview
- Q: How can you end your terminal session or close your UNIX shell?
A: logout or exit
- Q: How can you change your password?
A: passwd
- Q: How can you print the current date & time?
A: date
- Q: How can you log onto a remote workstation named stereo3?
A: telnet stereo3
- Q: How can you print a list of disks and current disk capacity (in kilobytes)?
A: df -k
- Q: How can you edit the contents of a file named master.data?
A: jot master.data or vi master.data or nedit master.data
- Q: How can you show the entire contents of a file named master.data on the screen?
A: cat master.data or more master.data
- Q: How can you reduce the size of a file named master.data?
A: compress master.data
- Q: How can you restore the size & content of a compressed file named master.data.Z?
A: uncompress master.data.Z
- Q: What is the file name created by typing compress master.data?
A: master.data.Z
- Q: How can you display the 18th and 19th lines of a file named master.data?
A: head -19 master.data | tail -2
- Q: How can you print the file named master.data to the default printer for your workstation?
A: lpr master.data
- Q: How can you print a file to a particular printer?
A: type lpr and follow the directions.
- Q: How can you move all files that end with .psv to a directory named all_psv?
A: mv *.psv all_psv
- Q: How can you create a new sub-directory named March.data in a new directory named master.data?
A: mkdir -p master.data/March.data
- Q: How can you remove all files named core in your home directory and sub-directories?
A: find $HOME -name core -exec rm '{}' ';'
- Q: How can you list all files in your home directory and sub-directories last modified today?
A: find $HOME -mtime 1 -print
- Q: How can you list all files in your home directory and sub-directories created more than 4 days ago?
A: find $HOME -ctime +4 -print
- Q: How can you determine the location of a command named run.calc that you want to execute?
A: which run.calc
- Q: How can you copy a file named current.data to a file named new.data?
A: cp current.data new.data
- Q: How can you automatically cont the number of lines that contain the text oregano in a file named spices?
A: grep oregano spices | wc -l
- Q: After you ftp a program to your home directory that worked on an identical workstation, can you immediately execute the program?
A: No. ftp sets all execute permissions to "off".
- Q: After you copy a program to your home directory that worked on an identical workstation, can you immediately execute the program?
A: Yes. cp retains all permissions.
- Q: How can you determine the file type of a file named master.data?
A: file master.data
- Q: How can you determine the file type of all files in your current working directory?
A: file *
- Q: How can you make a tar image named master.data.tar of a directory named master.data?
A: tar -cvf master.data.tar master.data
- Q: How do you restore a tar file named master.data.tar?
A: tar -xvf master.data.tar
- Q: How can you list all files and sub-directories that have names that start with zinc_finger?
A: ls zinc_finger*
END OF THE LIST OF QUESTIONS!
Back to | C687 Spring 1999 |
Courses & Instruction | MolViz
Home |
Send comments to chemvis@indiana.edu
Last updated: 01/23/2001