Browse chapters
1A · Biomolecules and the chemistry of life
Structure and function of proteins and their constituent amino acids
Proteins are polymers of amino acids. The 20 amino acids' side chains determine how a chain folds into a 3-D shape, and that shape determines function — most visibly in enzymes, the catalysts that run nearly every reaction in the cell. This category covers amino-acid chemistry, the four levels of protein structure, enzyme catalysis and kinetics, and how proteins are purified and analyzed in the lab.
Amino acids
Every amino acid has a central α-carbon bonded to an amino group, a carboxyl group, a hydrogen, and a variable side chain (R group). The R group is everything — it sets the amino acid's polarity, charge, and chemistry.
All 20 standard amino acids share the same backbone — an α-carbon carrying an amino group (–NH₂), a carboxyl group (–COOH), an H, and an R group — and differ only in that R group. Because the α-carbon has four different groups (in all but glycine), it is a chiral center; nearly all biological amino acids are the L-isomer (S configuration, except cysteine). Memorizing the 20 by side-chain class is the highest-leverage hour in FC1.
Classifying the 20 by side chain
Sort the 20 by R group: nonpolar (hydrophobic), polar uncharged, acidic (negatively charged), and basic (positively charged). The class predicts where a residue sits in a folded protein and how it behaves on a gel.
The four classes:
- Nonpolar / hydrophobic — Gly, Ala, Val, Leu, Ile, Pro, Met, Phe, Trp. Bury in the protein's hydrophobic core, away from water.
- Polar uncharged — Ser, Thr, Cys, Tyr, Asn, Gln. Sit on the surface; form hydrogen bonds.
- Acidic (negative at physiological pH) — Asp (D) and Glu (E); carboxyl side chains, deprotonated at pH 7.
- Basic (positive at physiological pH) — Lys (K), Arg (R), and His (H); amino/guanidinium/imidazole side chains.
A useful shortcut: the aromatic amino acids (Phe, Trp, Tyr) absorb UV light at 280 nm, which is how protein concentration is measured.
How AAMC tests it
A passage mutates a buried hydrophobic residue to a charged one and asks what happens — the answer is misfolding/destabilization, because you've put a charge in the water-excluding core.
The special amino acids worth knowing cold
A handful of side chains have outsized exam value: glycine (achiral, tiny, flexible), proline (rigid, a helix breaker), cysteine (forms disulfide bonds), histidine (buffers and catalyzes near pH 7), and the aromatics (UV absorbance).
- Glycine — R group is just H, so it is the only achiral amino acid and the smallest; its flexibility lets it fit tight turns.
- Proline — its side chain loops back to the backbone nitrogen (a secondary amine), making it rigid; it disrupts α-helices and is common at turns and kinks.
- Cysteine — its thiol (–SH) can oxidize with another cysteine to form a covalent disulfide bond, a key stabilizer of tertiary/quaternary structure (and the bond that reducing agents and SDS break).
- Histidine — its imidazole side chain has a pKa near 6, so it can pick up or give off a proton at physiological pH; this makes it both a biological buffer and a frequent acid–base catalyst in enzyme active sites.
Don't confuse
Disulfide bonds (covalent, between cysteines, part of tertiary/quaternary structure) vs. hydrogen bonds (weak, hold secondary structure together). A reducing agent breaks the former; heat/urea disrupt the latter.
Charge, zwitterions, and the isoelectric point (pI)
Amino acids are amphoteric — they act as both acid and base. At physiological pH most exist as a zwitterion (–NH₃⁺ and –COO⁻, net neutral). The isoelectric point (pI) is the pH at which the molecule carries no net charge.
Each amino acid has at least two pKa values — the α-carboxyl (~2) and the α-amino (~9–10) — plus a third for any ionizable side chain. As pH rises, groups deprotonate in order of pKa. At a pH below the pI the molecule is net positive (protonated); at a pH above the pI it is net negative. The pI is the average of the two pKa values that flank the neutral (zwitterionic) form: for an amino acid with no ionizable side chain, that's the average of the α-COOH and α-NH₃⁺ pKa values; for an acidic amino acid it's the average of the two lowest pKa values; for a basic one, the average of the two highest.
How AAMC tests it
Electrophoresis questions: at a given buffer pH, which way does a residue migrate? Below its pI it's positive and moves toward the cathode; above its pI it's negative and moves toward the anode; at its pI it doesn't move.
Related
This is the principle behind isoelectric focusing.
The peptide bond
Amino acids link when the carboxyl of one and the amino of the next join in a condensation (dehydration) reaction, forming a covalent peptide bond (an amide) and releasing water. Chains are read and built N-terminus → C-terminus.
The peptide bond forms by condensation: a molecule of water is removed as the bond is made (and hydrolysis — adding water, usually via a protease — breaks it). The bond has partial double-bond character from resonance, so it is planar and rigid, and there is no rotation about the C–N bond; conformational freedom comes from rotation at the φ (phi) and ψ (psi) backbone angles. Peptide bonds are kinetically stable — they don't fall apart on their own, which is why proteins last and why digestion needs enzymes.
Don't confuse
Condensation/dehydration (bond formed, water released) vs. hydrolysis (bond broken, water consumed). Same logic governs glycosidic and ester bonds.
The four levels of protein structure
Protein structure is described at four levels — primary (sequence), secondary (local backbone folding), tertiary (overall 3-D shape of one chain), and quaternary (assembly of multiple chains). Each higher level is determined by the one below it.
The sequence dictates the fold, the fold dictates the function. The four levels differ in what holds them together: primary structure is covalent (peptide bonds, plus disulfides); secondary structure is backbone hydrogen bonding; tertiary structure is side-chain interactions; quaternary structure is the same interactions, between separate polypeptides.
Primary structure
The linear sequence of amino acids, joined by peptide bonds (and including disulfide linkages). It encodes everything else.
Primary structure is the order of residues from N- to C-terminus. A single substitution can be catastrophic — sickle-cell anemia is one Glu→Val change in β-globin that creates a hydrophobic patch and makes hemoglobin polymerize. Primary structure is determined experimentally by sequencing (Edman degradation, mass spectrometry).
Secondary structure (α-helix & β-sheet)
Local, repeating folds held by hydrogen bonds between backbone atoms (not side chains): the α-helix and the β-pleated sheet.
In the α-helix, the backbone coils and each carbonyl O hydrogen-bonds to the amide H four residues ahead (i → i+4); side chains point outward. Proline kinks and glycine destabilizes helices. In the β-pleated sheet, extended strands lie side by side and hydrogen-bond across to one another, running parallel or antiparallel. The defining point: secondary structure is backbone hydrogen bonding — independent of which side chains are present.
Tertiary structure
The overall 3-D shape of a single polypeptide, driven mainly by hydrophobic interactions (the nonpolar core) and stabilized by hydrogen bonds, ionic/salt bridges, and covalent disulfide bonds.
Tertiary structure is where side chains do the work. The dominant driving force is the hydrophobic effect: nonpolar residues cluster away from water, which is entropically favorable. Salt bridges (between acidic and basic side chains), hydrogen bonds, and disulfide bonds then lock the fold in place. Tertiary structure is what gives an enzyme its active-site geometry.
Quaternary structure
The assembly of two or more polypeptide subunits into one functional protein. Not all proteins have it.
Quaternary structure is held by the same interactions as tertiary, but between subunits. The classic example is hemoglobin (four subunits), whose subunit cooperation produces sigmoidal O₂ binding. Multi-subunit assembly enables cooperativity and allosteric regulation.
Related
See hemoglobin vs. myoglobin for what quaternary structure buys you.
Folding, chaperones & denaturation
Proteins fold toward their lowest-energy native shape, often helped by chaperones. Denaturation unfolds them — destroying secondary/tertiary/quaternary structure (and function) while leaving the primary sequence intact.
Folding is driven by the sequence (Anfinsen's classic experiment: denatured ribonuclease refolds on its own once the denaturant is removed), but in the crowded cell chaperones (e.g., heat-shock proteins, chaperonins) prevent misfolding and aggregation. Denaturants — heat, extreme pH, urea, detergents like SDS, and reducing agents (which break disulfides) — disrupt the higher-order structure but do not break peptide bonds. Misfolding underlies diseases like the prion encephalopathies and amyloidoses.
Don't confuse
Denaturation (loss of 3-D shape; primary structure preserved) vs. hydrolysis/proteolysis (peptide bonds actually cut). Denaturation can be reversible; cutting the backbone is not.
Ubiquitination & proteasomal degradation
Ubiquitin is a small protein covalently attached to a target as a "destroy me" tag; a chain of ubiquitins routes the protein to the proteasome, which degrades it.
Ubiquitination is a post-translational modification: enzymes covalently link ubiquitin to a lysine of the target, and a polyubiquitin chain marks it for destruction. The barrel-shaped proteasome then recognizes the tag, unfolds the protein, and chops it into peptides — the cell's main route for regulated, selective protein turnover. This is how short-lived regulatory proteins are kept in check: it is the mechanism behind the fall in cyclin levels that drives the cell cycle forward and behind the rapid turnover of damage-sensing proteins like p53.
Don't confuse
Proteasomal degradation (ubiquitin-tagged, ATP-dependent, cytosolic, selective for specific proteins) vs. lysosomal degradation/autophagy (acidic-hydrolase digestion of bulk material and organelles). Both destroy proteins, but only the proteasome is the targeted ubiquitin pathway. Also keep it separate from denaturation (loss of shape, no covalent change) and proteolytic cleavage of zymogens (activates, rather than destroys).
How AAMC tests it
A passage notes that a regulatory protein disappears on cue (a cyclin between cell-cycle phases, or a transcription factor after signaling) and asks for the mechanism — recognize ubiquitin-tagging and proteasomal degradation, not transcriptional shutdown.
Enzymes
Enzymes are biological catalysts (mostly proteins; some are RNA, the ribozymes) that speed reactions by lowering activation energy. They don't change the reaction's equilibrium or ΔG, aren't consumed, and are exquisitely specific. This is the most heavily tested topic in FC1.
An enzyme binds its substrate at the active site and stabilizes the transition state, lowering the activation energy (Eₐ) for both the forward and reverse directions equally. Crucially, an enzyme changes only the rate — it does not change ΔG, the equilibrium constant, or the position of equilibrium. Enzymes are reusable and highly specific, and most operate best within a narrow optimal pH and temperature.
Active site, specificity & cofactors
Substrate specificity is explained by lock-and-key (rigid complementary site) and, more accurately, induced fit (the active site molds around the substrate). Many enzymes need a cofactor (inorganic ion) or coenzyme (organic, often vitamin-derived) to work.
The lock-and-key model says the substrate fits a pre-shaped active site; the induced-fit model (Koshland) — the better description — says binding induces a conformational change that brings catalytic groups into position. Helpers: cofactors are inorganic (e.g., Zn²⁺, Mg²⁺, Fe²⁺); coenzymes are small organic molecules, many derived from vitamins (NAD⁺ from niacin, FAD from riboflavin, coenzyme A from pantothenate). An enzyme without its cofactor is an inactive apoenzyme; with it, an active holoenzyme. A tightly/covalently bound cofactor is a prosthetic group (e.g., heme).
Don't confuse
Cofactor (inorganic ion) vs. coenzyme (organic molecule). Both assist catalysis; the exam asks you to tell them apart.
Michaelis–Menten kinetics
Reaction rate rises with substrate until the enzyme saturates at Vmax. Km is the substrate concentration at half-Vmax and is an inverse measure of affinity — a low Km means high affinity.
The Michaelis–Menten equation, v = Vmax·[S] / (Km + [S]), gives a hyperbolic curve: rate is roughly first-order in substrate at low [S] and plateaus (zero-order) at Vmax when every active site is occupied. Km equals the [S] at which v = ½Vmax; a small Km means the enzyme reaches half-speed at low substrate, i.e., binds tightly. kcat (turnover number) is Vmax/[E]ₜₒₜₐₗ, and kcat/Km measures catalytic efficiency. The Lineweaver–Burk double-reciprocal plot linearizes the data (y-intercept = 1/Vmax, x-intercept = −1/Km), which is why inhibitor questions are usually drawn that way. Cooperative (allosteric) enzymes break the rule with a sigmoidal curve.
How AAMC tests it
You'll get a graph or a table of rates and be asked to read off Km/Vmax or to say how each changes when an inhibitor is added — see the next node.
Competitive vs. noncompetitive vs. uncompetitive
Inhibitors are classified by where they bind and by how they change Km and Vmax. This is the most reliable enzyme question on the exam.
The four cases:
| Inhibitor | Binds | Km (apparent) | Vmax | Beaten by more substrate? |
|---|---|---|---|---|
| Competitive | Active site | ↑ increases | unchanged | Yes |
| Noncompetitive | Allosteric site (E and ES equally) | unchanged | ↓ decreases | No |
| Uncompetitive | ES complex only | ↓ decreases | ↓ decreases | No |
| Mixed | E and ES (unequal affinity) | ↑ or ↓ | ↓ decreases | No |
The intuition: a competitive inhibitor competes for the active site, so flooding with substrate outcompetes it (Vmax recoverable, apparent Km worsens). A noncompetitive inhibitor binds elsewhere and takes enzyme out of play no matter how much substrate you add (Vmax falls; affinity, hence Km, unchanged). An uncompetitive inhibitor binds only after substrate binds, locking the ES complex (both Km and Vmax fall).
Don't confuse
The single highest-frequency discriminator: competitive raises apparent Km with Vmax unchanged and is reversible by adding substrate; noncompetitive lowers Vmax with Km unchanged and is not. Memorize that pair.
Allosteric regulation, cooperativity & control
Cells control enzymes through allosteric effectors (activators and inhibitors that bind a regulatory site), feedback inhibition, covalent modification (e.g., phosphorylation), and activation of inactive zymogens.
Allosteric enzymes have regulatory sites distinct from the active site; binding of an effector shifts the enzyme between high- and low-activity states, producing a sigmoidal (cooperative) rate curve. In feedback (end-product) inhibition, a pathway's final product inhibits an early enzyme, preventing overproduction. Covalent modification — most often phosphorylation by kinases (reversed by phosphatases) — switches enzymes on or off. Zymogens (proenzymes like pepsinogen, trypsinogen, and the clotting factors) are made inactive and switched on by proteolytic cleavage, so destructive enzymes aren't active until and where they're needed. Cooperativity (as in hemoglobin) means binding at one site changes affinity at the others.
How AAMC tests it
A passage describes a metabolite that inhibits the first enzyme of its own synthesis pathway — recognize feedback inhibition — or a hormone that triggers phosphorylation and asks whether the target is activated or inhibited.
Irreversible vs. reversible inhibition
A reversible inhibitor binds non-covalently and lets go (its four kinetic classes — competitive, noncompetitive, uncompetitive, mixed — are the previous node); an irreversible inhibitor forms a covalent bond and permanently disables the enzyme.
This is the top-level split that sits above the four kinetic classes. Reversible inhibitors associate and dissociate freely, so the enzyme regains activity when the inhibitor is diluted out or outcompeted by substrate, and they shift Km/Vmax in the patterns of the kinetics table. An irreversible inhibitor instead reacts covalently — usually with a catalytic residue in or near the active site (e.g., a serine or cysteine) — so that one enzyme molecule is permanently knocked out. Removing or washing out the inhibitor does not restore activity; because the enzyme is destroyed rather than merely blocked, the only way for a cell to recover function is to synthesize new enzyme. Classic examples: organophosphates and nerve agents covalently modifying acetylcholinesterase, penicillin acylating a bacterial transpeptidase, and aspirin acetylating cyclooxygenase (COX).
Don't confuse
Irreversible inhibition is not the same as noncompetitive inhibition. Both can leave apparent Km unchanged while lowering Vmax, but a noncompetitive inhibitor binds reversibly at an allosteric site (wash it out and activity returns), whereas an irreversible inhibitor is covalently bonded and gone for good — recovery requires making fresh enzyme, not removing the inhibitor.
How AAMC tests it
A passage describes a compound that covalently modifies an enzyme and asks whether adding more substrate (or dialyzing the inhibitor away) restores activity — the answer is no, and the cell must express new enzyme to recover.
Proteins at work
Beyond catalysis, proteins are structural (collagen, keratin), motor (myosin, kinesin), transport (membrane carriers, hemoglobin), signaling (receptors), and defensive (antibodies). Binding specificity and conformational change are the recurring themes.
Function follows structure: fibrous proteins like collagen (the most abundant protein in the body) and keratin are built for tensile strength; motor proteins convert ATP into movement along cytoskeletal tracks; transport proteins move ions and molecules across membranes or through blood. The unifying behaviors AAMC tests are specific binding and conformational change in response to a ligand — exactly what makes hemoglobin a teaching favorite.
Hemoglobin vs. myoglobin
Hemoglobin is a four-subunit (quaternary) transporter with cooperative, sigmoidal O₂ binding; myoglobin is a single subunit with hyperbolic, higher-affinity binding that stores O₂ in muscle.
Hemoglobin's four subunits cooperate: binding one O₂ raises affinity for the next, giving the S-shaped curve that lets it load O₂ in the lungs and unload it in tissues. Its affinity is tuned by the environment — the Bohr effect: lower pH, higher CO₂, higher temperature, and higher 2,3-BPG all right-shift the curve (lower affinity), dumping O₂ where metabolism is high. Myoglobin has one heme, no cooperativity, and a hyperbolic curve sitting to the left (higher affinity) — perfect for grabbing and holding O₂ in muscle.
Don't confuse
Sigmoidal/cooperative (hemoglobin — multi-subunit) vs. hyperbolic (myoglobin — single subunit). A right-shift means lower affinity / more O₂ released.
Isolating & analyzing proteins
Proteins are separated by electrophoresis (in a gel) and chromatography (through a column), each exploiting a different property — size, charge, or specific binding.
These techniques recur across the section and are nearly pure recognition once you know what each separates by. The trick is matching the method to the property: size, charge, isoelectric point, or affinity for a ligand.
Gel electrophoresis
An electric field pulls proteins through a gel. SDS-PAGE separates strictly by size; native PAGE keeps proteins folded (size + shape + charge); isoelectric focusing (IEF) separates by pI.
In SDS-PAGE, the detergent SDS denatures proteins and coats them with uniform negative charge, so migration depends only on molecular weight — smaller proteins travel farther. Native PAGE omits SDS, so folded proteins separate by a mix of size, shape, and charge. Isoelectric focusing runs proteins through a pH gradient; each protein migrates until it reaches the pH equal to its pI, where its net charge is zero and it stops. Running IEF and then SDS-PAGE perpendicular to it is 2-D gel electrophoresis, which resolves complex mixtures.
Don't confuse
SDS-PAGE (size only — charge is masked) vs. IEF (pI only). If a question says SDS is present, charge no longer matters.
Column chromatography
Proteins pass through a column packed with beads; size-exclusion separates by size, ion-exchange by charge, and affinity by specific binding to an immobilized ligand.
In size-exclusion (gel-filtration) chromatography, large proteins can't enter the porous beads and elute first, while small ones are delayed (counterintuitive — big comes out fast). Ion-exchange columns carry a charge and retain oppositely charged proteins, which are then released by raising salt or changing pH. Affinity chromatography uses a column-bound ligand (an antibody, a substrate analog, or a metal for a His-tag) to capture only the protein of interest — the most specific and most powerful purification step.
Don't confuse
In size-exclusion chromatography, large molecules elute first (they skip the beads). This reverses many students' intuition.
ELISA
An enzyme-linked immunosorbent assay uses an antibody to detect a specific protein and an enzyme-linked antibody to generate a colorimetric signal whose intensity is proportional to how much of that protein is present.
A primary antibody binds the target antigen; a secondary antibody carrying an attached enzyme then binds, and adding the enzyme's substrate produces a colored product. The absorbance read by a plate reader is proportional to the amount of antigen, which makes ELISA the go-to quantitative answer when a passage asks how much protein (or hormone, or antibody) is in a sample. The classic format is the sandwich ELISA: a capture antibody coats the well, antigen binds, and a detection antibody completes the sandwich.
Don't confuse
ELISA vs. the Western blot. Both use antibodies to detect a protein, but Western blot first separates by size on a gel and answers is the protein present (and at what size)? — a qualitative readout — whereas ELISA skips separation and answers exactly how much? If a question stresses quantity or a numeric concentration, choose ELISA. (See Western/Southern/Northern blots and antibodies & antigens.)
Reducing vs. non-reducing SDS-PAGE
Plain SDS breaks only non-covalent interactions, so subunits held together by covalent disulfide bonds still run as one band; adding a reducing agent (β-mercaptoethanol or DTT) breaks those disulfides so every subunit separates by size.
SDS denatures a protein and masks its charge, but it cannot cleave the covalent disulfide bonds that link cysteines across separate chains. So under non-reducing conditions, disulfide-bonded subunits migrate together as a single high-molecular-weight band — SDS-PAGE is "size only" for what SDS can pull apart, not for covalently tethered subunits. Adding a reducing agent breaks the disulfides; now each chain runs as its own band at its true size. The textbook AAMC example is an antibody (~150 kDa) that appears as one band non-reduced but resolves into separate heavy and light chains only when a reducing agent is added.
Don't confuse
A reducing agent breaks disulfides (covalent); SDS handles the non-covalent denaturation. A change in banding only upon adding the reducing agent is the fingerprint of inter-chain disulfide bonds holding subunits together. (See gel electrophoresis and cysteine & disulfide bonds.)
Worked question
A researcher measures an enzyme's reaction rate across a range of substrate concentrations, with and without compound X. With X present, the measured Vmax is unchanged but the apparent Km increases, and adding a large excess of substrate restores the original rate. Compound X is most likely: