L07 Proteins Tertiary Structure

Tertiary structure of proteins

Tertiary structure describes the folding of the polypeptide chain to assemble the different secondary structure elements in a particular arrangement (a unique shape).

(蛋白质的三维结构描述了蛋白质的折叠,将不同的二级结构以特定的方式结合在一起)

Models of proteins:

  1. stick model: showing atomic positions
  2. cartoon model: show the backbone
  3. surface model: show the solvent accessible surface

1571463955537

Two major structural classes of proteins

The proteins have two major classes——Fibrous protein(纤维蛋白) and Globular protein(球状蛋白质)

1571465006459

1. Fibrous Protein

Fibrous proteins include the major proteins of skin and connective tissue and of animal fibers like hair and silk.

  • Fibrous proteins are distinguished from globular proteins by their filamentous, or elongated form.

  • Most of them play structural roles(Provide mechanical support,提供了机械支持) in animal cells and tissues—they hold things together.

  • The amino acid sequence of each of these proteins favors a particular kind of secondary structure(具有特定的二级结构), which confers on each protein a particular set of appropriate mechanical properties(赋予了特定蛋白特定的机械机能).

1571465854860

keratins 角蛋白

在生物体中存在$\alpha-$ 和 $\beta-$keratins,其中$\alpha-$keratins结构主要为alpha-helix,而beta–keratins包含更多的beta-sheet

keratins are adapted to helical form.

(1) $\alpha-$keratins

$\alpha-$keratins是Hair, Fingernails, Animal skin 的主要成分.

The α-keratins are members of a broad group of intermediate filament proteins, which play important structural roles in the nuclei, cytoplasm, and surfaces of many cell types.

Predominantly α-helical structure. The structure is also called a kind of coiled-coil.

α-keratins have cysteine-rich sequences, adapt to disulfide cross-links

The structure of a typical α-keratin, that of hair, is depicted in Figure:

1571467470470

The individual molecules contain long sequences—over 300 residues in length— that are wholly α-helical. Pairs of these right-handed helices twist about one another in a left-handed coiled-coil structure

In intermediate filaments,pairs of coiled coils themselves tend to associate into a four-chain protofilament (c),and two of these in turn pack together to form a protofibril (d).

在一些组织中, $\alpha-$keratins形成的总体结构可以很柔软,但是在某些组织中(如指甲)也可以变得很坚硬。这个变化的主要原因在于 disulfide cross-links within the several levels of fiber structure

由之前的图中可以看出,α-keratins含有较高比例的cysteine,在人类的“卷发”形状中,就是通过减少二硫键,蛋白质纤维的重新排列以及再氧化作用等使得特定的头发样式定型。

(2) $\beta-$keratins

The beta-keratins, as their name implies, contain much more beta-sheet structure. Indeed, they represented the second major structural class described by Pauling and his coworkers. The beta-keratins are found mostly in birds and reptiles,in structures like feathers and scales.

Fibroin 丝蛋白

Fibroin is the main component of silk created by silkworms or spiders

Glycine and alanine rich sequence (45% and 29%) ,The β-sheet regions comprise almost exclusively multiple repetitions of the sequence

1571469041801

Contains long regions of antiparallel β‐sheet, with the polypeptide chains running parallel to the fiber axis

fibroin contains small amounts of other, bulky amino acids like valine and tyrosine, which would not fit into the structure we have shown. These are carried in compact folded regions that periodically interrupt the β-sheet segments, and they probably account for the amount of stretchiness that silk fibers have.

collagen 胶原质

Because it performs such a wide variety of functions, collagen is the most abundant single protein in most vertebrates. Diverse forms include:1. Tendon 2. Bone 3. Skin

(1) Collagen Structure

The basic unit of the collagen fiber is the tropocollagen(原胶原(蛋白)) molecule,a triple helix of three polypeptide chains, each about 1000 residues in length.,这三条chain are in a right‐handed supercoil,但是单条链的螺旋方式是left-handed helices.

  • 3 residues/turn

  • Rise: 0.31 nm/residue: Collagen is more extended than α-helix

  • Repetitive motif: Gly-X-Y

    • X: Often Proline

    • Y: proline or hydroxyproline

      image-20191102213633692
  • This is Polyproline II helix

Examination of the model reveals that every third residue, which must lie near the center of the triple helix, can be only glycine. Any side chain other than —H would be too bulky.

1571470032303

Formation of the individual helices of the collagen type is also favored by the presence of proline or hydroxyproline in the tropocollagen molecule. A repetitive motif in the sequence is of the form Gly–X–Y, where X is often proline and Y is proline or hydroxyproline

(2) Poly-proline II helix

The PPII helix is defined by (φ,ψ) backbone dihedral angles of roughly (-75°, 150°) and trans isomers of the peptide bonds.

Substitution of the poly-Pro II (φ,ψ) dihedral angles into this equation yields almost exactly Ω = -120°, i.e., the PPII helix is a left-handed helix (since Ω is negative) with 3 residues per turn (360°/120° = 3). The rise per residue is approximately 3.1 Å. This structure is somewhat similar to that adopted in the fibrous protein collagen, which is composed mainly of proline, hydroxyproline, and glycine.

2. Globular proteins

Classification of globular proteins

Classification of globular proteins 与 Classification of protein domains 有区别

  1. All $\alpha$ proteins:
    All-α proteins are a class of structural domains in which the secondary structure is composed entirely of α-helices, with the possible exception of a few isolated β-sheets on the periphery.
    Common examples include the bromodomain, the globin fold and the homeodomain fold.

  2. All $\beta$ proteins :

    All-β proteins are a class of structural domains in which the secondary structure is composed entirely of β-sheets, with the possible exception of a few isolated α-helices on the periphery.

Common examples include the SH3 domain, the beta-propeller domain, the immunoglobulin fold and B3 DNA binding domain.

  1. $\alpha + \beta$ proteins:

α+β proteins are a class of structural domains in which the secondary structure is composed of α-helices and β-strands that occur separately along the backbone. The β-strands are therefore mostly antiparallel.
Common examples include the ferredoxin fold, ribonuclease A, and the SH2 domain.

1571547806048
  1. $\alpha / \beta$ proteins:

    α/β proteins are a class of structural domains in which the secondary structure is composed of alternating α-helices and β-strand(交替出现)s along the backbone. The β-strands are therefore mostly parallel.
    Common examples include the flavodoxin fold, the TIM barrel and leucine-rich-repeat (LRR) proteins such as ribonuclease inhibitor.

    1571548217599
  2. Others:

    • Contain few secondary structures
    • Intrinsically unstructured proteins

1571472807662

1571472843270

The majority of globular domainscan be broadly classified as: “mainly α ”,“mainly β ” and “ α + β ”

A small number of globular domains have little α or β secondary structure

Protein domains

The protein domains are structural units that fold more or less independently of each other.

A small number of globular domains have little α or β secondary structure

The majority of globular domains can be broadly classified as:

  • mainly α
  • mainly β
  • α + β

It is a substructure produced by any contiguous(接近的,临近的) part of a polypeptide chain that can fold independently of the rest of the protein into a compact, stable structure. A domain usually contains between 40 and 350 amino acids, and it is the modular unit from which many larger proteins are constructed.

A protein domain is a conserved part of a given protein sequence and (tertiary) structure that can evolve, function, and exist independently of the rest of the protein chain. $\rarr$ Same domains in different proteins usually have similar functions

Smaller proteins typically contain a single domain

Larger proteins may contain several domains

Alpha domain structure

(1) Four Helix Bundle
1571473787608

Beta domain structure

$\beta$ Sandwich
1571473968257

The sandwich has a edge, H-bond is influenced by that, so it is less stable than barrel.

$\beta$ barrel
1571474015601

Alpha/Beta domain structures

TIM barrel
1571474427361
Open twisted α/β
1571474617045

Domain structure

Complex domain structure are composed of simple domains

1571474920788

General rules governing tertiary structure

The environment is the key factor of protein folding

All globular proteins have a defined inside and outside

The hydrophobic residues to be packedmostly on the inside (hydrophobic core)

The hydrophilic residues are on the outside, in contact with water (hydrophilic surface)

Membrane proteins Transmembrane regions are highly hydrophobic

CATH database

The CATH Protein Structure Classification database is a free, publicly available online resource that provides information on the evolutionary relationships of protein domains.

CATH database (Class, Architecture, Topology, and “Homologous superfamily)

1571473546206

Protein purification

Chromatographic separations

1571471428683

色谱法利用不同物质在不同相态的选择性分配,以流动相对固定相中的混合物进行洗脱,混合物中不同的物质会以不同的速度沿固定相移动,最终达到分离的效果。

1. Ion exchange chromotography 离子交换色谱

1571471613873

离子交换色谱利用待分离组分与固定相之间发生离子交换的能力差异来实现分离。离子交换色谱的固定相一般为离子交换树脂,树脂分子结构中存在许多可以电离的活性中心,待分离组分中的离子会与这些活性中心发生离子交换,形成离子交换平衡,从而在流动相与固定相之间形成分配。固定相的固有离子与待分离组分中的离子之间相互争夺固定相中的离子交换中心,并随着流动相的运动而运动,最终实现分离。
离子交换色谱的分配系数又叫做选择系数,其表达式为:
$$
K_{s} = \frac{[RX^+]}{[X^+]}
$$

2. Gel filtration chromatography 凝胶色谱

凝胶色谱的原理比较特殊,类似于分子筛。待分离组分在进入凝胶色谱后,会依据分子量的不同,进入或者不进入固定相凝胶的孔隙中,不能进入凝胶孔隙的分子会很快随流动相洗脱,而能够进入凝胶孔隙的分子则需要更长时间的冲洗才能够流出固定相,从而实现了根据分子量差异对各组分的分离。调整固定相使用的凝胶的交联度可以调整凝胶孔隙的大小;改变流动相的溶剂组成会改变固定相凝胶的溶涨状态,进而改变孔隙的大小,获得不同的分离效果。

1571471984688

3. Affinity chromatography 亲和色谱法,亲和层析

High specificity

亲和色谱法(英语:Affinity chromatography,又称为亲和层析)是一种利用固定相的结合特性来分离分子的色谱方法。亲和色谱在凝胶过滤色谱柱上连接与待分离的物质有一定结合能力的分子,并且它们的结合是可逆的,在改变流动相条件时二者还能相互分离。亲和色谱可以用来从混合物中纯化或浓缩某一分子,也可以用来去除或减少混合物中某一分子的含量。

Protein folding

Factors Determining Secondary and Tertiary Structure

  • Amino Acids sequence
  • Physical and chemical conditions
    • High temperature
    • Extreme pH
    • Organic Solvents: Urea; Guanidine (胍); Alcohol
  • The process that a protein loses its native (functional) 3D structure is call denaturation
    • Random coil structure (maybe very complicated)
    • Loss of biological activities

Thermodynamics

Folding is a spontaneous process that is mainly guided by hydrophobic interactions, formation of intramolecular hydrogen bonds, van der Waals forces, and it is opposed by conformational entropy.

The protein folding process finally reach a state that the free energy minimum, the protein folding process is thermodynamic favoured.

1. Negative Free Energy Change

The negative free energy change is achieved by a balance of several thermodynamic factors :

  1. Conformational entropy
  2. Charge‐charge interactions: Charge–charge interactions can occur between positively and negatively charged side chain groups.
    1. Salt bridges: At neutral pH, one group will be charged positively and the other negatively, so an electrostatic attractive force exists between them.We could say that such a pair forms a kind of salt within the protein molecule; consequently, such interactions are sometimes called salt bridges.
  3. Internal H‐bonds
    1. H‐bond network:蛋白质中“蛋白质骨架”与“side chains”相互结合,共形成三种氢键。虽然单个氢键的作用较小,但是数量弥补了键强度的不足。
1571662080448
  1. van der Waals interactions
  2. Hydrophobic interactions (hydrophobic effects)
  3. Disulfide bonds (exist in some proteins): An important structure called disulfide bond stabilize the protein structure. The disufide bond is unstable in reductive environment. 在细胞内的环境通常是还原性的,而细胞外通常是氧化性的环境。在细胞内二硫键处于被还原(—SH HS—)的状态,而在细胞外处于被氧化的状态(—S—S—)。因此对于细胞产生的分泌蛋白以及位于外膜的蛋白,通常选择使用二硫键对蛋白质结构进行稳定。
1571658986922

Disulfide bonds are relatively rare, they mainly exists in:

  • Secreted proteins
  • Extracellular parts of membrane proteins

The different redox states

  • Inside of cells – reduced

  • Outside of cells– oxidized

2. Thermodynamics

  • The unfavorable conformational entropy change, which favors the unfolded state

  • The favorable enthalpy contribution arising from intramolecular interactions

  • The favorable entropy change arising from the burying of hydrophobic groups within the molecule

  • 1571658503456

Kinetics

Protein folding is not a completely random search through a vast conformational space

1. Free Energy Landscape

1571659433230

In physics, chemistry, and biochemistry, an energy landscape is a mapping of all possible conformations of a molecular entity, or the spatial positions of interacting molecules in a system, or parameters and their corresponding energy levels, typically Gibbs free energy.

2. rate limiting factors

  1. Protein size :
    • Smaller proteins fold faster than larger ones
    • Large numbers of disulfide bonds can slow folding
  2. Backbone topology:
    • All‐helical proteins tend to fold faster than all β‐sheet proteins

Factors Determining Secondary and Tertiary Structure

  1. Amino acid sequence:primary structure是蛋白质结构的根本决定因素
  2. Physical and chemical conditions:
    • Temperature
    • pH:The ionic bonds(salt bridge) are broken if the protein is taken to pH values high enough or low enough that either side chain loses its charge.
    • Organic solvents (Urea, guanidine, alcohol):可以通过竞争氢键来影响蛋白质的折叠

denaturation and renaturation 变性与复性

The process that a protein loses its native (functional) 3D structure is call denaturation

1571658253587

Anfinsen 表明,即使没有二硫键,蛋白质也可以正常折叠。

Anfinsen experiment表明,二硫键应该是最后形成的。

protein can self-assemble into its functional conformation, and it needs no information to guide it other than that contained in its sequence.

  • Uniqueness

    • Sequence determined
  • Stability

    • Free energy minimum

    • Folding is a thermodynamically favored process

    • Disulfide bonds help to stabilize folded structure

  • Kinetical accessibility

1. Thermal denaturation 热变性

Differences between the native and denatured conformations can be detected by several spectroscopic(借助分光镜的) methods

例:对于DNase A

image-20191102220059687
  • Solution viscosity (red)
  • Optical rotation at 365 nm (green)
  • UV absorbance at 287 nm (blue)

Protein folding in vivo

在体内存在一些分子蛋白伴侣(Molecular chaperones)可以帮助蛋白质正确折叠:

  • Heat Shock Proteins, HSP:
  1. Prevent improper folding
  2. Prevent and disrupt aggregation

diseases about protein folding

1. Prion

朊病毒是错误折叠的蛋白质,这种蛋白质可以导致其他正常的蛋白质也发生错误折叠

  • 疯牛病:

    1571660355690
1571660376952

2. Amyloid 淀粉样变性

Amyloid与许多疾病有关,比如阿尔茨海默症,帕金森症,二型糖尿病等…

1571660482400

Protein dynamics

Protein have dynamics structures

Protein dyanmicsis essential for protein activities

Categories

1. Atomic fluctuations 原子波动

  1. The vibrations and oscillations of individual atoms and bonds
  2. Time periods ranging from 10^‐15^to 10^‐12^s
  3. Spatial displacements between 0.01 and 1 Å

2. Collective motions 集体运动

The movement of groups of covalently-linked atoms as one unit

  1. Individual sidechain

  2. Helix

  3. Entire domain

Time periods ranging from 10^‐12^to 10^‐3^ s

Spatial displacements between 0.01 and >5 Å

3. Triggered conformational changes

The response motions (ligand binding) of groups of atoms

Individual sidechain

Whole interacting region

Time periods ranging from 10^‐9^to 10^3^ s

Spatial displacements between 0.5 and >10 Å

Prediction of secondary structure

High accuracy $\ \rarr \ $~80%

Differences are most frequently found in definition of the precise location of the ends of helices and strands

1571661077088

Prediction of tertiary structure

对三级结构的预测精确度很低,但是如果所预测的蛋白有已知的同源结构,准确度会提升

Largely depends on sizes of predicted proteins

The accuracy can be increase if a homologue structure exists

Higher sequence similarity, higher accuracy

Protein structure determination

X‐ray Crystallography

image-20191102222551196

Protein crystallography in applications

  • Academics

    • Protein structures
    • Structural genomics
    • Protein-protein or protein-ligand interactions
    • Catalytic mechanisms of enzymes
  • Industry

    • Understanding diseases
    • Drug design