麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 6 Ab initio structure prediction

7.91 Amy Keating Ab initio structure prediction Protein Design
Ab Initio Structure Prediction & Protein Design 7.91 Amy Keating

Ab initio prediction Ab initio = from the beginning"; in strictest sense uses first principles, not information about other protein structures In practice all methods rely on empirical observations about other structures Force fields Knowledge-based scoring functions Training sets Fragment structures a good review: Bonneau, R, and D Baker. "Ab Initio Protein Structure Prediction: Progress and Prospects. "Rev Biophys Biomol Struct 30(2001):17389
Ab initio prediction • Ab initio = “from the beginning”; in strictest sense uses first principles, not information about other protein structures • In practice, all methods rely on empirical observations about other structures – Force fields – Knowledge-based scoring functions – Training sets – Fragment structures A good review: Bonneau, R, and D Baker. "Ab Initio Protein Structure Prediction: Progress and Prospects." Rev Biophys Biomol Struct. 30 (2001): 173-89

Approaches to ab initio folding Full Md with explicit solvation( e.g. IBM Blue Gene) VERY expensive May not work Reduced complexity models No side chains(sometimes no main chain atoms either!) Reduced degrees of freedom On-or off-lattice Generally have a solvation-based score and a knowledge based residue-residue interaction term Sometimes used as first step to prune the enormous conformational space then resolution is increased for later fine-tuning
Approaches to ab initio folding • Full MD with explicit solvation (e.g. IBM Blue Gene) – VERY expensive – May not work • Reduced complexity models – No side chains (sometimes no main chain atoms either!) – Reduced degrees of freedom – On- or off-lattice – Generally have a solvation-based score and a knowledgebased residue-residue interaction term – Sometimes used as first step to prune the enormous conformational space, then resolution is increased for later fine-tuning

ROSETTA- the most successful approach to ab initio prediction David Baker, U. Washington, Seattle Based on the idea that the possible conformations of any short peptide fragment(3-9 residues) are well represented by the structures it is observed to adopt in the pdb Generate a library of different possible structures for each sequence segment Search the possible combinations of these for ones that are protein-like by various criteria
ROSETTA - the most successful approach to ab initio prediction • David Baker, U. Washington, Seattle • Based on the idea that the possible conformations of any short peptide fragment (3-9 residues) are wellrepresented by the structures it is observed to adopt in the pdb • Generate a library of different possible structures for each sequence segment • Search the possible combinations of these for ones that are protein-like by various criteria

ROSETTA fragment libraries Remove all homologs of the protein to be modeled(25% sequence identity) For each 9 residue segment in the target use sequence similarity and secondary structure similarity(compare predicted secondary stucture for target to fragment secondary structure) to select w 25 fragments Because secondary structure is influenced by tertiary structure, ensure that the fragments span different secondary structures The extent to which the fragments cluster around a consensus structure is correlated with how good a model the fragment is likely to be for the target LSERTVARS①e
ROSETTA fragment libraries • Remove all homologs of the protein to be modeled (>25% sequence identity) • For each 9 residue segment in the target, use sequence similarity and secondary structure similarity (compare predicted secondary stucture for target to fragment secondary structure) to select ~25 fragments • Because secondary structure is influenced by tertiary structure, ensure that the fragments span different secondary structures • The extent to which the fragments cluster around a consensus structure is correlated with how good a model the fragment is likely to be for the target LSERTVARS

ROSETTA search al algorithm Monte Carlo/ simulated Annealing Structures are assembled from fragments by Begin with a fully extended chain Randomly replace the conformation of one 9 residue segment with the conformation of one of its neighbors in the library Evaluate the move: Accept or reject based on an energy function Make another random move After a prescribed number of cycles switch to 3- residue fragment moves
ROSETTA search algorithm Monte Carlo/Simulated Annealing • Structures are assembled from fragments by: – Begin with a fully extended chain – Randomly replace the conformation of one 9 residue segment with the conformation of one of its neighbors in the library – Evaluate the move: Accept or reject based on an energy function – Make another random move… – After a prescribed number of cycles, switch to 3- residue fragment moves

ROSETTA scoring function p(sequence structure p(structure sequence)=p(structure)x (sequence) sequence is constant need to estimate for decoys built from fragments Main contributions to P(structure) secondary structure packing (e.g. ensureβ- strands formβ- sheets Vdw packing Simons et al. PROTEINs(1999)34, 82-95
ROSETTA scoring function P (structure | sequence) = P (structure) × P ( sequence | structure ) P( sequence) sequence is constant need to estimate for decoys built from fragments Main contributions to P(structure) - secondary structure packing (e.g. ensure β-strands form β-sheets) - VdW packing Simons et al. PROTEINS (1999) 34, 82-95

Native-like structures have characteristic secondary structure packing Example: b-strand dipeptide vector Simons, KT, I Ruczinski, C Kooperberg, BA Fox, C Bystroff, and D Baker. "Improved Recognition of Native-like Protein Structures using A Combination of Sequence-dependent and Sequence-independent Features of Proteins. "Proteins 34, no. 1(1 January 1999): 82-95
Native-like structures have characteristic secondary structure packing Example: b-strand dipeptide vector Simons, KT, I Ruczinski, C Kooperberg, BA Fox, C Bystroff, and D Baker. "Improved Recognition of Native-like Protein Structures using A Combination of Sequence-dependent and Sequence-independent Features of Proteins." Proteins 34, no. 1 (1 January 1999): 82-95

B-strand packing geometry can detect native-like structures Simons, KT, I Ruczinski, C Kooperberg, BA Fox, C Bystroff, and D Baker. "Improved Recognition of Native-like Protein Structures using A Combination of Sequence-dependent and Sequence- independent Features of Proteins. Proteins 34, no. 1(1 January 1999): 82-95
Simons, KT, I Ruczinski, C Kooperberg, BA Fox, C Bystroff, and D Baker. "Improved Recognition of Native-like Protein Structures using A Combination of Sequence-dependent and Sequence-independent Features of Proteins." Proteins 34, no. 1 (1 January 1999): 82-95. β-strand packing geometry can detect native-like structures

ROSETTA scoring function P(structure sequence)=P(structure)x p(sequence structure) sequence sequence is constant need to estimate for decoys built from fragments
ROSETTA scoring function P (structure | sequence) = P (structure) × P ( sequence | structure ) P( sequence) sequence is constant need to estimate for decoys built from fragments
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 7 DNA Microarrays Clustering.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 7 The protein interactome.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 2 Comparing protein Structures.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 3 For a molecular simulation or model.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 How are X-ray crystal structures.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 Review of protein structure hierarchy.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 5 Review -Homology Modeling.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 5 Markov models.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 6 Structure Prediction.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 4 Organization of topics.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 6 Predicting rna Secondary structure.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 3 Review of DNA Seq.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 Genome Sequencing.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 2 The Language of genomics.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 5 Molecular Phylogenetics.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 4 Database Searching.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 2 More Pairwise Sequence Comparisons.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 3 More Multiple Sequence Alignment.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 Michael Yaffe Introduction to Bioinformatics.pdf
- 《微生物遗传学》第四章 基因工程技术在改进微生物.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第三章 植物的矿质营养.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第二章 植物的水分代谢.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第五章 植物的呼吸作用.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第四章 植物的光合作用.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第一章 植物细胞和组织.ppt
- 四川农业大学:《生命科学概论》课程教学资源(PPT课件讲稿)植物鉴赏与人文精神.ppt
- 四川农业大学:《生命科学概论》课程教学资源(PPT课件讲稿)展望21世纪的生命科学.ppt
- 四川农业大学:《生命科学概论》课程教学资源(PPT课件讲稿)人兽共患病.ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第1章 绪论(主讲:邹思湘).ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第2章 生命的化学特征.ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第3章 蛋白质.ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第4章 核酸.ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第5章 糖类.ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第6章 生物膜与物质运输.ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第7章 生物催化剂一酶.ppt
- 南京农业大学:《动物生物化学 Animal Biochemistry》精品课程教学资源(PPT课件讲稿)第8章 糖代谢.ppt
- 《蛋白质与氨基酸》教学讲义(PPT课件).ppt
- 北京大学:《分子细胞生物学》课程教学资源(教学大纲,共十四章).doc
- 北京大学:《分子细胞生物学》课程教学资源(PPT课件讲稿,英文版)Chapter 10 Cytoskeleton System.ppt
- 北京大学:《分子细胞生物学》课程教学资源(PPT课件讲稿,英文版)Chapter 11 Cell proliferation and its regulation.ppt