麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 5 Molecular Phylogenetics

7.91-Lecture #5 Michael Yaffe Database Searching Molecular Phylogenetics ABCD ABc ((A,B)c)D)
7.91 – Lecture #5 Database Searching & Molecular Phylogenetics A A B B C D D (((A,B)C)D) C Michael Yaffe

Outline Distance matrix methods Neighbor-Joining Method and related Neighbor Methods · Maximum likelihood Parsimony Branch and bound Heuristic Seaching · Consensus Trees Software(PHYLIP, PAUP The tree of life
Outline • Distance Matrix Methods • Neighbor-Joining Method and Related Neighbor Methods • Maximum Likelihood • Parsimony Branch and Bound Heuristic Seaching • Consensus Trees • Software (PHYLIP, PAUP) • The Tree of Life

Transformed distance method UPGMA assumes constant rate Of evolution across all lineages-can lead to wrong tree topologies Can allow different rates of evolution across different lineages if you normalize using an external reference that diverged early.. e. use an outgroup Define dp =average distance a B d Between outgroup and all ingroups iD MiD )/2+d Now use d, to do the clustering basically just comes from the insight that ingroups evolved separately from each other ONLY AFTER they diverged from outgroup
Transformed Distance Method UPGMA assumes constant rate Of evolution across all lineages - can lead to wrong tree topologies Can allow different rates of evolution across different lineages if you normalize using an external reference that diverged early…i.e. use an outgroup Define dD =average distance A B C D Between outgroup and all ingroups d’ij = (dij –diD –djD)/2 + dD Now use d’ij to do the clustering ..basically just comes from the insight that ingroups evolved separately from each othe r ONLY AFTER they diverged from outgroup

EXample Species A B C B B is distance Between a and b C 8 D 15 10 A 6 Use d as outgroup 3 Species A B 2 B 10/3 C 16/316/3 dn=37/3 Now use UPgma to build tree
Example Species A B C B 9 dAB is distance Between A and B C 8 11 D 12 15 10 A B C D 6 Use D as outgroup 3 3 6 Species A B 2 1 B 10/3 C 16/3 16/3 dD = 37/3 Now use UPGMA to build tree

Neighbor's Relation Method Variant of UPGMA that pairs species in a way that creates a tree with minimal overall branch lengths Pairs of sequences separated by only 1 node are said to be neighbors. single central branch a e terminal branches D For this tree topology dac dbd= daD + dbc=a+b+C+d+ 2e=dAB dcD +2e For neighbor relations, four-point condition will be true dAB +dcd dac dBD ..and. daB+ dcd daD dBc So just have to consider all pairwise arrangements and determine which one satisfies the four-point condition
Neighbor’s Relation Method Variant of UPGMA that pairs species in a way that creates a tree with minimal overall branch lengths. Pairs of sequences separated by only 1 node are said to be neighbors. A B C D a b c d e terminal branches single central branch For this tree topology dAC + dBD = dAD + dBC = a + b + c + d + 2e =dAB + dCD +2e For neighbor relations, four-point condition will be true: dAB + dCD < dAC + dBD …and…dAB + dCD < dAD + dBC So just have to consider all pairwise arrangements and determine which one satisfies the four-point condition

Neighbor-Joining Methods Start with star-like tree. Find neighbors sequentially to minimize total length of all branches C B C D Studier& Kepler 1988 Q12=(N-2)d12-d1-d2 Where any 2 sequences can be 1 and 2 Try all possible sequence combinations. Whichever combination of pairs gives the smallest Q1 is the final tree
Neighbor-Joining Methods Start with star-like tree. Find neighbors sequentially to minimize total length of all branches A B D C D Studier & Kepler 1988: Q12=(N-2)d12 - Σ d1i - Σ d2i Where any 2 sequences can be 1 and 2 Try all possible sequence combinations. Whichever combination of pairs gives the smallest Q12 is the final tree! B A C

Maximum likelihood A purely statistical method Probablilities for every nucleotide substitution in a set of aligned sequences is considered Calculation of probabilities is complex since ancestor is unknown Test all possible trees and calculate the aggregate probablility Tree with single highest aggregate probablity is the most likely to reflect the true phylogenetic tree VERY COMPUTATONALLY INTENSE
Maximum Likelihood • A purely statistical method. • Probablilities for every nucleotide substitution in a set of aligned sequences is considered. • Calculation of probabilities is complex since ancestor is unknown • Test all possible trees and calculate the aggregate probablility. • Tree with single highest aggregate probablilty is the most likely to reflect the true phylogenetic tree. VERY COMPUTATIONALLY INTENSE

Parsimony Parsimony: a derogatory term from the 1930s and 1940s To describe someone who was especially careful with Spending money Biologically: Attach preference to an evolutionary pathway That minimizes the number of mutational events since (1)Mutations are rare events, and (2) The more unlikely events a model postulates, the less likely the model is to be true Parsimony: a character-based method, NoT a distance-based method
Parsimony Parsimony: a derogatory term from the 1930s and 1940s To describe someone who was especially careful with Spending money. Biologically: Attach preference to an evolutionary pathway That minimizes the number of mutational events since (1) Mutations are rare events, and (2) The more unlikely events a model postulates, the less l likely the model is to be true. Parsimony: a character-based method, NOT a distance-based method

Parsimony For parsimony analysis, positions in a sequence alignment fall into one of two categories: informative and uninformative Position Sequence 123456 GGGGGG GGG A G T 234 GG A T A G G A T C A T Only 3 possible unrooted trees you can make
Parsimony For parsimony analysis, positions in a sequence alignment fall into one of two categories: informative and uninformative. Position Sequence 1 2 3 4 5 6 1 G G G G G G 2 G G G A G T 3 G G A T A G 4 G A T C A T Only 3 possible unrooted trees you can make…

Parsimony Position Sequence 123456 GGGGGG GGG A G T 234 GG A T A G G A T C A T 4 Which tree is the right one?
Parsimony Position Sequence 1 2 3 4 5 6 1 G G G G G G 2 G G G A G T 3 G G A T A G 4 G A T C A T 1 3 1 2 1 2 2 4 3 4 4 3 Which tree is the right one?
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 4 Database Searching.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 2 More Pairwise Sequence Comparisons.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 3 More Multiple Sequence Alignment.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 Michael Yaffe Introduction to Bioinformatics.pdf
- 《微生物遗传学》第四章 基因工程技术在改进微生物.ppt
- 《分子生物学》课程教学资源(练习题)试题详解(含参考答案).doc
- 南京军区南京总医院:《组织芯片应用的现状与前景》讲义.pdf
- 《酶学》课程教学资源(讲义)第四章 酶的结构和功能.doc
- 《酶学》课程教学资源(讲义)第十一章 酶在医学方面的应用.doc
- 《酶学》课程教学资源(讲义)第六章 多种因素对酶反应速度的影响.doc
- 《酶学》课程教学资源(讲义)第八章 酶的别构效应.doc
- 《酶学》课程教学资源(讲义)第五章 酶催化动力学基础.doc
- 《酶学》课程教学资源(讲义)第二章 酶的一般性质和分类.doc
- 《酶学》课程教学资源(讲义)第九章 固定化生物催化剂.doc
- 《酶学》课程教学资源(讲义)第三章 酶活性的测定及分离纯化.doc
- 《酶学》课程教学资源(讲义)第七章 多底物酶反应动力学.doc
- 《酶学》课程教学资源(讲义)第一章 绪论.doc
- 孝感学院:《植物解剖学》第四章 种子植物的繁殖和繁殖器官.ppt
- 孝感学院:《植物解剖学》第三讲 叶.ppt
- 孝感学院:《植物解剖学》第一讲 花的重要组成结构及形成.ppt
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 2 The Language of genomics.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 Genome Sequencing.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 3 Review of DNA Seq.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 6 Predicting rna Secondary structure.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 4 Organization of topics.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 6 Structure Prediction.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 5 Markov models.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 5 Review -Homology Modeling.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 Review of protein structure hierarchy.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 1 How are X-ray crystal structures.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 3 For a molecular simulation or model.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 2 Comparing protein Structures.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 7 The protein interactome.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 7 DNA Microarrays Clustering.pdf
- 麻省理工大学:《Foundations of Biology》课程教学资源(英文版)Lecture 6 Ab initio structure prediction.pdf
- 《植物与植物生理学》课程PPT教学课件(高职高专)第三章 植物的矿质营养.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第二章 植物的水分代谢.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第五章 植物的呼吸作用.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第四章 植物的光合作用.ppt
- 《植物与植物生理学》课程PPT教学课件(高职高专)第一章 植物细胞和组织.ppt