复旦大学:《基因组学》课程教学资源(学习资料)基因注解网站

Stephen M. Mount Cell Biology and Molecular Genetics H. J. Patterson Hall University of Maryland College Park, MD 20742-5815 Phone301-405-6934 FAX301-314-9081 permanent email addresssm 193@umail umd. edu This is Steve Mount's web page for gene annotation and splice site selection. Much of the material here is relevant to a review article in the american journal of Human Genetics Annotation Gene annotation incorporates cdna data(including ESTS); sequence similarity; and computational predictions based on the recognition of probable splice sites and coding regions. The state of the art was recently surveyed by the Gene Annotation Assessment Project(GASP1), the results of which were published in a special issue of Genome Research Ensembl- Ensembl is a joint project between EMBL-EBI and the Sanger Centre to develop a software system which produces and maintains automatic annotation on eukaryotic genomes Gene Ontology Consortium-The goal of the Gene Ontology Consortium is to produce a dynamic controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing Oak Ridge National Laboratory Computational Biosciences Section-A project whose stated mission is to address fundamental questions in the life sciences and provide infomation and analytical resources to the wider biology research community. TIGR Databases The Institute for mic Research tigr databases are a collection of curated databases containing DNA and protein sequence, gene expression, cellular role, protein family, and taxonomic data for micr ants and humans Celera- Celera genomics, a division of PE Corporation, which produced the fruit fly and human genomes. They are currently working on the mouse, and have announced a plan to move into proteomIcs Genefinding
Stephen M. Mount Cell Biology and Molecular Genetics H. J. Patterson Hall University of Maryland College Park, MD 20742-5815 Phone301-405-6934 FAX301-314-9081 permanent email addresssm193@umail.umd.edu This is Steve Mount's web page for gene annotation and splice site selection. Much of the material here is relevant to a review article in the American Journal of Human Genetics. Annotation Gene annotation incorporates cDNA data (including ESTs); sequence similarity; and computational predictions based on the recognition of probable splice sites and coding regions. The state of the art was recently surveyed by the Gene Annotation Assessment Project (GASP1), the results of which were published in a special issue of Genome Research. Ensembl - Ensembl is a joint project between EMBL-EBI and the Sanger Centre to develop a software system which produces and maintains automatic annotation on eukaryotic genomes. Gene Ontology Consortium - The goal of the Gene Ontology Consortium is to produce a dynamic controlled vocabulary that can be applied to all eukaryotes even as knowledge of gene and protein roles in cells is accumulating and changing. Oak Ridge National Laboratory Computational Biosciences Section - A project whose stated mission is to address fundamental questions in the life sciences and provide information and analytical resources to the wider biology research community. TIGR Databases - The Institute for Genomic Research TIGR Databases are a collection of curated databases containing DNA and protein sequence, gene expression, cellular role, protein family, and taxonomic data for microbes, plants and humans. Celera - Celera genomics, a division of PE Corporation, which produced the fruit fly and human genomes. They are currently working on the mouse, and have announced a plan to move into proteomics. Genefinding

Many genefinding servers are available, and the following list is not complete. GENSCAN-GEnSCan is a genefinder developed by Chris Burge Glimmer M-limmerM is a gene finder developed specifically for small eukaryotes with a gene density of around 20% Genie- Genie is a gene identification tool developed at the University of California, Santa Cruz, that uses Hidden Markov Models to find genes FGENES- This genefinder is available through The Sanger Center's Computational Genomics Group. GRAIL- The Gene Recognition and Assembly Internet Link is available through the Oak Ridge National Laboratory Computational Biosciences Section HMMGene- This genefinder is available through the Center for Biological Sequence Analysis at the Department of Biotechnology, The Technical University of Denmark Splice site prediction Again, there are other sites, but the following sites are known to me. Splice Site Prediction by Neural Network-Hosted, by the berkeley Drosophila Genome Proiect and written by Martin Reese NetGene and NetPlantGene- both of these are available through the center for biological Sequence Analysis at the Department of Biotechnology, The Technical University of Denmark CDNA alignment SIM4-SIM4 is described by Florea et al The Intronerator-a collection of tools for exploring the molecular biology and genomics of C. elegans with a special emphasis on alternative splicing. This is specific to C. elegans, and does more than align cDNAs
Many genefinding servers are available, and the following list is not complete. GENSCAN - GENSCAN is a genefinder developed by Chris Burge. GlimmerM - limmerM is a gene finder developed specifically for small eukaryotes with a gene density of around 20%. Genie - Genie is a gene identification tool developed at the University of California, Santa Cruz, that uses Hidden Markov Models to find genes. FGENES - This genefinder is available through The Sanger Center's Computational Genomics Group. GRAIL - The Gene Recognition and Assembly Internet Link is available through the Oak Ridge National Laboratory Computational Biosciences Section HMMGene - This genefinder is available through the Center for Biological Sequence Analysis at the Department of Biotechnology, The Technical University of Denmark. Splice site prediction Again, there are other sites, but the following sites are known to me. Splice Site Prediction by Neural Network - Hosted, by the Berkeley Drosophila Genome Project and written by Martin Reese. NetGene and NetPlantGene - Both of these are available through the Center for Biological Sequence Analysis at the Department of Biotechnology, The Technical University of Denmark. cDNA alignment SIM4 - SIM4 is described by Florea et al.. The Intronerator - a collection of tools for exploring the molecular biology and genomics of C. elegans with a special emphasis on alternative splicing. This is specific to C. elegans, and does more than align cDNAs

Alternative Splicing PALS Putative Alternative Splicing database. Searchable, limited to mouse and human ASDB Alternative Splicing Database --based on GenBank entries HASDB Human Alternative Splicing Database. Chris Lee. UCLA. 6201 alternative splice relationships in human genes identified through a genome-wide analysis of expressed sequence tags(ESTs) Splicing anomalies in Arabidopsis put into categories that include alternative splicing, based on l-length cDNA sequences Splice Site Consensus It is well-established that nearly all splice sites conform to consensus sequences. These consensus sequences include nearly invariant dinucleotides at each end of the intron, gt at the 5 end of the intron, and AG at the 3 end of the intron, and generally resemble MAGgtRagt at the 5 splice ite and Cagg at the 3 splice site The most common class of nonconsensus splice sites consists of 5' splice sites with a GC dinucleotide (Wu and Krainer 1999) GC sites conform extremely well to the standard consensus positions.42of 44 sites have a consensus G residue at both position-1and position 5. It is reasonable to assume that GC sites are recognized by the standard(U2-dependent) spliceosome The second class of exception to splice site consensus is U12 introns, a minor class of rare introns with splice site sequences that are very different from the standard consensus, but which are very milar to each other(reviewed by Burge et al 1999 and Tarn and Steitz 1997. U12 introns can be identified by highly conserved sequences at the 5 splice site, (RTATCCTY; R=A or G;Y=Cor T); and branch site (TCCTRAY). U12 introns are found in many eukaryotes, including Drosophila melanogaster and Arabidopsis, but not C. elegans Finally, there are a small number of nonconsensus sites that fit into neither of the two categories mentioned above. Many reports of such variant splice sites can be traced to errors in annotation or interpretation, polymorphic differences between the sources of cDNA and genomic sequence. inclusion of pseudogene sequences, or failure to account for somatic mutation However, there are many examples of sites that match the consensus very poorly, and experimental work has established that 5 splice sites do not absolutely require gt, and 3 splice sites do not absolutely require AG, to be recognized in vivo
Alternative Splicing PALS Putative Alternative Splicing database. Searchable, limited to mouse and human. ASDB Alternative Splicing Database -- based on GenBank entries. HASDB Human Alternative Splicing Database. Chris Lee. UCLA. 6201 alternative splice relationships in human genes identified through a genome-wide analysis of expressed sequence tags (ESTs). Splicing anomalies in Arabidopsis put into categories that include alternative splicing, based on full-length cDNA sequences. Splice Site Consensus It is well-established that nearly all splice sites conform to consensus sequences . These consensus sequences include nearly invariant dinucleotides at each end of the intron, GT at the 5' end of the intron, and AG at the 3' end of the intron, and generally resemble MAG|GTRAGT at the 5' splice site and CAG|G at the 3' splice site. The most common class of nonconsensus splice sites consists of 5' splice sites with a GC dinucleotide (Wu and Krainer 1999). GC sites conform extremely well to the standard consensus sequences at other positions. 42 of 44 sites have a consensus G residue at both position -1 and position 5. It is reasonable to assume that GC sites are recognized by the standard (U2-dependent) spliceosome. The second class of exception to splice site consensus is U12 introns, a minor class of rare introns with splice site sequences that are very different from the standard consensus, but which are very similar to each other (reviewed by Burge et al 1999 and Tarn and Steitz 1997. U12 introns can be identified by highly conserved sequences at the 5' splice site, (RTATCCTY; R = A or G; Y = C or T); and branch site (TCCTRAY). U12 introns are found in many eukaryotes, including Drosophila melanogaster and Arabidopsis, but not C. elegans. Finally, there are a small number of nonconsensus sites that fit into neither of the two categories mentioned above. Many reports of such variant splice sites can be traced to errors in annotation or interpretation, polymorphic differences between the sources of cDNA and genomic sequence, inclusion of pseudogene sequences, or failure to account for somatic mutation. However, there are many examples of sites that match the consensus very poorly, and experimental work has established that 5' splice sites do not absolutely require GT, and 3' splice sites do not absolutely require AG, to be recognized in vivo

Microexons difficult to recognize using computational genefinding methods exons, A list of selected documented microexons is available. Very small or microexons, pose special problems for gene annotation. The even confound the alignment of cDNA and genomic sequences. Furthermore because microexons are very often the site of alternative splicing, an understanding of how they are recognized (and regulated) is key to understanding gene expression
Microexons A list of selected documented microexons is available. Very small exons, or microexons, pose special problems for gene annotation. They are difficult to recognize using computational genefinding methods, and can even confound the alignment of cDNA and genomic sequences. Furthermore, because microexons are very often the site of alternative splicing, an understanding of how they are recognized (and regulated) is key to understanding gene expression
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 复旦大学:《基因组学》课程教学资源(学习资料)简述miRNA及其在动植物中的差异.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)科学家绘制出最清晰立体人类基因组结构图.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)美国提出基因测序数据分类新标准.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)人类基因组范围转录异构变异——表达水平多样性.pdf
- 复旦大学:《基因组学》课程教学资源(学习资料)系统生物学综述.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)遗传学词典.doc
- 张勘上海市卫生局:病原微生物实验室生物安全管控的实践探索与未来挑战(张勘).pdf
- 复旦大学:《医学与生物安全》课程教学资源(讲稿)实验室生物安全基本概念与危险因子(叶荣).pdf
- 《医学与生物安全》课程教学资源:生物安全监督执法——强化世博会实验室生物安全保障培训(上海市卫生局监督所:顾小平).pdf
- 复旦大学:《医学与生物安全》课程教学资源(讲稿)生物安全 Bio-safety瞿涤人为生物危险(生物战剂、生物恐怖、防御突发事件的应对).pdf
- 复旦大学:《医学与生物安全》课程教学资源(讲稿)实验室生物安全基础——生物安全实验室的个人防护、消毒灭菌、废弃物处理(孙志平).pdf
- 复旦大学:《医学与生物安全》课程教学资源(讲稿)实验室管理——菌毒种保藏与运输(丁悦娜).pdf
- 复旦大学:《医学与生物安全》课程教学资源(讲稿)生物安全实验室设施——防护设施、空气净化与负压体系、生物安全柜(韩文东).pdf
- 复旦大学:《医学与生物安全》课程教学资源(见实习方案,WORD版).doc
- 复旦大学:《医学与生物安全》课程教学资源(见习实习方案).pdf
- 山东大学:PCR最新技术原理、方法及应用(第二版,张为宁),2011.ppt
- 浙江大学:《生物信息学》课程配套PPT课件(第二版)3 Analysis and alignment of sequences 3.1 Compositional bias in biological sequences 3.2 Alignment of pairs of sequences.pptx
- 浙江大学:《生物信息学》课程配套PPT课件(第二版)3 Analysis and alignment of sequences 3.4 Multiple sequence alignment and domain finding.pptx
- 浙江大学:《生物信息学》课程配套PPT课件(第二版)5 Phylogenetic Tree 5.1 Genetic polymorphism and phylogenetic tree 5.2 Construction of phylogenetic tree.pptx
- 上海交通大学医学院:常用实验动物生物学特性及其应用(小鼠、大鼠、豚鼠、兔).ppt
- 复旦大学:《基因组学》课程教学资源(学习资料)基因组结构的进化.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)Genome Project History(2011).ppt
- 复旦大学:《基因组学》课程教学资源(学习资料)2007年完成基因组测序的生物.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)大豆基因组测序完成.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)高粱基因组计划.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)国际研究小组完成木薯基因组图谱.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)科学家计划绘制香蕉基因组图谱.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)美科学家绘出玉米基因组草图.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)拟南芥基因组.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)葡萄基因组测定完成.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)世界千人测序计划.pdf
- 复旦大学:《基因组学》课程教学资源(学习资料)烟草基因组计划.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)杨树全基因组测序.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)玉米基因组.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)植物基因组.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)植物基因组计划.doc
- 复旦大学:《基因组学》课程教学资源(学习资料)基因组加倍与物种形成.pdf
- 复旦大学:《基因组学》课程教学资源(学习资料)全基因组加倍的意义.pdf
- 复旦大学:《基因组学》课程教学资源(学习资料)密码优化 Genome Research 2010.pdf
- 复旦大学:《基因组学》课程教学资源(学习资料)顺式元件与反式因子的共进化 Genome Research 2010.pdf