网格计算热点与综述_Issues with Production Grids

lssues with Production Grids Tony Hey Director of uk e-Science Core Programme
Issues with Production Grids Tony Hey Director of UK e-Science Core Programme

NGS S NGs“ Today Interfaces IERAGRI Projects ecee e-Minerals cabling Grids for OGSI:: Lite e-Materials Orbital Dynamics of Galaxies Bioinformatics(using BLAST GEODISE project Core Node Software Stack UKQCD Singlet meson project Census data analysis Foundations HPCM MIAKT project Middleware e-HTPX project. SR83 Globus( Reality Grid(chemistry) Backend ■盟sY UNIVERSIT Oxford PGIIntelGCc Total View Debugger UCL RedHat Enterprise Linux 3.0 Southampton Imperial Example Applications 器说★ 夥、 NAg Sheffield DL POLY Molecular Foundation QUB BBSRC CCLRC ecee Enabling Grids for -science in Europe
NGS “Today” Projects e-Min erals e-M aterials Orbit al Dyna mics of G ala xies Bioinform atics (usin g BLA ST) GEODISE proj e ct UKQCD Sin glet meson proj e ct Census data analysis MIAKT proj e ct e-HTPX proj e ct. R ealityGrid (chemistry) Users Leeds Oxford UCL Car diff Southampton Imperial Liverpool Sheffiel d C ambridge Edinburgh QUB BBSRC CCLRC. Interfaces OGSI::Lite

dustervision NGS Hardware Compute cluster Data Cluster .64 dual CPU Intel 3.06 GHz(1MB cache )nodes.20 dual CPU Intel 3.06 GHz nodes 2GB memory per node .4GB memory per node .2X 120GB IDE disks(1 boot, 1 data) 2X120GB IDE disks(1 boot, 1 data . Gigabit network .Gigabit network . Myrinet M3F-PCIXD-2 Myrinet M3FPCⅨXD2 . Front end (as node) Front end(as node) .Disk server(as node) with 2x Infortrend 2.1TB-18TB Fibre SAN(Infortrend F16F41TB Fibre U16U SCSI Arrays(Ultra Star 146Z10 disks) Arrays(Ultra Star 146Z10 disks) .PGI compilers .PGI compilers .Intel Compilers, Mr .Intel Compilers, MKL .PBSPro .TotalView Debugger TotalView Debugger . Redhat es 3.0 oracle 9i rac .Oracle Application server .RedHat ES 3.0
NGS Hardware Compute Cluster •64 dual CPU Intel 3.06 GHz (1MB cache) nodes •2GB memory per no d e •2x 120GB IDE disks (1 boot, 1 data) •Gigabit network •Myrinet M3F-PCIXD-2 •Front end (as node) •Disk server (as n o d e) with 2x Infortrend 2.1TB U16 U SCSI Arrays (UltraStar 1 4 6 Z10 disks) •PGI compilers •Intel Compilers, MKL •PBSPro •TotalView Debugger •RedHat ES 3.0 Data Cluster •20 dual CPU Intel 3.06 GHz nodes •4GB memory per no d e •2x120GB IDE disks (1 boot, 1 data) •Gigabit network •Myrinet M3F-PCIXD-2 •Front end (as node) •18TB Fibre SAN ( Infortrend F16F 4.1TB Fibre Arrays (UltraStar 1 46Z10 disks) •PGI compilers •Intel Compilers, MKL •PBSPro •TotalView Debugger •Oracle 9i R AC •Oracle Applicati on server •RedHat ES 3.0

NGS Software Core Node Software Stack Foundations OGSA- DAL Middleware VDT 1.2 SRB 3 Globus gr3 Backend Oracle RAC 9i Libraries Tools PGI Intel Total View Debugger RedHat Enterprise Linux 3.0 Example Applications NAg DL POLY Molecular Foundations. 确m0s Ab initi NCBI B MATLAB Reality Grid
NGS Software

Reality Grid AHM Experiment Measuring protein-peptide binding energies -44G ind is vital for e. g. understanding fundamental physical processes at play at the molecular level, for designing new drugs Computing a pept otide-protein binding energy traditionally takes weeks to months We have developed a grid ligand based method to accelerate this process We computed 44Ghind during the uK ami.e. Src SH2 domain in less than 48 hours
RealityGrid AHM Experiment • Measuring protein-peptide binding energies – ∆∆Gbind is vital for e.g. understanding fundamental physical processes at play at the molecular level, for designing new drugs. • Computing a peptide-protein binding energy traditionally takes weeks to months. • We have developed a gridbased method to accelerate this process. We computed ∆∆Gbind during the UK AHM i.e. in less than 48 hours ligand Src SH2 domain

Experiment Details A Grid based approach, using the reality Grid steering library enables us to launch, monitor checkpoint and spawn multiple simulations Each simulation is a parallel molecular dynamic simulation running on a supercomputer class machine At any given instant, we had up to nine simulations in progress(over 140 processors) on machines at 5 different sites e.g 1X TG-SDSC, 3X TG-NCSA, 3x NGS-OXford 1x NGS-Leeds 1X NGS-RAL
Experiment Details • A Grid based approach, using the RealityGrid steering library enables us to launch, monitor, checkpoint and spawn multiple simulations • Each simulation is a parallel molecular dynamic simulation running on a supercomputer class machine • At any given instant, we had up to nine simulations in progress (over 140 processors) on machines at 5 different sites: e.g 1x TG-SDSC, 3x TG-NCSA, 3x NGS-Oxford, 1x NGS-Leeds, 1x NGS-RAL

Experiment Details(2) In all 26 simulations were run over 48 hours We simulated over 6.8ns of classical molecular dynamics in this time Real time visualization and off-line analysis required bringing back data from sImulations in progress We used UK-light between UCL and the TeraGrid machines(SDSC, NCSa)
Experiment Details (2) • In all 26 simulations were run over 48 hours. We simulated over 6.8ns of classical molecular dynamics in this time • Real time visualization and off-line analysis required bringing back data from simulations in progress. • We used UK-light between UCL and the TeraGrid machines (SDSC, NCSA)

The e-Infrastructure UK NGS Starlight( Chicago) anchester US TeraGrid Netherlight Amsterdam) Oxford SDSC RAL NCSA PSC UCL UKLight AHM 2004 All sites connected by and manchester production network(not vncserver ll shown) Computation Steering clients O Network PoP O Service Registry
Computation Starlight (Chicago) Netherlight (Amsterdam) Leeds PSC SDSC NCSA Manchester Oxford RAL US TeraGrid UK NGS UCL UKLight The e-Infrastructure AHM 2004 Local laptops and Manchester vncserver All sites connected by production network (not all shown) Steering clients Network PoP Service Registry

The scientific results 400 Thermodynamic Integrations 300▲ ′d 200 100 0.2 0.4 0.6 08 100 lambda -200 Some simulations require extending and more sophisticated analysis needs to be performed
The scientific results … T h e r m odyna mic I nte g r ations -200 -100 0 100 200 300 400 0 0.2 0.4 0.6 0.8 1 la mbda dE/dl dp p o Some simulations require extending and more sophisticated analysis needs to be performed

and the problems Restarted the GridService container Wednesday evenin Numerous quota and permission issues, especially at TG-SDSC NGS-Oxford was unreachable Wednesday evening to Thursday morning The steerer and launcher occasionally fail We were unable to checkpoint two simulations The batch queuing systems occasionally did not like our simulations 5 simulations died of natural causes Overall, up to six people were working on this calculation to solve these problems
… and the problems • Restarted the GridService container Wednesday evening • Numerous quota and permission issues, especially at TG-SDSC • NGS-Oxford was unreachable Wednesday evening to Thursday morning • The steerer and launcher occasionally fail • We were unable to checkpoint two simulations • The batch queuing systems occasionally did not like our simulations • 5 simulations died of natural causes • Overall, up to six people were working on this calculation to solve these problems
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- C语言程序设计(下)_第9讲 位运算,枚举,类型定义与编译预处理.pps
- C语言程序设计(下)_第8讲 结构与联合.pps
- C语言程序设计(下)_第7讲 查找与排序算法.pps
- C语言程序设计(下)_第13讲 非线性结构及数据结构应用举例.pps
- C语言程序设计(下)_第12讲 数据结构基础(二).pps
- C语言程序设计(下)_第11讲 数据结构基础(一).pps
- C语言程序设计(下)_第10讲 文件.pps
- C语言程序设计(上)_第6讲 指针.pps
- C语言程序设计(上)_第5讲 函数.pps
- C语言程序设计(上)_第4讲 数组的概念及应用.pps
- C语言程序设计(上)_第3讲 C语言程序的基本控制结构.pps
- C语言程序设计(上)_第2讲 C语言基础.pps
- C语言程序设计(上)_第1讲 预备知识.pps
- C语言程序设计(上)_第0讲 前言.pps
- C语言程序设计(上)_cover.ppt
- 华北电力大学:数据结构_第8章(查找表).ppt
- 华北电力大学:数据结构_第7章(图).ppt
- 华北电力大学:数据结构_第6章(树).ppt
- 华北电力大学:数据结构_第5章(串).ppt
- 华北电力大学:数据结构_第4章(数组和广义表).ppt
- 操作系统原理试题.doc
- 新标准中文版Office XP五合一基础培训教程-目录.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第1章 Windows XP入门.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第3章 Windows XP附件程序.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第2章运行程序.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第5章系统设置.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第6章软件和硬件的安装与删除.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第4章文件(夹)和程序的管理.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第7章 Windows XP网络.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第9章初步使用Word2002.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第8章Word2002基本操作.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第11章格式化文档.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第12章使用表格.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第10章编辑文档.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第17章工作簿和工作表.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第13章图文混排.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第15章样式与模板.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第18章函数和公式.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第16章 Excel2002入门.ppt
- 《新标准中文版Office XP五合一基础培训教程》电子教案(PPT课件)第14章设置版面.ppt