《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-4-LSbM-tree

LSbM-tree:一个读写兼优的大数据存储结构
LSbM-tree:一个读写兼优的大数据存储结构 1

2MajorDataFormatsinStorageSystems: Sequentially archived data- Indexed data, e.g. sorted data by a defined key, ...- Read/write largely by B+-tree and LSM-tree.Relational tables- Structured data formats for relational databases, e.g. MySQL- Read/write operations by relational algebra/calculus·Key-value store-Apair ofkey/value fora data item, e.g.redis, memcached-Read/write: request -> index -> fetching data·Graph-databases? Free-style text files-A file may be retrieved by KV-store, indexed directory,
Major Data Formats in Storage Systems • Sequentially archived data – Indexed data, e.g. sorted data by a defined key, . – Read/write largely by B+-tree and LSM-tree • Relational tables – Structured data formats for relational databases, e.g. MySQL – Read/write operations by relational algebra/calculus • Key-value store – A pair of key/value for a data item, e.g. redis, memcached – Read/write: request -> index –> fetching data • Graph-databases • Free-style text files – A file may be retrieved by KV-store, indexed directory, . 2

3New Challenges to AccessPerformance in Big Data.Sequentially archived data-Canwemassivelyprocessbothreadsandwritesconcurrently?-butLSM-treefavorswritesandB+-treefavorsreads.Relationaltables- Tables must partitioned/placed among many nodes,e.g.Apache Hive- How to minimize data transfers among nodes and from local disks?·Key-valuestore-Keyindexingbecomesabottleneckas#concurrentrequestsincrease-Howtoacceleratedataaccessesforin-memorykey-valuestore?
New Challenges to Access Performance in Big Data • Sequentially archived data – Can we massively process both reads and writes concurrently? – but LSM-tree favors writes and B+-tree favors reads • Relational tables – Tables must partitioned/placed among many nodes, e.g. Apache Hive – How to minimize data transfers among nodes and from local disks? • Key-value store – Key indexing becomes a bottleneck as # concurrent requests increase – How to accelerate data accesses for in-memory key-value store? 3

Fast Accesses to Sequentially/ArchivedDatain both memory and disks
Fast Accesses to Sequentially Archived Data in both memory and disks 4

What is LSM-tree?It is a Log-structured merge-tree (1996):Multiple levels of sorted data, (e.g. each by a B+ tree)Each level increases exponentially, forming a “pyramid'The smallest level is in memory and the rest on diskCoDRAMHDDC1C2
What is LSM-tree? It is a Log-structured merge-tree (1996): • Multiple levels of sorted data, (e.g. each by a B+ tree) • Each level increases exponentially, forming a “pyramid” • The smallest level is in memory and the rest on disk C0 C1 C2 C0 C1 C2 HDD DRAM 5

LsM-treeiswidely usedinproduction systemsAPACHGoogleBigTableHBASEDTANLBSTMUEUROcassandraRocksDBMariaDBlevelDBb
LSM-tree is widely used in production systems 6 RocksDB

Why Log-structured merge-tree(LSM-tree)?B+treeInsert2.5- In-placeupdateseekRandoml/Os-Lowwritethroughput2.5dddad,LSM-treewrites- Log-structuredupdate-Merge/compactionforsortingDRAM-Sequentiall/OsHDDmerges-Highwritethroughput
Why Log-structured merge-tree (LSM-tree)? • B+ tree – In-place update – Random I/Os – Low write throughput • LSM-tree – Log-structure for sequential I/O – Merge for sorting – Batched I/O operations – High write throughput 7 2.5 d2 ’ seek Insert 2.5 • LSM-tree – Log-structured update – Merge/compaction for sorting – Sequential I/Os – High write throughput 7 writes merges HDD DRAM

8SQLite Experiences of LSM-tree by Richard HippLog StructuredMerge(LSM)BadGood-Slowerreads.FasterwritesBackgroundmergeReduced writeprocessamplificationMorespaceondisk-Linearwrites.Greatercomplexity.LessSSDwear
SQLite Experiences of LSM-tree by Richard Hipp 8

AbuffercacheproblemreportedbyCassandrain2014CASSANDRA-6916-readoaC*2.1 beta1 vs beta2110,000effective caching100,00090,00080.00070,00060.00050,00warmingup40.00030,00020,000disenabled caching10,000250t5o20010060Total operation time (seconds)*https://www.datastax.com/dev/blog/compaction-improvements-in-cassandra-219
A buffer cache problem reported by Cassandra in 2014 9 *https://www.datastax.com/dev/blog/compaction-improvements-in-cassandra-21 warming up effective caching disenabled caching

BasicFunctionsofBufferCacheBuffer cache is in DRAM or other fast devicesData entries in buffer cache refer to disk blocks (page)Cache the frequently read disk blocks for reuseBuffer cache.disk2a10
Basic Functions of Buffer Cache • Buffer cache is in DRAM or other fast devices • Data entries in buffer cache refer to disk blocks (page) • Cache the frequently read disk blocks for reuse 10 b c a disk Buffer cache a c b
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-3-MR-model-and-systems.pptx
- 《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-2-access-patterns-in-big-data.pptx
- 《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-1-balanced-systems-updated.pptx
- 《系统软件与软件安全》课程教学资源(文献资料)系统软件与软件安全文献合集.pdf
- 济南大学:研究生院《人工智能》专业课程教学大纲汇编.pdf
- 济南大学:研究生院《计算机技术》专业课程教学大纲汇编.pdf
- 济南大学:研究生院《计算机科学与技术》专业课程教学大纲汇编.pdf
- 北京信息科技大学:研究生院计算机学院课程教学大纲汇编.pdf
- 湖南工业大学:计算机与人工智能学院人工智能专业课程教学大纲汇编(2023版人才培养方案).pdf
- 湖南工业大学:计算机与人工智能学院智能科学与技术专业课程教学大纲汇编(2023版人才培养方案).pdf
- 湖南工业大学:计算机与人工智能学院物联网工程专业课程教学大纲汇编(2023版人才培养方案).pdf
- 湖南工业大学:计算机与人工智能学院网络工程专业课程教学大纲汇编(2023版人才培养方案).pdf
- 湖南工业大学:计算机与人工智能学院通信工程专业课程教学大纲汇编(2023版人才培养方案).pdf
- 湖南工业大学:计算机与人工智能学院软件工程专业课程教学大纲汇编(2023版人才培养方案).pdf
- 华中科技大学:计算机科学与技术学院《机器学习》课程教学大纲(2021版).pdf
- 华中科技大学:计算机科学与技术学院《计算机图形学》课程教学大纲(2021版).pdf
- 华中科技大学:计算机科学与技术学院《计算理论》课程教学大纲(2021版).pdf
- 华中科技大学:计算机科学与技术学院《计算思维》课程教学大纲(2021版).pdf
- 华中科技大学:计算机科学与技术学院《接口技术》课程教学大纲(2021版).pdf
- 华中科技大学:计算机科学与技术学院《命令式计算原理》课程教学大纲(2021版).pdf
- 《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-7-big-volume-data-accesses.pptx
- 《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-6-locks-and-CC.pptx
- 《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-7-SSD-sys.pptx
- 《系统软件与软件安全》课程教学课件(PPT讲稿,英文)Lecture-8-SDS-vision.pptx
- 江苏科技大学:《计算机组成原理》课程教学资源(PPT课件,完整讲稿,共十章).pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter1_1计算机基础知识.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter1_2计算机中数的表示和编码.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter2_1 8086-8088微处理器结构.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter2_2 8086-8088的寻址方式.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter2_3 8086-8088的指令系统.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter2_4逻辑指令-控制转移指令.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter2_5处理机控制-串处理指令.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter3_1汇编语言及其程序结构.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter3_2汇编语言程序举例.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter3_3 BIOS和DOS中断功能调用.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter3_4 汇编语言程序设计.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter3_5 汇编语言程序设计小结.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter4_1 PC机的总线结构和时序.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter4_2 总线与时序.pptx
- 江苏科技大学:《微机原理与接口技术》课程教学资源(PPT课件)Chapter5_0接口概述.pptx
