同济大学:《大数据分析与数据挖掘 Big Data Analysis and Mining》课程教学资源(PPT课件讲稿)关联规则 Association Rule

Big Data Analysis and Mining Association Rule Qinpei Zhao赵钦佩 qinpeizhao@tongji.edu.cn 2015 Fall 2021/1/27
2021/1/27 1 Big Data Analysis and Mining Qinpei Zhao 赵钦佩 qinpeizhao@tongji.edu.cn 2015 Fall Association Rule

Frequent Pattern Analysis Frequent patten a pattern(a set of itemS,S subsequences, substructures, etc. )that occur frequently in a data set a First proposed by Agrawal, Imielinski, and Swami in the context of frequent itemsets and assocIation rule mining a Motivation Finding inherent regularities in data o What products were often purchased together? 口 Beer and diapers? What are the subsequent purchases after buying a PC?
Frequent Pattern Analysis 2 ◼ Frequent pattern: a pattern (a set of items, subsequences, substructures, etc.) that occurs frequently in a data set ◼ First proposed by Agrawal, Imielinski, and Swami in the context of frequent itemsets and association rule mining. ◼ Motivation: Finding inherent regularities in data ◆ What products were often purchased together? Beer and diapers? ◆ What are the subsequent purchases after buying a PC?

Association Rule Discovery a Supermarket shelf management- Market-basket model ■Goal o Identify items that are bought together by sufficiently many customers ■ Approach e Process the sales data collected with barcode scanners to find dependencies among items ■ a classic ru|e If someone buys diaper and milk, then he/she is likely to buy beer Don't be surprised if you find six-packs next to diapers!
Association Rule Discovery 3 ◼ Supermarket shelf management – Market-basket model ◼ Goal: ◆ Identify items that are bought together by sufficiently many customers ◼ Approach: ◆ Process the sales data collected with barcode scanners to find dependencies among items ◼ A classic rule: ◆ If someone buys diaper and milk, then he/she is likely to buy beer ◆ Don’t be surprised if you find six-packs next to diapers!

Applications-(1) a Items= products Baskets = sets of products someone bought in one trip to the store a Real market baskets: Chain stores keep TBs of data about what customers buy together o Tells how typical customers navigate stores, let them position tempting items Suggests tie-in tricks", e.g., run sale on diapers and raise the price of beer Need the rule to occur frequently a Amazon's people who bought X also bought Y
Applications – (1) 4 ◼ Items = products; Baskets = sets of products someone bought in one trip to the store ◼ Real market baskets: Chain stores keep TBs of data about what customers buy together ◆ Tells how typical customers navigate stores, let them position tempting items ◆ Suggests tie-in “tricks”, e.g., run sale on diapers and raise the price of beer ◆ Need the rule to occur frequently ◼ Amazon’s people who bought X also bought Y

ELSES PROPERTY PLAGIARISm ORK Applications-(2) Baskets sentences Items documents containing those sentences Items that appear together too often could represent plagiarism ◆ Notice items do not have to be“in” baskets Baskets= patients; Items drugs side-effects o has been used to detect combinations of drugs that result in particular side-effects But requires extension: Absence of an item needs to be observed as well as presence
Applications – (2) 5 ◼ Baskets = sentences; Items = documents containing those sentences ◆ Items that appear together too often could represent plagiarism ◆ Notice items do not have to be “in” baskets ◼ Baskets = patients; Items = drugs & side-effects ◆ Has been used to detect combinations of drugs that result in particular side-effects ◆ But requires extension: Absence of an item needs to be observed as well as presence

ELSES PROPERTY ARISm ORE IO Plagiarism Checker Similar Content search Online Plagiarism Paste Original content Here Paste Alternate content Here esting Plagiarism ABC College for Women is one of the most prestigious since the establishment of ABC College for Women and in early Januar institutions of London with a full time enrollment of about 8000 students 12, the University has tried its level best for improvement in Highes Government did variou institution have been shaped by its institutional history, which is spread foreign universities MoU with various national industries and linkages with t years. In 02, the University made all strong foreign universities have been established in the field of Pharmacy sions for the improvement in on. Established in May Electronics, Ent al Science, Fine Arts, Economics and Mass Communication. This is how they made the glorious academic values of University of the Oxford, it was housed in a building on XYZ Road,with his oldest premier post-graduate female institution very nicely ngth of 90 students and then the progress flourished with full shot And College started programs like Electronics, Environmental Science, ts, Economics and Mass Communication, Various national industries and linkages with Foreign Colleges helped a lot c842W:130s:6P:1 C:558W805:3P1 lear Highlight Clear all Import ar Highlight Clear all Occurances Density Matching Limit Case Sensitive Higher Education. Established 1% Scan Density 2.31% Statistics: Plagiarism ABC College 2.31% 30 %/o Duplicate Found! Education, Established in 231% a Export csy 39 Matches Detected Established in May Export相ML for Women s 231% Scan Now 6
6

Transaction data: a set of documents A text document data set, each document is treated as a“bag” of keywords doc 1 Student, Teach, School doc2 Student School docs Teach, School, City, Game doc Baseball, Basketball doc5 Basketball, Player, Spectator doc Baseball. Coach. Game, Team doc: Basketball, Team, City, Game
7 Transaction data: a set of documents ◼ A text document data set. Each document is treated as a “bag” of keywords doc1: Student, Teach, School doc2: Student, School doc3: Teach, School, City, Game doc4: Baseball, Basketball doc5: Basketball, Player, Spectator doc6: Baseball, Coach, Game, Team doc7: Basketball, Team, City, Game

The model: rules a transaction t contains x. a set of items (itemset)in / ifX c t An association rule is an implication of the form X→>Y, Where x,Ycl,andX∩Y= An itemset is a set of items + E.g., X=milk, bread, cereal) is an itemset ak-itemset is an itemset with k items E.g., milk, bread, cereal] is a 3-itemset
8 The model: rules ◼ A transaction t contains X, a set of items (itemset) in I, if X t. ◼ An association rule is an implication of the form: X → Y, where X, Y I, and X Y = ◼ An itemset is a set of items. ◆ E.g., X = {milk, bread, cereal} is an itemset. ◼ A k-itemset is an itemset with k items. ◆ E.g., {milk, bread, cereal} is a 3-itemset

Rule strength measures (the transaction data set) if sup gor sup in T Support: The rule holds with support transactions containⅩ∪Y ◆Sp=Pr(x∪Y Confidence. The rule holds in t with confidence conf if conf of tranactions that contain x also contain y conf=Pr(r X) a An association rule is a pattern that states When x occurs. y occurs with certain probability
9 Rule strength measures ◼ Support: The rule holds with support sup in T (the transaction data set) if sup % of transactions contain X Y. ◆ sup = Pr(X Y). ◼ Confidence: The rule holds in T with confidence conf if conf % of tranactions that contain X also contain Y. ◆ conf = Pr(Y | X) ◼ An association rule is a pattern that states when X occurs, Y occurs with certain probability

Support and Confidence Support count: The support count of an itemset X, denoted by X count, in a data set T is the number of transactions in t that contain X assume t has n transactions Then (X∪Y) count support= (X∪) count confidence Xcount
10 Support and Confidence ◼ Support count: The support count of an itemset X, denoted by X.count, in a data set T is the number of transactions in T that contain X. Assume T has n transactions. ◼ Then, n X Y count support ( ). = X count X Y count confidence . ( ). =
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 《程序设计基础》课程教学资源:实验教学大纲.pdf
- 白城师范学院:《数据库系统概论 An Introduction to Database System》课程教学资源(PPT课件讲稿)第二章 关系数据库(2.4 关系代数 2.5 关系演算 2.6 小结).ppt
- 安徽工贸职业技术学院:《计算机组装与维护》课程教学资源(PPT课件讲稿)项目五 微型计算机维护.ppt
- 曙光:并行程序设计简介(PPT讲座).ppt
- 《单片机原理与应用》课程教学资源(PPT课件讲稿)第7章 显示与开关/键盘输入及微型打印机接口设计.ppt
- 数据结构与算法(PPT课件讲稿)Data Structures and Algorithms.pptx
- 四川大学:《计算机操作系统 Operating System Principles》课程教学资源(PPT课件讲稿)第5章 死锁.ppt
- 四川大学:《Java面向对象编程》课程PPT教学课件(Object-Oriented Programming - Java)Unit 1.1 Java Applications 1.1.1 Applications in Java(熊运余).ppt
- 厦门大学:《大数据技术原理与应用》课程教学资源(PPT课件讲稿,2016)第8章 流计算.ppt
- Adaptive Dynamic Bipartite Graph Matching:A Reinforcement Learning Approach.pptx
- 中国科学技术大学:《网络安全协议》课程教学资源(PPT课件讲稿)第一章 网络安全综述 Network Security Protocols(薛开平).ppt
- 电子工业出版社:《计算机网络》课程教学资源(第五版,PPT课件讲稿)第二章 物理层.ppt
- Excel 2010高级使用技巧(PPT讲稿).ppt
- 《数据库原理》课程教学资源(PPT课件讲稿)第三章 关系数据库标准查询语言SQL.pps
- 河南中医药大学(河南中医学院):《计算机文化》课程教学资源(PPT课件讲稿)第一章 计算机网络概述(主讲:阮晓龙).pptx
- 《软件工程导论》课程教学资源(PPT课件讲稿)第9章 面向对象方法学.ppt
- 南京航空航天大学:《C++》课程电子教案(PPT课件讲稿)第3章 类的基础部分(主讲:陈哲).ppt
- 南京大学:使用失效数据来引导决定(PPT讲稿,计算机系:赵建华).ppt
- 南京大学:《Java语言程序设计》课程教学资源(PPT课件讲稿)第2章 Java语言语法基础.ppt
- 上海交通大学:并发理论(PPT课件诗篇)Concurrency Theory.ppt
- 《图像处理与计算机视觉 Image Processing and Computer Vision》课程教学资源(PPT课件讲稿)Chapter 11 Bundle adjustment Structure reconstruction SFM from N-frames.pptx
- 电子科技大学:《计算机操作系统》课程教学资源(PPT课件讲稿)第三章 处理机的调度和死锁.ppt
- 香港科技大学:Clustering(PPT讲稿).ppt
- 上海交通大学:TLS/SSL Security(PPT课件讲稿).pptx
- 山东大学计算机学院:《人机交互技术》课程教学资源(PPT课件讲稿)第7章 Web界面设计.ppt
- 山东大学:《微机原理及单片机接口技术》课程教学资源(PPT课件讲稿)第三章 IAP15W4K58S4单片机的硬件结构.ppt
- 南京大学:《面向对象技术 OOT》课程教学资源(PPT课件讲稿)面向方面的编程 Aspect Oriented Programming.ppt
- 武昌首义学院:Word的基本操作与技巧(PPT讲稿,主讲:张旋子).pptx
- 《VB程序设计》课程教学资源(PPT课件讲稿)第八章 过程.pps
- 湖南生物机电职业技术学院:《电子商务概论》课程教学资源(PPT课件)第五章 网络信息搜索.ppt
- 《电子商务》课程教学资源(PPT课件讲稿)第十章 网络营销.pptx
- 广西外国语学院:《计算机网络》课程教学资源(PPT课件讲稿)第7章 传输层协议——TCP与UDP.ppt
- 九州大学(日本国立综合大学):烟花算法爆炸因子分析及改良(艺术工学府:余俊).pptx
- 图像视频编码与表达的理论与方法(PPT讲稿)图像压缩标准JPEG.ppt
- 中国科学技术大学:《计算机视觉》课程教学资源(PPT课件讲稿)第九章 单幅图像深度重建 Depthmap Reconstruction Based on Monocular cues.ppt
- 电子工业出版社:《计算机网络》课程教学资源(第五版,PPT课件讲稿)第六章 应用层.ppt
- 《计算机导论》课程教学资源(PPT课件讲稿)第3章 计算机发展史和计算思维.pptx
- 武昌理工学院(武汉科技大学中南分校):Windows 2000/XP网络组建与系统管理(PPT课件讲稿,主讲:李燕).ppt
- 《网络编程实用教程(第三版)Network Application Programming》课程教学资源(PPT课件讲稿)第1章 概述.ppt
- 电子工业出版社:《计算机网络》课程教学资源(第五版,PPT课件讲稿)第十章 下一代因特网.ppt