合肥工业大学:使用大数据进行计算建模(PPT讲稿)Computing/Modeling with Big Data(主讲:吴信东)

Computing/Modeling with Big Data Xindong wu(吴信东) 中国·合肥工业大学计算机与信息学院; Department of Computer Science University of Vermont, USA
Computing/Modeling with Big Data Xindong Wu (吴信东) 中国 · 合肥工业大学计算机与信息学院; Department of Computer Science University of Vermont, USA

Outline The Era of Big Data 2 Big Data Character istics 3 A Big Data Processing Framework Streaming Data and Streaming Features 5 Concluding Remarks
1 The Era of Big Data 2 Big Data Characteristics 3 A Big Data Processing Framework 4 Streaming Data and Streaming Features Outline 5 Concluding Remarks 2

ICDM 13 Panel: Data Mining with Big data Panel Chair: Xindong wu Panelists: what useful content our Big Data: a hot topic, bu Chris Clifton (NSF Purdue) What new aspects? or is Vipin kumar(minnesota it just data mining FIEEE, FACM, FAAAS) · How does data mining Jian Pei(TKDE EiC, Canada, FIEEE chans nge with Big Data? Bhavani Thuraisingham What should data miners CUTDallas, Security, FIEEE, FAAAS) Geoff Webb(DMKD EiC, Australia) do to cope with these Zhi-Hua zhou changes (Nanjing, China, FIEEE Big Data
ICDM ’13 Panel: Data Mining with Big Data Panel Chair: Xindong Wu Panelists: • Chris Clifton (NSF & Purdue) • Vipin Kumar (Minnesota, FIEEE,FACM,FAAAS) • Jian Pei (TKDE EiC, Canada, FIEEE) • Bhavani Thuraisingham (UTDallas, Security, FIEEE, FAAAS) • Geoff Webb (DMKD EiC, Australia) • Zhi-Hua Zhou (Nanjing,China,FIEEE) • Big Data: a hot topic, but what useful content? • What new aspects? or is it just data mining? • How does data mining change with Big Data? • What should data miners do to cope with these changes? Big Data

Big Data, from 70s to Now, and 2046 The 1st International Conference on Very Large Data Bases (September 22-24, 1975, Framingham, MA, USA Very large big? The first Er model paper, QBE XLDB -EXtremely Large Databases and Data Management started on october 25, 2007 ·? LDB in2046? ULDB-Upmost Large Databases O Cent 01: being big is relative, going big is a deterministic trend Data mining: keep evolving J. Pei: Big Data Analytics 101
Big Data, from 70s to Now, and 2046 • The 1st International Conference on Very Large Data Bases (September 22-24, 1975, Framingham, MA, USA) – Very large = big? – The first ER model paper, QBE, … • XLDB – Extremely Large Databases and Data Management, started on October 25, 2007 • ?LDB in 2046? – ULDB – Upmost Large Databases ☺ • Cent 01: being big is relative, going big is a deterministic trend • Data mining: keep evolving J. Pei: Big Data Analytics 101 4

Some comments on big data David hand Imperial College, London David Hand: Some comments on big data December 2013
Some comments on big data David Hand Imperial College, London David Hand: Some comments on big data, December 2013

The power law theorem of data set size The number of data sets of size n is inversely proportional to n There are vastly more small data sets than very large ones So small data sets are likely to have a much larger impact on the world than big data sets David Hand: Some comments on big data December 2013
The power law theorem of data set size: • The number of data sets of size n is inversely proportional to n • There are vastly more small data sets than very large ones • So small data sets are likely to have a much larger impact on the world than big data sets David Hand: Some comments on big data, December 2013

No-one actually wants data What people want are answers Which may be extracted from data So data are only half the answer The other half is statistics, data mining machine learning and other data analytic sciplines David Hand: Some comments on big data December 2013
No-one actually wants data • What people want are answers • Which may be extracted from data • So data are only half the answer • The other half is statistics, data mining, machine learning, and other data analytic disciplines David Hand: Some comments on big data, December 2013

The manure heap theorem of data discoveries The probability of finding a gold coin in a heap of manure tends towards 1 as the size of the heap tends to infinity. (This theorem is false) David Hand: Some comments on big data December 2013
The manure heap theorem of data discoveries The probability of finding a gold coin in a heap of manure tends towards 1 as the size of the heap tends to infinity. (This theorem is false) David Hand: Some comments on big data, December 2013

0100 00100 Data Science not just for Big data Gregory piatetsky @kdnuggets nuggets Analytics, Big Data. Data mining, and data Science resources o KDnuggets 2013
Data Science not just for Big Data Gregory Piatetsky, @kdnuggets Analytics, Big Data, Data Mining, and Data Science Resources © KDnuggets 2013 9

What do we call it? Statistics, 1830 Same Core ldea Data mining, 1980 Finding Useful Knowledge Discovery in Patterns in Data Data(KDD),1989 Business analytics, 1997 Predictive analytics, 2002 Data analytics, 2011 Different · Data science,2011 Empl hasis Big Data, 2012 o KDnuggets 2013
What do we call it? • Statistics, 1830- • Data mining, 1980- • Knowledge Discovery in Data (KDD), 1989- • Business Analytics, 1997- • Predictive Analytics, 2002- • Data Analytics,2011- • Data Science, 2011- • Big Data, 2012 - © KDnuggets 2013 10 Same Core Idea: Finding Useful Patterns in Data Different Emphasis
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 人工神经网络(ANN)方法简介(PPT课件讲稿).ppt
- 清华大学:《数据中心网络 Data Center Networking》课程教学资源(PPT课件讲稿).pptx
- 上饶师范学院:《数据库系统原理 An Introduction to Database System》课程教学资源(PPT课件讲稿,共九章).ppt
- 北京大学:计算智能实验室(PPT讲稿)烟花算法算子分析.pptx
- 《Chemdraw 软件教程》教学资源(PPT讲稿)第一部分 ChemDraw简介.ppt
- 《数据库系统原理》课程PPT教学课件(SQLServer)第7章 Transact-SQL程序设计.ppt
- 清华大学出版社:《计算机导论 Introduction to Computer Science》课程配套教材教学资源(PPT课件讲稿,第3版)第4章 操作系统与网络知识.ppt
- 山东大学:《微机原理及单片机接口技术》课程教学资源(PPT课件讲稿)第三章 计算机系统的组成与工作原理 3.1 理解模型机的结构及工作过程 3.2 掌握单片机的结构.ppt
- 机器翻译研讨会(PPT讲稿)神经机器翻译前沿进展(PPT讲稿).pptx
- 西安电子科技大学:《计算机操作系统》课程PPT教学课件(讲稿)第六章 文件管理.ppt
- 厦门理工学院:《网页设计》培训课件教学资源(PPT课件).ppt
- 《数字图像处理》课程教学资源(PPT课件讲稿)第5章 图像编码与压缩.ppt
- 香港浸会大学:Community Search over Big Graphs:Models, Algorithms, and Opportunities.ppt
- 清华大学出版社:《JAVA程序设计实例教程》课程教材电子教案(PPT课件讲稿,共七章,主编:关忠).ppt
- 香港中文大学:Arm board tutorial Part 1 Using the ARM board And start working with C Tutorial 5 and 6.pptx
- 同济大学:《大数据分析与数据挖掘 Big Data Analysis and Mining》课程教学资源(PPT课件讲稿)Evaluation & other classifiers.pptx
- 面积对象编程(PPT讲稿)Object-Oriented Programming and Classes.ppt
- 《计算机网络概述》教学资源(PPT课件讲稿).ppt
- 《计算机组成原理》课程PPT教学课件(讲稿)第三章 计算机核心部件及其工作原理.ppt
- 《大型机系统管理技术》课程教学资源(PPT课件讲稿)第2章 大型服务器外存管理.ppt
- 《模式识别》课程教学资源(PPT讲稿)Learning with information of features.ppt
- 烟台大学:《C语言程序设计》课程电子教案(PPT课件讲稿)第五章 数组、字符串、指针(主讲:荆蕾).ppt
- 《数据结构》课程教学资源(PPT课件讲稿)第六章 树与二叉树.ppt
- 南京大学:《计算机图形学》课程教学资源(PPT课件讲稿)第6讲 图形观察与几何变换.pptx
- 《高级软件工程》课程教学大纲 Advanced Software Engineering.doc
- 《Android 程序设计基础》课程教学资源(PPT课件讲稿)第8章 数据存储和访问.ppt
- 新乡学院:《PHP动态网站开发》课程教学资源(教学大纲).pdf
- 南京大学:《面向对象技术 OOT》课程教学资源(PPT课件讲稿)构件化软件 Component Software.ppt
- MSC Software Corporation:Dynamic System Modeling, Simulation, and Analysis Using MSC.EASY5(Introductory Class).ppt
- 南京航空航天大学:《C++》课程电子教案(PPT课件讲稿)第2章 文件操作.pptx
- 《Java面向对象程序设计》课程教学资源(PPT课件讲稿)第四章 Java图形用户界面设计 4.3 事件处理.pptx
- 中国科学技术大学:《网络信息安全 NETWORK SECURITY》课程教学资源(PPT课件讲稿)Windows 操作系统.ppt
- 中国科学技术大学:《嵌入式操作系统 Embedded Operating Systems》课程教学资源(PPT课件讲稿)第七讲 存储器管理.ppt
- 华南理工大学:神经计算的生理和动力学指标(PPT讲稿).ppt
- 《编译原理与技术》课程教学资源(PPT课件讲稿)运行环境.ppt
- 同济大学:《大数据分析与数据挖掘 Big Data Analysis and Mining》课程教学资源(PPT课件讲稿)Data Preprocessing.ppt
- 中国科学技术大学:《算法基础》课程教学资源(PPT课件讲稿)第五讲 概率分析与随机算法.pptx
- Robust Networking Architecture and Secure Communication Scheme for Heterogeneous Wireless Sensor Networks.pptx
- 《数据结构》课程教学资源(PPT讲稿)二叉树和二叉搜索树 Trees, Binary Trees, and Binary Search Trees.ppt
- 《网页设计与制作》课程PPT教学课件(Fireworks Mx 2004)第九章 Firework图像处理.ppt