An Introduction to WEKA
An Introduction to WEKA Contributed by Yizhou Sun 2008
Contributed by Yizhou Sun 2008 An Introduction to WEKA
Content What iS WEKA? e EXPLorer e Preprocess data ● Classification Clustering Association rules Attribute Selection Data visualization References and resources
Content What is WEKA? The Explorer: Preprocess data Classification Clustering Association Rules Attribute Selection Data Visualization References and Resources 2 1/29/2021
What is WEKA? o Waikato Environment for Knowledge analysis It's a data mining/ machine learning tool developed by Department of computer Science, university of waikato, New Zealand e Weka is also a bird found only on the islands of new zealand
What is WEKA? Waikato Environment for Knowledge Analysis It’s a data mining/machine learning tool developed by Department of Computer Science, University of Waikato, New Zealand. Weka is also a bird found only on the islands of New Zealand. 3 1/29/2021
Download and Install WeKa Website http://www.cs.waikatoac.nz/ml/weka/index.htm Support multiple platforms(written in java) Windows Mac os X and linux
Download and Install WEKA Website: http://www.cs.waikato.ac.nz/~ml/weka/index.html Support multiple platforms (written in java): Windows, Mac OS X and Linux 4 1/29/2021
Main Features e 49 data preprocessing tools o 76 classification /regression algorithms 8 clustering algorithms 3 algorithms for finding association rules e 15 attribute/ subset evaluators+ 10 search algorithms for feature selection
Main Features 49 data preprocessing tools 76 classification/regression algorithms 8 clustering algorithms 3 algorithms for finding association rules 15 attribute/subset evaluators + 10 search algorithms for feature selection 5 1/29/2021
Main gu Three graphical user interfaces eka ●“ The explorer”( exploratory data analysis) Waikato Environment for The Experimenter"(experimental Version 3. 4. 12 environment) (c)1999·200 niversity of Waikato ●“ The Knowledge Flow( new process model inspired interface Experimenter KnowledgeFlow
Main GUI Three graphical user interfaces “The Explorer” (exploratory data analysis) “The Experimenter” (experimental environment) “The KnowledgeFlow” (new process model inspired interface) 6 1/29/2021
Content What iS WEKa? The explorer e Preprocess data ● Classification Clustering Association rules Attribute Selection Data visualization References and resources
Content What is WEKA? The Explorer: Preprocess data Classification Clustering Association Rules Attribute Selection Data Visualization References and Resources 7 1/29/2021
EXplorer: pre-processing the data Data can be imported from a file in various formats: ARFF CSV, C4.5,binary e Data can also be read from a url or from an SQl database (using jDBC ●Pre- processing tools in WEKa are called“ filters” WEKA contains filters for Discretization, normalization, resampling, attribute selection transforming and combining attributes
8 1/29/2021 Explorer: pre-processing the data Data can be imported from a file in various formats: ARFF, CSV, C4.5, binary Data can also be read from a URL or from an SQL database (using JDBC) Pre-processing tools in WEKA are called “filters” WEKA contains filters for: Discretization, normalization, resampling, attribute selection, transforming and combining attributes, …
WEKA only deals with"flatfiles arelation heart-disease-simplified (attribute age numeric @attribute sexi female, male) @attribute chest-pain_type typ_angina, asympt, non_anginal, atyp_anginal (attribute cholesterol numeric @attribute exercise_induced _angina no, yes @attribute class present, not_present) (ad 63, male, typ_angina, 233, no, not_present 67, male, asympt. 286. ves, present 67, male, asympt, 229, yes, present Flat file in 38, female, non_anginal, no, not_present ARFF format 1/29/2021
9 1/29/2021 @relation heart-disease-simplified @attribute age numeric @attribute sex { female, male} @attribute chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina} @attribute cholesterol numeric @attribute exercise_induced_angina { no, yes} @attribute class { present, not_present} @data 63,male,typ_angina,233,no,not_present 67,male,asympt,286,yes,present 67,male,asympt,229,yes,present 38,female,non_anginal,?,no,not_present ... WEKA only deals with “flat” files
WEKA only deals with"flatfiles arelation heart-disease-simplified numeric attribute (attribute age numeric @attribute sexi female, male) -nominal attribute @attribute chest-pain_type typ_angina, asympt, non_anginal, atyp_anginal (attribute cholesterol numeric @attribute exercise_induced _angina no, yes @attribute class present, not_present) (ad 63, male, typ_angina, 233, no, not_present 67, male, asympt. 286. ves, present 67, male, asympt, 229, yes, present 38, female, non_anginal, no, not_present
10 1/29/2021 @relation heart-disease-simplified @attribute age numeric @attribute sex { female, male} @attribute chest_pain_type { typ_angina, asympt, non_anginal, atyp_angina} @attribute cholesterol numeric @attribute exercise_induced_angina { no, yes} @attribute class { present, not_present} @data 63,male,typ_angina,233,no,not_present 67,male,asympt,286,yes,present 67,male,asympt,229,yes,present 38,female,non_anginal,?,no,not_present ... WEKA only deals with “flat” files
按次数下载不扣除下载券;
注册用户24小时内重复下载只扣除一次;
顺序:VIP每日次数-->可用次数-->下载券;
- 高等教育质量常态监测与审核评估(王战军).pptx
- 集美大学图书馆:科技查新相关知识与查新委托需知(吴淑华).ppt
- 人民大学商学院:答辩那些事儿.ppt
- 走近IEEE:看全新IEEE Xplore平台如何加速您的科研创新进程.ppt
- 北京大学英国高等教育评估与质量保证——问题与经验(金顶兵).ppt
- SAGE(世哲):携手科研一流服务——与众不同的出版者.ppt
- 黄河水利职业技术学院:高职高专就业、创业指导课程教学专题(PPT讲稿).ppt
- 高等学校科学技术学术规范指南(宣讲稿).ppt
- 《大学生心理健康教育》课程教学资源(PPT课件讲稿)心理与压力.ppt
- 中原工学院:现代图书馆管理系统及其选择(张怀涛).ppt
- 北京师范大学:国内外数字图书馆(DL)研究现状热点及未来趋势(肖明).ppt
- 演讲教学PPT:自信演讲训练(讲稿).ppt
- 残差迭代分解 Residue Iteration Decomposition(RIDE)Restoring latency-variable ERP components from single trials.pptx
- 华南农业大学兽医学院:如何写作及发表SCI论文?How to Write and Publish SCI Cited Papers? An Overview.ppt
- 复旦大学:中国高校信息化指标体系研究(PPT讲稿,2005,张成洪).ppt
- 武昌首义学院:关于“教学研究与教学成果”(吴昌林).ppt
- 郑州大学河南医学院:标引词表及标引工具书(PPT讲稿).ppt
- 华东师范大学:当前世界职教课程改革基本趋势及其对我国的启示(石伟平).ppt
- 国家自然科学基金2010年度资助工作概况及2011年度申请注意事项.ppt
- 集美大学:毕业论文资料查找技能辅导讲座——毕业论文的格式、写作、选题与开题(张新).ppt
- 复旦大学:《道德与法律》课程教学资源(PPT课件讲稿)第十三讲 心理健康与人格完善.ppt
- 北京师范大学:《教育科学研究方法》课程教学资源(PPT课件讲稿)Lecture 3 The Design and Logics of Comparative-Historical Method in the Social Sciences.ppt
- 浙江大学:关于教学改革与精品课程建设申报及共享.ppt
- 北京师范大学:职业教育科研实施策略(李兴洲).pptx
- 人际交往中的心理学(PPT课件讲稿).ppt
- 面向全球化经济的高等工程教育发展战略——“做中学”、产学合作与国际化.ppt
- 怀化职业技术学院:大学生就业心理准备及心理调适.ppt
- 安徽广播影视职业技术学院:理清工作思路创新工作方法——谈班主任工作(王诗文).ppt
- 浙江省成人高校招生考试监考员培训教程(本教程供监考员培训用).pptx
- 《课程理论:课程的基础原理与问题》课程教学资源(PPT课件讲稿)第一章 课程的基础.ppt
- 《思想政治教育学原理》课程教学资源(考试大纲).pdf
- 中北大学:《大学生安全教育》课程电子教案(PPT教学课件)第一章 大学生安全教育概述(制作人:原彦飞).ppt
- 南开大学:关于本科教学审核评估的认识与实践(杨光明).pptx
- 上海外国语大学:人文社科电子资源介绍及利用(严丹).ppt
- 河海大学:《普通高等学校学生管理规定》教育部令第41号释义.pptx
- ROLE OF A 21ST CENTURY RESEARCH LIBRARY IN THE ACADEMY - CORNELL APPROACHES.ppt
- 转变教育观念、培养大批创新人才.ppt
- 上海交通大学:资源建设的绩效评估(PPT讲稿,黄镝).pptx
- Open Access:Threats and Promises of Scholarly Communication.ppt
- 国家教育发展研究中心:后大众化时代高等学校发展危机与改革的战略选择.pptx