Sklean数据集(5)-波士顿房价

2021-01-04 11:32:51 浏览数 (1)

顾老师新书《全栈软件测试工程师宝典》

https://item.m.jd.com/product/10023427978355.html

以前两本书的网上购买地址:

《软件测试技术实战设计、工具及管理》:

https://item.jd.com/34295655089.html

《基于Django的电子商务网站》:

https://item.jd.com/12082665.html

原文

Boston house prices dataset

**Data Set Characteristics:**

Number of Instances: 506

Number of Attributes: 13 numeric/categorical predictive. MedianValue (attribute 14) is usually the target.

Attribute Information (in order):

  • CRIM per capita crime rateby town
  • ZN proportion ofresidential land zoned for lots over 25,000 sq.ft.
  • INDUS proportion ofnon-retail business acres per town
  • CHAS Charles River dummyvariable (= 1 if tract bounds river; 0 otherwise)
  • NOX nitric oxidesconcentration (parts per 10 million)
  • RM average number ofrooms per dwelling
  • AGE proportion ofowner-occupied units built prior to 1940
  • DIS weighted distances tofive Boston employment centres
  • RAD index ofaccessibility to radial highways
  • TAX full-valueproperty-tax rate per $10,000
  • PTRATIO pupil-teacher ratioby town
  • B 1000(Bk - 0.63)^2where Bk is the proportion of blacks by town
  • LSTAT % lower status of thepopulation
  • MEDV Median value ofowner-occupied homes in $1000's

Missing Attribute Values: None

Creator: Harrison, D. and Rubinfeld, D.L.

This is a copy of UCI ML housing dataset.

https://archive.ics.uci.edu/ml/machine-learning-databases/housing/

This dataset was taken from the StatLib library which is maintainedat Carnegie Mellon University.The Boston house-price data of Harrison, D. andRubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ.Economics & Management,vol.5, 81-102, 1978. Used in Belsley, Kuh & Welsch,'Regression diagnostics...', Wiley, 1980. N.B. Various transformations are used in the table on pages 244-261 ofthe latter.

The Boston house-price data has been used in many machine learningpapers that address regressionproblems.

topic:: References

  • Belsley, Kuh & Welsch, 'Regression diagnostics: IdentifyingInfluential Data and Sources of Collinearity', Wiley, 1980. 244-261.
  • Quinlan,R. (1993). Combining Instance-Based and Model-BasedLearning. In Proceedings on the Tenth International Conference of MachineLearning, 236-243, University of Massachusetts, Amherst. Morgan Kaufmann.

译文

波士顿房价数据集

**数据集特征:**

实例数:506

属性数:13数值/分类预测。中值(属性14)通常是目标。

属性信息(按顺序):

  • CRIM 按城镇划分的人均犯罪率
  • ZN 2.5万宗地住宅用地锌比例平方英尺.
  • INDUS 每个城镇的非零售商业面积所占比例
  • CHAS Charles-River虚拟变量(=1,如果是河流边界,则为0;否则为0)
  • NOX 氮氧化物浓度(百万分之几)
  • RM 每间住宅的平均房间数
  • AGE 1940年以前建造的自住单位的年龄比例
  • DIS 离波士顿五个就业中心的距离
  • RAD 辐射状公路可达性指数
  • TAX 每万元房产税完税率
  • PTRATIO按城镇划分的师生比例
  • B 1000(Bk-0.63)^2,其中Bk是按城镇划分的黑人比例
  • LSTAT 人口地位降低%
  • MEDV 业主自住房屋的MEDV中值(1000美元)

缺少属性值:无

创作者:哈里森D.和鲁宾菲尔德D.L。

这是UCI ML住房数据集的副本。

https://archive.ics.uci.edu/ml/machine-learning-databases/housing/

这个数据集取自卡内基梅隆大学的StatLib图书馆大学。那个Harrison,D.和Rubinfeld,D.L.的波士顿房价数据,“享乐价格和对清洁空气的需求”,J.Environ。《经济学与管理》,第5卷,81-1021978年。用于Belsley,Kuh&Welsch,“回归诊断…”,Wiley,1980年。N、 B.后者第244-261页的表格中使用了各种变换。

波士顿房价数据已被用于许多机器学习论文,以解决回归问题。

主题::参考文献

  • Belsley,Kuh&Welsch,“回归诊断:识别共线性的影响数据和来源”,Wiley,1980年。244-261。

昆兰,R.(1993年)。结合实例学习和基于模型的学习。第十届机器学习国际会议论文集,236-243,马萨诸塞大学,阿默斯特。摩根考夫曼。

—————————————————————————————————

顾老师课程欢迎报名

软件安全测试

https://study.163.com/course/courseMain.htm?courseId=1209779852&share=2&shareId=480000002205486

接口自动化测试

https://study.163.com/course/courseMain.htm?courseId=1209794815&share=2&shareId=480000002205486

DevOps 和Jenkins之DevOps

https://study.163.com/course/courseMain.htm?courseId=1209817844&share=2&shareId=480000002205486

DevOps与Jenkins 2.0之Jenkins

https://study.163.com/course/courseMain.htm?courseId=1209819843&share=2&shareId=480000002205486

Selenium自动化测试

https://study.163.com/course/courseMain.htm?courseId=1209835807&share=2&shareId=480000002205486

性能测试第1季:性能测试基础知识

https://study.163.com/course/courseMain.htm?courseId=1209852815&share=2&shareId=480000002205486

性能测试第2季:LoadRunner12使用

https://study.163.com/course/courseMain.htm?courseId=1209980013&share=2&shareId=480000002205486

性能测试第3季:JMeter工具使用

https://study.163.com/course/courseMain.htm?courseId=1209903814&share=2&shareId=480000002205486

性能测试第4季:监控与调优

https://study.163.com/course/courseMain.htm?courseId=1209959801&share=2&shareId=480000002205486

Django入门

https://study.163.com/course/courseMain.htm?courseId=1210020806&share=2&shareId=480000002205486

啄木鸟顾老师漫谈软件测试

https://study.163.com/course/courseMain.htm?courseId=1209958326&share=2&shareId=480000002205486

0 人点赞