基于Python的二手车市场的数据分析
基于Python的二手车市场的数据分析(任务书,开题报告,外文翻译,论文19000字,参考代码)
摘要
本文借助面向对象解释类型的计算机程序设计语言python对二手车市场的数据进行分析、建模、预测,首先针对某拍车网站上二手车拍卖成交的相关记录进行数据抓取,该过程主要利用了python中的第三方库requests库、pandas库;其次对可能对拍卖价格产生影响的因素进行提取并对提取出的数据进行分析清洗,使之成为后续建模需要的数据类型;接着根据数据特征和问题类型选择合适的算法模型,选择合适的参数优化训练模型,并对用随机森林随机梯度下降回归和随机梯度下降回归两种情况下的预测模型进行了比较,所得结果对于二手车拍卖价格的预测具有重要的指导意义。研究结果表明随机森林模型对本数据集能很好的解释,可用于对类似数据集的预测分析以及对各特征重要度的讨论。本文的特色在于提出一种新的预测拍卖二手车价格的预测模型,在某种程度上对于想要出售自己二手车的消费者来说也是一种可以用来对自己爱车进行估值的方法,同时利用本文的随机森林算法模型可以作为一种新型的对二手车保值率的评判标准。
关键词:二手车;网上拍卖价格;随机森林;随机梯度下降回归 [来源:http://Doc163.com]
Abstract
In this paper, the data of the second-hand car market is analyzed, modeled and predicted with the aid of the computer programming language python of the object oriented interpretation type. First, the data is captured for the related records of the second-hand car auction on the car site. The process mainly uses the third party library requests library and the pandas Library in the python; secondly, it is used in this process. The factors that may affect the price of the auction are extracted and analyzed and cleaned to make it a data type needed for subsequent modeling. Then, the appropriate algorithm model is selected according to the characteristics of the data and the type of the problem, and the appropriate parameter optimization training model is selected, and the random forest random gradient is used. The results are of great guiding significance for the prediction of the second-hand car auction price in two cases.
This paper mainly studies the prediction model of second-hand car price and the factors that affect the auction price.
[来源:http://www.doc163.com]
The results show that the random forest model can explain the data set well, and can be used to predict and analyze similar data sets.
The feature of this paper is to propose a new forecast model for the forecast of second-hand car price, and to some extent, it is also a method for the consumers who want to sell their used cars. At the same time, the model of the random forest algorithm in this paper can be used as a new type of second-hand car. The evaluation criterion of the rate of value preservation.
Key Words:Used car;Online auction price;Random forest;Random gradient descent regression
[来源:http://www.doc163.com]
目录
第1章绪论 1
1.1 研究背景及意义 1
1.1.1 研究背景 1
1.1.2 研究意义 2
1.2研究现状 3
1.2.1二手车价值评估国内外研究现状 3
1.2.2网上拍卖成交价格影响及价格预测的国内外研究现状 3
1.3研究框架 4
1.3.1研究内容 4
1.3.2技术路线图 5
第2章数据分析 6
2.1二手车拍卖价格影响因素分析 6
2.2二手车拍卖数据采集 8
2.2.1使用工具 8
2.2.2算法设计 8
2.2.3爬取结果展示 8
2.3二手车拍卖数据处理 10
2.3.1数据分类 10
2.3.2数据清洗 10
2.3.3数据可视化 14
2.3.4数值分析 17
2.4本章小结 20
第3章预测模型研究设计 21
3.1 系统分析 21
3.1.1基本训练模型方案 21
3.1.2模型效果的测量指标 21
3.1.3开发环境 21
3.2 模型介绍 22
3.2.1随机森林模型 22
3.2.2随机梯度递减回归模型 22
3.3模型训练与评价 23
3.3.1模型训练 23
3.3.2评价预测模型 23
3.4结果分析 25
3.5本章总结 29 [资料来源:http://Doc163.com]
第4章绪论 30
4.1总结 30
4.2 展望 30
参考文献 32
致谢 33 [资料来源:Doc163.com]