资源简介
Python-KaggleInstacart市场篮子分析
orders (3.4m rows, 206k users):
order_id: order identifier
user_id: customer identifier
eval_set: which evaluation set this order belongs in (see SET described below)
order_number: the order sequence number for this user (1 = first, n = nth)
order_dow: the day of the week the order was placed on
order_hour_of_day: the hour of the day the order was placed on
days_since_prior: days since the last order, capped at 30 (with NAs for order_number = 1)
products (50k rows):
product_id: product identifier
product_name: name of the product
aisle_id: foreign key
department_id: foreign key
aisles (134 rows):
aisle_id: aisle identifier
aisle: the name of the aisle
deptartments (21 rows):
department_id: department identifier
department: the name of the department
order_products__SET (30m+ rows):
order_id: foreign key
product_id: foreign key
add_to_cart_order: order in which each product was added to cart
reordered: 1 if this product has been ordered by this user in the past, 0 otherwise
where SET is one of the four following evaluation sets (eval_set in orders):
"prior": orders prior to that users most recent order (~3.2m orders)
"train": training data supplied to participants (~131k orders)
"test": test data reserved for machine learning competitions (~75k orders)
orders (3.4m rows, 206k users):
order_id: order identifier
user_id: customer identifier
eval_set: which evaluation set this order belongs in (see SET described below)
order_number: the order sequence number for this user (1 = first, n = nth)
order_dow: the day of the week the order was placed on
order_hour_of_day: the hour of the day the order was placed on
days_since_prior: days since the last order, capped at 30 (with NAs for order_number = 1)
products (50k rows):
product_id: product identifier
product_name: name of the product
aisle_id: foreign key
department_id: foreign key
aisles (134 rows):
aisle_id: aisle identifier
aisle: the name of the aisle
deptartments (21 rows):
department_id: department identifier
department: the name of the department
order_products__SET (30m+ rows):
order_id: foreign key
product_id: foreign key
add_to_cart_order: order in which each product was added to cart
reordered: 1 if this product has been ordered by this user in the past, 0 otherwise
where SET is one of the four following evaluation sets (eval_set in orders):
"prior": orders prior to that users most recent order (~3.2m orders)
"train": training data supplied to participants (~131k orders)
"test": test data reserved for machine learning competitions (~75k orders)
代码片段和文件信息
- 上一篇:爬取百度poi数据.py
- 下一篇:2019NCT_Python_1级测试卷及答案
相关资源
- django图片浏览+scrapy实现数据抓取功能
- Python数据结构.pdf60078
- 葡萄牙银行客户营销数据
- Django+MySql增删改查入门案例(附数据
- scrapy框架爬取58同城数据
- Python爬虫数据分析可视化
- 《大数据数学基础(Python语言描述)
- python 采集京东商品数据
- python实现逻辑回归
- pyqt5窗体数据传输简单
- python数据预处理.ipynb
- python数据类型学习思维导图
- 小说阅读项目源码(附数据库脚本)
- 基于PCA模型的鸢尾花数据可视化
- 以树莓派为基础,连接有毒气体传感
- pycaret数据挖掘实践
- python数据结构
- Python操作Excel表格并将其中部分数据写
- MNIST手写体数字训练/测试数据集(图
- 带书签-数据结构与算法 Python语言描
- 微博用户评论情感分析python代码数据
- python分析国家统计局数据网站本情况
- python数据分析源代码Ivan Idris
- 用python的pyecharts模块绘制世界地图疫
- tensorflow制作自己的灰度图像数据集并
- Python 数据挖掘入门与实践--代码与文
- Python3.x+PyQtChart实现数据可视化界面
- 基于自编写的随机森林算法的adult数据
- django+mysql家具购物网站,包含部署教
- 西电数据挖掘作业——医院数据处理
评论
共有 条评论