python采集起点小说并保存到txt文件

大小: 1.76KB

文件类型: .py

金币: 1

下载: 0 次

发布日期: 2021-01-30
语言: Python
标签: 爬取小说

高速下载

资源简介

Python爬取小说网站

资源截图

小图大图

代码片段和文件信息

import requests
import pymysql
from lxml import etree
import os

# 设计模式 -- 面向对象 继承、封装
class Spider（object）:

    def start_request（self）:
        # 1. 请求网站拿到HTML源代码，抽取小说名、小说链接 创建文件夹
        response = requests.get（“https://www.qidian.com/all“）
        html = etree.HTML（response.text）   # lxml 中的 etree 来解析 HTML
        Bigtit_list = html.xpath（‘//div[@class=“book-mid-info“]/h4/a/text（）‘）
        Bigsrc_list = html.xpath（‘//div[@class=“book-mid-info“]/h4/a/@href‘）
        for Bigtit Bigsrc in zip（Bigtit_list Bigsrc_list）:
            if os.path.exists（Bigtit） == False:
                os.mkdir（Bigtit）
            self.file_data（Bigtit Bigsrc）

    def file_data（self Bigtit Bigsrc）:
        # 2. 请求小说拿到HTML源代码，抽取章名、章链接
        response = requests.get（“http:“ + Bigsrc）

上一篇：《Django框架（django_2.0.1）手册》pdf
下一篇：Python采集尤图网美女图片

共有条评论

python采集起点小说 并保存到txt文件

资源简介

资源截图

代码片段和文件信息

评论

相关资源

python采集起点小说并保存到txt文件