怎么用python爬取div内容

lewis 9年前 (2017-09-05) 阅读数 20 #程序编程

文章标签 python

要使用Python爬取div内容，可以使用第三方库如BeautifulSoup或者Scrapy来实现。下面是使用BeautifulSoup的一个例子：

import requests
from bs4 import BeautifulSoup

# 发起请求获取网页内容
url = "http://example.com"
response = requests.get(url)
html_content = response.content

# 创建BeautifulSoup对象解析网页内容
soup = BeautifulSoup(html_content, "html.parser")

# 通过标签和属性定位到需要爬取的div元素
div_element = soup.find("div", class_="content")

# 提取div内容
div_content = div_element.text

# 打印结果
print(div_content)

这个例子中，首先使用requests库发起GET请求获取网页内容。然后使用BeautifulSoup库将网页内容解析成BeautifulSoup对象soup。通过find方法定位到需要爬取的div元素，可以使用标签名和属性来定位。最后通过text属性获取div内容。