BeautifulSoup怎么从网页中抓取数据

lewis 2024-04-17 26次阅读

使用BeautifulSoup从网页中抓取数据的步骤如下：

from bs4 import BeautifulSoup
import requests

url = 'https://example.com'
response = requests.get(url)

soup = BeautifulSoup(response.text, 'html.parser')

# 找到所有的标题
titles = soup.find_all('h2')

# 找到所有的链接
links = soup.find_all('a')

# 找到特定class的元素
specific_class = soup.find_all(class_='specific-class')

for title in titles:
    print(title.text)

for link in links:
    print(link['href'])

for element in specific_class:
    print(element.text)

通过以上步骤，您可以使用BeautifulSoup从网页中抓取数据并提取出需要的内容。

◎欢迎参与讨论，请在这里发表您的看法、交流您的观点。