python如何批量保存图片

在Python中，可以通过使用库如requests、os和PIL（Pillow）来实现批量保存图片。具体步骤包括：下载图片、处理图片、保存图片。使用requests下载图片、PIL进行图像处理、os模块创建目录。首先，定义图片URL列表，然后遍历列表，下载并保存图片。

一、准备工作

在开始批量保存图片之前，需要准备一些必要的工具和库。Python提供了一些强大的库可以帮助我们高效地完成这个任务。

安装所需库

首先，确保已经安装了requests库和PIL库（Pillow）。requests库用于从互联网下载图片，而Pillow则用于处理和保存图片。可以使用以下命令安装这些库：

pip install requests pip install pillow

在脚本开始之前，首先需要定义一个保存图片的目录。可以使用os库来创建这个目录。os库提供了创建目录的功能，可以保证在保存图片时，目录已经存在。

import os
save_dir = 'downloaded_images'
if not os.path.exists(save_dir):
    os.makedirs(save_dir)

二、获取图片URL

在批量下载图片时，首先需要获取图片的URL。这些URL可以来自一个文件、一个API响应，或者直接在脚本中定义一个列表。

从文件读取URL

如果图片的URL存储在一个文本文件中，可以通过读取文件的方式来获取这些URL。

def read_urls_from_file(file_path):
    with open(file_path, 'r') as file:
        urls = file.readlines()
    return [url.strip() for url in urls]

直接定义URL列表

如果URL是已知的，可以直接在代码中定义一个列表。

urls = [ 'http://example.com/image1.jpg', 'http://example.com/image2.jpg', 'http://example.com/image3.jpg' ]

三、下载和保存图片

接下来，使用requests库下载图片，并使用PIL库保存图片。

下载图片

使用requests.get()方法获取图片数据。可以通过遍历URL列表，逐个下载图片。

import requests
def download_image(url):
    try:
        response = requests.get(url)
        response.raise_for_status()  # 检查请求是否成功
        return response.content
    except requests.RequestException as e:
        print(f"Failed to download {url}: {e}")
        return None

保存图片

使用PIL库中的Image.open()方法将图片数据转化为图片对象，并保存到指定目录中。

from PIL import Image
from io import BytesIO
def save_image(image_data, file_name):
    try:
        image = Image.open(BytesIO(image_data))
        image_path = os.path.join(save_dir, file_name)
        image.save(image_path)
    except Exception as e:
        print(f"Failed to save image {file_name}: {e}")

整合下载和保存流程

将下载和保存图片的步骤整合到一个流程中。

def download_and_save_images(urls):
    for url in urls:
        print(f"Downloading {url}")
        image_data = download_image(url)
        if image_data:
            file_name = url.split('/')[-1]
            save_image(image_data, file_name)

四、优化下载流程

在批量下载图片的过程中，可以对流程进行优化，以提高效率和稳定性。

使用多线程

使用Python的多线程可以加快下载速度。在批量下载图片时，使用线程池可以同时下载多个图片，提高效率。

from concurrent.futures import ThreadPoolExecutor
def download_and_save(url):
    image_data = download_image(url)
    if image_data:
        file_name = url.split('/')[-1]
        save_image(image_data, file_name)
def download_images_concurrently(urls, max_workers=5):
    with ThreadPoolExecutor(max_workers=max_workers) as executor:
        executor.map(download_and_save, urls)

异常处理

在批量下载过程中，可能会遇到各种问题，如网络不稳定、URL无效等。需要添加异常处理以提高程序的鲁棒性。

def download_image_with_retry(url, retries=3):
    for attempt in range(retries):
        try:
            return download_image(url)
        except Exception as e:
            print(f"Attempt {attempt + 1} failed for {url}: {e}")
    return None

五、验证和测试

在批量下载和保存图片后，验证下载的图片是否完整和正确是非常重要的。

验证图片完整性

在保存图片后，可以通过PIL库检查图片的完整性，以确保图片没有损坏。

def is_image_valid(image_path):
    try:
        with Image.open(image_path) as img:
            img.verify()
        return True
    except Exception:
        return False

测试代码

最后，测试整个流程，确保从下载到保存的每个步骤都正常运行。

if __name__ == "__main__":
    urls = read_urls_from_file('image_urls.txt')
    download_images_concurrently(urls)
    for file_name in os.listdir(save_dir):
        image_path = os.path.join(save_dir, file_name)
        if not is_image_valid(image_path):
            print(f"Image {file_name} is not valid or corrupted.")

通过以上步骤，你可以使用Python批量下载和保存图片。该流程不仅简单易用，还能够通过多线程和异常处理提高效率和稳定性。