python里curl如何使用

在Python中使用cURL的方法有很多种，如使用requests库、pycurl库和subprocess库。其中，requests库是最简单和常见的选择，因为它提供了简洁的API来处理HTTP请求。以下将详细介绍使用requests库在Python中实现cURL的操作。

一、requests库的使用

requests库是一个非常流行的HTTP库，能够方便地进行HTTP请求。它比直接使用cURL命令更加Pythonic，并且易于使用。

1、安装requests库

你需要先安装requests库，可以使用pip命令来安装：

pip install requests

2、基本用法

以下是一个简单的GET请求示例：

import requests
response = requests.get('https://api.github.com')
print(response.status_code)
print(response.text)

详细描述： 在上面的示例中，我们使用requests.get()方法发起了一个GET请求。response对象包含了服务器的响应，response.status_code返回HTTP状态码，response.text返回响应内容。

3、发送POST请求

你可以使用requests.post()方法发送POST请求：

import requests
url = 'https://httpbin.org/post'
data = {'key': 'value'}
response = requests.post(url, data=data)
print(response.status_code)
print(response.json())

二、pycurl库的使用

pycurl是一个基于libcurl的Python库，可以更灵活地处理HTTP请求。尽管它的使用稍微复杂一些，但在需要细粒度控制的情况下非常有用。

1、安装pycurl库

你可以使用pip命令来安装pycurl库：

pip install pycurl

2、基本用法

以下是一个简单的GET请求示例：

import pycurl
from io import BytesIO
buffer = BytesIO()
c = pycurl.Curl()
c.setopt(c.URL, 'https://api.github.com')
c.setopt(c.WRITEDATA, buffer)
c.perform()
c.close()
body = buffer.getvalue()
print(body.decode('utf-8'))

在上面的示例中，我们使用pycurl.Curl()来创建一个Curl对象，然后设置URL和WRITEDATA选项，最后使用perform()方法执行请求。

三、subprocess库的使用

subprocess库允许你直接调用系统命令，因此你可以使用它来执行cURL命令。

1、基本用法

以下是一个使用subprocess库执行cURL命令的示例：

import subprocess
result = subprocess.run(['curl', 'https://api.github.com'], capture_output=True, text=True)
print(result.stdout)

在上面的示例中，我们使用subprocess.run()来执行cURL命令，并捕获输出。

四、requests库的高级用法

requests库不仅能够处理基本的GET和POST请求，还支持其他HTTP方法、文件上传、会话处理和SSL证书验证等高级功能。

1、发送带有参数的GET请求

你可以使用params参数来发送带有查询参数的GET请求：

import requests
url = 'https://httpbin.org/get'
params = {'key1': 'value1', 'key2': 'value2'}
response = requests.get(url, params=params)
print(response.url)
print(response.json())

2、发送带有JSON数据的POST请求

你可以使用json参数来发送带有JSON数据的POST请求：

import requests
url = 'https://httpbin.org/post'
json_data = {'key': 'value'}
response = requests.post(url, json=json_data)
print(response.json())

3、文件上传

你可以使用files参数来上传文件：

import requests
url = 'https://httpbin.org/post'
files = {'file': open('report.txt', 'rb')}
response = requests.post(url, files=files)
print(response.json())

4、会话处理

使用requests.Session()可以在多个请求之间保持会话：

import requests
session = requests.Session()
session.get('https://httpbin.org/cookies/set/sessioncookie/123456789')
response = session.get('https://httpbin.org/cookies')
print(response.json())

5、SSL证书验证

默认情况下，requests库会验证SSL证书。你可以使用verify参数来控制这一行为：

import requests
response = requests.get('https://api.github.com', verify=True)
print(response.status_code)

五、pycurl库的高级用法

pycurl库提供了更加灵活和细粒度的控制，适用于复杂的HTTP请求场景。

1、设置HTTP头

你可以使用HTTPHEADER选项来设置自定义HTTP头：

import pycurl
from io import BytesIO
buffer = BytesIO()
c = pycurl.Curl()
c.setopt(c.URL, 'https://httpbin.org/headers')
c.setopt(c.HTTPHEADER, ['User-Agent: pycurl'])
c.setopt(c.WRITEDATA, buffer)
c.perform()
c.close()
body = buffer.getvalue()
print(body.decode('utf-8'))

2、发送POST请求

你可以使用POSTFIELDS选项来发送POST请求：

import pycurl
from io import BytesIO
buffer = BytesIO()
c = pycurl.Curl()
c.setopt(c.URL, 'https://httpbin.org/post')
c.setopt(c.POSTFIELDS, 'key=value')
c.setopt(c.WRITEDATA, buffer)
c.perform()
c.close()
body = buffer.getvalue()
print(body.decode('utf-8'))

3、文件上传

你可以使用HTTPPOST选项来上传文件：

import pycurl
from io import BytesIO
buffer = BytesIO()
c = pycurl.Curl()
c.setopt(c.URL, 'https://httpbin.org/post')
c.setopt(c.HTTPPOST, [('file', (c.FORM_FILE, 'report.txt'))])
c.setopt(c.WRITEDATA, buffer)
c.perform()
c.close()
body = buffer.getvalue()
print(body.decode('utf-8'))

六、subprocess库的高级用法

subprocess库提供了更多的选项来控制子进程的执行和输出。

1、捕获标准错误输出

你可以使用stderr参数来捕获标准错误输出：

import subprocess
result = subprocess.run(['curl', 'https://api.github.com'], capture_output=True, text=True)
print(result.stdout)
print(result.stderr)

2、传递输入数据

你可以使用input参数来传递输入数据：

import subprocess
result = subprocess.run(['curl', '-d', 'key=value', 'https://httpbin.org/post'], capture_output=True, text=True)
print(result.stdout)

3、设置超时

你可以使用timeout参数来设置超时：

import subprocess
try:
    result = subprocess.run(['curl', 'https://api.github.com'], capture_output=True, text=True, timeout=5)
    print(result.stdout)
except subprocess.TimeoutExpired:
    print('Request timed out')

七、错误处理和调试

无论你使用哪种方法，都需要处理可能出现的错误和进行调试。

1、requests库的错误处理

requests库提供了一些异常类来处理错误：

import requests
try:
    response = requests.get('https://api.github.com')
    response.raise_for_status()
except requests.exceptions.HTTPError as errh:
    print("HTTP Error:", errh)
except requests.exceptions.ConnectionError as errc:
    print("Error Connecting:", errc)
except requests.exceptions.Timeout as errt:
    print("Timeout Error:", errt)
except requests.exceptions.RequestException as err:
    print("OOps: Something Else", err)

2、pycurl库的错误处理

pycurl库提供了一些异常类来处理错误：

import pycurl
from io import BytesIO
buffer = BytesIO()
c = pycurl.Curl()
c.setopt(c.URL, 'https://api.github.com')
c.setopt(c.WRITEDATA, buffer)
try:
    c.perform()
except pycurl.error as e:
    errno, errstr = e.args
    print('An error occurred:', errstr)
finally:
    c.close()
body = buffer.getvalue()
print(body.decode('utf-8'))

3、subprocess库的错误处理

subprocess库提供了一些异常类来处理错误：

import subprocess
try:
    result = subprocess.run(['curl', 'https://api.github.com'], capture_output=True, text=True, check=True)
    print(result.stdout)
except subprocess.CalledProcessError as e:
    print('An error occurred:', e)

八、总结

在Python中使用cURL有多种方法，包括requests库、pycurl库和subprocess库。requests库是最常见和简单的选择，适用于大多数HTTP请求场景。pycurl库提供了更灵活和细粒度的控制，适用于复杂的HTTP请求。subprocess库允许你直接调用系统命令，适用于需要执行cURL命令的场景。

无论你选择哪种方法，都需要注意错误处理和调试，以确保程序的健壮性和可靠性。