一、网页分析
打开谷歌翻译链接:https://translate.google.com/
按F12,点击network。在左侧输入"who are you"
可以看到,请求的链接为:
代码语言:javascript复制https://translate.google.com/_/TranslateWebserverUi/data/batchexecute?rpcids=MkEWBc&f.sid=-2609060161424095358&bl=boq_translate-webserver_20201203.07_p0&hl=zh-CN&soc-app=1&soc-platform=1&soc-device=1&_reqid=359373&rt=c
发送的数据为:
这里面的who are you表示,需要翻译的文字
ja 表示日本的简称。
二、代码演示
代码语言:javascript复制# !/usr/bin/python3
# -*- coding: utf-8 -*-
import requests
import re
def translated_content(text, target_language):
headers = {
"accept": "text/html,application/xhtml xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.9",
# "accept-language": "en,zh-CN;q=0.9,zh;q=0.8",
"content-type": "application/x-www-form-urlencoded;charset=UTF-8",
"user-agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/87.0.4280.66 Safari/537.36"
}
# 请求url
url = "https://translate.google.com/_/TranslateWebserverUi/data/batchexecute?rpcids=MkEWBc&f.sid=-2609060161424095358&bl=boq_translate-webserver_20201203.07_p0&hl=zh-CN&soc-app=1&soc-platform=1&soc-device=1&_reqid=359373&rt=c"
# 数据参数
from_data = {
"f.req": r"""[[["MkEWBc","[["{}","auto","{}",true],[null]]",null,"generic"]]]""".format(text, target_language)
}
try:
r = requests.post(url, headers=headers, data=from_data, timeout=60)
if r.status_code == 200:
# 正则匹配结果
response = re.findall(r',[[\"(.*?)\",[\', r.text)
if response:
response = response[0]
else:
response = re.findall(r',[[\"(.*?)\"]', r.text)
if response:
response = response[0]
return response
except Exception as e:
print(e)
return False
# 翻译各个国家语言
for i in ['en', 'zh', 'fr', 'ja', 'de']:
response = translated_content("who are you", i)
print(response)
执行输出:
代码语言:javascript复制who are you
你是谁
qui êtes vous
あなたは誰
Wer bist du