熊猫pd.read_csvs3_path失败,出现“ TypeError:'协程'对象不可下标”

如何解决熊猫pd.read_csvs3_path失败,出现“ TypeError:'协程'对象不可下标”

我正在Amazon EMR Cluster中运行一个spark应用程序,并且从几天前开始,每当尝试使用熊猫从S3读取文件时,都会出现以下错误。我添加了引导操作来安装pandas,fsspec和s3fs。

代码:

import pandas as pd
df = pd.read_csv(s3_path)

错误日志:

Traceback (most recent call last):
  File "spark.py",line 84,in <module>
    df=pd.read_csv('s3://<bucketname>/<filename>.csv')
  File "/usr/local/lib64/python3.7/site-packages/pandas/io/parsers.py",line 686,in read_csv
    return _read(filepath_or_buffer,kwds)
  File "/usr/local/lib64/python3.7/site-packages/pandas/io/parsers.py",line 435,in _read
    filepath_or_buffer,encoding,compression
  File "/usr/local/lib64/python3.7/site-packages/pandas/io/common.py",line 222,in get_filepath_or_buffer
    filepath_or_buffer,mode=mode or "rb",**(storage_options or {})
  File "/usr/local/lib/python3.7/site-packages/fsspec/core.py",line 133,in open
    out = self.__enter__()
  File "/usr/local/lib/python3.7/site-packages/fsspec/core.py",line 101,in __enter__
    f = self.fs.open(self.path,mode=mode)
  File "/usr/local/lib/python3.7/site-packages/fsspec/spec.py",line 844,in open
    **kwargs
  File "/usr/local/lib/python3.7/site-packages/s3fs/core.py",line 394,in _open
    autocommit=autocommit,requester_pays=requester_pays)
  File "/usr/local/lib/python3.7/site-packages/s3fs/core.py",line 1276,in __init__
    cache_type=cache_type)
  File "/usr/local/lib/python3.7/site-packages/fsspec/spec.py",line 1134,in __init__
    self.details = fs.info(path)
  File "/usr/local/lib/python3.7/site-packages/s3fs/core.py",line 719,in info
    return sync(self.loop,self._info,path,bucket,key,kwargs,version_id)
  File "/usr/local/lib/python3.7/site-packages/fsspec/asyn.py",line 51,in sync
    raise exc.with_traceback(tb)
  File "/usr/local/lib/python3.7/site-packages/fsspec/asyn.py",line 35,in f
    result[0] = await future
  File "/usr/local/lib/python3.7/site-packages/s3fs/core.py",line 660,in _info
    Key=key,**version_id_kw(version_id),**self.req_kw)
  File "/usr/local/lib/python3.7/site-packages/s3fs/core.py",line 214,in _call_s3
    raise translate_boto_error(err)
  File "/usr/local/lib/python3.7/site-packages/s3fs/core.py",line 207,in _call_s3
    return await method(**additional_kwargs)
  File "/usr/local/lib/python3.7/site-packages/aiobotocore/client.py",line 121,in _make_api_call
    operation_model,request_dict,request_context)
  File "/usr/local/lib/python3.7/site-packages/aiobotocore/client.py",line 140,in _make_request
    return await self._endpoint.make_request(operation_model,request_dict)
  File "/usr/local/lib/python3.7/site-packages/aiobotocore/endpoint.py",line 90,in _send_request
    exception):
  File "/usr/local/lib/python3.7/site-packages/aiobotocore/endpoint.py",line 199,in _needs_retry
    caught_exception=caught_exception,request_dict=request_dict)
  File "/usr/local/lib/python3.7/site-packages/aiobotocore/hooks.py",line 29,in _emit
    response = handler(**kwargs)
  File "/usr/local/lib/python3.7/site-packages/botocore/utils.py",line 1225,in redirect_from_error
    new_region = self.get_bucket_region(bucket,response)
  File "/usr/local/lib/python3.7/site-packages/botocore/utils.py",line 1283,in get_bucket_region
    headers = response['ResponseMetadata']['HTTPHeaders']
TypeError: 'coroutine' object is not subscriptable
sys:1: RuntimeWarning: coroutine 'AioBaseClient._make_api_call' was never awaited

s3fs可能有问题,因为这似乎是唯一收到更新的软件包,但是我在熊猫的更新日志中找不到与此相关的任何东西吗?

解决方法

Dask/s3fs团队已确认这是一个错误。此Github issue建议aiobotocore无法获得S3存储桶的 region_name

如果您遇到相同的问题,请考虑将s3fs降级为0.4.2,或者尝试将环境变量AWS_DEFAULT_REGION设置为解决方法。

编辑:aiobotocore=1.1.1的最新版本已对其进行了修复。如果您遇到相同的问题,请升级aiobotocore和s3fs。

版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。

相关推荐


依赖报错 idea导入项目后依赖报错,解决方案:https://blog.csdn.net/weixin_42420249/article/details/81191861 依赖版本报错:更换其他版本 无法下载依赖可参考:https://blog.csdn.net/weixin_42628809/a
错误1:代码生成器依赖和mybatis依赖冲突 启动项目时报错如下 2021-12-03 13:33:33.927 ERROR 7228 [ main] o.s.b.d.LoggingFailureAnalysisReporter : *************************** APPL
错误1:gradle项目控制台输出为乱码 # 解决方案:https://blog.csdn.net/weixin_43501566/article/details/112482302 # 在gradle-wrapper.properties 添加以下内容 org.gradle.jvmargs=-Df
错误还原:在查询的过程中,传入的workType为0时,该条件不起作用 &lt;select id=&quot;xxx&quot;&gt; SELECT di.id, di.name, di.work_type, di.updated... &lt;where&gt; &lt;if test=&qu
报错如下,gcc版本太低 ^ server.c:5346:31: 错误:‘struct redisServer’没有名为‘server_cpulist’的成员 redisSetCpuAffinity(server.server_cpulist); ^ server.c: 在函数‘hasActiveC
解决方案1 1、改项目中.idea/workspace.xml配置文件,增加dynamic.classpath参数 2、搜索PropertiesComponent,添加如下 &lt;property name=&quot;dynamic.classpath&quot; value=&quot;tru
删除根组件app.vue中的默认代码后报错:Module Error (from ./node_modules/eslint-loader/index.js): 解决方案:关闭ESlint代码检测,在项目根目录创建vue.config.js,在文件中添加 module.exports = { lin
查看spark默认的python版本 [root@master day27]# pyspark /home/software/spark-2.3.4-bin-hadoop2.7/conf/spark-env.sh: line 2: /usr/local/hadoop/bin/hadoop: No s
使用本地python环境可以成功执行 import pandas as pd import matplotlib.pyplot as plt # 设置字体 plt.rcParams[&#39;font.sans-serif&#39;] = [&#39;SimHei&#39;] # 能正确显示负号 p
错误1:Request method ‘DELETE‘ not supported 错误还原:controller层有一个接口,访问该接口时报错:Request method ‘DELETE‘ not supported 错误原因:没有接收到前端传入的参数,修改为如下 参考 错误2:cannot r
错误1:启动docker镜像时报错:Error response from daemon: driver failed programming external connectivity on endpoint quirky_allen 解决方法:重启docker -&gt; systemctl r
错误1:private field ‘xxx‘ is never assigned 按Altʾnter快捷键,选择第2项 参考:https://blog.csdn.net/shi_hong_fei_hei/article/details/88814070 错误2:启动时报错,不能找到主启动类 #
报错如下,通过源不能下载,最后警告pip需升级版本 Requirement already satisfied: pip in c:\users\ychen\appdata\local\programs\python\python310\lib\site-packages (22.0.4) Coll
错误1:maven打包报错 错误还原:使用maven打包项目时报错如下 [ERROR] Failed to execute goal org.apache.maven.plugins:maven-resources-plugin:3.2.0:resources (default-resources)
错误1:服务调用时报错 服务消费者模块assess通过openFeign调用服务提供者模块hires 如下为服务提供者模块hires的控制层接口 @RestController @RequestMapping(&quot;/hires&quot;) public class FeignControl
错误1:运行项目后报如下错误 解决方案 报错2:Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.8.1:compile (default-compile) on project sb 解决方案:在pom.
参考 错误原因 过滤器或拦截器在生效时,redisTemplate还没有注入 解决方案:在注入容器时就生效 @Component //项目运行时就注入Spring容器 public class RedisBean { @Resource private RedisTemplate&lt;String
使用vite构建项目报错 C:\Users\ychen\work&gt;npm init @vitejs/app @vitejs/create-app is deprecated, use npm init vite instead C:\Users\ychen\AppData\Local\npm-