如何解决如何从雅虎财务中提取表格?
我正在尝试从Yahoo Finance中提取财务表。我正在使用thimport熊猫作为pd
import requests
from bs4 import BeautifulSoup
url="https://finance.yahoo.com/quote/FB/financials?p=FB"
headers={"User-Agent":"Mozilla/5.0"}
r=requests.get(url,headers=headers)
soup=BeautifulSoup(r.content,"html.parser")
stattable=soup.findAll('div',class_="M(0) Whs(n) BdEnd Bdc($seperatorColor) D(itb)")
stattable=stattable[0]
breakdown=[]
for row in stattable.findAll("div"):
for cell in row.findAll(class_="D(ib) Va(m) Ell Mt(-3px) W(215px)--mv2 W(200px) undefined"):
breakdown.append(cell.text)
正在提取的数据不正确,并且正在复制自身。 这是数据的一小部分:
'Breakdown','ttm','12/31/2019','12/31/2018','12/31/2017','12/31/20
16','Breakdown','12/3
1/2016','
12/31/2016','Total Revenue75,157,00070,697,00055,838,00040,653,000-','
','75,000','70,'55,'40,'','Cost of
Revenue13,935,00012,770,0009,355,0005,454,'13,'12,0
00','9,'5,'Gross Profit61,222,00057,927,00046,4
83,00035,199,'61,'57,'46,483,'35,00
0','Operating Expense33,323,00033,941,00021,570,00014,996,'33,'21,'14,'Operating
Income27,899,00023,986,00024,913,00020,203,'27,'23,'24,'20,'Net Non Operating Interest Incom
e Expense877,000904,000652,000392,'877,'904,'652,'392,'Other Income Expense-286,000-78,000-204,000-1,000-
','-286,'-78,'-204,'-1,'Pretax Income28,490,812,00025,361,594,'28,'2
5,'Tax Provision4,969,0006,327,0003,249,0004,660,'4,'6,'3,'Net Income Common Stockholders23,521,00018,485,00022,111,00015,920,'18,'22,'15,'Average Dilution Earnings-01,'0','1,''
我的目标是将其提取到熊猫数据框中。有人可以帮我吗,谢谢。
解决方法
这是利用yahooquery的解决方案:
from yahooquery import Ticker
fb = Ticker('fb')
fb.income_statement()
披露:我是yahooquery的作者
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。