如何解决Scrapy 提取不同列数
我正在尝试使用 scrapy 从这个 HTML 文件中提取值:
<div class="property-secondary-vl__detail">
<div class="row">
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tipe Properti </div>
<div class="property-secondary-vl__detail__value">
Rumah </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tipe Iklan </div>
<div class="property-secondary-vl__detail__value">
Dijual </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tipe Market </div>
<div class="property-secondary-vl__detail__value">
Secondary </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Umur Bangunan </div>
<div class="property-secondary-vl__detail__value">
Lama </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Harga </div>
<div class="property-secondary-vl__detail__value">
Rp1.800.000.000 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Sertifikasi </div>
<div class="property-secondary-vl__detail__value">
SHM </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Kamar Tidur </div>
<div class="property-secondary-vl__detail__value">
3 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Kamar Mandi </div>
<div class="property-secondary-vl__detail__value">
3 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Luas Tanah </div>
<div class="property-secondary-vl__detail__value">
104m² </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Luas Bangunan </div>
<div class="property-secondary-vl__detail__value">
125m² </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Jumlah Lantai </div>
<div class="property-secondary-vl__detail__value">
3 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Carports </div>
<div class="property-secondary-vl__detail__value">
2 </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Daya Listrik </div>
<div class="property-secondary-vl__detail__value">
5500 watt </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Interior </div>
<div class="property-secondary-vl__detail__value">
Semi Furnished </div>
</div>
<div class="property-secondary-vl__detail__column">
<div class="property-secondary-vl__detail__key">
Tahun Dibangun </div>
<div class="property-secondary-vl__detail__value">
2005 </div>
</div>
</div>
</div>
我想要“__detail__key”作为列标题,而“__detail__value”是值本身。
另一个问题是,在这个特定的房产列表网站上,每个列表都有不同数量的项目。
示例: 在此列表 https://www.99.co/id/properti/rumah-dijual-1-8mily-ciputat-timur-520409782 上,“__detail__key”有 15 列,而在此列表 https://www.99.co/id/properti/rumah-dijual-1-65mily-pamulang-410793202 中,有 16 列。
是否可以使用scrapy来做到这一点?
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。