如何解决类型 uuid 的无效输入语法:PosgreSQL 复制命令中的“”或“null”
我已经从 SELECT 查询创建了 csv-backup,现在尝试将它导入回数据库。但我收到此错误:
COPY doc FROM '/tmp/doc.csv' DELIMITER ',' CSV HEADER;
ERROR: invalid input syntax for type uuid: "null"
如您所见,我的文件中的 "null"
为 NULL。
这发生在之前为空的可选字段上。
我找到了这个解决方案:https://stackoverflow.com/a/40428667/8443131
但它对我不起作用:
COPY doc FROM '/tmp/doc.csv' DELIMITER ',' CSV HEADER QUOTE '"null"' NULL '';
ERROR: COPY quote must be a single one-byte character
我如何导入这个文件?
UPD:我尝试用空引号替换空值。
尝试过的命令:
COPY doc FROM '/tmp/null.csv' DELIMITER ',' CSV HEADER QUOTE '"' NULL '';
ERROR: invalid input syntax for type uuid: ""
文件的简短版本:
"id","removed","modified_at","root_id","parent_id","acl","properties","data","file_meta"
"f6a16ff7-4a31-11eb-be7b-8344edc8f36b","false","2021-01-04 00:00:12.347988","","IS_PUBLIC",""
"2fdd0b8b-4a70-11eb-99fd-ad786a821574","2021-01-04 00:00:06.87298",""
"2c6d5fd1-4a70-11eb-99fd-ad786a821574","2021-01-04 00:00:07.536212",""
"fd645c21-4a6f-11eb-99fd-ad786a821574","2021-01-04 00:00:11.892367",""
"35c1fc53-4a70-11eb-99fd-ad786a821574","2021-01-04 00:00:05.517109",""
"35d165a4-4a70-11eb-99fd-ad786a821574","2021-01-04 00:00:01.72546",""
"fd40806d-4a6f-11eb-99fd-ad786a821574","2021-01-04 00:00:09.173726",""
"30ba4b45-4a70-11eb-99fd-ad786a821574","2021-01-04 00:00:04.655073",""
表创建:
-- Dumped from database version 13.0 (Debian 13.0-1.pgdg100+1)
-- Dumped by pg_dump version 13.0 (Debian 13.0-1.pgdg100+1)
CREATE TABLE public.doc (
id uuid NOT NULL,removed boolean,modified_at timestamp without time zone,root_id uuid,parent_id uuid,acl jsonb,properties jsonb,data jsonb,file_meta jsonb
);
ALTER TABLE ONLY public.doc
ADD CONSTRAINT doc_pkey PRIMARY KEY (id);
ALTER TABLE ONLY public.doc
ADD CONSTRAINT fk_document_entity FOREIGN KEY (id) REFERENCES public.main_table(id);
ALTER TABLE ONLY public.doc
ADD CONSTRAINT fk_document_parent FOREIGN KEY (parent_id) REFERENCES public.doc(id);
解决方法
我使用以下内容复制了您的案例,假设第二列是 boolean
,第三列是 timestamp
create table test (col1 varchar,col2 boolean,col3 timestamp,col4 varchar,col5 varchar,col6 varchar,col7 varchar,col8 varchar,col9 varchar) ;
如果我现在使用
copy test from STDIN delimiter ',' CSV QUOTE '"' NULL 'null';
并传递您提到的字符串
"f6a16ff7-4a31-11eb-be7b-8344edc8f36b","false","2021-01-04 00:00:12.347988","null","IS_PUBLIC","null"
数据解析正确
COPY 1
表格的输出看起来是正确的。
defaultdb=> select * from test;
col1 | col2 | col3 | col4 | col5 | col6 | col7 | col8 | col9
--------------------------------------+------+----------------------------+------+------+-----------+------+------+------
f6a16ff7-4a31-11eb-be7b-8344edc8f36b | f | 2021-01-04 00:00:12.347988 | null | null | IS_PUBLIC | null | null | null
(1 row)
,
您无法使用 COPY
加载此文件,因为 "null"
用双引号引起来,因此不能用作 NULL 占位符 - 它始终被解释为字符串。
您能做的最好的事情是将文件加载到一个表中,其中相应的列定义为 text
,然后执行类似的操作
ALTER TABLE doc ALTER uuidcol TYPE uuid USING CAST(nullif(uuidcol,'null') AS uuid);
,
即使@Abelisto 命令正在工作,我仍然无法上传一些 jsonb 行。
但我也有一个 .json 替代我的文件,如下所示:
[
{
"c0": "f6a16ff7-4a31-11eb-be7b-8344edc8f36b","c1": false,"c2": "2021-01-04 00:00:12.347988","c3": null,"c4": null,"c5": "IS_PUBLIC","c6": null,"c7": null,"c8": null
},...
]
所以我最终编写了这个对我有用的python脚本:
import json
import psycopg2
from datetime import datetime
import uuid
connection = psycopg2.connect(user="admin",password="admin",host="127.0.0.1",port="5432",database="postgres")
cursor = connection.cursor()
def insertLine(line):
id = uuid.UUID(line['c0']).hex
removed = bool(line['c1'])
modified_at = datetime.strptime(line['c2'],'%Y-%m-%d %H:%M:%S.%f')
root_id = uuid.UUID(line['c3']).hex if line['c3'] else None
parent_id = uuid.UUID(line['c4']).hex if line['c4'] else None
acl = json.dumps(line['c5']) if line['c5'] else None
properties = json.dumps(line['c6']) if line['c6'] else None
data = json.dumps(line['c7']) if line['c7'] else None
file_meta = json.dumps(line['c8']) if line['c8'] else None
record_to_insert = (id,removed,modified_at,root_id,parent_id,acl,properties,data,file_meta)
try:
postgres_insert_query = """INSERT INTO doc (id,file_meta) VALUES (%s,%s,%s)"""
cursor.execute(postgres_insert_query,record_to_insert)
connection.commit()
count = cursor.rowcount
except psycopg2.Error as error:
print("ERROR:" + str(error))
file = 'table.json'
with open(file) as json_file:
data = json.load(json_file)
for p in data:
insertLine(p)
if connection:
cursor.close()
connection.close()
print("PostgreSQL connection is closed")
所以我想在 csv 中备份 jsonb 字段是不好的做法。
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。