如何解决如何使用unix将一个csv文件内容映射到第二个csv文件并写入另一个csv
| 编写了一些unix脚本后,我能够设法将数据从不同的xml文件获取为csv格式,现在我陷入了以下问题 file1.csv:包含1,5,6,7,8
2,3,4,9
1,10,11,12
1,12
file2.csv:包含
1,Mango,Tuna,Webby,Through,Franky,Sam,Sumo
2,Franky
3,Sam
4,Sumo
5,Webby
6,Through
7,Sumo
8,Nothing
9,Sumo
10,Sumo,Tuna
11,Through
12,Franky
我想要的输出是
1,8
Mango,Sumo
Mango,Webby
Tuna,Through
Through,Sumo
Nothing
Common word:None
2,9
Franky
Sam
Sumo
Mango,Webby
Sam,Sumo
Common Word:None
1,12
Mango,Sumo
Tuna,Through
Sumo,Tuna
Mango,Through
Mango,Franky
Common word: Tuna
1,Webby
Mango,Franky
Common word: Mango,Webby
我感谢您的帮助。
谢谢
我有一些解决方案,但还不完整
##!/bin/bash
count=1
count_2=1
for i in `cat file1.csv`
do
echo $i > $count.txt
cat $count.txt | tr \",\" \"\\n\" > $count_2.txt
count=`expr $count + 1`
count_2=`expr $count_2 + 1`
done;
#this code will create separte files for each line in file1.csv,bash file3_search.sh
##########################
file3_search.sh
================
##!/bin/bash
cat file2.csv | sed \'/^$/d\' | sed \'s/[ ]*$//\' > trim.txt
dos2unix -q 1.txt 1.txt
dos2unix 2.txt 2.txt
dos2unix 3.txt 3.txt
echo \"1st Combination results\"
for i in `cat 1.txt`
do
cat trim.txt | egrep -w $i
done > Combination1.txt;
echo \"2nd Combination results\"
for i in `cat 2.txt`
do
cat trim.txt | egrep -w $i
done > Combination2.txt;
echo \"3rd Combination results\"
for i in `cat 3.txt`
do
cat trim.txt | egrep -w $i
done > Combination3.txt;
伙计们,我不擅长编程(我是软件测试人员),请有人可以重构我的代码,也请告诉我如何在这些Combination.txt文件中获取常用词
解决方法
恕我直言,它的工作原理:
for line in $(cat 1.csv) ; do
echo $line ;
grepline=`echo $line | sed \'s/ \\+//g;s/,/,|/g;s/^\\(.*\\)$/^(\\1,)/\'`;
egrep $grepline 2.csv
egrep $grepline 2.csv | \\
awk -F \",\" \'
{ for (i=2;i<=NF;i++)
{s[$i]+=1}
}
END { for (key in s)
{if (s[key]==NR) { tp+=key \",\" }
}
if (tp!=\"\") {print \"Common word(s): \" gensub(/,$/,\"\",\"g\",tp)}
else {print \"Common word: None\"}}\'
echo
done
高温超导
, 这是您的答案。它取决于bash版本4的关联数组功能:
IFS=,declare -a words
# read and store the words in file2
while read line; do
set -- $line
n=$1
shift
words[$n]=\"$*\"
done < file2.csv
# read file1 and process
while read line; do
echo \"$line\"
set -- $line
indexes=( \"$@\" )
NF=${#indexes[@]}
declare -A common
for (( i=0; i<$NF; i++)); do
echo \"${words[${indexes[$i]}]}\"
set -- ${words[${indexes[$i]}]}
for word; do
common[$word]=$(( ${common[$word]} + 1))
done
done
printf \"Common words: \"
n=0
for word in \"${!common[@]}\"; do
if [[ ${common[$word]} -eq $NF ]]; then
printf \"%s \" $word
(( n++ ))
fi
done
[[ $n -eq 0 ]] && printf \"None\"
unset common
printf \"\\n\\n\"
done < file1.csv
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。