Find value from one csv in another one (like vlookup) in bash (Linux)

前端未结

关注

 4  1979

猫巷女王i 2021-01-03 16:38

I have already tried all options that I found online to solve my issue but without good result.

Basically I have two csv files (pipe separated):

file1.

4条回答

刺人心 (楼主)

2021-01-03 17:21

This will work, but since the input files must be sorted, the output order will be affected:

join -t '|' -1 7 -2 1 -o 2.3 <(sort -t '|' -k7,7 file1.csv) <(sort -t '|' -k1,1 file2.csv)

The output would look like:

2200 2200 2400

which is useless. In order to have a useful output, include the key value:

join -t '|' -1 7 -2 1 -o 0,2.3 <(sort -t '|' -k7,7 file1.csv) <(sort -t '|' -k1,1 file2.csv)

The output then looks like this:

CORKCOR|2200 CORKKIN|2200 MAYOBAN|2400

Edit:

Here's an AWK version:

awk -F '|' 'FNR == NR {keys[$7]; next} {if ($1 in keys) print $3}' file1.csv file2.csv

This loops through file1.csv and creates array entries for each value of field 7. Simply referring to an array element creates it (with a null value). FNR is the record number in the current file and NR is the record number across all files. When they're equal, the first file is being processed. The next instruction reads the next record, creating a loop. When FNR == NR is no longer true, the subsequent file(s) are processed.

So file2.csv is now processed and if it has a field 1 that exists in the array, then its field 3 is printed.

0 讨论(0)

查看其它4个回答

发布评论:

提交评论

加载中...

验证码

看不清?

提交回复