シェルスクリプトを使用して2つの異なるファイルの1対1行を比較する

Question 1

きれいではないかもしれませんが、次のようなものが始まるかもしれません。

# 1. Read lines from file1 as string, and file2 as comma-separated array.
while read -r a && IFS=, read -ra b <&3; do
    # 2. If both empty lines, continue.
    if [[ "$a" == "" && ${#b[@]} == 0 ]]; then
        continue
    fi
    # 3. Start assuming diff.
    diff=1
    # 4. Loop fields in $b.
    for e in ${b[@]}; do
        # Compare field in $b with $a, if match then abort.
        if [[ "$e" == "$a" ]]; then
            diff=0
            break
        fi
    done
    # 5. If no match found, print line from $b.
    if [[ $diff == 1 ]]; then
        # Join array with <space>comma.
        line=$(printf ", %s" "${b[@]}")
        # Print line, excluding leading <space>comma.
        printf "%s\n" "${line:2}"
    fi
# Input argument one as file 1 to stdin, and argument two as file 2 to
# file descriptor 3.
done < "$1" 3<"$2"

通常、次のように使用されます。

$ ./myscript file1 file2

今、Python、Perl、awkなどを使用する方が良いでしょう。

Answer

きれいではないかもしれませんが、次のようなものが始まるかもしれません。

# 1. Read lines from file1 as string, and file2 as comma-separated array.
while read -r a && IFS=, read -ra b <&3; do
    # 2. If both empty lines, continue.
    if [[ "$a" == "" && ${#b[@]} == 0 ]]; then
        continue
    fi
    # 3. Start assuming diff.
    diff=1
    # 4. Loop fields in $b.
    for e in ${b[@]}; do
        # Compare field in $b with $a, if match then abort.
        if [[ "$e" == "$a" ]]; then
            diff=0
            break
        fi
    done
    # 5. If no match found, print line from $b.
    if [[ $diff == 1 ]]; then
        # Join array with <space>comma.
        line=$(printf ", %s" "${b[@]}")
        # Print line, excluding leading <space>comma.
        printf "%s\n" "${line:2}"
    fi
# Input argument one as file 1 to stdin, and argument two as file 2 to
# file descriptor 3.
done < "$1" 3<"$2"

通常、次のように使用されます。

$ ./myscript file1 file2

今、Python、Perl、awkなどを使用する方が良いでしょう。

Question 2

おそらく、このスタックオーバーフローの答えは正しい方向に導くでしょう。

ほとんどの場合、各ファイルの各行循環リストまたは大量に、最初の提案を使用してください。その後、同時に繰り返し、2番目の提案を使用して文字列を比較します。

Answer

おそらく、このスタックオーバーフローの答えは正しい方向に導くでしょう。

ほとんどの場合、各ファイルの各行循環リストまたは大量に、最初の提案を使用してください。その後、同時に繰り返し、2番目の提案を使用して文字列を比較します。

Question 3

努力する：

paste file1 file2 | grep -vP '^(.*)\t.*\1.*'

また、状況に合わせて正規表現を調整することもできます。

Answer

努力する：

paste file1 file2 | grep -vP '^(.*)\t.*\1.*'

また、状況に合わせて正規表現を調整することもできます。

Question 4

GNU awkを使用すると、1行にできます。

awk '{a=$0;getline <File2;if($0 ~ a)print "OK"; else print a,$0}' File1

Answer

GNU awkを使用すると、1行にできます。

awk '{a=$0;getline <File2;if($0 ~ a)print "OK"; else print a,$0}' File1

シェルスクリプトを使用して2つの異なるファイルの1対1行を比較する

答え1

答え2

答え3

答え4

関連情報