CSVテキスト行から部分フィールドを抽出する方法

Question 1

次のことを試すことができます。

grep -o "^[0-9]*\|,tran.*$" file | sed 'N;s/\n,/,/'

出力：

391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

Answer

次のことを試すことができます。

grep -o "^[0-9]*\|,tran.*$" file | sed 'N;s/\n,/,/'

出力：

391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

Question 2

ファイルにカンマや改行を含むフィールドがないと仮定すると（たとえば、「単純なCSVファイル」など）、次のように各行の最初の2つのフィールドを取得できます。

$ awk -F , 'BEGIN { OFS=FS } { print $1, $(NF-1), $NF }' file.csv
391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

NF行あたりのフィールド数を含む特殊変数で、入力フィールドと出力フィールドの区切り文字をコンマに設定します。ブロック内では、print興味のあるフィールドのみを印刷します。

Answer

ファイルにカンマや改行を含むフィールドがないと仮定すると（たとえば、「単純なCSVファイル」など）、次のように各行の最初の2つのフィールドを取得できます。

$ awk -F , 'BEGIN { OFS=FS } { print $1, $(NF-1), $NF }' file.csv
391,translate_hits=4399,untranslate_hits=4413
431,translate_hits=284903,untranslate_hits=8472
432,translate_hits=0,untranslate_hits=0
436,translate_hits=1966,untranslate_hits=1966
437,translate_hits=84908,untranslate_hits=1965
440,translate_hits=18970,untranslate_hits=18970

NF行あたりのフィールド数を含む特殊変数で、入力フィールドと出力フィールドの区切り文字をコンマに設定します。ブロック内では、print興味のあるフィールドのみを印刷します。

CSVテキスト行から部分フィールドを抽出する方法

答え1

答え2

関連情報