awkまたはPerlを使用して文字列から特定のパターンを削除する方法は？

Question 1

簡単な交換のため -sed十分です:

sed -E 's/\[gene=[a-z]{3}[A-Z]\] *//' file

出力：

>lcl|NZ_CP018664.1_gene_628 [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]

ファイルの変更「所定の位置に」- 追加された-iオプション：sed -i ....

Answer

簡単な交換のため -sed十分です:

sed -E 's/\[gene=[a-z]{3}[A-Z]\] *//' file

出力：

>lcl|NZ_CP018664.1_gene_628 [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]

ファイルの変更「所定の位置に」- 追加された-iオプション：sed -i ....

Question 2

そしてGNU awk：

$ echo '>lcl|NZ_CP018664.1_gene_628 [gene=mscL] [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]'  | awk '{$0=gensub(/\s*\S+/,"",2)}1'
>lcl|NZ_CP018664.1_gene_628 [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]

これは次の方法で行うこともできますcut。

$ echo '>lcl|NZ_CP018664.1_gene_628 [gene=mscL] [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]'  | cut -d' ' -f1,3-
>lcl|NZ_CP018664.1_gene_628 [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]

Answer

そしてGNU awk：

$ echo '>lcl|NZ_CP018664.1_gene_628 [gene=mscL] [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]'  | awk '{$0=gensub(/\s*\S+/,"",2)}1'
>lcl|NZ_CP018664.1_gene_628 [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]

これは次の方法で行うこともできますcut。

$ echo '>lcl|NZ_CP018664.1_gene_628 [gene=mscL] [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]'  | cut -d' ' -f1,3-
>lcl|NZ_CP018664.1_gene_628 [locus_tag=AUO97_RS03160] [location=complement(694895..695326)]

awkまたはPerlを使用して文字列から特定のパターンを削除する方法は？

答え1

答え2

関連情報