sedを使用した文字の削除

Question 1

現在のロケールがすでにUTF-8を文字セットとして使用し、ファイルがその文字セットを使用して作成されている場合：

<file LC_ALL=C sed 's/[^ -~]//g'

または、AIX sed に制御文字を含めるには、次のようにします。

<file LC_ALL=C sed "$(printf "s/[^[:print:]\t\r]//g")"

Answer

現在のロケールがすでにUTF-8を文字セットとして使用し、ファイルがその文字セットを使用して作成されている場合：

<file LC_ALL=C sed 's/[^ -~]//g'

または、AIX sed に制御文字を含めるには、次のようにします。

<file LC_ALL=C sed "$(printf "s/[^[:print:]\t\r]//g")"

Question 2

次のようにコマンドを使用できますtr。

tr -cd '[:print:]\t\r\n'

説明する：

`[:print:]'
Any character from the `[:space:]' class, and any character that is not in the `[:graph:]' class
\r -- return
\t -- horizontal tab

はいbased on Centos 7:tris GNU and UTF-8 encoding

$ echo "fiancÃÂÃÂÃÂÃÂÃÂ" | tr -cd '[:print:]\t\r\n'
fianc

$ echo "get ^▒▒^▒▒^▒▒^▒▒^▒▒^▒▒ " | tr -cd '[:print:]\t\r\n'
get ^^^^^^

echo " Caucasian male lives in Arizona w/ fianc▒^▒▒^▒▒^▒▒^▒▒^▒▒^▒^▒▒^▒▒^▒▒^▒▒^▒▒^▒"  | tr -cd '[:print:]\t\r\n'
 Caucasian male lives in Arizona w/ fianc^^^^^^^^^^^^

Answer

次のようにコマンドを使用できますtr。

tr -cd '[:print:]\t\r\n'

説明する：

`[:print:]'
Any character from the `[:space:]' class, and any character that is not in the `[:graph:]' class
\r -- return
\t -- horizontal tab

はいbased on Centos 7:tris GNU and UTF-8 encoding

$ echo "fiancÃÂÃÂÃÂÃÂÃÂ" | tr -cd '[:print:]\t\r\n'
fianc

$ echo "get ^▒▒^▒▒^▒▒^▒▒^▒▒^▒▒ " | tr -cd '[:print:]\t\r\n'
get ^^^^^^

echo " Caucasian male lives in Arizona w/ fianc▒^▒▒^▒▒^▒▒^▒▒^▒▒^▒^▒▒^▒▒^▒▒^▒▒^▒▒^▒"  | tr -cd '[:print:]\t\r\n'
 Caucasian male lives in Arizona w/ fianc^^^^^^^^^^^^

sedを使用した文字の削除

答え1

答え2

関連情報