テキストファイルのセクションの並べ替え

Question 1

gawk -v RS="" '
  match($0, /index = ([^[:space:]]+)/, m) {
    stanzas[m[1]] = $0
  }
  END {
    PROCINFO["sorted_in"] = "@ind_str_asc"
    ORS = "\n\n"
    for (indx in stanzas) print stanzas[indx]
  }
' file

ファイルに別のセクションを追加してみましょう。

[monitor:///..]
disabled = true
index = xyz
sourcetype= ...

[monitor:///..]
disabled = true
index = abc
sourcetype= ...

[monitor:///...]
disabled = true
index = def
sourcetype= ...

その後、gawkコマンドの結果は次のようになります。

[monitor:///..]
disabled = true
index = abc
sourcetype= ...

[monitor:///...]
disabled = true
index = def
sourcetype= ...

[monitor:///..]
disabled = true
index = xyz
sourcetype= ...

（最後に空白行があります）

参考資料：

組み込み文字列関数3つのパラメータの場合match()
gawkで事前定義された配列スキャン順序を使用する

Answer

使用愚かな:

gawk -v RS="" '
  match($0, /index = ([^[:space:]]+)/, m) {
    stanzas[m[1]] = $0
  }
  END {
    PROCINFO["sorted_in"] = "@ind_str_asc"
    ORS = "\n\n"
    for (indx in stanzas) print stanzas[indx]
  }
' file

ファイルに別のセクションを追加してみましょう。

[monitor:///..]
disabled = true
index = xyz
sourcetype= ...

[monitor:///..]
disabled = true
index = abc
sourcetype= ...

[monitor:///...]
disabled = true
index = def
sourcetype= ...

その後、gawkコマンドの結果は次のようになります。

[monitor:///..]
disabled = true
index = abc
sourcetype= ...

[monitor:///...]
disabled = true
index = def
sourcetype= ...

[monitor:///..]
disabled = true
index = xyz
sourcetype= ...

（最後に空白行があります）

参考資料：

組み込み文字列関数3つのパラメータの場合match()
gawkで事前定義された配列スキャン順序を使用する

Question 2

dirktの説明を使用して作成されたBash関数：

function sort_stanzas() {
    declare file_path="$1"
    cat "$file_path" \
        | sed -z \
            -e 's/\n/\t/g' \
            -e 's/\t\t/\n/g' \
        | sort \
        | sed -z \
            -e 's/\n/\t\t/g' \
            -e 's/\t/\n/g'
}

使用法:sort_stanzas <file>

Answer

dirktの説明を使用して作成されたBash関数：

function sort_stanzas() {
    declare file_path="$1"
    cat "$file_path" \
        | sed -z \
            -e 's/\n/\t/g' \
            -e 's/\t\t/\n/g' \
        | sort \
        | sed -z \
            -e 's/\n/\t\t/g' \
            -e 's/\t/\n/g'
}

使用法:sort_stanzas <file>

テキストファイルのセクションの並べ替え

答え1

答え2

関連情報