テキストファイルから複数行をgrep

Question 1

次の-zオプションが利用可能ですgrep。

-z、--null-dataデータラインは改行ではなく0バイトで終わります。

$ grep -zo -- '---.start[^-]*---' file
---
start
a
b
c
d
---

Answer

次の-zオプションが利用可能ですgrep。

-z、--null-dataデータラインは改行ではなく0バイトで終わります。

$ grep -zo -- '---.start[^-]*---' file
---
start
a
b
c
d
---

Question 2

---常に行の後に続く「トリック」部分がテキストに存在しないと確信している場合（例のように）、startセクションタイトルをに減らして---次のように使用できます。

sed -n '/---/,//p' text

上記がわからない場合：

sed -n '/---/{n;/start/{:a H;n;/---/!ba;x;G;s/^/---/p;s/.*/\n---/;D}}' test


sed : /bin/sed executable
-n : sed option to avoid auto line printing
/---/ : Match a pattern of 3 "-"
n: Get the next line of input
/start/: Match a line "start"
:a : Build a label called "a" (For the loop)
H: Happend the line to the HOLD space (Save it)
n: Get the next line
/---/!: Test if the current line **is not** equal to: "---"
ba: Jump to the label 'a' if the test succede
x: Swap the Hold space and the Pattern space.
G: Get the line from the Hold space and append it to the Pattern space
s/^/---/p: Append to the start of the string a sequence of "---" and print the line
s/.*/\n---/: Replace the current line with a new line and an : "---"
D: Delete character in the current line (Pattern space) up to the  first new line character and start the next cycle with the remaining line

awk短絡モードで：

awk -v h="---" -v h2="start" '                     
    f == 2
    $0 == h {f=1}
    f == 1 && h2 == $0 {print h;print;f++}
' test

Answer

---常に行の後に続く「トリック」部分がテキストに存在しないと確信している場合（例のように）、startセクションタイトルをに減らして---次のように使用できます。

sed -n '/---/,//p' text

上記がわからない場合：

sed -n '/---/{n;/start/{:a H;n;/---/!ba;x;G;s/^/---/p;s/.*/\n---/;D}}' test


sed : /bin/sed executable
-n : sed option to avoid auto line printing
/---/ : Match a pattern of 3 "-"
n: Get the next line of input
/start/: Match a line "start"
:a : Build a label called "a" (For the loop)
H: Happend the line to the HOLD space (Save it)
n: Get the next line
/---/!: Test if the current line **is not** equal to: "---"
ba: Jump to the label 'a' if the test succede
x: Swap the Hold space and the Pattern space.
G: Get the line from the Hold space and append it to the Pattern space
s/^/---/p: Append to the start of the string a sequence of "---" and print the line
s/.*/\n---/: Replace the current line with a new line and an : "---"
D: Delete character in the current line (Pattern space) up to the  first new line character and start the next cycle with the remaining line

awk短絡モードで：

awk -v h="---" -v h2="start" '                     
    f == 2
    $0 == h {f=1}
    f == 1 && h2 == $0 {print h;print;f++}
' test

Question 3

@schrodigerscatcuriosityの回答に基づいて、次のことができます。

grep -zoP -- '(?s)\n---\nstart\n.*?\n---\n' file

-PPCRE拡張と(?s)fot用PCRE_DOTALL

---start言及した間のオプションの空行の場合

grep -zoP -- '(?s)\n---\n[\n\s]*start\n.*?\n---\n' file

Answer

@schrodigerscatcuriosityの回答に基づいて、次のことができます。

grep -zoP -- '(?s)\n---\nstart\n.*?\n---\n' file

-PPCRE拡張と(?s)fot用PCRE_DOTALL

---start言及した間のオプションの空行の場合

grep -zoP -- '(?s)\n---\n[\n\s]*start\n.*?\n---\n' file

Question 4

複数文字のRSとRTにGNU awkを使用し、入力にレコード区切り文字としてのみ表示されているとします（たとえば、中間レコードのようなものを---\n持つことはできません）。b---\n

$ awk -v RS='---\n' -v ORS= '/^start/ && RT{print RT $0 RT}' file
---
start
a
b
c
d
---

Answer

複数文字のRSとRTにGNU awkを使用し、入力にレコード区切り文字としてのみ表示されているとします（たとえば、中間レコードのようなものを---\n持つことはできません）。b---\n

$ awk -v RS='---\n' -v ORS= '/^start/ && RT{print RT $0 RT}' file
---
start
a
b
c
d
---

テキストファイルから複数行をgrep

答え1

答え2

答え3

答え4

関連情報