式1に続く式2のgrep正規表現パターン

Question 1

次のようなものが目標に良いようです。

$ cat d2.txt
<h3>Agent</h3>
<p>Blah blah blah </p>
<p>Their agent is XYZ Corp.</p>

$ grep -i 'agent' d2.txt #-i = ignore case. By default grep returns lines containing agent followed by anything or even alone
<h3>Agent</h3>
<p>Their agent is XYZ Corp.</p>

$ grep -iE 'agent.*XYZ' d2.txt #match agent followed by XYZ
<p>Their agent is XYZ Corp.</p>

Answer

次のようなものが目標に良いようです。

$ cat d2.txt
<h3>Agent</h3>
<p>Blah blah blah </p>
<p>Their agent is XYZ Corp.</p>

$ grep -i 'agent' d2.txt #-i = ignore case. By default grep returns lines containing agent followed by anything or even alone
<h3>Agent</h3>
<p>Their agent is XYZ Corp.</p>

$ grep -iE 'agent.*XYZ' d2.txt #match agent followed by XYZ
<p>Their agent is XYZ Corp.</p>

Question 2

h3タイトルが常にエージェント名とは別の行に表示されると仮定すると、sed必要に応じて機能するようです。

与えられた入力ファイル

some data
at the top
<h3>Agent</h3>
<p>Blah blah blah </p>
<p>Their agent is XYZ Corp.</p>
some data
at the bottom

注文する

sed -n '\#<h3>Agent</h3>#,/XYZ/p' input.html

生成する

<h3>Agent</h3>
<p>Blah blah blah </p>
<p>Their agent is XYZ Corp.</p>

このコマンドは、sed2つの正規表現<h3>Agent</h3>と（含む）とXYZ一致する行の間のすべての内容を出力します。最初の正規表現を区切る方法は、カスタム区切り\#...#文字を使用する方法です。私がしていることは脱出モードではありません/。

Answer