如果模式在多段内匹配，如何连接多行

作者: 庸人自扰1
发布时间: 2024-06-28 08:52:11 (12天前)
转自：

3 条回复

0#
回复此人
易米烊光 | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> 可能不是最短的，但这是一个简单的sed版本： </p> <pre> <code> sed <file -E ' :l; /(^|\n)segment[ \t]*$/!{ N; s/(^|\n)(.)([^\n]*)[ \t](.*)\n[+]\2[ \t]+([^\n]*)/\1\2\3\5\4/; bl; } ' </code> </pre> <UL> <LI> 如果不匹配分段线， <UL> <LI> 追加下一行以容纳空间 </LI> <LI> 搜索以x和+ x开头的行，并将后者的尾部追加到前者 </LI> <LI> 跳回到顶部 </LI> </UL> </LI> <LI> 否则，隐式打印，删除保留空间并开始下一个循环 </LI> </UL> </DIV>

编辑
1#
回复此人
楊♡ | 2019-08-31 10-32

<div class =“post-text”itemprop =“text”> <P> 如果 <code> perl </code> 是您的选择，请尝试以下方法： </p> <pre> <code> perl -ne ' s/\s+$//; if (/^segment/) { push(@ary, $_); print(join("\n", @ary), "\n"); undef @ary; } elsif (/^(\S)\S*\s+is/) { push(@ary, $_); $index{$1} = $#ary; } elsif (/^\+(\S)\s+(\S+)/) { $ary[$index{$1}] .= " $2"; } ' file.txt </code> </pre> <P> 输出： </p> <pre> <code> segment bob is working eating drinking linda is studying john is reading listening segment john is driving linda is cooking washing bob is sleeping snoring segment </code> </pre> <UL> <LI> <code> -n </code> 选项告诉 <code> perl </code> 迭代输入文件为 <code> awk -n </code> 。 </LI> <LI> <code> s/\s+$// </code> 删除尾随换行符和空格（如果有的话）。 </LI> <LI> 该 <code> if (/^segment/) </code> 部分刷新内容 <code> @ary </code> 和重置下一个段的数组。 </LI> <LI> 下一个 <code> elsif (/^(\S)\S*\s+is/) </code> 部分与线匹配喜欢 <code> bob is working </code> 然后将该行添加到 <code> @ary </code> 通过记忆初始的索引如“b”。 </LI> <LI> 下一个 <code> elsif (/^\+(\S)\s+(\S+)/) </code> 部分与线匹配喜欢 <code> +b eating </code> 然后添加动作 <code> eating </code> 到元素 <code> @ary </code> 通过“b”索引。 </LI> </UL> <P> 我可以写一个脚本 <code> awk </code> 同样，但脚本将是更长的时间。我更喜欢 <code> perl </code> 因为它的灵活性（和古怪）。 <BR/> 希望这可以帮助。 </p> </DIV>

编辑

登录后才能参与评论