Using lookbehind with grep

Thu, October 31, 2024 - 2 min read

Using lookbehinds with grep (ggrep for MacOs)

When working with text data on the command line, lookbehinds can be a valuable took, especially for extracting specific fields from JSON or logs. While macOS comes with BSD grep (which doesn’t support advanced regex), we can use GNU grep with PCRE (Perl-Compatible Regular Expressions) to access features like lookbehinds by installing ggrep using brew install grep.

Using the following JSON:

examples.json
[
 {
 "id": 1,
 "name": "Item 1",
 "description": "Description for item 1",
 "createdAt": "2024-10-31T00:00:00Z",
 "isActive": true
 },
 {
 "id": 2,
 "name": "Item 2",
 "description": "Description for item 2",
 "createdAt": "2024-10-31T00:00:00Z",
 "isActive": false
 }
]
 

And typing:

ggrep_cmd
ggrep -oP '(?<="name":).*' examples.sh
 

Will print:

ggrep_cmd
 
"item 1",
"item2",
 

• The (?<=…) syntax is a lookbehind that ensures the match only occurs after “name”:. • -o makes it print just the match, so you get a clean result with no extra text. • -P stands for Perl-Compatible Regular Expressions (PCRE). This flag enables more advanced regex features—such as lookbehinds, lookaheads, and non-capturing groups—that are part of the Perl regex syntax but not available in standard grep.

You can also use SED and AWK in Linux for similar results but this, personally has the simplest syntax.

What commands do you use daily to make your life easier?