Changes

Tutorial9: Regular Expressions

9 bytes removed, 09:11, 17 July 2020

→‎INVESTIGATION 2: EXTENDED REGULAR EXPRESSIONS

# Issue the following linux command to download another data file called words.dat: wget <nowiki>https://ict.senecacollege.ca/~murray.saul/uli101/words.dat</nowiki>

# View the contents of the '''numbers2.dat''' file using the '''more''' command and quickly view the contents of this file. You should notice valid and more invalid numbers contained in this file. When finished, exit the more command.

# Issue the following linux command to display two or more occurrences of the word "the": egrep -i "(the){2,}" ~~numbers2~~words.dat | more You should not see any output due to the fact that a space should be included at the end of the word "the". Usually words are separated by spaces; therefore, there were no matches since there were not occurrences of "thethe" as opposed to "the the" # Reissue the previous command including a space in brackets: egrep -i "(the ){2,}" ~~numbers2~~words.dat | more The or symbol | can be used within the grouping regular expression symbol to allow matching of additional groups of characters. Again, it is important to follow the character groupings with the space character # Issue the following linux command to search for 2 or more occurrences of the word "the" or the word "and": egrep -i "(the |and ){2,}" ~~numbers2~~words.dat | more

Proceed to Investigation 3

Msaul

Administrators

13,420

edits

Changes

Tutorial9: Regular Expressions

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

get involved with CDOT

courses

course projects

links

Tools