I need a bit of help parsing few large excel files about 8mb each file filled with these variation. I want to capture what in bold only. These are all narrow down variation. Most of the time wording are misspelled or abbreviated. ^(?:.*?)(\d+|\$\d+|\d+.\d+|\$\d+.\d+|\$\s\d+.\d+|\d+) invoice 928.00 paid 880.00 pricing. Invoice $ 35.20 Paid $ 31.12 Paid invoice per system pricing inv 1681.00 pd 1575.00 no pay Invoice $80.00 Paid $79.50 paid per g (2012-10-08:61516 ) Invoice $ 218.50 Paid $ 16