Search the Community
Showing results for tags 'scrape'.
-
Hello everyone, #include <GUIConstantsEx.au3> #include <GUIListBox.au3> #include <WindowsConstants.au3> #cs FILE DATA : <name>John</name> <random>Hello</random> <name>Silly</name> <other>Test</other> <other>World</other> <name>Billy</name> #ce Local $file = "file.txt" $Form1 = GUICreate("Get items from text", 360, 250, -1, -1) $List1 = GUICtrlCreateList("", 8, 8, 160, 235, -1, 0) _getitems() GUISetState(@SW_SHOW, $Form1) While 1 $nMsg = GUIGetMsg() Switch $nMsg Case $GUI_EVENT_CLOSE Exit EndSwitch WEnd Func _getitems() EndFunc ;==>_getitems I have the code above and I try to figure out the next one : It must read a text file with fields like in example below. <name>John</name> <random>Hello</random> <name>Silly</name> <other>Test</other> <other>World</other> <name>Billy</name> How to add into list only what is between <name>*</name>, something like a wildcard tried to applied but not working. Any suggestions? Thank you!
-
Hi all, Ive made a script that scrapes an xml off the web code below -<availability> -<members date="2015-07-18" daytag="Today" count="11" day="8" night="9" ooa="0" s44="" na="0"> <qualification abbrev="2YR" name="2 Years Experience" category="Ability" count="4" day="3" night="3" ooa="0" s44="0"na="0"/> <qualification abbrev="BA" name="Breathing Apparatus Operator" category="Operator" count="4" day="3" night="4" ooa="0"s44="0" na="0"/> </members> -<members date="2015-07-19" daytag="Tomorrow" count="11" day="8" night="11" ooa="0" s44="0" na="0"> <qualification abbrev="2YR" name="2 Years Experience" category="Ability" count="4" day="4" night="4" ooa="0" s44="0"na="0"/> <qualification abbrev="BA" name="Breathing Apparatus Operator" category="Operator" count="6" day="6" night="4" ooa="0"s44="0" na="0"/> </members> <availability> My script is meant to scrape the "today" section. The first part of my script works and picks up the correct "day" count but when its comes to the "breathing Apparatus Operator" it collects the number from "tomorrow" how can I fix this? My code below $sXML = BinaryToString(InetRead($Site)) $day = StringRegExpReplace($sXML, '(?is).*<availability.*?day="([^"]+).*</availability.*', '$1') $BA = StringRegExpReplace($sXML, '(?is).*<members.*? name="Breathing Apparatus Operator".*?day="([^"]+).*</members.*', '$1');this gets the info we need