eschettini Posted February 18, 2019 Share Posted February 18, 2019 Hi all, I'd like to know if it is possible to scrape javascript rendered data from a web page and eventually how. Thanks in advance, Enrico Link to comment Share on other sites More sharing options...
Moderators Melba23 Posted February 18, 2019 Moderators Share Posted February 18, 2019 Moved to the appropriate forum, as the AutoIt Example Scripts forum very clearly states: Quote Share your cool AutoIt scripts, UDFs and applications with others. Do not post general support questions here, instead use the AutoIt Help and Support forums. Moderation Team Any of my own code posted anywhere on the forum is available for use by others without any restriction of any kind Open spoiler to see my UDFs: Spoiler ArrayMultiColSort ---- Sort arrays on multiple columnsChooseFileFolder ---- Single and multiple selections from specified path treeview listingDate_Time_Convert -- Easily convert date/time formats, including the language usedExtMsgBox --------- A highly customisable replacement for MsgBoxGUIExtender -------- Extend and retract multiple sections within a GUIGUIFrame ---------- Subdivide GUIs into many adjustable framesGUIListViewEx ------- Insert, delete, move, drag, sort, edit and colour ListView itemsGUITreeViewEx ------ Check/clear parent and child checkboxes in a TreeViewMarquee ----------- Scrolling tickertape GUIsNoFocusLines ------- Remove the dotted focus lines from buttons, sliders, radios and checkboxesNotify ------------- Small notifications on the edge of the displayScrollbars ----------Automatically sized scrollbars with a single commandStringSize ---------- Automatically size controls to fit textToast -------------- Small GUIs which pop out of the notification area Link to comment Share on other sites More sharing options...
Skeletor Posted February 19, 2019 Share Posted February 19, 2019 Could you give an example of what you trying to accomplish? And how about some code... Kind RegardsSkeletor "Coffee: my defense against going postal." Microsoft Office Splash Screen | Basic Notepad Program (Beginner) | Transparent Splash Screen | Full Screen UI Link to comment Share on other sites More sharing options...
eschettini Posted February 19, 2019 Author Share Posted February 19, 2019 Hi Skeletor, I need to extract javascript rendered data of a web page like this: https://www.hpba.org/Membership/Organization-Search/Organization-Profile/orgcd/111773 but trying with _IEDocReadHTML or _INetGetSource functions I only get the original source of the page without data that browser's javascript engine process as receive the html from the webserver. Here are a couple of examples: 1) #include <IE.au3> #include <MsgBoxConstants.au3> Local $oIE = _IECreate("https://www.hpba.org/Membership/Organization-Search/Organization-Profile/orgcd/111773") Local $sHTML = _IEDocReadHTML($oIE) ConsoleWrite($sHTML) _IEQuit($oIE) 2) #include <Inet.au3> Local $sURL = "https://www.hpba.org/Membership/Organization-Search/Organization-Profile/orgcd/111773" Local $sXML = StringReplace(_INetGetSource($sURL), @CRLF, "") ConsoleWrite($SXML&@CRLF) Is there a way to get these kind of data with Autoit ? Thanks in advance, Enrico Link to comment Share on other sites More sharing options...
Nine Posted February 19, 2019 Share Posted February 19, 2019 (edited) Try this : Opt ("MustDeclareVars", 1) #include <IE.au3> Local $oIE = _IECreate("https://www.hpba.org/Membership/Organization-Search/Organization-Profile/orgcd/111773") Local $cFrames = _IEFrameGetCollection ($oIE) Local $iNumFrames = @extended Local $sTxt = $iNumFrames & " frames found" & @CRLF & "Error = " & @error & @CRLF ; MsgBox (0,"",$sTxt) Local $oFrame = 0 $oFrame = _IEFrameGetCollection($oIE, 0) Local $cTags = _IETagNameGetCollection ($oFrame, "dd") Local $sInfo = "", $sTag, $iS For $oTag in $cTags $sTag = $oTag.innerText $iS = StringInStr ($sTag,"; }",0,-1) if $iS then $sTag = StringMid($sTag,$iS+4) $sInfo &= $sTag & @CRLF Next MsgBox (0,"",$sInfo) _IEQuit($oIE) Edited February 19, 2019 by Nine rm4453 1 “They did not know it was impossible, so they did it” ― Mark Twain Spoiler Block all input without UAC Save/Retrieve Images to/from Text Monitor Management (VCP commands) Tool to search in text (au3) files Date Range Picker Virtual Desktop Manager Sudoku Game 2020 Overlapped Named Pipe IPC HotString 2.0 - Hot keys with string x64 Bitwise Operations Multi-keyboards HotKeySet Recursive Array Display Fast and simple WCD IPC Multiple Folders Selector Printer Manager GIF Animation (cached) Screen Scraping Multi-Threading Made Easy Link to comment Share on other sites More sharing options...
eschettini Posted February 19, 2019 Author Share Posted February 19, 2019 Thanks a lot Nine!!! It works fine. Link to comment Share on other sites More sharing options...
Nine Posted February 19, 2019 Share Posted February 19, 2019 “They did not know it was impossible, so they did it” ― Mark Twain Spoiler Block all input without UAC Save/Retrieve Images to/from Text Monitor Management (VCP commands) Tool to search in text (au3) files Date Range Picker Virtual Desktop Manager Sudoku Game 2020 Overlapped Named Pipe IPC HotString 2.0 - Hot keys with string x64 Bitwise Operations Multi-keyboards HotKeySet Recursive Array Display Fast and simple WCD IPC Multiple Folders Selector Printer Manager GIF Animation (cached) Screen Scraping Multi-Threading Made Easy Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now