I got problem retrieving UTF-8 Chinese textcontent from webpage. <a href="/index/XAU.php">費城金銀</a>#Include <FF.au3>
_FFConnect()
$sExpect= "費城金銀" ;the expected result
$sReturn=_FFXPath("//a[@href = '/index/XAU.php']","textContent",2)
msgbox(4096, "", "$sExpect" & @crlf & $sExpect & @crlf & @crlf & _
"$sReturn" & @crlf & $sReturn & @crlf & @crlf & _
"binaryToString(stringToBinary($sReturn,1),4)" & @crlf & binarytostring(stringToBinary($sReturn,1),4)& @crlf & @crlf & _ ;convert it back to UTF-8, but only 1 character is correct.
"binaryToString(stringToBinary(binaryToString(stringToBinary($sExpect,4),1),1),4)" & @crlf & binarytostring(stringtobinary(binarytostring(stringToBinary($sExpect,4),1),1),4)) ;simulate the error.
Does anybody know the way to retrieve textcontent in UTF-8 directly instead of ANSI? Thanks for your help!