0

当我的 rpa 开始数据抓取时,rpa 能够开始运行,但需要永远并且永远不会完成数据抓取。没有错误消息输出,rpa 永远不会结束。为什么会这样?我强烈怀疑这是由选择器引起的。

数据抓取

“附加浏览器'Cookingoi'页面”的选择器:

<html app='chrome.exe' htmlwindowname='nameStorage:?wm_referrer=https%3A%2F%2Fredmart.lazada.sg%2F%23home&amp;refer_pv_id=*' title='* - Buy * at Best Price in Singapore | www.lazada.sg' />

“提取结构化数据'DIV'”的选择器:

<webctrl css-selector='body&gt;div&gt;div&gt;div&gt;div&gt;div' parentid='root' tag='DIV' />

“NextLinkSelector”的选择器:

<webctrl idx='7' isleaf='1' parentid='root' tag='A' />
4

1 回答 1

0

Ok, I was able to do this you have to select the larger box encompassing the whole div for an item. This is how my extract metadata tag looks like.

<extract>
    <row exact='1'>
        <webctrl tag='div' class='shelf_container' idx='1'/>
        <webctrl tag='div' class='shelf_wrapper' idx='1'/>
        <webctrl tag='div' class='product_container'/>
    </row>
    <column exact='1' name='col1' attr='text'>
        <webctrl tag='div' class='shelf_container' idx='1'/>
        <webctrl tag='div' class='shelf_wrapper' idx='1'/>
        <webctrl tag='div' class='product_container'/>
        <webctrl tag='a' class='pdp_link' idx='1'/>
    </column>
    <column exact='1' name='Column2' attr='text'>
        <webctrl tag='div' class='shelf_container' idx='1'/>
        <webctrl tag='div' class='shelf_wrapper' idx='1'/>
        <webctrl tag='div' class='product_container'/>
        <webctrl tag='a' class='pdp_link' idx='1'/>
        <webctrl tag='div' class='product_details_container' idx='1'/>
        <webctrl tag='div' class='title_wrapper' idx='1'/>
        <webctrl tag='div' class='title' idx='1'/>
    </column>
    <column exact='1' name='Column3' attr='text'>
        <webctrl tag='div' class='shelf_container' idx='1'/>
        <webctrl tag='div' class='shelf_wrapper' idx='1'/>
        <webctrl tag='div' class='product_container'/>
        <webctrl tag='a' class='pdp_link' idx='1'/>
        <webctrl tag='div' class='product_details_container' idx='1'/>
        <webctrl tag='div' class='price_container' idx='1'/>
        <webctrl tag='div' class='product_price' idx='1'/>
    </column>
</extract>
于 2020-05-18T22:09:42.827 回答