如果有以下html:
</br></td>, <td class="first">TEXT_1a<br>TEXT_1b
</br></td>, <td class="first">TEXT_2a<br>TEXT_2b
</br></td>, <td class="first">TEXT_3a<br>TEXT_3b
</br></td>, <td class="first">TEXT_4a<br>TEXT_4b
</br></td>, <td class="first">TEXT_5a<br>TEXT_5b
</br></td>, <td class="first">TEXT_6a<br>TEXT_6b
我用了
[i.text.strip() for i in soup.select('td.first')]
但是,然后我得到了
['TEXT_1aTEXT_1b', 'TEXT_2aTEXT_2b', 'TEXT_3aTEXT_3b', 'TEXT_4aTEXT_4b', 'TEXT_5aTEXT_5b', 'TEXT_6aTEXT_6b']
如何删除 TEXT_1b、TEXT_2b...TEXT_nb。另外我不想有一个数组,但 N1 = TEXT_1a, N2 = TEXT_2a...Nn = TEXT_na