0

语境

我对编码很陌生,并且一直在通过视频和反复试验学习。虽然它似乎已经用尽了这一点。

我能够使用 Helium(Selenium 的一个简单版本)下载一组 youtube 链接。但是,我想遍历这些列表以从中下载成绩单。

# Get the links
def Get_links():
    # For the class (categories with segments of information), find them all
    Lnk = find_all(S('.style-scope ytd-video-renderer'))
    fin = []

    # Within this class,
    for l in Lnk:
        # These variables exist
        # The xpath that contains the links
        ind_links = find_all(S('//*[@id="thumbnail"]'))
        # links in this this xpath
        href_list = [e.web_element.get_attribute('href') for e in ind_links]
        # We want to separate the duplicates
        # for every link in the href_lists variable
        for i in href_list:
            # within the empty list 'fin', if it is not in the empty list, then we append it.
            # This makes sense because if there is nothing in the list, then there will only be one copy of the list of links
            if i not in fin:
                fin.append(i)
                
    print(fin)

输出是链接列表

 ['https://www.youtube.com/watch?v=eHnXgh0j500', None, 
  'https://www.youtube.com/watch?v=wDHtXXApfbc', 
  'https://www.youtube.com/watch?v=CJhOGDU636k', 
  'https://www.youtube.com/watch?v=xIB6uNsgFb8', 
  'https://www.youtube.com/watch?v=u7Ckt6A6du8', 
  'https://www.youtube.com/watch?v=PnSC2BY4e7c', 
  'https://www.youtube.com/watch?v=UkIAsYWgciQ', 
  'https://www.youtube.com/watch?v=MqC_k2WxZro', 
  'https://www.youtube.com/watch?v=B0BpL20QHPU', 
  'https://www.youtube.com/watch?v=UujbkSBzuI0', 
  'https://www.youtube.com/watch?v=7Q8ZvFDyjhA', 
  'https://www.youtube.com/watch?v=Z8pVlfulkcw', 
  'https://www.youtube.com/watch?v=fy0clsby3v8', 
  'https://www.youtube.com/watch?v=oYJaLgJL0Ok', 
  'https://www.youtube.com/watch?v=rampRBuDIIQ', 
  'https://www.youtube.com/watch?v=BuhUXD0KH8k', 
  'https://www.youtube.com/watch?v=27mtHjDTgvQ', 
  'https://www.youtube.com/watch?v=kebonpz4bD0', 
  'https://www.youtube.com/watch?v=2KgH0UpiRiw', 
  'https://www.youtube.com/watch?v=TA-P5ilI_Vg', 
  'https://www.youtube.com/watch?v=TOTmOToM6zQ', 
  'https://www.youtube.com/watch?v=CRVYXC2OH7U', 
  'https://www.youtube.com/watch?v=g4TrGD2tDek', 
  'https://www.youtube.com/watch?v=tAO-Ff7_4CE', 
  'https://www.youtube.com/watch?v=fwe-PjrX23o', 
  'https://www.youtube.com/watch?v=Gu7-vlVFUnw', 
  'https://www.youtube.com/watch?v=oXOqExfdKNg', 
  'https://www.youtube.com/watch?v=zrh7P9fgga8', 
  'https://www.youtube.com/watch?v=HVdZ-ccwkj8', 
  'https://www.youtube.com/watch?v=vCdTLteTPtM']

问题

有没有办法我可以进入这些链接以使用氦气(或硒)在浏览器中打开它们,然后下载成绩单,而无需手动复制和粘贴它们作为变量,然后将它们放入列表中?

4

1 回答 1

1

例子

您的网址列表:

fin = ['https://www.youtube.com/watch?v=eHnXgh0j500', None, 
  'https://www.youtube.com/watch?v=wDHtXXApfbc', 
  'https://www.youtube.com/watch?v=CJhOGDU636k'
  ]

循环列表并做一些事情:

for url in fin:
    if url: #check for the NONE values
        #do something in selenium e.g. driver.get(url)
        print(url) #or just print
于 2020-12-27T20:35:00.923 回答