正则表达式提取满足以下条件的所有子字符串:
前 4 个字符是数字,子字符串以数字或字母结尾
15 或 18 个字符长
如果有 2 个子字符串符合条件,则只返回第一个
df1 = pd.DataFrame(data ={"Messy_IDS":["Looking for ID : 7010M000002N8c5T7A","5634M000002N8c5T7A,7010M000002N8c5T7A","https://website.com/12340000000f5F5"], "Desired_Output":["7010M000002N8c5T7A","5634M000002N8c5T7A","12340000000f5F5"]})
df1
Messy_IDS Desired_Output
0 Looking for ID : 7010M000002N8c5T7A 7010M000002N8c5T7A
1 5634M000002N8c5T7A,7010M000002N8c5T7A 5634M000002N8c5T7A
2 https://website.com/12340000000f5F5 12340000000f5F5