0

参考以下代码

# import
import pandas as pd
import numpy as np
import string 

# create data frame
data = {'Name': ['Jas,on', 'Mo.lly', 'Ti;na', 'J:ake', '!Amy', "Myself"]}
df = pd.DataFrame(data, columns = ['Name'])
df

# get cleanName - Function
def getCleanName(pName):
    vRetVals = pName.translate(str.maketrans(" ", " ", string.punctuation))
    return(vRetVals)  

# clean Name
print("PreClean Good Rows", df.shape[0] - df.Name.map(lambda v:v.isalpha()).sum())
df['Name'] = [getCleanName for n in df.Name]
print("PostClean Good Rows", df.shape[0] - df.Name.map(lambda v: v.isalpha()).sum())

问题

第一次运行以下行时,它运行正常:

print("PreClean Good Rows", df.shape[0] - df.Name.map(lambda v: v.isalpha()).sum())

第二次运行同一行时,会出现以下错误

AttributeError: 'function' object has no attribute 'isalpha'

任何想法,是什么导致了这个问题?

4

1 回答 1

2

你忘了打电话getCleanName,所以你list最终得到了一堆相同的函数引用。将其更改为:

df['Name'] = [getCleanName(n) for n in df.Name]
#                         ^^^ changed

实际调用函数并使用结果。

于 2019-03-22T15:51:02.870 回答