我有一个作为 df1 的数据框,其中包含作为 University_name 的大学名称的列,并且有 500000 行。现在我有另一个数据框作为 df2 ,它包含 2 列作为 university_name 和 university_aliases 并且有 150 行。现在,我想将 university_aliases 列中的每个大学别名与 university_name_new 中的大学名称进行匹配。
df1$university_name 的样本
university of auckland
the university of auckland
university of warwick - warwick business school
unv of warwick
seneca college of applied arts and technology
seneca college
univ of auckland
df2 样本
University_Alias Univeristy_Name_new
univ of auckland university of auckland
universiry of auckland university of auckland
auckland university university of auckland
university of auckland university of auckland
warwick university university of warwick
warwick univercity university of warwick
university of warwick university of warwick
seneca college seneca college
unv of warwick university of warwick
我期待这样的输出
university of auckland
university of auckland
university of warwick
seneca college
seneca college
我正在使用以下代码,但它不起作用
df$university_name[ grepl(df$university_name,df2$university_alias)] <- df2$university_name_new