8

我想在 scala 中以功能方式对字符串执行几个有序和连续的 replaceAll(...,...) 。

什么是最优雅的解决方案?斯卡拉兹欢迎!;)

4

5 回答 5

15

如果它只是几个调用,那么只需将它们链接起来。否则我想我会试试这个:

Seq("a" -> "b", "b" -> "a").foldLeft("abab"){case (z, (s,r)) => z.replaceAll(s, r)}

或者,如果您喜欢带有混淆通配符和额外闭包的较短代码:

Seq("a" -> "b", "b" -> "a").foldLeft("abab"){_.replaceAll _ tupled(_)}
于 2012-07-05T18:23:00.650 回答
13

首先,让我们从方法中得到一个函数replaceAll

scala> val replace = (from: String, to: String) => (_:String).replaceAll(from, to)
replace: (String, String) => String => java.lang.String = <function2>

现在您可以将Functor实例用于函数,在 scalaz 中定义。这样,您可以使用map(或使其看起来更好,使用 unicode 别名)来组合函数。

它看起来像这样:

scala> replace("from", "to") ∘ replace("to", "from") ∘ replace("some", "none")
res0: String => java.lang.String = <function1>

如果您更喜欢 haskell-way compose(从右到左),请使用contramap

scala> replace("some", "none") ∙ replace("to", "from") ∙ replace ("from", "to")
res2: String => java.lang.String = <function1>

您还可以通过Category instance获得一些乐趣:

scala> replace("from", "to") ⋙ replace("to", "from") ⋙ replace("some", "none")
res5: String => java.lang.String = <function1>

scala> replace("some", "none") ⋘ replace("to", "from") ⋘ replace ("from", "to")
res7: String => java.lang.String = <function1>

并应用它:

scala> "somestringfromto" |> res0
res3: java.lang.String = nonestringfromfrom

scala> res2("somestringfromto")
res4: java.lang.String = nonestringfromfrom

scala> "somestringfromto" |> res5
res6: java.lang.String = nonestringfromfrom

scala> res7("somestringfromto")
res8: java.lang.String = nonestringfromfrom
于 2012-07-06T07:40:33.610 回答
4

这个问题的另一个基于 Scalaz 的解决方案是使用Endomonoid。这个 monoid 捕获了恒等函数(作为 monoid 的恒等元素)和函数组合(作为 monoid 的附加操作)。如果您要应用任意大小(甚至可能为空)的函数列表,此解决方案将特别有用。

val replace = (from: String, to: String) => (_:String).replaceAll(from, to)

val f: Endo[String] = List(
  replace("some", "none"),
  replace("to", "from"),
  replace("from", "to")    
).foldMap(_.endo)

例如(使用 folone 的一个例子)

scala> f.run("somestringfromto")
res0: String = nonestringfromfrom
于 2013-03-11T12:14:36.337 回答
3

使用匿名参数定义一个替换函数,然后您可以将连续的替换函数链接在一起。

scala> val s = "hello world"
res0: java.lang.String = hello world

scala> def replace = s.replaceAll(_, _)
replace: (java.lang.String, java.lang.String) => java.lang.String

scala> replace("h", "H")  replace("w", "W")
res1: java.lang.String = Hello World
于 2012-07-05T17:47:32.707 回答
-1
#to replace or remove multiple substrings in scala in dataframe's string column

import play.api.libs.json._
#to find
def isContainingContent(str:String,regexStr:String):Boolean={
  val regex=new scala.util.matching.Regex(regexStr)
  val containingRemovables= regex.findFirstIn(str)
  containingRemovables match{
    case Some(s) => true
    case None => false
  }
}
val colContentPresent= udf((str: String,regex:String) => {
  isContainingContent(str,regex)
})
#to remove
val cleanPayloadOfRemovableContent= udf((str: String,regexStr:String) => {
  val regex=new scala.util.matching.Regex(regexStr)
  val cleanedStr= regex.replaceAllIn(str,"")
  cleanedStr
})
#to define
val removableContentRegex=
"<log:Logs>[\\s\\S]*?</log:Logs>|\\\\n<![\\s\\S]*?-->|<\\?xml[\\s\\S]*?\\?>"

#to call
val dfPayloadLogPresent = dfXMLCheck.withColumn("logsPresentInit", colContentPresent($"payload",lit(removableContentRegex)))
val dfCleanedXML = dfPayloadLogPresent.withColumn("payload", cleanPayloadOfRemovableContent($"payload",lit(removableContentRegex)))
于 2019-03-06T17:47:07.543 回答