5

I have a string that can contain pretty much any character. Inside the string there is the delimiter {{{.

For example: afskjdfakjsdfkjas{{{fasdf.

Using attoparsec, what is the idiomatic way of writing a Parser () that skips all characters before {{{, but without consuming the {{{?

4

2 回答 2

3

Use attoparsec's lookAhead (which applies a parser without consuming any input) and manyTill to write a parser that consumes everything up to (but excluding) a {{{ delimiter. You're then free to apply that parser and throw its result away.

{-# LANGUAGE OverloadedStrings #-}

import Control.Applicative ( (<|>) )
import Data.Text ( Text )
import qualified Data.Text as T
import Data.Attoparsec.Text
import Data.Attoparsec.Combinator ( lookAhead, manyTill )

myParser :: Parser Text
myParser = T.concat <$> manyTill (nonOpBraceSpan <|> opBraceSpan)
                                 (lookAhead $ string "{{{")
                    <?> "{{{"
  where
    opBraceSpan    = takeWhile1 (== '{')
    nonOpBraceSpan = takeWhile1 (/= '{')

In GHCi:

λ> :set -XOverloadedStrings 
λ> parseTest myParser "{foo{{bar{{{baz"
Done "{{{baz" "{foo{{bar"
于 2015-07-02T11:53:15.253 回答
0

You can do it the slightly harder way like this:

foo = many $ do
  Just c <- fmap (const Nothing) (try $ string "{{{") <|> fmap Just anyChar
  return c

Or you could use this helper function manyTill like this:

foo = manyTill anyChar (try $ string "{{{")
于 2015-05-29T01:30:15.387 回答