官术网_书友最值得收藏!

Text parsers

Parsing is something that web apps need to do quite often. Opa features a built-in syntax for building text parsers, which are first class values just as functions. The parser is based on parsing expression grammar (http://en.wikipedia.org/wiki/Parsing_expression_grammar), which may look like regular expressions at first, but do not behave anything like them. One big advantage of text parsers over regular expressions is that you can easily combine parsers. A good example is parsing URLs. Let's start right away with our first Opa parser:

first_parser = parser { 
case "Opa"  : 1 
}

For first_parser, the expressions are just literal strings, which means this parser will succeed only if fed with the string "Opa". Then how to use this parser? The module Parser (http://doc.opalang.org/module/stdlib.core.parser/Parser) has a bunch of functions to deal with parsers. The most important one is:

Parser.try_parse : Parser.general_parser('a), string -> option('a)

It takes a parser and a string as parameters and produces an optional value of some type. Let's see how to use this function:

x = Parser.try_parse(parser1,"Opa")  //x = {some: 1}
y = Parser.try_parse(parser1,"Java") //y = {none}

Now let's consider the following parsers:

digit1 = parser { case x=[0-9]+: x }
digit2 = parser { case x=([0-9]+): x }

Both digit1 and digit2 accept a number string like "5","100", and both will assign the value to the identifier x. If we feed the parser digit1 with the string "100", x will be the parsing result of the string: a list of characters ['1','0','0']. If we feed the string "100" to parser digit2, x will be the input string: 100. So, if we want to get hold of the input string, we need to put the expression in parentheses.

Let's move it a little further; consider the following parser:

abs_parser = parser{
  case x=("+"?[0-9]+): Int.of_string("{x}")
  case x=("-"[0-9]+) : 0 – Int.of_string("{x}")
}
x = Parser.try_parse(abs_parser,"-100") // x = {some: 100}

This parser accepts an integer string and returns the absolute value. You may figure out how it works with the previous knowledge. Note that even if the expression of PEG looks like a regular expression, they are different.

主站蜘蛛池模板: 桦川县| 小金县| 沛县| 开远市| 若羌县| 正宁县| 新泰市| 白城市| 甘谷县| 易门县| 邯郸市| 洛南县| 资溪县| 维西| 翁源县| 安新县| 新丰县| 剑川县| 潞城市| 铜鼓县| 禄丰县| 且末县| 东辽县| 呼玛县| 上饶县| 方山县| 鲁山县| 姚安县| 田阳县| 贡嘎县| 且末县| 工布江达县| 桐乡市| 沁阳市| 芜湖县| 慈溪市| 乌拉特中旗| 汪清县| 东莞市| 佳木斯市| 安徽省|