Optional Group Capture with Lua Pattern Matching

爷,独闯天下 提交于 2019-12-10 15:44:22

问题


I am trying to parse chemical formulas in Lua using simple pattern matching. However, I do not know how to specify a capture group as being optional. Here is the pattern I have come up with:

pattern = "(%u%l*)(%d*)"

The first group captures the atomic symbol (i.e. "H", "He", etc..) and the second group captures the number of that atom in the molecule. This value is usually an integer value, but if it is 1, it is often omitted, such as in:

formula = "C2H6O"

When I attempt to do a global match, if there is no match the result of count is '' instead of what I would anticipate of nil.

compound = {}
for atom,count in string.gmatch(formula, pattern) do
    compound[atom] = count or 1
end

Obviously I could just check if count = ''but I was curious if there was an optional capturing group in Lua.


回答1:


if there was an optional capturing group in Lua.

No; pattern items don't list captures as acceptable options, so you can't have, for example, (%d*)? like you'd do in Perl.




回答2:


There is no optional capturing group in Lua.

count is the empty string instead of nil because the empty string matches %d*.

Try this instead:

compound[atom] = tonumber(count) or 1

Note that tonumber will return nil if count is the empty string, which is what you want to check.



来源:https://stackoverflow.com/questions/26044905/optional-group-capture-with-lua-pattern-matching

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!