2015-01-02 193 views
1

我想弄清楚如何匹配我需要的东西的其他部分,但似乎无法让它工作。PHP - 正则表达式匹配其他正则表达式中的大括号

这是我到目前为止有:

preg_match_all("/^(.*?)(?:.\(([\d]+?)[\/I^\(]*?\))(?:.\((.*?)\))?/m",$data,$r, PREG_SET_ORDER); 

示例文本:

INPUT - Each line represents a line inside a text file. 
------------------------------------------------------------------------------------- 
"!?Text" (1234)           1234-4321 
"#1 Text" (1234)          1234-???? 
#2 Text (1234) {Some text (#1.1)}      1234 
Text (1234)            1234 
Some Other Text: More Text here 1234-4321 (1234) (V) 1234 

我想要做什么:

我也想匹配大括号和东西的东西在大括号的括号内。 我似乎无法得到它的工作考虑到花括号+括号中的内容可能并不总是在行内。

基本上第一个(1234)将是一年,我只想匹配一次,但是在最后一个字符串示例中它也匹配(V),但我不希望它。

理想的输出:

有关使用
Array 
(
    [0] => "!?Text" (1234) 
    [1] => "!?Text" 
    [2] => 1234 
) 
Array 
(
    [0] => "#1 Text" (1234) 
    [1] => "#1 Text" 
    [2] => 1234 
) 
Array 
(
    [0] => "#2 Text" (1234) 
    [1] => "#2 Text" 
    [2] => 1234 
    [3] => Some text (#1.1) // Matches things within curly brackets if there are any. 
    [4] => Some text // Extracts text before brackets 
    [5] => #1.1 // Extracts text within brackets (if any because brackets may not be within curly brackets.) 
) 
Array 
(
    [0] => Text (1234) 
    [1] => Text 
    [2] => 1234 
) 
Array // (My current regular expression gives me a 4th match with value 'V', which it shouldn't do) 
(
    [0] => Some Other Text: More Text here 1234-4321 (1234) (V) 
    [1] => Some Other Text: More Text here 1234-4321 
    [2] => 1234 
) 
+0

所以要捕捉括号之间的所有数字?修改您的示例以显示EXPECTED输出 – Enissay

+0

不是。我想捕捉开始(这是一年前应该是第一个括号(1234))的所有内容,然后我想在大括号内捕捉所有内容,如果字符串中有这样的括号。如果存在大括号,我需要+特别匹配大括号括号内的内容) – CLECode

+0

在所有示例中,“1234”位于括号内,而不是花括号。 – Barmar

回答

1

什么:

^((.*?) *\((\d+)\))(?: *\{((.*?) *\((.+?)\)) *\})? 

DEMO

NODE      EXPLANATION 
-------------------------------------------------------------------------------- 
^      the beginning of the string 
-------------------------------------------------------------------------------- 
    (      group and capture to \1: 
-------------------------------------------------------------------------------- 
    (      group and capture to \2: 
-------------------------------------------------------------------------------- 
     .*?      any character except \n (0 or more 
           times (matching the least amount 
           possible)) 
-------------------------------------------------------------------------------- 
    )      end of \2 
-------------------------------------------------------------------------------- 
    *      ' ' (0 or more times (matching the most 
          amount possible)) 
-------------------------------------------------------------------------------- 
    \(      '(' 
-------------------------------------------------------------------------------- 
    (      group and capture to \3: 
-------------------------------------------------------------------------------- 
     \d      digits (0-9) 
-------------------------------------------------------------------------------- 
           ' ' 
-------------------------------------------------------------------------------- 
    )      end of \3 
-------------------------------------------------------------------------------- 
    \)      ')' 
-------------------------------------------------------------------------------- 
)      end of \1 
-------------------------------------------------------------------------------- 
    (?:      group, but do not capture (optional 
          (matching the most amount possible)): 
-------------------------------------------------------------------------------- 
    *      ' ' (0 or more times (matching the most 
          amount possible)) 
-------------------------------------------------------------------------------- 
    \{      '{' 
-------------------------------------------------------------------------------- 
    (      group and capture to \4: 
-------------------------------------------------------------------------------- 
     (      group and capture to \5: 
-------------------------------------------------------------------------------- 
     .*?      any character except \n (0 or more 
           times (matching the least amount 
           possible)) 
-------------------------------------------------------------------------------- 
    )      end of \5 
-------------------------------------------------------------------------------- 
     *      ' ' (0 or more times (matching the 
           most amount possible)) 
-------------------------------------------------------------------------------- 
     \(      '(' 
-------------------------------------------------------------------------------- 
     (      group and capture to \6: 
-------------------------------------------------------------------------------- 
     .      any character except \n 
-------------------------------------------------------------------------------- 
     ?      ' ' (optional (matching the most 
           amount possible)) 
-------------------------------------------------------------------------------- 
    )      end of \6 
-------------------------------------------------------------------------------- 
     \)      ')' 
-------------------------------------------------------------------------------- 
    )      end of \4 
-------------------------------------------------------------------------------- 
    *      ' ' (0 or more times (matching the most 
          amount possible)) 
-------------------------------------------------------------------------------- 
    \}      '}' 
-------------------------------------------------------------------------------- 
)?      end of grouping 
+0

作品伟大的家伙,谢谢。 – CLECode