?? group.html
字號:
<HTML>
<HEAD>
<META http-equiv="Content-Type" content="text/html; charset=UTF-8">
<TITLE>Capturing Text in a Group in a Regular Expression
(Java Developers Almanac Example)
</TITLE>
<META CONTENT="Patrick Chan" NAME="AUTHOR">
<META CONTENT="Code Examples from The Java Developers Almanac 1.4" NAME="DESCRIPTION">
<META CONTENT="Addison-Wesley/Patrick Chan" NAME="OWNER">
<META CONTENT="3/20/02" NAME="revision">
<STYLE TYPE="text/css">
<!-- BODY CODE {font-family: Courier, Monospace; font-size: 11pt} TABLE, BODY {font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10pt} PRE {font-family: Courier, Monospace; font-size: 10pt} H3 {font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 11pt} A.eglink {text-decoration: none} A:hover.eglink {text-decoration: underline} -->
</STYLE>
</HEAD>
<BODY>
<TABLE CELLSPACING="0" CELLPADDING="0" BORDER="0">
<TR>
<TD rowspan="3"><A HREF="/?l=ex"><IMG BORDER="0" ALIGN="BOTTOM" HSPACE="10" SRC="/egs/almanac14a.jpg"></A></TD><TD VALIGN="top"><font face="Times" size="6"><b>The Java Developers Almanac 1.4</b></font>
<br>
Order this book from <a href="/cgi-bin/scripts/redirect.pl?l=ex&url=http://www.amazon.com/exec/obidos/ASIN/0201752808/xeo">Amazon</a>.
</TD>
</TR>
<TR>
<TD align="right" valign="bottom">
<FORM method="get" action="/cgi-bin/search/find.pl">
<INPUT size="25" name="words" type="text"><INPUT value="Search" type="submit">
</FORM>
</TD>
</TR>
</TABLE>
<HR color="#6666cc">
<DIV ALIGN="LEFT">
<A HREF="/">Home</A>
>
<A HREF="../index.html">List of Packages</A>
>
<B><A HREF="../java.util.regex/pkg.html">java.util.regex</A></B><font color="#666666" SIZE="-2">
[26 examples]
</font>
>
<B><A HREF="../java.util.regex/pkg.html#Groups">Groups</A></B><font color="#666666" SIZE="-2">
[5 examples]
</font>
</DIV><P>
<h3>
e436.
Capturing Text in a Group in a Regular Expression</h3>
A <font color="#0066ff"><i>group</i></font> is a pair of parentheses used to group subpatterns. For
example, <code>h(a|i)t</code> matches <code>hat</code> or <code>hit</code>. A group
also captures the matching text within the parentheses. For example,
<pre>
input: abbc
pattern: a(b*)c
</pre>
causes the substring <code>bb</code> to be captured by the group <code>(b*)</code>.
A pattern can have more than one group and the groups can be nested.
For example,
<pre>
pattern: (a(b*))+(c*)
</pre>
contains three groups:
<pre>
group 1: (a(b*))
group 2: (b*)
group 3: (c*)
</pre>
The groups are numbered from left to right, outside to inside. There
is an implicit group 0, which contains the entire match. Here is an
example of what is captured in groups.
Notice that group 1 was applied twice, once to the input <code>abb</code> and
then to the input <code>ab</code>. Only the most recent match is captured.
Note that when using * on a group and the group matches zero
times, the group will not be cleared. In particular, it will hold the
most recently captured text. For example,
<pre>
input: aba
pattern: (a(b)*)+
group 0: aba
group 1: a
group 2: b
</pre>
Group 1 first matched <code>ab</code> capturing <code>b</code> in group 2. Group 1 then
matched the <code>a</code> with group 2 matching zero <code>b</code>s, therefore leaving
intact the previously captured <code>b</code>.
<P> Note: If it is not necessary for a group to capture text, you
should use a non-capturing group since it is more efficient. For more
information, see <a href="../java.util.regex/NoGroup.html" class="eglink"><font size="-1"><b>e438</b> Using a Non-Capturing Group in a Regular Expression</font></a>.
<P>
This example demonstrates how to retrieve the text in a group.
<pre>
CharSequence inputStr = <font color="#0066ff"><i>"abbabcd"</i></font>;
String patternStr = <font color="#0066ff"><i>"(a(b*))+(c*)"</i></font>;
// Compile and use regular expression
Pattern pattern = Pattern.compile(patternStr);
Matcher matcher = pattern.matcher(inputStr);
boolean matchFound = matcher.find();
if (matchFound) {
// Get all groups for this match
for (int i=0; i<=matcher.groupCount(); i++) {
String groupStr = matcher.group(i);
}
}
</pre>
<P><table width="600" CELLSPACING="0" CELLPADDING="2" BORDER="0">
<tr>
<td bgcolor="#6666cc" align="center"><font color="#ffffff">
Related Examples
</font></td>
</tr>
</table>
e437. <a class="eglink" href="GroupIx.html?l=rel">
Getting the Indices of a Matching Group in a Regular Expression
</a>
<br>
e438. <a class="eglink" href="NoGroup.html?l=rel">
Using a Non-Capturing Group in a Regular Expression
</a>
<br>
e439. <a class="eglink" href="GroupInPat.html?l=rel">
Using the Captured Text of a Group within a Pattern
</a>
<br>
e440. <a class="eglink" href="GroupInRep.html?l=rel">
Using the Captured Text of a Group within a Replacement Pattern
</a>
<br>
<table width="600" CELLSPACING="0" CELLPADDING="2" BORDER="0">
<tr>
<td align="left">
<br>
See also:
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Flags">
Flags
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Lines">
Lines
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Paragraphs">
Paragraphs
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Searching%20and%20Replacing">
Searching and Replacing
</a>
<a class="eglink" href="/egs/java.util.regex/pkg.html?l=rel#Tokenizing">
Tokenizing
</a>
</td>
</tr>
</table>
<br>
<br>
<FONT FACE="Verdana, Arial, Helvetica, sans-serif" SIZE="0">
© 2002 Addison-Wesley.
</FONT>
</BODY>
</HTML>
?? 快捷鍵說明
復制代碼
Ctrl + C
搜索代碼
Ctrl + F
全屏模式
F11
切換主題
Ctrl + Shift + D
顯示快捷鍵
?
增大字號
Ctrl + =
減小字號
Ctrl + -