亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频

? 歡迎來到蟲蟲下載站! | ?? 資源下載 ?? 資源專輯 ?? 關(guān)于我們
? 蟲蟲下載站

?? tech.notes

?? 一套很值得分析的短信SMS開發(fā)源代碼。是我今年早些時(shí)候從taobao上買來的。但我現(xiàn)在也沒看完(先說清楚
?? NOTES
字號(hào):
Technical Notes about PCRE
--------------------------

Many years ago I implemented some regular expression functions to an algorithm
suggested by Martin Richards. These were not Unix-like in form, and were quite
restricted in what they could do by comparison with Perl. The interesting part
about the algorithm was that the amount of space required to hold the compiled
form of an expression was known in advance. The code to apply an expression did
not operate by backtracking, as the Henry Spencer and Perl code does, but
instead checked all possibilities simultaneously by keeping a list of current
states and checking all of them as it advanced through the subject string. (In
the terminology of Jeffrey Friedl's book, it was a "DFA algorithm".) When the
pattern was all used up, all remaining states were possible matches, and the
one matching the longest subset of the subject string was chosen. This did not
necessarily maximize the individual wild portions of the pattern, as is
expected in Unix and Perl-style regular expressions.

By contrast, the code originally written by Henry Spencer and subsequently
heavily modified for Perl actually compiles the expression twice: once in a
dummy mode in order to find out how much store will be needed, and then for
real. The execution function operates by backtracking and maximizing (or,
optionally, minimizing in Perl) the amount of the subject that matches
individual wild portions of the pattern. This is an "NFA algorithm" in Friedl's
terminology.

For the set of functions that forms PCRE (which are unrelated to those
mentioned above), I tried at first to invent an algorithm that used an amount
of store bounded by a multiple of the number of characters in the pattern, to
save on compiling time. However, because of the greater complexity in Perl
regular expressions, I couldn't do this. In any case, a first pass through the
pattern is needed, in order to find internal flag settings like (?i) at top
level. So PCRE works by running a very degenerate first pass to calculate a
maximum store size, and then a second pass to do the real compile - which may
use a bit less than the predicted amount of store. The idea is that this is
going to turn out faster because the first pass is degenerate and the second
pass can just store stuff straight into the vector. It does make the compiling
functions bigger, of course, but they have got quite big anyway to handle all
the Perl stuff.

The compiled form of a pattern is a vector of bytes, containing items of
variable length. The first byte in an item is an opcode, and the length of the
item is either implicit in the opcode or contained in the data bytes which
follow it. A list of all the opcodes follows:

Opcodes with no following data
------------------------------

These items are all just one byte long

  OP_END                 end of pattern
  OP_ANY                 match any character
  OP_SOD                 match start of data: \A
  OP_CIRC                ^ (start of data, or after \n in multiline)
  OP_NOT_WORD_BOUNDARY   \W
  OP_WORD_BOUNDARY       \w
  OP_NOT_DIGIT           \D
  OP_DIGIT               \d
  OP_NOT_WHITESPACE      \S
  OP_WHITESPACE          \s
  OP_NOT_WORDCHAR        \W
  OP_WORDCHAR            \w
  OP_EODN                match end of data or \n at end: \Z
  OP_EOD                 match end of data: \z
  OP_DOLL                $ (end of data, or before \n in multiline)
  OP_RECURSE             match the pattern recursively


Repeating single characters
---------------------------

The common repeats (*, +, ?) when applied to a single character appear as
two-byte items using the following opcodes:

  OP_STAR
  OP_MINSTAR
  OP_PLUS
  OP_MINPLUS
  OP_QUERY
  OP_MINQUERY

Those with "MIN" in their name are the minimizing versions. Each is followed by
the character that is to be repeated. Other repeats make use of

  OP_UPTO
  OP_MINUPTO
  OP_EXACT

which are followed by a two-byte count (most significant first) and the
repeated character. OP_UPTO matches from 0 to the given number. A repeat with a
non-zero minimum and a fixed maximum is coded as an OP_EXACT followed by an
OP_UPTO (or OP_MINUPTO).


Repeating character types
-------------------------

Repeats of things like \d are done exactly as for single characters, except
that instead of a character, the opcode for the type is stored in the data
byte. The opcodes are:

  OP_TYPESTAR
  OP_TYPEMINSTAR
  OP_TYPEPLUS
  OP_TYPEMINPLUS
  OP_TYPEQUERY
  OP_TYPEMINQUERY
  OP_TYPEUPTO
  OP_TYPEMINUPTO
  OP_TYPEEXACT


Matching a character string
---------------------------

The OP_CHARS opcode is followed by a one-byte count and then that number of
characters. If there are more than 255 characters in sequence, successive
instances of OP_CHARS are used.


Character classes
-----------------

OP_CLASS is used for a character class, provided there are at least two
characters in the class. If there is only one character, OP_CHARS is used for a
positive class, and OP_NOT for a negative one (that is, for something like
[^a]). Another set of repeating opcodes (OP_NOTSTAR etc.) are used for a
repeated, negated, single-character class. The normal ones (OP_STAR etc.) are
used for a repeated positive single-character class.

OP_CLASS is followed by a 32-byte bit map containing a 1 bit for every
character that is acceptable. The bits are counted from the least significant
end of each byte.


Back references
---------------

OP_REF is followed by two bytes containing the reference number.


Repeating character classes and back references
-----------------------------------------------

Single-character classes are handled specially (see above). This applies to
OP_CLASS and OP_REF. In both cases, the repeat information follows the base
item. The matching code looks at the following opcode to see if it is one of

  OP_CRSTAR
  OP_CRMINSTAR
  OP_CRPLUS
  OP_CRMINPLUS
  OP_CRQUERY
  OP_CRMINQUERY
  OP_CRRANGE
  OP_CRMINRANGE

All but the last two are just single-byte items. The others are followed by
four bytes of data, comprising the minimum and maximum repeat counts.


Brackets and alternation
------------------------

A pair of non-capturing (round) brackets is wrapped round each expression at
compile time, so alternation always happens in the context of brackets.

Non-capturing brackets use the opcode OP_BRA, while capturing brackets use
OP_BRA+1, OP_BRA+2, etc. [Note for North Americans: "bracket" to some English
speakers, including myself, can be round, square, curly, or pointy. Hence this
usage.]

Originally PCRE was limited to 99 capturing brackets (so as not to use up all
the opcodes). From release 3.5, there is no limit. What happens is that the
first ones, up to EXTRACT_BASIC_MAX are handled with separate opcodes, as
above. If there are more, the opcode is set to EXTRACT_BASIC_MAX+1, and the
first operation in the bracket is OP_BRANUMBER, followed by a 2-byte bracket
number. This opcode is ignored while matching, but is fished out when handling
the bracket itself. (They could have all been done like this, but I was making
minimal changes.)

A bracket opcode is followed by two bytes which give the offset to the next
alternative OP_ALT or, if there aren't any branches, to the matching KET
opcode. Each OP_ALT is followed by two bytes giving the offset to the next one,
or to the KET opcode.

OP_KET is used for subpatterns that do not repeat indefinitely, while
OP_KETRMIN and OP_KETRMAX are used for indefinite repetitions, minimally or
maximally respectively. All three are followed by two bytes giving (as a
positive number) the offset back to the matching BRA opcode.

If a subpattern is quantified such that it is permitted to match zero times, it
is preceded by one of OP_BRAZERO or OP_BRAMINZERO. These are single-byte
opcodes which tell the matcher that skipping this subpattern entirely is a
valid branch.

A subpattern with an indefinite maximum repetition is replicated in the
compiled data its minimum number of times (or once with a BRAZERO if the
minimum is zero), with the final copy terminating with a KETRMIN or KETRMAX as
appropriate.

A subpattern with a bounded maximum repetition is replicated in a nested
fashion up to the maximum number of times, with BRAZERO or BRAMINZERO before
each replication after the minimum, so that, for example, (abc){2,5} is
compiled as (abc)(abc)((abc)((abc)(abc)?)?)?. The 99 and 200 bracket limits do
not apply to these internally generated brackets.


Assertions
----------

Forward assertions are just like other subpatterns, but starting with one of
the opcodes OP_ASSERT or OP_ASSERT_NOT. Backward assertions use the opcodes
OP_ASSERTBACK and OP_ASSERTBACK_NOT, and the first opcode inside the assertion
is OP_REVERSE, followed by a two byte count of the number of characters to move
back the pointer in the subject string. When operating in UTF-8 mode, the count
is a character count rather than a byte count. A separate count is present in
each alternative of a lookbehind assertion, allowing them to have different
fixed lengths.


Once-only subpatterns
---------------------

These are also just like other subpatterns, but they start with the opcode
OP_ONCE.


Conditional subpatterns
-----------------------

These are like other subpatterns, but they start with the opcode OP_COND. If
the condition is a back reference, this is stored at the start of the
subpattern using the opcode OP_CREF followed by two bytes containing the
reference number. Otherwise, a conditional subpattern will always start with
one of the assertions.


Changing options
----------------

If any of the /i, /m, or /s options are changed within a parenthesized group,
an OP_OPT opcode is compiled, followed by one byte containing the new settings
of these flags. If there are several alternatives in a group, there is an
occurrence of OP_OPT at the start of all those following the first options
change, to set appropriate options for the start of the alternative.
Immediately after the end of the group there is another such item to reset the
flags to their previous values. Other changes of flag within the pattern can be
handled entirely at compile time, and so do not cause anything to be put into
the compiled data.


Philip Hazel
August 2001

?? 快捷鍵說明

復(fù)制代碼 Ctrl + C
搜索代碼 Ctrl + F
全屏模式 F11
切換主題 Ctrl + Shift + D
顯示快捷鍵 ?
增大字號(hào) Ctrl + =
減小字號(hào) Ctrl + -
亚洲欧美第一页_禁久久精品乱码_粉嫩av一区二区三区免费野_久草精品视频
制服丝袜亚洲播放| 亚洲天堂2014| 依依成人精品视频| 91丝袜美女网| 成人免费在线播放视频| 成人激情动漫在线观看| 国产欧美中文在线| 不卡在线观看av| 奇米精品一区二区三区在线观看 | 26uuu久久天堂性欧美| 久久97超碰色| 国产视频一区二区在线| 成人av在线看| 国产一区激情在线| 中文字幕一区av| 综合婷婷亚洲小说| 日韩精品一区在线| 国产一区视频在线看| 午夜电影久久久| xnxx国产精品| 日韩一区二区三区四区| 国产成人免费在线观看| 樱花草国产18久久久久| 国产精品嫩草影院av蜜臀| 欧美日韩在线综合| 捆绑紧缚一区二区三区视频| 久久精品综合网| 日韩欧美国产综合| 91精品国产麻豆| 成人午夜视频免费看| 一区二区在线观看视频 | 一区二区三区日韩精品视频| 欧美另类变人与禽xxxxx| 美女视频一区二区| 亚洲精品视频在线| 亚洲人成在线播放网站岛国| 欧美成人激情免费网| 91丨porny丨国产入口| 国产高清久久久久| 国产福利精品一区二区| 国产福利91精品一区| 国产福利一区在线观看| 国产99久久久国产精品| 日本亚洲最大的色成网站www| 国产精品久久久久久一区二区三区| 欧美日韩不卡在线| 91免费国产在线| 色综合久久久久综合99| 国产精品888| 日韩精品亚洲一区| 亚洲色图欧洲色图| 一区二区三区不卡视频| 亚洲电影激情视频网站| 亚洲欧美在线视频观看| 亚洲欧美视频在线观看视频| 又紧又大又爽精品一区二区| 亚洲国产乱码最新视频 | 99re免费视频精品全部| 激情五月激情综合网| 亚洲综合另类小说| 中文字幕精品一区二区精品绿巨人 | 亚洲人123区| 亚洲精品中文字幕在线观看| 午夜一区二区三区视频| 美国毛片一区二区| 成人一区在线观看| 欧美主播一区二区三区| 成人黄色网址在线观看| 色网综合在线观看| 日韩一区二区不卡| 国产精品你懂的| 午夜视频在线观看一区二区| 精品一区二区三区免费观看| 成人激情视频网站| 在线不卡中文字幕播放| 久久久亚洲精品石原莉奈| 欧美不卡视频一区| 国产精品久久毛片av大全日韩| 亚洲精品欧美专区| 久久99精品久久久久久动态图| 东方aⅴ免费观看久久av| 欧美亚洲愉拍一区二区| 在线视频一区二区三| 日韩欧美中文一区| 中文字幕在线不卡视频| 日本在线播放一区二区三区| 国产精品1024| 91精品国产综合久久婷婷香蕉 | 日本福利一区二区| 日本韩国精品一区二区在线观看| 91精品国产综合久久小美女| 亚洲国产精品99久久久久久久久| 亚洲成va人在线观看| 国产成人免费在线观看| 欧美一区二区三区在线观看| 亚洲日本青草视频在线怡红院| 蜜臀av性久久久久av蜜臀妖精| bt欧美亚洲午夜电影天堂| 91天堂素人约啪| 久久―日本道色综合久久| 亚洲综合色噜噜狠狠| 国产高清精品在线| 日韩视频免费观看高清在线视频| 亚洲天堂久久久久久久| 国产成人精品免费网站| 欧美精品国产精品| 一区二区在线观看免费视频播放 | 成人综合婷婷国产精品久久| 4438x成人网最大色成网站| 国产精品成人在线观看| 狠狠色丁香久久婷婷综| 欧美人成免费网站| 亚洲柠檬福利资源导航| 成人永久看片免费视频天堂| 欧美精品一区二区三区很污很色的| 久久精品亚洲精品国产欧美| 日韩精品一卡二卡三卡四卡无卡| 色88888久久久久久影院按摩| 国产日韩在线不卡| 韩国av一区二区三区四区| 欧美欧美午夜aⅴ在线观看| 亚洲精品免费电影| 91丨九色丨黑人外教| 中文字幕国产精品一区二区| 国产一区二区在线观看免费| 欧美一级国产精品| 日韩一区精品视频| 欧美日韩日日骚| 亚洲成a人v欧美综合天堂 | 欧美性高清videossexo| 亚洲色图欧美激情| 91同城在线观看| 亚洲人亚洲人成电影网站色| 99久久精品情趣| 亚洲视频一二三区| 色噜噜偷拍精品综合在线| 综合色天天鬼久久鬼色| 91国产福利在线| 亚洲一区二区欧美日韩| 欧美日韩一本到| 日韩激情在线观看| 精品裸体舞一区二区三区| 国产专区综合网| 国产精品美女久久久久高潮| 91在线观看地址| 亚洲精品国产第一综合99久久| 91色porny在线视频| 一区二区欧美国产| 欧美理论片在线| 精品一区二区三区香蕉蜜桃| 26uuu精品一区二区三区四区在线| 狠狠色丁香久久婷婷综合_中| 国产亚洲1区2区3区| 成人伦理片在线| 悠悠色在线精品| 91精品国产乱码久久蜜臀| 国产美女视频91| 中文字幕在线一区二区三区| 91九色02白丝porn| 日本欧美大码aⅴ在线播放| xfplay精品久久| 99久久国产免费看| 性久久久久久久| 国产午夜精品一区二区| av午夜精品一区二区三区| 亚洲一区在线观看网站| 日韩一区二区三区在线视频| 福利视频网站一区二区三区| 亚洲美女视频在线观看| 7878成人国产在线观看| 国产精品一区免费视频| 一区二区三区四区乱视频| 国产午夜精品久久久久久免费视| youjizz国产精品| 午夜精品久久久| 国产精品网站一区| 欧美日本视频在线| 成人小视频在线| 日韩va欧美va亚洲va久久| 国产农村妇女毛片精品久久麻豆| 日本道免费精品一区二区三区| 蜜臀a∨国产成人精品| 成人免费在线播放视频| 日韩无一区二区| 91蝌蚪porny九色| 激情综合五月婷婷| 亚洲一二三区不卡| 久久久久久久国产精品影院| 欧美性受极品xxxx喷水| 国产一区二区福利| 亚洲电影第三页| 亚洲欧洲成人精品av97| 欧美电视剧在线看免费| 在线免费av一区| 粉嫩aⅴ一区二区三区四区| 视频一区视频二区中文| 自拍偷拍亚洲欧美日韩| 精品三级av在线| 欧美私人免费视频| 成人国产在线观看|