array.regexp
Use regular expressions to process input
Description
With array.regexp, it's possible to use PERL-compatible regular expressions (PCRE) to match or make substitutions within arrays.
Discussion
The first argument is a symbol that sets the regular expression (RE), as does the re attribute in the left inlet. Subsequent arguments (or the substitute attribute) are treated as array values to be used in substitutions. Any other input in the left inlet is converted to an array object and treated as the subject "string".
The subject array is processed according to the re and substitute values. The middle outlet reports an array of the instances where the RE matched portions of the subject array. If the RE contains parentheses, they are treated as capture groups, and the center-left outlet puts them into a array upon every match within the subject array. If a 'substitute' array has been set, the leftmost output returns the subject array with necessary substitutions.
Note that, unlike other objects supporting regular expression in Max, which expect UTF-8 data, array.regexp will work with any byte buffer, including binary bufferrs containing 0s. Any incoming list or array is converted to a byte array (with 8-bit values from 0-255). If you intend to work with strings, you'll need to convert them to a list of bytes using atoi or similar before using array.regexp.
Arguments
expression[symbol]
optional
A regular expression may be used as an argument to set the regular expression (see above for regular expression formatting and metacharacter information).
substitution[array]
optional
An optional second argument will set the substitution array. As in other regexp objects, substitution arrays may contain capture groups; here they can be in the classic form (e.g. for the first capture group), or encoded as bytes ( = , = ).
Attributes
re[symbol]
The PCRE reference
is the best place to learn more about how regular expressions in Max are built, but here is a quick summary of the basics:
The word , followed by a PERL-compatible regular expression, sets the regular expression rules to be used when parsing or making substitutions within any symbol or list input.
If a regular expression contains spaces, it must be enclosed within double quotes when specified using the message or as a typed-in argument to the array.regexp object.
Regular expressions use the following form and syntax:
defines a 'class' of characters. any of the characters within it may be matched. several special symbols may also appear within it:
specifies a range (within ASCII codes)
specifies a decimal digit ( specifies a non-decimal digit). Note that double backslashes must be used -- Max erases single backslashes.
specifies white space ( specifies non-white space). Note that double backslashes must be used -- Max erases single backslashes.
specifies an alphanumeric ( specifies a nonalphanumeric). Note that double backslashes must be used -- Max erases single backslashes.
specifies a complement of
appears zero times
appears at least once
appears once or not at all
specifies a capture group that may be referred to in a substitution array such as , where is the position of the parenthesis in left-to-right order.
substitute[atom]
The word , followed by a list of integer values between 0 and 255, specifies an array to be used in substitutions. If the word is not followed by any values, the previous substitution symbol is removed. The word can also specify that no substitution should occur. The word indicates that substitution should occur, but that the matched values should be deleted from the final output array.
Note: If you need to output a followed by a number in any substitution array, you should use (encoded: ) , so that the is not interpreted as a capture group.
Common Box Attributes
annotation[symbol]
Sets the text that will be displayed in the Clue window when the user moves the mouse over the object.
background[int]: 0
Adds or removes the object from the patcher's background layer. adds the object to the background layer, removes it. Objects in the background layer are shown behind all objects in the default foreground layer.
color[4 floats]
Sets the color for the object box outline.
fontface[int]
Sets the type style used by the object. The options are:
plain
bold
italic
bold italic
Possible values:
0 = 'regular'
1 = 'bold'
2 = 'italic'
3 = 'bold italic'
fontname[symbol]
Sets the object's font.
fontsize[float]
Sets the object's font size (in points).
Possible values:
'8'
'9'
'10'
'11'
'12'
'13'
'14'
'16'
'18'
'20'
'24'
'30'
'36'
'48'
'64'
'72'
hint[symbol]
Sets the text that will be displayed in as a pop-up hint when the user moves the mouse over the object in a locked patcher.
ignoreclick[int]: 0
Toggles whether an object ignores mouse clicks in a locked patcher.
jspainterfile[symbol]
You can override the default appearance of a user interface object by assigning a JavaScript file with code for painting the object. The file must be in the search path.
patching_rect[4 floats]: 0. 0. 100. 0.
Aliases: patching_position, patching_size
Sets the position and size of the object in the patcher window.
position[2 floats]
write-only
Sets the object's x and y position in both patching and presentation modes (if the object belongs to its patcher's presentation), leaving its size unchanged.
presentation[int]: 0
Sets whether an object belongs to the patcher's presentation.
presentation_rect[4 floats]: 0. 0. 0. 0.
Aliases: presentation_position, presentation_size
Sets the x and y position and width and height of the object in the patcher's presentation, leaving its patching position unchanged.
rect[4 floats]
write-only
Sets the x and y position and width and height of the object in both patching and presentation modes (if the object belongs to its patcher's presentation).
size[2 floats]
write-only
Sets the object's width and height in both patching and presentation modes (if the object belongs to its patcher's presentation), leaving its position unchanged.
textcolor[4 floats]
Sets the color for the object's text in RGBA format.
textjustification[int]
Sets the justification for the object's text.
Possible values:
0 = 'left'
1 = 'center'
2 = 'right'
valuepopup[int]: 0
For objects with single values, enabling valuepopup will display the object's current value in a popup caption when the mouse is over the object or it is being changed with the mouse.
valuepopuplabel[int]: 0
Sets the source of a text label shown in a value popup caption.
Possible values:
0 = 'None'
1 = 'Hint'
2 = 'Scripting Name'
3 = 'Parameter Long Name'
4 = 'Parameter Short Name'
varname[symbol]
Sets the patcher's scripting name, which can be used to address the object by name in pattr, scripting messages to thispatcher, and the js object.
Messages
bang
Reprocess previously received array and trigger output.
int
Convert an incoming integer to an array, then process as described for the message.
- value
[int]
float
Convert an incoming floating-point number to an array, then process as described for the message.
- value
[float]
list
Convert an incoming list to an array, then process as described for the message.
- list-value
[list]
anything
Convert an incoming list to an array, then process as described for the message.
In the middle inlet, a symbol/list can be provided as a new re. In the right inlet, a symbol/list can be provided as a substitute.
- list-value
[list]
array
In the left inlet, an array will be processed as the subject array by the PCRE engine, using any re and substitute provided.
dictionary
TEXT_HERE
string
TEXT_HERE
Output
symbol or list
Out left outlet: If a substitution array has been set using the message, the input list or symbol is sent out the left outlet with any required substitutions).
Out center-left outlet: If the regular expression contains parentheses, they are treated as capture groups. The center-left outlet reports the capture groups upon every match within the subject array, and outputs them as a nested array; one array per subarrays match, each containing any captured arrays.
Out center outlet: The center outlet reports an array of subarrays where the regular expression matched portions of the subject array.
Out center-right outlet: The center-right outlet passes the original input, in the case that no match occurs.
Out right outlet: The rightmost outlet is used to report the offsets (start, end) of any subarray matches; that is, the offsets which can be used by array.slice to derive the subarray(s) from the input array. The offsets are provided as a nested array; one array with start and end offsets for each subarray match.
See Also
| Name | Description |
|---|---|
| fromsymbol | Convert a symbol into numbers/messages |
| key | Report keyboard presses |
| keyup | Report key information on release |
| message | Send any message |
| regexp | Use regular expressions to process input |
| spell | Convert input to UTF-8 (Unicode) codes |
| tosymbol | Convert messages, numbers, or lists to a single symbol |
| array | Create or duplicate an array object |
| string.regexp | Use regular expressions to process input |