The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Math::Formula::Type - variable types for Math::Formula

INHERITANCE

 Math::Formula::Type
   is a Math::Formula::Token

SYNOPSIS

  my $string = MF::STRING->new("example");
  my $answer = MF::INTEGER->new(42);

  # See more details per section

DESCRIPTION

This page describes all Types used by Math::Formula. The types are usually auto-detected. Operations are driven by the data-types.

METHODS

Constructors

Math::Formula::Type->new( $token|undef, [$value] )

The object is a blessed ARRAY. On the first spot is the $token. On the second spot might be the decoded value of the token, in internal Perl representation. When no $token is passed (value undef is explicit), then you MUST provide a $value. The token will be generated on request.

example:

  my $node = MF::BOOLEAN->new('true');    # value will be '1'
  my $node = MF::BOOLEAN->new(undef, 1);  # token will be 'true'
  my $node = MF::DATETIME->new(undef, DateTime->now);

MF::Formula::Type

The following methods and features are supported for any Type defined on this page.

All types can be converted into a string:

  "a" ~ 2          => STRING "a2"

All types may provide attributes (calls) for objects of that type. Those get inherited (of course). For instance:

   02:03:04.hour   => INTEGER 2
$obj->cast($type)

Type-convert a typed object into an object with a different type. Sometimes, the represented value changes a little bit, but usually not.

Any Type can be cast into a MF::STRING. Read the documentation for other types to see what they offer.

$obj->collapsed()

Returns the normalized version of the token(): leading and trailing blanks removed, intermediate sequences of blanks shortened to one blank.

$obj->token()

Returns the token in string form. This may be a piece of text as parsed from the expression string, or generated when the token is computed from the value.

$obj->value()

Where token() returns a string representation of the instance, the value() produces its translation into internal Perl values or objects, ready to be involved in computations.

MF::BOOLEAN, a truth value

Represents a truth value, either true or false.

Booleans implement the prefix operator "+", and infix operators 'and', 'or', and 'xor'.

The weird infix operator -> implements an 'if-then', which can also be used as substitute. When the left-side of the arrow is true, then the right side is returned.

When left of the -> is a possitive regular expression match =~, then the right side may can be a complex expression which may contains $1, 2 etc. The represent the captured values in the regexp.

. Example: for booleans

  true    false     # the only two values
  0                 # will be cast to false in boolean expressions
  42                # any other value is true, in boolean expressions

  not true               => BOOLEAN false
  true and false         => BOOLEAN false
  true  or false         => BOOLEAN true
  true xor false         => BOOLEAN true

  true  -> something        => something
  false -> something        => undef     # rule fails
  "abcd" =~ "^(.*)c" -> $1  => STRING "ab"

  filename =~ "\.([^.]+)$" -> "extension is " ~ $1

MF::STRING, contains text

Represents a sequence of UTF-8 characters, which may be used single and double quoted.

Strings may be cast into regular expressions (MF::REGEXP) when used on the right side of a regular expression match operator ('=~' and '!~').

Strings may be cast into a pattern (MF::PATTERN) when used on the right of a pattern match operator ('like' and 'unlike').

Besides the four match operators, strings can be concatenated using '~'.

Strings also implement textual comparison operators lt, le, eq, ne, ge, gt, and cmp. Read its section in Math::Formula. These comparisons use Unicode::Collate in an attempt to get correct UTF-8 sorting.

Warning! When you create an expression which is only one string, then you will probably forget the double quotes:

  Math::Formula->new(origin => 'Larry');      # wrong!!!
  Math::Formula->new(origin => '"Larry"');    # right
  Math::Formula->new(origin => "'Larry'");    # right
  Math::Formula->new(origin => $string);      # probably wrong
  Math::Formula->new(origin => "'$string'");  # probably wrong: can ' be in string?

  Math::Formula->new(origin => \"Larry");     # right
  Math::Formula->new(origin => \'Larry');     # right
  Math::Formula->new(origin => \$string);     # right

See also Math::Formula::Context::new(lead_expressions), for a different solution.

. Example: of strings

  "double quoted string"
  'single quoted string'   # alternative

String operations:

  "a" + 'b'           => STRING  "ab"
  "a" =~ "regexp"     => BOOLEAN, see MF::REGEXP
  "a" like "pattern"  => BOOLEAN, see MF::PATTERN

  "a" gt "b"          => BOOLEAN
  "a" cmp "b"         => INTEGER -1, 0, 1

Attributes:

   "abc".length       => INTEGER  3
   "".is_empty        => BOOLEAN true   # only white-space
   "ABC".lower        => STRING "abc", lower-case using utf8 folding

MF::INTEGER, a long whole number

Integers contain a sequence of ASCII digits, optionally followed by a multiplier. Numbers may use an underscore ('_') on the thousands, to be more readable. For instance, '42k' is equal to '42_000'.

Supported multipliers are

  • 1000-based k, M, G, T, E, and Z;

  • 1024-based kibi, Mibi, Gibi, Tibi, Eibi, and Zibi;

The current guaranteed value boundaries are ±2⁶³ which is about 9 Zeta, just below C(10¹)>.

Integers can be cast to booleans, where 0 means false and all other numbers are true.

Integers support prefix operators + and -.

Integers support infix operators +, -, *, % (modulo) which result in integers. Infix operator / returns a float. All numeric comparison operators return a boolean.

Integers implement the numeric sort operator <=>, which may be mixed with floats.

. Example: of integers

  42            # the answer to everything
  8T            # how my disk was sold to me
  7451Mibi      # what my system tells me the space is
  -12           # negatives
  1_234_567     # _ on the thousands, more readable

  + 2          => INTEGER   2      # prefix op
  - 2          -> INTEGER   -2     # prefix op
  - -2         -> INTEGER   2      # prefix op, negative int
  
  1 + 2        => INTEGER   3      # infix op
  5 - 9        -> INTEGER   -4     # infix op
  3 * 4        => INTEGER   12
  12 % 5       => INTEGER   2
  12 / 5       => FLOAT     2.4

  1 < 2        => BOOLEAN   true
  1 <=> 2      => INTEGER   -1     # -1, 0, 1

  not 0        => BOOLEAN   true
  not 42       => BOOLEAN   false

Attributes

  (-3).abs     -> INTEGER   3      # -3.abs == -(3.abs)

MF::FLOAT, floating-point numbers

Floating point numbers. Only a limited set of floating point syntaxes is permitted, see examples. Especially: a float SHALL contain a dot or 'e'. When it contains a dot, there must be a digit on both sides.

Floats support prefix operators + and -.

Floats support infix operators +, -, *, % (modulo), and / which result in floats. All numeric comparison operators are supported, also in combination with integers.

. Example: of floats

  0.0        # leading zero obligatory
  0e+0
  0.1234
  3e+10
  -12.345

  3.14 / 4
  2.7 < π        => BOOLEAN true
  2.12 <=> 4.89  => INTEGER -1    # -1, 0, 1

MF::DATETIME, refers to a moment of time

The datetime is a complex value. The ISO8601 specification offers many, many options which make interpretation expensive. To simplify access, one version is chosen. This is an example of that version:

  yyyy-mm-ddThh:mm:ss.sss+hhmm
  2013-02-20T15:04:12.231+0200

Mind the 'T' between the date and the time components. Second fractions are optional.

The time-zone is relative to UTC. The first two digits reflect an hour difference, the latter two are minutes.

DateTimes can be cast to a time or a date, with loss of information.

It is possible to add (+) and subtract (-) durations from a datetime, which result in a new datetime. When you subtract one datetime from another datetime, the result is a duration.

Compare a datetime with an other datetime numerically (implemented as text comparison). When the datetime is compared with a date, it is checked whether the point of time is within the specified date range (from 00:00:00 in the morning upto 23:59:61 at night).

. Example: for datetime

  2023-02-18T01:28:12
  2023-02-18T01:28:12.345
  2023-02-18T01:28:12+0300
  2023-02-18T01:28:12.345+0300

  2023-02-21T11:28:34 + P2Y3DT2H -> DATETIME  2025-02-24T13:28:34
  2023-02-21T11:28:34 - P2Y3DT2H -> DATETIME  2021-02-18T09:28:34
  2023-02-21T11:28:34 - 2021-02-18T09:28:34 -> DURATION P2Y3DT2H

Attributes: (the date and time attributes combined)

  date = 2006-11-21T12:23:34.56+0110
  dt.year     => INTEGER  2006
  dt.month    => INTEGER  11
  dt.day      => INTEGER  21
  dt.hour     => INTEGER  12
  dt.minute   => INTEGER  23
  dt.second   => INTEGER  34
  dt.fracsec  => FLOAT    34.56
  dt.timezone => TIMEZONE +0110
  dt.time     => TIME     12:23:34.56
  dt.date     => DATE     2006-11-21+0110

MF::DATE, refers to a day in some timezone

A date has format 'YYYY-MM-DD+TZ', for instance 2023-02-20+0100.

The date may be cast into a datetime, where the time is set to 00:00:00. This transformation is actually slightly problematic: a date means "anywhere during the day, where a datetime is a specific moment.

You may add (+) and subtract (-) durations from a date, which result in a new date. Those durations cannot contain a time component.

An subtract (-) from a date produces a duration.

You may also add a time to a date, forming a datetime. Those may be in different time-zones. You may also numerically compare dates, but only when they are not in the same timezone, this will return false.

. Example: for date

  1966-12-21        # without timezone, default from context
  1966-12-21+0200   # with explicit timezone info

  2023-02-21+0200 - P3D            -> DATE     2023-02-18+0200
  2012-03-08+0100 + 06:07:08+0200  -> DATETIME 2012-03-08T06:07:08+0300
  2023-02-26 - 2023-01-20          -> DURATION P1M6D
  2023-02-22 < 1966-04-05          -> BOOLEAN  false
  2023-02-22 <=> 1966-04-05        -> INTEGER 1      # -1, 0 1

  4 + 2000-10-20 -> INTEGER 1974  # parser accident repaired

Attributes:

  date = 2006-11-21+0700
  date.year      => INTEGER   2006
  date.month     => INTEGER   11
  date.day       => INTEGER   21
  date.timezone  => TIMEZONE  +0700

MF::TIME, a moment during any day

Useful to indicate a daily repeating event. For instance, start-backup: 04:00:12. Times do no have a timezone: it only gets a meaning when added to a (local) time.

Time supports numeric comparison. When you add (+) a (short) duration to a time, it will result in a new time (modulo 24 hours).

When you subtract (-) one time from another, you will get a duration modulo 24 hours. This does not take possible jumps to and from Daylight Savings time into account. When you care about that, than create a formula involving the actual date:

  bedtime = 23:00:00
  wakeup  = 06:30:00
  now     = #system.now
  sleep   = ((now+P1D).date + wakeup) - (now.date + bedtime)  #DURATION

. Example: for time

  12:00:34      # lunch-time, in context default time-zone
  23:59:61      # with max leap seconds
  09:11:11.111  # precise time (upto nanoseconds)

  12:00:34 + PT30M => TIME 12:30:34   # end of lunch
  12:00:34 - PT15M -> TIME 11:45:34   # round-up coworkers
  23:40:00 + PT7H  => TIME 06:40:00   # early rise
  07:00:00 - 23

  18:00:00 ==  17:00:00 => BOOLEAN
  18:00:00 <=> 17:00:00 => INTEGER

Attributes:

  time = 12:23:34.56
  time.hour        => INTEGER 12
  time.minute      => INTEGER 23
  time.second      => INTEGER 34
  time.fracsec     => FLOAT   34.56

MF::TIMEZONE, time-zone.

Clock difference to UTC (Zulu). The format requires a plus (+) or minus (-) followed by two digits representing hours and two digits for minutes.

Timezone supports numeric comparison. When you add (+) and subtract (-) a (short) duration to a timezone.

When you subtract (-) one timezone from another, you will get a duration.

. Example: for timezone

  +0000
  -0612

  -0600 + PT1H     -> -0500
  +0230 - PT3H30M  -> -0100

Attributes:

  tz = -1236
  tz.in_minutes    => INTEGER -756
  tz.in_seconds    => INTEGER -45360

MF::DURATION, a period of time

Durations are usually added to datetimes, and may be negative. They are formatted in ISO8601 standard format, which is a bit awkward, to say the least.

Durations can be added (+) together, subtracted (-) together, or multiplied by an integer factor. The prefix + and - are also supported.

Be warned that operations may not always lead to the expected results. A sum of 12 months will lead to 1 year, but 40 days will stay 40 days because the day length differs per month. This will only be resolved when the duration is added to an actual datetime.

Two durations can be compared numerically. However: a bit of care must be taken. Sometimes, it is only clear what the ordering is when relative to a specific datetime. For instance, which is longer: P1M or P30D? The current time ("now") is used as reference point. Otherwise, add some other datetime to both durations before comparison.

. Example: for duration

  P1Y2M5D          # duration one year, 2 months, 5 days
  PT1M             # mind the "T" before smaller than day values!
  P3MT5M           # confusing: 3 months + 5 minutes
  PT3H4M8.2S       # duration 3 hours, 4 minutes, just over 8 seconds

  - -P1Y           # prefix + and =
  P3Y2M + P1YT3M5S  => DURATION P4Y2MT3M5S
  P1Y2MT3H5M - P3Y8MT5H13M14S -> DURATION -P2Y6MT2H8M14S
  P1DT2H * 4        => DURATION P4DT8H
  4 * P1DT2H        => DURATION P4DT8H

  P1Y > P1M         => BOOLEAN true
  PT20M <=> PT19M   => INTEGER 1     # -1, 0, 1

Attributes

  P1Y.in_days       => INTEGER  365  (4yrs will have extra day)
  P1H.in_seconds    => INTEGER  3600
$obj->inSeconds()

Returns the duration in seconds, nanoseconds ignored. This is exact when the duration only contains minutes and seconds. When hours are involved, do you care about switching from and to Daylight Savings time? Months are taken as 1/12 of a year. A year is taken as 365.256 year. For exact calculations, add the duration to a DATETIME first.

MF::NAME, refers to something in the context

The Math::Formula::Context object contains translations for names to contextual objects. Names are the usual tokens: Unicode alpha-numeric characters and underscore (_), where the first character cannot be a digit.

On the right-side of a fragment indicator # or method indicator ., the name will be lookup in the features of the object on the left of that operator.

A name which is not right of a hash (#) or dot (.) can be cast into an object from the context.

Names are symbol: are not a value by themselves, so have no values to be ordered. However, they may exist however: test it with prefix operator exists.

The more complicated concept is the 'defaults to' operator (//). When there is no formula with the name on the left side, the right side is taken. This is often stacked with a constant default value at the end.

. Example: of names

  tic
  route66
  the_boss
  _42       # and '_' works as a character
  αβΩ       # Unicode alpha nums allowed

  7eleven   # not allowed: no start with number

  See "Math::Formula::Context" for the following
  #frag     # (prefix #) fragment of default object
  .method   # (prefix .) method on default object
  name#frag # fragment of object 'name'
  file.size # method 'size' on object 'file'

Attributes on names

  exists live     => BOOLEAN    # does formula 'live' exist?
  not exists live => BOOLEAN

  live // dead // false         # pick first rule which exists
Math::Formula::Type->validated($string, $where)

Create a MF::NAME from a $string which needs to be validated for being a valid name. The $where will be used in error messages when the $string is invalid.

MF::PATTERN, pattern matching

This type implements pattern matching for the like and unlike operators. The patterns are similar to file-system patterns. However, there is no special meaning to leading dots and '/'.

Pattern matching constructs are * (zero or more characters), ? (any single character), and [abcA-Z] (one of a, b, c, or capital), [!abc] (not a, b, or c). Besides, it supports curly alternatives like *.{jpg,gif,png}.

. Example: of patterns

  "abc" like "b"     => BOOLEAN false
  "abc" like "*b*"   => BOOLEAN false
  "abc" like "*c"    => BOOLEAN true

  "abc" unlike "b"   => BOOLEAN true
  "abc" unlike "*b*" => BOOLEAN true
  "abc" unlike "*c"  => BOOLEAN false

MF::REGEXP, Regular expression

This type implements regular expressions for the =~ and !~ operators.

The base of a regular expression is a single or double quote string. When the operators are detected, those will automatically be cast into a regexp.

. Example: of regular expressions

  "abc" =~ "b"       => BOOLEAN true
  "abc" =~ "c$"      => BOOLEAN true
  "abc" !~ "b"       => BOOLEAN false
  "abc" !~ "c$"      => BOOLEAN false
$obj->regexp()

Returns the regular expression as compiled object (qr).

MF::FRAGMENT, access to externally provided data

The used of this type is explained in Math::Formula::Context.

SEE ALSO

This module is part of Math-Formula distribution version 0.16, built on March 14, 2023. Website: http://perl.overmeer.net/CPAN/

LICENSE

Copyrights 2023 by [Mark Overmeer <markov@cpan.org>]. For other contributors see ChangeLog.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See http://dev.perl.org/licenses/