Support for the Hanoi Omega Automata (HOA) Format
Table of Contents
The Hanoi Omega-Automata format is a textual representation of ω-automata labeled by Boolean formulas over a set of atomic propositions, and using an arbitrary acceptance condition. The typical acceptance conditions like Büchi, generalized-Büchi, co-Büchi, Rabin, Streett, parity, … are all supported, but the main advantage of this format is that any arbitrary acceptance condition can be defined. The HOA format has support for many features such as non-determinism, alternation, multiple initial states, transition or state-based acceptance, named states, and a range of property flags making it possible to store additional information about the automaton.
The HOA format is already supported in several tools. The goal of this page is to detail the support of this format in Spot. It contains some information that are useful to better understand the behavior of the tools distributed by Spot, and it also looks at some lower-level, discussing details that are interesting when programming with Spot.
Spot can read files written using either version 1 or version 1.1 of
the HOA format. It currently outputs version 1 by default, but
version 1.1 can be requested from the command-line using option
-H1.1
. Future version of Spot are likely to switch to version 1.1
of HOA by default, so version 1 can already be requested explicitly
using -H1
. Note that version 1.1 is not yet published at the time
of writing, and that there are discussion about calling it version 2
instead.
Format, files, and TωA
Some note about the abbreviation first. We usually write "HOA format"
or occasionally HOAF to denote the format (as a specification), and
HOA or "HOA file" to denote an automaton in that format. In most
examples involving HOA files, we use *.hoa
as a filename extension
(even if the actual extension does not matter).
When an HOA file is loaded by Spot, it is stored into the
data-structure used by Spot to represent ω-Automata. This structure
is called Transition-based ω-Automaton, henceforth abbreviated TωA.
Such a TωA can be saved back as an HOA file. If you run a command
such as autfilt input.hoa >output.hoa
this is exactly what
happens: the file input.hoa
is parsed to create a TωA, and this TωA
is then printed in the HOA format into output.hoa
.
Since the TωA structure is not a perfect one-to-one representation of the HOA format, the output may not be exactly the same as the input.
Features of the HOA format with no or limited support in Spot
- Automata using explicit alphabet (introduced in version 1.1 of the
format via
Alphabet:
) are not supported. The maximum number of acceptance sets used is limited to 32.
In the past, this limitation has forced us to improve some of our algorithms to be less wasteful and not introduce useless acceptance sets.
This hard-coded limit can be augmented at configure time using option
--enable-max-accsets=N
, but doing so will consume more memory and time.Fin(!x)
andInf(!x)
are rewritten away.Internally Spot only deals with acceptance conditions involving the primitives
Fin(x)
orInf(x)
. When the parser encounters the variantsFin(!x)
orInf(!x)
, it automatically complements the setx
so that the resulting acceptance uses onlyFin(x)
andInf(x)
. For instanceFin(0)&Inf(!1)
gets rewritten intoFin(0)&Inf(1)
and the membership of all transitions to the set1
is reversed.If
x
was already used without complementation in another primitive, then a new set has to be created. For instance the acceptanceInf(0)&Inf(!0)
can only be fixed by adding a new set,1
, that stores the complement of set0
, and usingInf(0)&Inf(1)
.Multiple (or missing) initial states are emulated.
The internal TωA representation used by Spot supports only a single initial state. To make it possible to still process HOA files that declare multiple initial states, Spot's automaton parser will automatically transform the parsed automaton into an equivalent TωA by merging the initial states into a single initial state that has no incoming edge. This can be achieved in two ways:
- If one of the original initial states has no incoming edge, it can serve as the new initial state. The parse simply has to duplicate the edges leaving the other initial states onto this one. (As an optimization, if the other initial state alsos do not have any incoming edge, the source of its outgoing edges are changed to avoid copying them.)
- If all of the original initial states has incoming edges, a new state is created to serve as the new initial states, with a copy of all the outgoing edges of the original initial states.
This process is done in such a way that the states are not renumbered: states are kept even if they become useless after the above change, and if a new state is added, it is added after all declared states.
This fusing of initial states may have puzzling consequences if you are unaware of it:
- An automaton with multiple initial states is considered
non-deterministic and can be declared as so in the HOA 1.1 format.
However, fusing its initial states could potentially turn in into a
deterministic automaton if the only nondeterminism was in the
choice of the initial state. For this reason Spot's parser will
keep the prop_universal() set to
maybe()
when multiple initial states are declared, regardless of any determinism property specified in the HOA file. - Similarly, fusing all initial states into an existing one can change the prop_complete() property of the automaton.
- Finally, if an automaton that declares state-based acceptance, and uses different acceptance marks on their initial states. Only one of these acceptance marks will be kept on the single initial state. Remember that the new initial state has no incoming edge, so it cannot be part of any loop and its acceptance mark is actually irrelevant.
Similarly, when an automaton with no initial state is loaded (this includes the case where the automaton has no state), a disconnected initial state is added. As a consequence, Spot's HOA output always contains at least one state, even when the input had no state.
Internal representations of some features
In this section we discuss features of the format that are fully supported, but in a way that so people could find unexpected. These design choices do not affect the semantics of the HOA format in any way.
State-based vs. transition-based acceptance
A Transition-based ω-Automaton (TωA), as its name implies, uses transition-based acceptance sets. Each edge is stored as a quadruplet \((s,d,\ell,F)\) where \(s\) and \(d\) are the source and destination state numbers, \(\ell\) is a Binary Decision Diagram (BDD) representing the Boolean function labeling the edge, and \(F\) is a bit-vector representing the membership of the transition to each declared acceptance set.
States are just numbers, and may not belong to any accepting set. When reading a HOA file that use state-based acceptance (or even a mix of state-based and transitions-based acceptance), all the acceptance are pushed onto the outgoing transitions.
So an automaton represented as an HOA file with this transition structure:
/* state-based acceptance */ State: 0 {0 1} [0&!1] 0 [0&1] 1 [!0] 2 /* mixed state- and transition-based acceptance */ State: 1 {0} [0] 1 {1} [0&1] 2 /* transition-based acceptance */ State: 2 [!0] 1 {0} [0] 2 {1}
will always be stored as a TωA with this transition structure:
State: 0 [0&!1] 0 {0 1} [0&1] 1 {0 1} [!0] 2 {0 1} State: 1 [0] 1 {0 1} [0&1] 2 {0} State: 2 [!0] 1 {0} [0] 2 {1}
Even if an input HOA file uses only state-based acceptance, Spot internally stores it using transition-based acceptance. However, in that case the TωA will have a property flag indicating that it actually represents an automaton with the "state-based acceptance" property: this implies that transitions leaving one state all belong to the same acceptance sets. A couple of algorithms in Spot checks for this property, and enable specialized treatments of state-based automata.
Furthermore, even if an automaton does not have the "state-based acceptance" property flag set, the HOA output routine may detect that the automaton satisfies this property. In that case, it outputs the automaton with state-based acceptance.
For instance in the following automaton, the outgoing transitions of each state belong to the same sets:
cat >sba.hoa <<EOF_HOA HOA: v1 States: 3 Start: 0 AP: 2 "a" "b" Acceptance: 2 Inf(0)&Inf(1) --BODY-- State: 0 {0} [0&!1] 0 [0&1] 1 [!0] 2 State: 1 [0] 1 {1} [0&1] 2 {1} State: 2 [!0] 1 {0 1} [0] 2 {0 1} --END-- EOF_HOA autfilt sba.hoa
so the HOA output of autfilt
automatically uses state-based acceptance:
HOA: v1 States: 3 Start: 0 AP: 2 "a" "b" acc-name: generalized-Buchi 2 Acceptance: 2 Inf(0)&Inf(1) properties: trans-labels explicit-labels state-acc --BODY-- State: 0 {0} [0&!1] 0 [0&1] 1 [!0] 2 State: 1 {1} [0] 1 [0&1] 2 State: 2 {0 1} [!0] 1 [0] 2 --END--
The rationale for this automatic switch to state-based acceptance is as follows:
- Tools that support transition-based acceptance can easily see state-based acceptance as syntactic sugar, so they should be able to process state-based or transition-based acceptance indifferently.
- Tools that support only state-based acceptance, cannot easily process automata with transition-based acceptance. So by using state-based acceptance whenever possible, we are making these automata compatible with a larger number of tools.
- Using state-based acceptance is slightly more space efficient, because there is less redundancy in the output file.
Nevertheless, should you really insist on having an output with
transition-based acceptance, you can do so by passing the option t
to the HOA printer:
autfilt -Ht sba.hoa
HOA: v1 States: 3 Start: 0 AP: 2 "a" "b" acc-name: generalized-Buchi 2 Acceptance: 2 Inf(0)&Inf(1) properties: trans-labels explicit-labels trans-acc --BODY-- State: 0 [0&!1] 0 {0} [0&1] 1 {0} [!0] 2 {0} State: 1 [0] 1 {1} [0&1] 2 {1} State: 2 [!0] 1 {0 1} [0] 2 {0 1} --END--
By default, the output uses either state-based acceptance, or
transition-based acceptance. However, there is no restriction in the
format to prevents mixing the two: if you use -Hm
, the decision of
using state or transition-based acceptance will be made for each state
separately. For instance:
ltl2tgba -Hm 'GFa | Fb'
HOA: v1 name: "F(b | GFa)" States: 3 Start: 0 AP: 2 "b" "a" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels complete stutter-invariant --BODY-- State: 0 [!0] 0 [0] 1 [!0&1] 2 State: 1 {0} [t] 1 State: 2 [!1] 2 [1] 2 {0} --END--
So far we have discussed transforming state-based acceptance into transition-based acceptance (this can be seen as removing syntactic sugar), and representing transition-based acceptance into state-based acceptance when this is possible (adding syntactic sugar) to do so without adding states.
It is also possible to transform automata with transition-based
acceptance into automata with state-based acceptance, adding states
when necessary. Most tools have a -S
option (or
--state-based-acceptance
) for this purpose. Compare the following
output with the previous one.
ltl2tgba -S -Hm 'GFa | Fb'
HOA: v1 name: "F(b | GFa)" States: 4 Start: 0 AP: 2 "b" "a" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels complete stutter-invariant --BODY-- State: 0 [!0] 0 [0] 1 [!0&1] 2 State: 1 {0} [t] 1 State: 2 {0} [1] 2 [!1] 3 State: 3 [1] 2 [!1] 3 --END--
Generic acceptance
Currently, Spot's parser for HOA ignores the optional acc-name:
line, and only uses the mandatory Acceptance:
line. As explained
earlier, if this line contains primitives of the form Inf(!x)
or
Fin(!x)
these are rewritten away, because internally Spot only works
with primitives of the form Inf(x)
or Fin(x)
. This also means
that Spot will never produce an acceptance condition containing
Fin(!x)
or Inf(!x)
.
Whenever an HOA file is output, Spot attempts to recognize the
acceptance condition to give it a suitable acc-name:
(even if Spot
does not use this line, it is useful to tools that only deal with one
specific acceptance condition and that do not want to parse the
Acceptance:
line). However, the HOA printer has no idea of
what type of automata you are trying to output: it is only looking at
the acceptance condition and trying to name it as precisely as
possible. This could be a problem when a given condition accepts
multiple names.
For instance according to the canonical encodings specified by the HOA
format the condition Inf(0)
could be called Buchi
, or
generalized-Buchi 1
, or (why not?) parity min even 1
or parity
max even 1
. Spot will always call this acceptance condition Buchi
.
Similarly, the acceptance condition t
is always called all
(not
generalized-Buchi 0
or Rabin 0
, etc.), and while f
is always
named none
.
One of the consequence is that when you run ltl2tgba
with its
default settings (which are to produce automata with transition-based
generalized Büchi acceptance) you actually obtain an output that:
- has an
Acceptance:
line that is a conjunction ofInf(x)
primitives (ort
), because that is what generalized Büchi is; - has an
acc-name:
line that can be eithergeneralized-Buchi n
(for \(n>1\)) orBuchi
(corresponding to \(n=1\)) orall
(corresponding to \(n=0\)).
The use of Buchi
or all
instead of generalized-Buchi n
follow
the same idea as our use of state-based acceptance whenever possible.
By using the name of these inferior acceptance conditions, we hope
that the resulting automaton can be easier to use with tools that only
deal with such inferior acceptance conditions. However, unlike for
state vs. transition-based acceptance, there is currently no means to
request another acceptance name to be used.
The canonical encodings for acceptance conditions are specified quite
strictly in the HOA format. For instance generalized-Buchi 2
corresponds to Inf(0)&Inf(1)
, not to Inf(1)&Inf(0)
, even though
the two formulas are equivalent. Spot's HOA output routine contains
some limited form of equivalence check (based mostly on associativity
and commutativity of the Boolean operators), so that if it detects
such a simple inversion, it will output it in the order required to be
allowed to name the acceptance condition.
In the following example, you can see autfilt
removing the duplicate
Rabin pair, and reordering the remaining pair to fit the syntax
corresponding to Rabin 1
.
autfilt <<EOF HOA: v1 States: 3 Start: 0 AP: 2 "a" "b" Acceptance: 2 (Inf(1) & Fin(0)) | (Inf(1) & Fin(0)) --BODY-- State: 0 {0} [0&!1] 0 [0&1] 1 [!0] 2 State: 1 [0] 1 {1} [0&1] 2 {1} State: 2 [!0] 1 {0 1} [0] 2 {0 1} --END-- EOF
HOA: v1 States: 3 Start: 0 AP: 2 "a" "b" acc-name: Rabin 1 Acceptance: 2 Fin(0) & Inf(1) properties: trans-labels explicit-labels state-acc --BODY-- State: 0 {0} [0&!1] 0 [0&1] 1 [!0] 2 State: 1 {1} [0] 1 [0&1] 2 State: 2 {0 1} [!0] 1 [0] 2 --END--
Internally, the acceptance condition is stored as an array in reverse
polish notation, and the primitives Inf
and Fin
are actually
parametered by bitsets representing multiple sets numbers. For
instance the generalized Büchi acceptance
Inf(0)&Inf(1)&Inf(2)&Inf(3)
is actually stored as a single term
Inf({0,1,2,3})
. Similarly, Fin({1,3,5})
is our internal encoding
for Fin(1)|Fin(3)|Fin(5)
.
A more complex acceptance condition, such as
(Fin(0)&Inf(1))|(Fin(2)&Inf(3)&Inf(4))|Fin(5)
(a generalized-Rabin
acceptance), would be encoded as the following 8-element array.
Fin({5}) Inf({3,4}) Fin({2}) 2& Inf({1}) Fin(0) 2& 3|
This has to be read as a reverse Polish notation where the numbers in
front of the operators &
and |
indicate the number of arguments
they consume (these operators are n-ary).
When you look at an acceptance condition output by Spot, you can
actually spot the terms that have been grouped together internally by
looking at the spacing around operators &
and |
. For instance:
randaut -A"Fin(0)|Fin(1)|Fin(2)&Fin(3)&Inf(4)&Inf(5)" 0 | grep Acceptance:
Acceptance: 6 (Fin(0)|Fin(1)) | (Fin(2) & Fin(3) & (Inf(4)&Inf(5)))
Here Fin(0)|Fin(1)
is actually a single internal term Fin({0,1})
,
and likewise for Inf(4)&Inf(5)
.
State-based vs. transition-based labels
State labels are handled in the same way as state-based acceptance: Spot store labels on transitions internally, so if an input automaton has state labels, those are pushed to all outgoing transitions.
For instance an automaton declared in some HOA file with this body:
State: [0&1] 0 0 1 2 State: [!0&1] 1 {0} 0 1 State: [!1] 2 2 1
will always be stored as an automaton with the following transition structure
State: 0 [0&1] 0 [0&1] 1 [0&1] 2 State: 1 [!0&1] 0 {0} [!0&1] 1 {0} State: 2 [!1] 2 [!1] 1
The HOA printer has an option to output automata using state-based
labels whenever that is possible. The option is named k
(i.e., use
-Hk
with command-line tools) because it is useful when the HOA file
is used to describe a Kripke structure.
Property flags
The HOA
format supports a number of optional property:
tokens.
These properties can be useful to speedup certain algorithms: for
instance it is easier to complement a deterministic automaton that is
known to be inherently weak.
Spot stores the properties that matters to its algorithms as additional bits attached to each automaton. Currently, the HOA parser ignores all the properties that are unused by Spot.
Some of the supported properties are double-checked when the automaton
is parsed; this is for instance the case of deterministic
,
state-based
. The parser will in fact infer these properties from the
body of the file, and then return and error if what has been declared
does not correspond to the reality.
Some supported properties (like weak
, inherently-weak
,
very-weak
, terminal
, unambiguous
, semi-deterministic
, or
stutter-invariant
) are not double-checked, because that would
require more operations. Command-line tools that read HOA files all
take a --trust-hoa=no
option to ignore properties that are not
double-checked by the parser.
It should be noted that each property can take three values: true,
false, or maybe. So actually two bits are used per property. For
instance if in some algorithm you want to know whether an automaton is
complete (the equivalent of calling autfilt -q
--is-complete aut.hoa
from the command-line), you should not
call the method aut->prop_complete()
because that only checks
the property bits, and it might return maybe
even if aut
is
deterministic. Instead, call the function is_complete(aut)
.
This function will first test the property bits, and do the actual
check in case it is unknown.
Algorithms that update a TωA should call the method prop_keep()
and
use the argument to specify which of the properties they preserve.
Algorithms that input a TωA and output a new one may call the method
prop_copy()
to copy over the subset of properties they preserve.
Using these two functions ensure that in the future, whenever a new
property is added to the TωA class, we cannot forget to update all the
calls prop_copy()
or prop_keep()
(because these functions will
take a new argument).
The HOA
printer also tries to not bloat the output with many
redundant and useless properties. For instance deterministic
automata are necessarily unambiguous
, and people interested in
unambiguous automata know that, so Spot only outputs the unambiguous
property if an unambiguous automaton is non-deterministic. Similarly,
while Spot may produce alternating automata, it does not output
the no-univ-branch
property because we cannot think of a situation
where this would be useful. This decision can be overridden by
passing the -Hv
(or --hoa=v
) option to the command-line tools:
this requests "verbose" properties.
The following table summarizes how supported properties are handled. In particular:
- For the parser,
checked
means that the property is always inferred and checked against any declaration (if present),trusted
means that the property will be stored without being checked (unless--trust-hoa=no
is specified). - Stored properties are those represented as bits in the automaton.
- The printer will sometime check some properties when it can do it as part of its initial "survey scan" of the automaton; in that case the stored property is not used. This makes it possible to detect deterministic automata that have been output by algorithms that do not try to output deterministic automata.
property |
parser |
stored |
printer |
notes |
---|---|---|---|---|
state-labels |
checked | no | checked if -Hk |
state labels are converted to transition labels when reading TωA |
trans-labels |
checked | no | always, unless -Hi or -Hk |
|
implicit-labels |
checked | no | if -Hi |
-Hi only works for deterministic automata |
explicit-labels |
checked | no | always, unless -Hi |
|
state-acc |
checked | yes | checked, unless -Ht or -Hm |
|
trans-acc |
checked | no | if not state-acc and not -Hm |
|
no-univ-branch |
ignored | no | only if -Hv |
|
univ-branch |
checked | no | checked | |
deterministic |
checked | yes | checked | |
complete |
checked | yes | checked | |
unambiguous |
trusted | yes | as stored if (-Hv or not deterministic ) |
can be checked with --check=unambiguous |
semi-deterministic |
trusted | yes | as stored if (-Hv or not deterministic ) |
can be checked with --check=semi-deterministic |
stutter-invariant |
trusted | yes | as stored | can be checked with --check=stuttering |
stutter-sensitive |
trusted | yes | as stored (opposite of stutter-invariant ) |
can be checked with --check=stuttering |
terminal |
trusted | yes | as stored | can be checked with --check=strength |
very-weak |
trusted | yes | as stored if (-Hv or not terminal ) |
can be checked with --check=strength |
weak |
trusted | yes | as stored if (-Hv or not (terminal or very-weak )) |
can be checked with --check=strength |
inherently-weak |
trusted | yes | as stored if (-Hv or not weak ) |
can be checked with --check=strength |
colored |
ignored | no | checked |
The above table is for version 1 of the format. When version 1.1 is
selected (using -H1.1
), some negated properties may be output. In
particular, stutter-sensitive
is replaced by !stutter-invariant
.
The logic of not cluttering the output with all of !terminal
,
!weak
, and !inherently-weak
is similar to the positive versions:
!inherently-weak
implies !weak
which in turn implies !terminal
,
so only one of those is emitted unless -Hv
is used.
Named properties
In addition to the bit properties discussed above, a TωA can carry named properties of any type. When attaching a property to a TωA, you only supply a name for the property, a pointer, and an optional destructor function.
There are currently three named properties related to the HOA format.
automaton-name
- Is a string that stores the name of the automaton (the one given after
name:
in the HOA format) state-names
- Is a vector of strings that stores the name of the states (in case states are named in the HOA format)
aliases
- Is a vector of pairs (name, BDD) that declares aliases to use in the HOA format
You can see these properties being preserved when an automaton is read and then immediately output:
cat >hw.hoa <<EOF HOA: v1 name: "hello world!" States: 3 Start: 0 AP: 2 "a" "b" Acceptance: 2 Inf(0)&Inf(1) Alias: @x 0&1 --BODY-- State: 0 {0} [0&!1] 0 [@x] 1 [!0] 2 State: 1 "I am a state" [0] 1 {1} [@x] 2 {1} State: 2 "so am I" [!0] 1 {0 1} [0] 2 {0 1} --END-- EOF autfilt hw.hoa
HOA: v1 name: "hello world!" States: 3 Start: 0 AP: 2 "a" "b" acc-name: generalized-Buchi 2 Acceptance: 2 Inf(0)&Inf(1) properties: trans-labels explicit-labels state-acc Alias: @x 0&1 --BODY-- State: 0 {0} [0&!1] 0 [@x] 1 [!0] 2 State: 1 "I am a state" {1} [@x | 0&!1] 1 [@x] 2 State: 2 "so am I" {0 1} [!0] 1 [@x | 0&!1] 2 --END--
However, when Spot performs some transformation, and actually has to
construct a new automaton, those properties will not be quarried over
to the new automaton. First because it is not obvious that the new
automaton should have the same name, and second because if a new
automaton is created, there might not be clear correspondence between
the old states and the new ones. autfilt
tries to preserve aliases
by reintroducing them to the automaton before it prints it (unless
option --aliases=drop
is used).
Here is for instance the result when autfilt
is instructed to
simplify the automaton:
autfilt --small hw.hoa
HOA: v1 States: 3 Start: 0 AP: 2 "a" "b" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels state-acc deterministic Alias: @x 0&1 --BODY-- State: 0 [0&!1] 0 [@x] 1 [!0] 2 State: 1 [0&!1] 1 [@x] 2 State: 2 {0} [!0] 1 [@x | 0&!1] 2 --END--
Note that if the name of the automaton is important to you, it can be
fixed via the --name
option. For instance --name=%M
will
construct the new name by simply copying the one of the original
automaton.
autfilt --small hw.hoa --name=%M
HOA: v1 name: "hello world!" States: 3 Start: 0 AP: 2 "a" "b" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels state-acc deterministic Alias: @x 0&1 --BODY-- State: 0 [0&!1] 0 [@x] 1 [!0] 2 State: 1 [0&!1] 1 [@x] 2 State: 2 {0} [!0] 1 [@x | 0&!1] 2 --END--
The page about common output options for automata has a section
showing how --name
can be used to construct complex pipelines with
automata that preserve their equivalent LTL formula in the name:
field.
Streaming support
The HOA format has been designed to easily allow multiple automata to be concatenated together (in the same file, or in a pipe) and processed in batch. Spot's parser supports this scenario and can be called repeatedly to read the next automaton from the input stream.
For instance the following creates 3 formulas of the form \(\bigwedge_i
\mathsf{G}\mathsf{F} p_i\), translates those into Büchi automata output
in the HOA format, and then read those automata with autfilt
to
randomize the order of their transitions and states before printing
them in HOA format.
genltl --and-gf=1..3 | ltl2tgba -B | autfilt --randomize
HOA: v1 name: "GFp1" States: 2 Start: 1 AP: 1 "p1" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels state-acc complete properties: deterministic stutter-invariant --BODY-- State: 0 [0] 1 [!0] 0 State: 1 {0} [!0] 0 [0] 1 --END-- HOA: v1 name: "G(Fp1 & Fp2)" States: 3 Start: 1 AP: 2 "p1" "p2" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels state-acc complete properties: deterministic stutter-invariant --BODY-- State: 0 [0] 1 [!0] 0 State: 1 {0} [0&1] 1 [!1] 2 [!0&1] 0 State: 2 [!1] 2 [0&1] 1 [!0&1] 0 --END-- HOA: v1 name: "G(Fp1 & Fp2 & Fp3)" States: 4 Start: 1 AP: 3 "p1" "p2" "p3" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels state-acc complete properties: deterministic stutter-invariant --BODY-- State: 0 [!0] 0 [0] 1 State: 1 {0} [!2] 3 [!1&2] 2 [0&1&2] 1 [!0&1&2] 0 State: 2 [!1] 2 [!0&1] 0 [0&1] 1 State: 3 [0&1&2] 1 [!1&2] 2 [!0&1&2] 0 [!2] 3 --END--
It should be noted that the HOA parser is less efficient when it reads
from a pipe than when it reads from a file. The reason is that if two
commands A | B
exchange automata via a pipe, the command A
may
require some time between the output of two automata, so if B
does a
block read on its input to fill the parser's buffer, it might not be
able to process any automaton before A
has produced enough automata
to fill the input buffer of B
. To avoid this delay, whenever B
detects that the input is a pipe (or a terminal), it switches to an
interactive mode, where the input is read one character at a time.
This way an automaton can be processed by B
as soon as its --END--
has been received.
The HOA format has support for a --ABORT--
token, that can be used
by tools that produce automata in a stream to cancel the current one.
This makes sense for instance when the automaton is constructed
on-the-fly, while it is being output. This scenario does not occur in
Spot (automata are constructed before they are output), so it does not
emit --ABORT--
. However, the input parser is fully aware of this
token. Tools like autfilt
will diagnose aborted automata in the
input, and continue processing with the next automaton. The Python
bindings for the HOA parser can be configured in two modes: skip
aborted automata, or flag them as errors.
Error recovery
The HOA parser does a fair amount of error recovery. It is important
that when parsing a stream of automata, a syntax error in one
automaton does not invalidate the following automata (the parser
should at least be able to ignore everything up to --END--
if it
cannot recover before).
Another scenario where syntax errors are more frequent is when an HOA file is hand-edited. For instance one could edit an HOA file to add a few states and transitions, and forget to update the total number of states in the format. In that case the parser will diagnose the problem, and fix the number of states.
Checked properties
When an automaton is output in HOA format, the property:
lines
includes property registered into the automaton (see the Property
bits section above), and property that are trivial to compute.
Command-line tools with a HOA output all have a --check
option that
can be used to request additional checks such as testing whether the
automaton is stutter-invariant, unambiguous, (inherently) weak, and
terminal.
Extensions
Highlighting states and edges
Spot supports two additional headers that are not part of the standard
HOA format. These are spot.highlight.states
and
spot.highlight.edges
. These are used to decorate states and edges
with colors.
cat >decorate.hoa <<EOF HOA: v1.1 States: 3 Start: 1 AP: 2 "a" "b" Acceptance: 0 t spot.highlight.states: 1 0 2 3 spot.highlight.edges: 1 1 2 2 --BODY-- State: 0 [t] 0 /* edge #1 */ State: 1 [t] 2 /* edge #2 */ State: 2 [1] 0 /* edge #3 */ [0&!1] 2 /* edge #4 */ --END-- EOF autfilt decorate.hoa -d'.#'
On the above example, we call autfilt
with option -d#
to display
edges numbers, which helps identify the edges to highlight. The
headers spot.highlight.states:
and spot.highlight.edges:
are both
followed by a list of alternating state/edges numbers and color numbers.
So in the above file,
spot.highlight.states: 1 0 2 3 spot.highlight.edges: 1 1 2 2
specifies that states #1
should have color 0
, state #2
should have
color 3
, edge #1
should have color 1
, and edge #2
should have
color 2
.
State numbers obviously correspond to the state numbers used in the HOA file, and are 0-based. Edge numbers are 1-based (because that is how they are actually stored in Spot), and numbered in the order they appear in the HOA file.
The color palette is currently the same that is used for coloring acceptance sets. This might change in the future.
The automaton parser will not complain if these headers are used in
some HOA: v1
file, even if v1
disallows dots in header names.
However, the automaton printer is more rigorous and will only output
these lines when version 1.1 is selected.
Compare:
autfilt -H1 decorate.hoa; echo
autfilt -H1.1 decorate.hoa
HOA: v1 States: 3 Start: 1 AP: 2 "a" "b" acc-name: all Acceptance: 0 t properties: trans-labels explicit-labels state-acc deterministic weak --BODY-- State: 0 [t] 0 State: 1 [t] 2 State: 2 [1] 0 [0&!1] 2 --END-- HOA: v1.1 States: 3 Start: 1 AP: 2 "a" "b" acc-name: all Acceptance: 0 t properties: trans-labels explicit-labels state-acc !complete properties: deterministic weak spot.highlight.states: 1 0 2 3 spot.highlight.edges: 1 1 2 2 --BODY-- State: 0 [t] 0 State: 1 [t] 2 State: 2 [1] 0 [0&!1] 2 --END--
Sample words
When the --check=stutter-sensitive-example
option is used, and when
a stutter-sensitive automaton is output, two sample words are added to
the HOA output as a proof that the automaton is stutter-sensitive.
One of these words is accepted, the other is rejected, and the two are
stutter-equivalent (i.e., they differ only by some stuttering).
The headers are called spot.accepted-word
and spot.rejected-word
if HOA v1.1 is selected. However since these are also useful to
third-party tools, we also output them as spot-accepted-word
and
spot-rejected-word
in HOA v1.
ltl2tgba --check=stutter-sensitive-example -H1 Xa; echo
ltl2tgba --check=stutter-sensitive-example -H1.1 Xa
HOA: v1 name: "Xa" States: 3 Start: 1 AP: 1 "a" acc-name: all Acceptance: 0 t properties: trans-labels explicit-labels state-acc deterministic properties: stutter-sensitive terminal very-weak spot-accepted-word: "!a; cycle{a}" spot-rejected-word: "!a; !a; cycle{a}" --BODY-- State: 0 [0] 2 State: 1 [t] 0 State: 2 [t] 2 --END-- HOA: v1.1 name: "Xa" States: 3 Start: 1 AP: 1 "a" acc-name: all Acceptance: 0 t properties: trans-labels explicit-labels state-acc !complete properties: deterministic !stutter-invariant terminal very-weak spot.accepted-word: "!a; cycle{a}" spot.rejected-word: "!a; !a; cycle{a}" --BODY-- State: 0 [0] 2 State: 1 [t] 0 State: 2 [t] 2 --END--
Arenas for two-player games
An automaton can be seen as a two-player game by simply annotating
states with number representing the player that should play in this
state. We use an extension to the HOA format to transmit this
information: the spot-state-player:
head (or spot.state-player:
when
using the HOA 1.1 version) is followed by a string of 0
or
1
, one per state of the automaton, representing the player's number
associated to each state.
For instance in the following output of ltlsynt
, the sequence
0 0 0 0 1 1 1 1 1 1
indicates the players owning each of the
10 states.
ltlsynt --ins=a --outs=b -f 'Ga <-> Gb' --print-game-hoa
HOA: v1 States: 10 Start: 3 AP: 2 "a" "b" acc-name: Buchi Acceptance: 1 Inf(0) properties: trans-labels explicit-labels state-acc complete properties: deterministic spot-state-player: 0 0 0 0 1 1 1 1 1 1 controllable-AP: 1 --BODY-- State: 0 {0} [t] 4 State: 1 [t] 5 State: 2 [!0] 6 [0] 7 State: 3 {0} [!0] 8 [0] 9 State: 4 {0} [t] 0 State: 5 [!1] 0 [1] 1 State: 6 [t] 0 State: 7 [t] 2 State: 8 {0} [!1] 0 [1] 1 State: 9 {0} [!1] 2 [1] 3 --END--
When converted to dot, states from player 1 have a diamond shape:
ltlsynt --ins=a --outs=b -f 'Ga <-> Gb' --print-game-hoa | autfilt --dot