This is a straightforward generalization of the classical Pumping Lemma for regular languages. It was proved independently by the Warsaw team and by Filippo Bonchi, Daniela Petrisan and Alexandra Silva.
For a definition of an automaton with atoms, see our paper Automata theory in nominal sets.
Lemma. For any language recognizable by an equivariant nondeterministic automaton with atoms, there exists a number
such that for every word
of length at least
there exist:
- a decomposition
, and
- an atom bijection
,
such that
,
,
- for every
, the word
is in
.
Proof. The argument is completely analogous to the classical case. Let be a nondeterministic automaton that recognizing
, and let
be the number of orbits of states in
. For a fixed
with
, take any accepting run of
on
. After some
steps, the run visits a state
which shares the orbit with some state visited before (say, after
steps, and denote the state
). Decompose
so that and
.
Pick an atom bijection so that
; it exists since
and
are in the same orbit of states. Since
on the word
can move from the state
to the state
, by equivariance, on the word
it can move from
to
. By induction, on the word
can move from
to
. Again by equivariance,
on
can move from
to an accepting state, since it can move to an accepting state from
.
In this way we have constructed an accepting run of on
Remark 1. The lemma works for any alphabet and any kind of atoms, or even for a straightforward generalization of automata to arbitrary -sets for a group
, without any assumptions regarding finite supports.
Remark 2. The lemma is not particularly strong. For example, over the alphabet of equality atoms, the language of words where every letter appears at most once:
satisfies the lemma, but is not recognized by a nondeterministic automaton. A few basic questions arise:
- Does the lemma work for alternating automata?
- Is there an easy pumping lemma for nondeterministic automata that would exclude
above?
- Is there an easy pumping lemma that would hold for deterministic automata and exclude standard examples of nondeterministic automata that do not determinize?
Edit 1 [April 5, 2014]: The answer to Question 1 above is negative. A counterexample is the language of those words of even length, where
- all atoms in the first half are equal, and
- all atoms in the second half are distinct.
This looks a bit like the language, but is actually recognizable by an alternating automaton with atoms. (This was shown by Tomasz Wysocki in his MSc thesis, unfortunately written in Polish.) Clearly, our pumping pemma fails for this language.