This seems to also be tied to problems in having
the TaskManager register. I have to repeatedly
restart the TaskManager until it finally connects to
the Job Manager. Most times it doesn't connect and
doesn't complain making the determination of the
root cause more difficult. The cluster is not busy
and I have tried both with IP addresses and host
names to determine if name resolution issues were
the cause, but both situations are the same.
I have also noticed that if 2 job managers are
launched on different nodes in the same cluster,
they both come back with logging indicating that
they are the leader so they are not talking to
each other effectively and the logging is not even
indicating that they are even attempting to talk
with one another.