python-driver tests/integration: set `skip_wait_for_gossip_to

to speed up the boot sequence of scylla nodes we are using skip_wait_for_gossip_to_settle=0 same as we are using for quite a while in dtest on almost all tests

also introduced wait_other_notice=True for places where starting the cluster, because without it we can get into situation we start a test, and cluster isn't fully ready and up.

this change shaves 1h of integration tests run, and it's now finishes in 28min.

Feb 22 '24 15:02 fruch

Interesting, I remember that I did try to do this at one point, but got a lot of failures. Maybe I just made some mistake when running the tests.

Feb 23 '24 21:02 Lorak-mmk

Interesting, I remember that I did try to do this at one point, but got a lot of failures. Maybe I just made some mistake when running the tests.

it depends when you tried it, we (mostly @nyh) did a lot of fine tuning to ccm, to support this case correctly. while trying to figure out why that UDT test is failing, it was annoying to wait that much time for cluster creation.

Feb 24 '24 19:02 fruch

I think we can merge it after CI passes

Feb 27 '24 17:02 Lorak-mmk

I think we can merge it after CI passes

one of the integration suite was stuck for 5h, I'm running it all again:

tests/integration/standard/test_metadata.py ss...s.............x...s.s.. [ 15%]
s...s.ss.s...x.s.x.....sssssssssss...ss.s....s.s...ss                    [ 20%]
Error: The operation was canceled.

I'm not sure if it's connected to this change or not, we'll need more reruns, and maybe enabling of more debug in CI to figure this one out

Feb 28 '24 06:02 fruch

I think we can merge it after CI passes

one of the integration suite was stuck for 5h, I'm running it all again:
tests/integration/standard/test_metadata.py ss...s.............x...s.s.. [ 15%]
s...s.ss.s...x.s.x.....sssssssssss...ss.s....s.s...ss                    [ 20%]
Error: The operation was canceled.
I'm not sure if it's connected to this change or not, we'll need more reruns, and maybe enabling of more debug in CI to figure this one out

it getting stuck also in other places, which are not this PR: https://github.com/scylladb/python-driver/actions/runs/8076169015/job/22064206623

tests/integration/standard/test_metadata.py ss...s.............x...s.s.. [ 15%]
s...s.ss.s...x.s.x.....sssssssssss...ss.s....s.s...ss                    [ 20%]
Error: The operation was canceled.

Feb 28 '24 13:02 fruch

clearly from logs, test_connection_error is the one getting stuck, still not clear why

also seen that test_connection_honor_cluster_port leave a trail of session behind, which keep trying to reconnect to cluster that isn't' there anymore

Feb 28 '24 22:02 fruch

clearly from logs, test_connection_error is the one getting stuck, still not clear why

also seen that test_connection_honor_cluster_port leave a trail of session behind, which keep trying to reconnect to cluster that isn't' there anymore

Are the problems in those tests caused by this PR? If not then I think we can merge this

Apr 29 '24 13:04 Lorak-mmk

clearly from logs, test_connection_error is the one getting stuck, still not clear why

also seen that test_connection_honor_cluster_port leave a trail of session behind, which keep trying to reconnect to cluster that isn't' there anymore

Are the problems in those tests caused by this PR? If not then I think we can merge this

I didn't find any connection to this change

Apr 30 '24 06:04 fruch

Looks like all tests are passing now, aren't they?

Apr 30 '24 14:04 roydahan

tests/integration: set `skip_wait_for_gossip_to_settle=0`