Daniel Sanche issues

Results 34 issues of


                                            Daniel Sanche

Add alert on dependency updates

We [recently had an issue where a terraform dependency updated and broke our released code](https://github.com/GoogleCloudPlatform/cloud-ops-sandbox/pull/864). One of the brainstormed solutions to this problem would be to create an alert to...

type: feature request

priority: p2

Ensure dependency bot is tracking terraform dependencies

[We recently had an issue where a terraform dependency updated and broke our released code](https://github.com/GoogleCloudPlatform/cloud-ops-sandbox/pull/864). I believe this repo should have a bot to catch these issues and auto-update dependencies...

priority: p2

type: process

pin terraform provider versions

Make sure we pin specific versions, so upstream updates don't break sandboxes

type: bug

priority: p1

lang: terraform

Add test that deploys through the Cloud Shell UI

We have E2E tests that build and test our custom cloud shell image in Docker. Real users use the Cloud Shell UI to deploy our image, which often has errors...

type: feature request

priority: p1

Automate checks against cloud shell

We sometimes run into issues where the Cloud Shell environment can't be provisioned (https://github.com/GoogleCloudPlatform/cloud-ops-sandbox/issues/583). It would be useful to have checks in place so we can catch these early, instead...

type: feature request

priority: p3

Connecting to existing project creates duplicate monitoring resources

If the ` Test Using Existing Project` e2e test is moved above monitoring tests, it will fail, due to multiple resources being created. If we can integrate the tfstate files,...

type: feature request

priority: p3

feat: add an authentication layer in front of public services

There have been reports of the Loadgenerator being hit by bots, trying to modify the url field to find a potential log4j opening. The lodgenerator is not vulnerable to log4j,...

type: feature request

priority: p3

define and track uptime SLOs for sandbox project

Cloud Ops Sandbox is a live service that users are starting to depend on. We should consider tracking up time metrics, and possibly define SLOs around them

type: process

priority: p3

Upgrading sandbox doesn't update cluster code

When we install sandbox over top of an existing sandbox, we expect the old sandbox to be updated to use the new code. Instead, the cluster seems to continue to...

type: bug

priority: p3

Add tests to make sure services are healthy

We want to make sure fresh sandboxes don't have any errors. Either error logs, or failing SLOs

type: feature request

priority: p3