Skip to content

Fill in us-west-1 H100 capacity reservation IDs#582

Open
huydhn wants to merge 1 commit into
gh/huydhn/26/basefrom
gh/huydhn/26/head
Open

Fill in us-west-1 H100 capacity reservation IDs#582
huydhn wants to merge 1 commit into
gh/huydhn/26/basefrom
gh/huydhn/26/head

Conversation

@huydhn
Copy link
Copy Markdown
Contributor

@huydhn huydhn commented May 16, 2026

Stack from ghstack (oldest at bottom):

Replaces the empty placeholder with the actual reservations:

cr-04d3d1d84e127a562 — 2 × p5.48xlarge (16 H100 GPUs)
cr-09a53051589034fb8 — 4 × p5.48xlarge (32 H100 GPUs)

Total: 6 reserved nodes, 48 H100 GPUs in us-west-1.

Still pending before deploy: the nodepools generator change that
reads this override from clusters.yaml. Until that lands, the def
file's hardcoded us-east-2 reservation IDs would take effect, which
is wrong for us-west-1.

[ghstack-poisoned]
@github-actions
Copy link
Copy Markdown

tofu plan — arc-cbr-production

❌ Plan failed · commit 17750a2d · run log

Plan output
Installed 1 package in 2ms
{
    "BucketArn": "arn:aws:s3:::ciforge-tfstate-arc-cbr-prod",
    "BucketRegion": "us-west-2",
    "AccessPointAlias": false
}
━━━ PLAN: Base (arc-cbr-production) ━━━
There are some problems with the CLI configuration:
╷
│ Error: The specified plugin cache dir /home/runner/work/ci-infra/ci-infra/osdc/.terraform.d/plugin-cache cannot be opened: stat /home/runner/work/ci-infra/ci-infra/osdc/.terraform.d/plugin-cache: no such file or directory
│
╵

As a result of the above problems, OpenTofu may not behave as intended.



Error: Error acquiring the state lock

Error message: operation error DynamoDB: PutItem, https response error
StatusCode: 400, RequestID:
1HNK2G9FRMK7GOTG7KOOAN0RSNVV4KQNSO5AEMVJF66Q9ASUAAJG,
ConditionalCheckFailedException: The conditional request failed
Lock Info:
  ID:        9afa6aa7-2b21-0927-795a-dc94ac249b14
  Path:      ciforge-tfstate-arc-cbr-prod/arc-cbr-production/base/terraform.tfstate
  Operation: OperationTypePlan
  Who:       runner@runnervmrw5os
  Version:   1.7.10
  Created:   2026-05-16 02:44:09.95301431 +0000 UTC
  Info:      


OpenTofu acquires a state lock to protect the state from being written
by multiple users at the same time. Please resolve the issue above and try
again. For most commands, you can disable locking with the "-lock=false"
flag, but this is not recommended.
error: recipe `plan` failed with exit code 1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant