Skip to content

x/build/cmd/makemac: Mac VMs are failing to start due to "Invalid configuration" #23859

Closed
@bradfitz

Description

@bradfitz

Half of our Mac VMs aren't running.

On our Mac VMware cluster, on the Linux VM that runs x/build/cmd/makemac in a systemd unit:

# journalctl -f -u makemac
...
Feb 15 14:19:06 godns makemac[24540]: $ govc device.usb.add -vm mac_10_11_host10a
Feb 15 14:19:07 godns makemac[24540]: $ govc vm.disk.attach -vm mac_10_11_host10a -link=true -persist=false -ds=Pure1-1 -disk osx_11_frozen/osx_11_frozen.vmdk
Feb 15 14:19:07 godns makemac[24540]: $ govc vm.destroy mac_10_11_host10a
Feb 15 14:19:08 godns makemac[24540]: 2018/02/15 14:19:08 Error creating 10.11: govc vm.disk.attach ...: exit status 1, govc: Invalid configuration for device '0'.
Feb 15 14:19:13 godns makemac[24540]: 2018/02/15 14:19:13 Have capacity for 8 more Mac VMs; creating requested 10.10 ...
Feb 15 14:19:14 godns makemac[24540]: $ govc vm.create -m 4096 -c 6 -on=false -net dvPortGroup-Private -g darwin14_64Guest -ds BOOT_8 mac_10_10_host08a
Feb 15 14:19:16 godns makemac[24540]: $ govc vm.change -e smc.present=TRUE -e ich7m.present=TRUE -e firmware=efi -e guestinfo.key-darwin-amd64-10_10=xx -e guestinfo.name=mac_10_10_host08a -vm mac_10_10_host08a
Feb 15 14:19:17 godns makemac[24540]: $ govc device.usb.add -vm mac_10_10_host08a
Feb 15 14:19:18 godns makemac[24540]: $ govc vm.disk.attach -vm mac_10_10_host08a -link=true -persist=false -ds=Pure1-1 -disk osx_10_frozen/osx_10_frozen.vmdk
Feb 15 14:19:18 godns makemac[24540]: $ govc vm.destroy mac_10_10_host08a
Feb 15 14:19:18 godns makemac[24540]: 2018/02/15 14:19:18 Error creating 10.10: govc vm.disk.attach ...: exit status 1, govc: Invalid configuration for device '0'.
...

Notice all the govc: Invalid configuration for device '0'..

Why did this start failing? This has been running unmodified for about 18 months.

Investigate.

/cc @andybons @mdempsky @aclements

Activity

added this to the Unreleased milestone on Feb 15, 2018
added
Buildersx/build issues (builders, bots, dashboards)
on Feb 15, 2018
self-assigned this
on Feb 15, 2018
bradfitz

bradfitz commented on Feb 15, 2018

@bradfitz
ContributorAuthor

Original bug report I should've used was #23856.

gopherbot

gopherbot commented on Feb 15, 2018

@gopherbot
Contributor

Change https://golang.org/cl/94601 mentions this issue: dashboard: disable Mac trybots for now

gopherbot

gopherbot commented on Feb 21, 2018

@gopherbot
Contributor

Change https://golang.org/cl/95735 mentions this issue: dashboard: adjust how many Mac VMs we expect

bradfitz

bradfitz commented on Feb 21, 2018

@bradfitz
ContributorAuthor

Logged in and poked around. It seems our vSphere/vCenter/vWhatever crapped itself and ran out of disk space for something and then went downhill fast into a weird state.

The MacStadium folk are cleaning it up.

bradfitz

bradfitz commented on Feb 21, 2018

@bradfitz
ContributorAuthor

MacStadium said they fixed something, but I still see 5 alerts.

But upon poking around more, I found that 4 of our 10 physical nodes had lost their connections to the shared NFS datastore. I had to manually remount those:

screen shot 2018-02-21 at 11 23 30 am

No clue why they became unmounted or why manual action was required to repair it.

But it all seems to be working again, even with VMware still alerting about stuff:

screen shot 2018-02-21 at 11 27 40 am

I'm following up with MacStadium about that. (https://portal.macstadium.com/tickets/47331)

/cc @andybons

bradfitz

bradfitz commented on Feb 21, 2018

@bradfitz
ContributorAuthor

And I see all 20 back up & connected.

I'll re-enable trybots.

ianlancetaylor

ianlancetaylor commented on Nov 29, 2018

@ianlancetaylor
Contributor

Seems like this issue is fixed, so closing.

locked and limited conversation to collaborators on Nov 29, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @bradfitz@ianlancetaylor@gopherbot

        Issue actions

          x/build/cmd/makemac: Mac VMs are failing to start due to "Invalid configuration" · Issue #23859 · golang/go