Skip to content

x/build: s390x builder not receiving/detecting new builds #27182

Closed
@billotosyr

Description

@billotosyr

Please answer these questions before submitting your issue. Thanks!

What version of Go are you using (go version)?

master tip (1.12)

Does this issue reproduce with the latest release?

yes

What operating system and processor architecture are you using (go env)?

GOARCH="s390x"
GOBIN=""
GOEXE=""
GOHOSTARCH="s390x"
GOHOSTOS="linux"
GOOS="linux"
GOPATH=""
GORACE=""
GOROOT="/data/golang/go"
GOTOOLDIR="/data/golang/go/pkg/tool/linux_s390x"
CC="gcc"
GOGCCFLAGS="-fPIC -m64 -march=z196 -pthread -fmessage-length=0 -fdebug-prefix-map=/tmp/go-build637051903=/tmp/go-build -gno-record-gcc-switches"
CXX="g++"
CGO_ENABLED="1"

What did you do?

loaded the build dashboard

What did you expect to see?

s390x building with all the new CL's

What did you see instead?

Since yesterday (Aug. 22) it's been silent. The buildbot is still running normally (as far as I can tell) but no builds are happening. Suggestions?

Activity

billotosyr

billotosyr commented on Aug 24, 2018

@billotosyr
Author
changed the title [-]build: s390x builder not receiving/detecting new builds[/-] [+]x/build: s390x builder not receiving/detecting new builds[/+] on Aug 24, 2018
added this to the Unreleased milestone on Aug 24, 2018
added
Buildersx/build issues (builders, bots, dashboards)
on Aug 24, 2018
added
NeedsInvestigationSomeone must examine and confirm this is a valid issue and not a duplicate of an existing one.
on Aug 24, 2018
dmitshur

dmitshur commented on Aug 24, 2018

@dmitshur
Member

I see 23 entries of linux-s390x-ibm under active builds. They're all started anywhere between 51 and 46 hours ago, and stuck on this step:

waiting_machine_in_use 

That might (or might not) be related.

dmitshur

dmitshur commented on Aug 24, 2018

@dmitshur
Member

The buildbot is still running normally (as far as I can tell) but no builds are happening. Suggestions?

@billotosyr @mundaym Can you try restarting it (the buildlet process)? Perhaps it's running but disconnected, and hasn't detected that.

You can also try updating to the latest version (i.e., go get -u golang.org/x/build/cmd/buildlet).

billotosyr

billotosyr commented on Aug 24, 2018

@billotosyr
Author

restarted buildlet process - doesn't seem to have had any effect.

billotosyr

billotosyr commented on Aug 24, 2018

@billotosyr
Author

I'm pretty sure its not a version thing -- as the buildlet was working up untl a benign unrelated CL yesterday ("skip TestGcSys on Windows").
Our machine had run out of space, we fixed it, and it started building again, but somehow the queue is too big perhaps? It just seems clogged. Any thoughts on getting it unclogged?

dmitshur

dmitshur commented on Aug 24, 2018

@dmitshur
Member

Our machine had run out of space, we fixed it, and it started building again, but somehow the queue is too big perhaps? It just seems clogged. Any thoughts on getting it unclogged?

Great to hear that it's running again!

Do you know if the machine ran out of space due to the Go builder work, or was it for unrelated reasons?

About the queue, after talking to @bradfitz about it, I don't think there's an existing way to clear it out. Sorry about that. We have an open bug for a scheduler (#19178) and that would be helpful here (e.g., prioritize more recent commits). Let's keep an eye on how the builder runs, and I'll investigate what else can be done to help.

billotosyr

billotosyr commented on Aug 27, 2018

@billotosyr
Author

Specifically, it's build cache that is filling up the drive. My current workaround is to use
'go clean -cache '
as root, frequently. Perhaps I will have to turn off the build cache.

bradfitz

bradfitz commented on Aug 27, 2018

@bradfitz
Contributor

I thought the buildlet wiped its work directory environment per run?

billotosyr

billotosyr commented on Aug 29, 2018

@billotosyr
Author

Our buildlet doesn't do that. Maybe because it's an older version.

bradfitz

bradfitz commented on Aug 29, 2018

@bradfitz
Contributor

No, I was just misremembering. But if that helps your environment & wrapper process that's managing keeping it alive, we could add it to the buildlet.

billotosyr

billotosyr commented on Aug 29, 2018

@billotosyr
Author

Well, I think it would help us, yes. The build cache is consuming about 2GB per day, and the drive it's on is small enough that I am now cleaning it daily. (This is a new thing, as I never had to clean it before.)

gopherbot

gopherbot commented on Oct 25, 2018

@gopherbot
Contributor

Change https://golang.org/cl/144637 mentions this issue: cmd/buildlet: set up & clean TMPDIR and GOCACHE for child processes

bradfitz

bradfitz commented on Oct 25, 2018

@bradfitz
Contributor

Okay, this the new s390x buildlet binary is pushed.

I'll watch it for any new problems.

It'll keep itself cleaned for new stuff, but you might have to clean legacy messes.

billotosyr

billotosyr commented on Oct 25, 2018

@billotosyr
Author
bradfitz

bradfitz commented on Oct 25, 2018

@bradfitz
Contributor

Hmm, weird.

I'll debug in a few hours.

gopherbot

gopherbot commented on Oct 26, 2018

@gopherbot
Contributor

Change https://golang.org/cl/144858 mentions this issue: cmd/buildlet: restore tmp/gocache dirs after recursive delete of world

added a commit that references this issue on Oct 26, 2018
57258c5
locked and limited conversation to collaborators on Oct 26, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    Buildersx/build issues (builders, bots, dashboards)FrozenDueToAgeNeedsInvestigationSomeone must examine and confirm this is a valid issue and not a duplicate of an existing one.

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @bradfitz@dmitshur@gopherbot@billotosyr

        Issue actions

          x/build: s390x builder not receiving/detecting new builds · Issue #27182 · golang/go