Skip to content

building on illumos gets stuck occasionally #35261

Closed
@jclulow

Description

@jclulow

As I've been trying to look into #35085, I've noticed the builder become stuck a couple of times. Most recently I've dug in to see where.

The last build action:

2019/10/31 01:59:08 [0xc0000b8580] Running /var/tmp/workdir-host-illumos-amd64-jclulow/go/src/make.bash with args ["/var/tmp/workdir-host-illumos-amd64-jclulow/go/src/make.bash"] and env ["GOMAXPROCS=4" "TMPDIR=/var/tmp" "USER=gobuild" "GOROOT_BOOTSTRAP=/var/tmp/workdir-host-illumos-amd64-jclulow/go1.4" "SMF_FMRI=svc:/site/buildlet:default" "A__z=\"*SHLVL" "PATH=/usr/bin:/usr/sbin:/sbin:/opt/local/bin:/opt/local/sbin:/opt/go/bootstrap/bin" "GO_BUILDER_ENV=host-illumos-amd64-jclulow" "SMF_RESTARTER=svc:/system/svc/restarter:default" "PWD=/home/gobuild" "LANG=en_US.UTF-8" "TZ=UTC" "SMF_ZONENAME=09a56fd3-94eb-ee7b-d03c-9bfd8c9b7619" "SHLVL=2" "HOME=/home/gobuild" "LOGNAME=gobuild" "SMF_METHOD=start" "_=/usr/bin/ctrun" "GO_STAGE0_NET_DELAY=400ms" "GO_STAGE0_DL_DELAY=200ms" "WORKDIR=/var/tmp/workdir-host-illumos-amd64-jclulow" "GO_BUILDER_NAME=illumos-amd64" "GOBIN=" "TMPDIR=/var/tmp/workdir-host-illumos-amd64-jclulow/tmp" "GOCACHE=/var/tmp/workdir-host-illumos-amd64-jclulow/gocache" "GOROOT_BOOTSTRAP=/opt/go/bootstrap"] in dir /var/tmp/workdir-host-illumos-amd64-jclulow/go/src

This has been running for some time:

[root@gobuild3 ~]# date -uR
Thu, 31 Oct 2019 02:50:26 +0000

The process tree:

      77885 /opt/go/build/bin/stage0
        77886 ./buildlet.exe --halt=false --reverse-type=host-illumos-amd64-jclulow --coo
          77887 bash /var/tmp/workdir-host-illumos-amd64-jclulow/go/src/make.bash
            78001 ./cmd/dist/dist bootstrap -a
              78401 /var/tmp/workdir-host-illumos-amd64-jclulow/go/pkg/tool/illumos_amd64
[root@gobuild3 ~]# pargs 78401
78401:	/var/tmp/workdir-host-illumos-amd64-jclulow/go/pkg/tool/illumos_amd64/go_bootst
argv[0]: /var/tmp/workdir-host-illumos-amd64-jclulow/go/pkg/tool/illumos_amd64/go_bootstrap
argv[1]: install
argv[2]: -gcflags=all=
argv[3]: -ldflags=all=
argv[4]: -i
argv[5]: cmd/asm
argv[6]: cmd/cgo
argv[7]: cmd/compile
argv[8]: cmd/link

There's an OS thread possibly stuck in the allocator?

--------------------- thread# 1 / lwp# 1 ---------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affbb0) + 13
 fffffc7fef275ef8 sem_wait (affbb0) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 0000000000438881 runtime.exitsyscall0 () + 111
 000000000045b5e4 runtime.mcall () + 64
 0000000000416c05 runtime.(*mcache).refill () + 85
 000000000040b7e7 runtime.(*mcache).nextFree () + 87
 000000000040c123 runtime.mallocgc () + 793
 0000000000444fdc runtime.makeslice () + 6c
 00000000004eacc3 bytes.makeSlice () + 73
 00000000004ea60b bytes.(*Buffer).grow () + 15b
 00000000004eaab8 bytes.(*Buffer).ReadFrom () + 48
 000000000049a87c io.copyBuffer () + 2fc
 00000000004fa653 os/exec.(*Cmd).writerDescriptor.func1 () + 63
 00000000004fa6d7 os/exec.(*Cmd).Start.func1 () + 27
 000000000045d5e1 runtime.goexit () + 1
--------------------- thread# 2 / lwp# 2 ---------------------
 fffffc7fef2905f7 lwp_park (0, fffffc7fee7ffdc0, 0)
 fffffc7fef2828b8 sema_reltimedwait (affdb0, c000038358) + 28
 fffffc7fef275fdb sem_reltimedwait_np (affdb0, c000038358) + 4b
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a84d runtime.notetsleep_internal () + 10d
 000000000040aa08 runtime.notetsleep () + 58
 000000000043b93f runtime.sysmon () + 3bf
 00000000004338d3 runtime.mstart1 () + c3
 00000000004337f6 runtime.mstart () + 66
 000000000045efa2 runtime.tstart_sysvicall () + 42
 fffffc7fef2905b0 _lwp_start ()
--------------------- thread# 3 / lwp# 3 ---------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affc30) + 13
 fffffc7fef275ef8 sem_wait (affc30) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 000000000043626d runtime.findrunnable () + a0d
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64
 000000000041b7ff runtime.gcBgMarkWorker () + ff
 000000000045d5e1 runtime.goexit () + 1
--------------------- thread# 4 / lwp# 4 ---------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affbf0) + 13
 fffffc7fef275ef8 sem_wait (affbf0) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 000000000043626d runtime.findrunnable () + a0d
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64
 000000000041b7ff runtime.gcBgMarkWorker () + ff
 000000000045d5e1 runtime.goexit () + 1
--------------------- thread# 5 / lwp# 5 ---------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affc70) + 13
 fffffc7fef275ef8 sem_wait (affc70) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 000000000043626d runtime.findrunnable () + a0d
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64
--------------------- thread# 6 / lwp# 6 ---------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affcb0) + 13
 fffffc7fef275ef8 sem_wait (affcb0) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 000000000043626d runtime.findrunnable () + a0d
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64
 0000000000421753 runtime.bgscavenge () + 3b3
 000000000045d5e1 runtime.goexit () + 1
--------------------- thread# 7 / lwp# 7 ---------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affcf0) + 13
 fffffc7fef275ef8 sem_wait (affcf0) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 000000000043626d runtime.findrunnable () + a0d
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64
 0000000000421e81 runtime.bgsweep () + 131
 000000000045d5e1 runtime.goexit () + 1
--------------------- thread# 8 / lwp# 8 ---------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affd30) + 13
 fffffc7fef275ef8 sem_wait (affd30) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 0000000000438881 runtime.exitsyscall0 () + 111
 000000000045b5e4 runtime.mcall () + 64
 0000000000000000 ???????? ()
 0000c000142e5700 ???????? ()
--------------------- thread# 9 / lwp# 9 ---------------------
 fffffc7fef296a6a portfs   (6, 4, fffffc7fea1ff200, 80, 1, 0)
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000042ad55 runtime.netpoll () + c5
 0000000000435f8b runtime.findrunnable () + 72b
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64
 0000000000421753 runtime.bgscavenge () + 3b3
 000000000045d5e1 runtime.goexit () + 1
-------------------- thread# 10 / lwp# 10 --------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affdf0) + 13
 fffffc7fef275ef8 sem_wait (affdf0) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 000000000043626d runtime.findrunnable () + a0d
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64
 000000000042136d runtime.scavengeSleep () + ed
 0000000000421753 runtime.bgscavenge () + 3b3
 000000000045d5e1 runtime.goexit () + 1
-------------------- thread# 11 / lwp# 11 --------------------
 fffffc7fef2905f7 lwp_park (0, 0, 0)
 fffffc7fef282883 sema_wait (affe30) + 13
 fffffc7fef275ef8 sem_wait (affe30) + 38
 000000000045ef1a runtime.asmsysvicall6 () + 5a
 000000000040a6e0 runtime.notesleep () + e0
 0000000000434ca0 runtime.stopm () + c0
 000000000043626d runtime.findrunnable () + a0d
 0000000000436da5 runtime.schedule () + 2f5
 00000000004371ed runtime.park_m () + 9d
 000000000045b5e4 runtime.mcall () + 64

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions