Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate flaky test-stream-pipe-same-destination-twice on SmartOS #13227

Closed
Trott opened this issue May 25, 2017 · 10 comments
Closed

Investigate flaky test-stream-pipe-same-destination-twice on SmartOS #13227

Trott opened this issue May 25, 2017 · 10 comments
Labels
smartos Issues and PRs related to the SmartOS platform. stream Issues and PRs related to the stream subsystem. test Issues and PRs related to the tests.

Comments

@Trott
Copy link
Member

Trott commented May 25, 2017

  • Version: v8.0.0-pre
  • Platform: smartos16-64
  • Subsystem: test, stream

From https://ci.nodejs.org/job/node-test-commit-smartos/9202/nodes=smartos16-64/console:

not ok 1006 parallel/test-stream-pipe-same-destination-twice
  ---
  duration_ms: 0.592
  severity: crashed
  stack: |-
    oh no!
    exit code: CRASHED (Signal: 4)

/cc @nodejs/platform-smartos

@Trott Trott added smartos Issues and PRs related to the SmartOS platform. stream Issues and PRs related to the stream subsystem. test Issues and PRs related to the tests. labels May 25, 2017
@misterdjules
Copy link

@nodejs/build The process terminating due to receiving SIGILL should have made it generate a core dump. Is it possible for you to grab that core file from the test-joyent-smartos16-x64-1 VM (it should be somewhere in the /home/iojs/cores/ directory) and upload it somewhere?

@refack
Copy link
Contributor

refack commented May 25, 2017

@misterdjules AFAIK the fs state is ephemeral, the next run would have cleared the tree 😞
I started a stress test, might reproduce: https://ci.nodejs.org/job/node-stress-single-test/1233/nodes=smartos15-64/

@misterdjules
Copy link

@refack

AFAIK the fs state is ephemeral

What do you mean by "the fs state is ephemeral"? Is /home/iojs/cores/ cleaned up before and/or after every tests run? That was not my understanding.

@refack
Copy link
Contributor

refack commented May 26, 2017

@refack

AFAIK the fs state is ephemeral
What do you mean by "the fs state is ephemeral"? Is /home/iojs/cores/ cleaned up before and/or after every tests run? That was not my understanding.

Yes you are right. It's just the build dir that gets cleaned up. I didn't read the fine print.

@gibfahn
Copy link
Member

gibfahn commented May 26, 2017

@misterdjules I'm not seeing that folder on test-joyent-smartos16-x64-1 (the machine that failed):

~ ❯ s test-joyent-smartos16-x64-1 ls /home/iojs                                                                                                                                                        ~
build
jenkins_diagnostics.txt
jenkins_manifest.xml
node-tmp
slave.jar

@misterdjules
Copy link

@gibfahn

I'm not seeing that folder on test-joyent-smartos16-x64-1

In this case core dumps won't be written to that directory. It needs to be created.

@gibfahn
Copy link
Member

gibfahn commented May 26, 2017

Okay, I've added the folder to all the test machines. This probably needs something in our ansible scripts to automate it, but this works for now.

~for i in `cat ~/.ssh/config | grep smartos | grep test | awk '{print $2}'`; do echo ">>> $i"; ssh $i ls -al /home/iojs/cores; done
>>> test-joyent-smartos14-x64-1
total 2
drwxr-xr-x  2 iojs iojs   2 May 26 09:18 .
drwxr-xr-x 11 iojs other 26 May 26 09:18 ..
>>> test-joyent-smartos14-x64-2
total 2
drwxr-xr-x  2 iojs iojs   2 May 26 09:18 .
drwxr-xr-x 10 iojs other 22 May 26 09:18 ..
>>> test-joyent-smartos14-x86-1
total 2
drwxr-xr-x  2 iojs iojs   2 May 26 09:19 .
drwxr-xr-x 10 iojs other 26 May 26 09:19 ..
>>> test-joyent-smartos14-x86-2
total 2
drwxr-xr-x  2 iojs iojs   2 May 26 09:19 .
drwxr-xr-x 10 iojs other 22 May 26 09:19 ..
>>> test-joyent-smartos15-x64-1
total 2
drwxr-xr-x  2 iojs iojs   2 May 26 09:19 .
drwxr-xr-x 12 iojs other 26 May 26 09:19 ..
>>> test-joyent-smartos15-x64-2
total 2
drwxr-xr-x  2 iojs iojs   2 May 26 09:20 .
drwxr-xr-x 12 iojs other 26 May 26 09:20 ..
>>> test-joyent-smartos16-x64-1
total 2
drwxr-xr-x  2 iojs iojs   2 May 26 09:10 .
drwxr-xr-x 11 iojs other 25 May 26 09:13 ..
>>> test-joyent-smartos16-x64-2
total 2
drwxr-xr-x  2 iojs iojs  2 May 26 09:20 .
drwxr-xr-x 11 iojs iojs 24 May 26 09:20 ..

If it fails again LMK and I'll get the coredump.

@misterdjules
Copy link

@gibfahn Thank you for doing that!

@refack
Copy link
Contributor

refack commented May 30, 2017

After nodejs/build#736 has been addressed, Stressing smartOS16: https://ci.nodejs.org/job/node-stress-single-test/1247/nodes=smartos16-64/

@refack refack mentioned this issue Jun 30, 2017
2 tasks
@Trott
Copy link
Member Author

Trott commented Dec 24, 2017

More than six months without a re-occurrence. Going to close this...

@Trott Trott closed this as completed Dec 24, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
smartos Issues and PRs related to the SmartOS platform. stream Issues and PRs related to the stream subsystem. test Issues and PRs related to the tests.
Projects
None yet
Development

No branches or pull requests

4 participants