Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Balloon] White Rabbit Reliability #6

Closed
alexazon opened this issue Sep 16, 2016 · 6 comments
Closed

[Balloon] White Rabbit Reliability #6

alexazon opened this issue Sep 16, 2016 · 6 comments
Assignees
Labels

Comments

@alexazon
Copy link
Collaborator

alexazon commented Sep 16, 2016

Dusan and me noticed that plugging in and out the fibre cable has a fancy effect on our timing receivers. Every time when a node is synchronized again, all Outputs are “shifted” +-4ns (compared to a node which wasn't unplugged).

Test setup:

  • Scope
  • PMC Card (based on proposed_master) => Connected to the scope (red)
  • EXPLODER5A (based on balloon, plugged in and out) => Connected to the scope (green)

Start condition:

We restarted both receivers and just used the clock generator and the ECA to output pulses. They are perfectly aligned. You can see that on the following picture:

pmc 89286 vs exploder - pps - 0

Reproducing the error:

As already mentioned, just plug the fibre cable out and in again. The clock/output pulse will be shifted in a range of +-4ns. We also got a picture of the behaviour:

pmc 89286 vs exploder - pps - 6

This happens on PMC and EXPLODER.

Exploder bitstream information:

[root@scuxl0095 ~]# eb-info dev/wbm0
Project : exploder5_csco_tr
Platform : exploder5 +db[12] +wrex2
FPGA model : Arria V (5AGXMA3D4F27I3)
Source info : balloon-1267
Build type : developer preview
Build date : Thu Sep 15 03:17:22 CEST 2016
Prepared by : Jenkins Nightly Build csco-tg@gsi.de
Prepared on : tsl002.acc.gsi.de
OS version : Debian GNU/Linux 8.5 (jessie), kernel 3.16.0-4-amd64
Quartus : Version 16.0.0 Build 211 04/27/2016 SJ Standard Edition

1ff7194 ftm: fixed EBM/PQ...

@alexazon alexazon added the bug label Sep 16, 2016
@alexazon alexazon changed the title White Rabbit Reliability on Balloon [Balloon] White Rabbit Reliability Sep 26, 2016
@alexazon
Copy link
Collaborator Author

Info:
Unable to reproduce with the latest version of balloon.

@miree
Copy link
Contributor

miree commented Nov 24, 2016

After tests with SCU3, EXPLODER5A, VETAR2, SPECv4, this issue could be reproduced on EXPLODER5A only. It seems to appear on units with Arria V FPGA.

@alexazon
Copy link
Collaborator Author

It seems to appear on units with Arria V FPGA.

Did you test this with a pexaria5?

@bradomyn
Copy link
Contributor

bradomyn commented Nov 24, 2016 via email

@miree
Copy link
Contributor

miree commented Dec 5, 2016

The issue was reproduced on a pexaria5 by Alessandro while monitoring the WR parameters "bitslide" and "setpoint". These were unrelated to the observed shift (+- 4 ns) in the pps signal. After discussing with Alessandro, Alex, Cesar and me, it is suspected that the problem might be in the SERDES Transmitter/Receiver of the Arria V FPGA. Perhaps the tuning of the 125MHz clock phase has some feedback on the SERDES?

It was decided to do the following:

  • Study SERDES (datasheet & VHDL).
  • Check if there were changes in the SERDES configuration in the past.
  • Check if it is possible to reset the SERDES after each WR synchronization.
  • Slow down the WR synchronization and check if this avoids the problem.

In addition, it was also observed that, sometimes, while being in TRACK PHASE the pps signal was varying by 1ns (without unplugging the WR link).

bradomyn pushed a commit that referenced this issue Mar 31, 2017
Every time white rabbit is down and up the pps pulses were getting
lock to a different point +/- 8ns in step of 1ns. For more information
about the bug a how this commit fixes the issue, read:
#6
bradomyn pushed a commit that referenced this issue Mar 31, 2017
Every time white rabbit is down and up the pps pulses were getting
lock to a different point +/- 8ns in step of 1ns. For more information
about the bug a how this commit fixes the issue, read:
#6
bradomyn pushed a commit that referenced this issue Mar 31, 2017
Every time white rabbit is down and up the pps pulses were getting
lock to a different point +/- 8ns in step of 1ns. For more information
about the bug a how this commit fixes the issue, read:
#6
@alexazon
Copy link
Collaborator Author

alexazon commented Aug 2, 2017

/closed

@alexazon alexazon closed this as completed Aug 2, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants