Packet formats from the LWA352 OVRO bifrost branch #206

jack-h · 2023-06-01T10:06:24Z

LWA352 OVRO bifrost branch, including SNAP2 F-Engine packet format

This allows the DP4A library to be used, which is way faster

I.e., call xGPU passing pointers to already transferred data on the device. This gives up xGPU's pipelining abilities, but makes it easier to use the xGPU kernel alongside other consumers also using the same GPU input buffer.

Add some checking for proper pointer spaces. More checking required

Inspired by the CUBLAS usage in https://github.com/devincody/DSAbeamformer Operates in 3 steps -- 1. Tranpose data and promote to float 2. Compute beams 3. Compute beam dynamic spectra, and sum to (in LWA352's case) 1ms Assumes no polarization ordering of input, but relies on user to upload beamforming coefficients which create X-pol and Y-pol beams. This is an easy way to deal with the arbitrary input ordering at runtime, but isn't very efficient (half the beamforming coeffs are zero). The kernel assumes the beams are constructed like this and uses the fact to generate averaged dynamic spectra (XX, YY, XY_r, XY_i). May well have synchronization bugs which make the benchmarks meaningless, but currently obtains ~50 Gbps throughput (~9MHz bandwidth for 4-bit inputs) with NANTS = 352 NPOLS = 2 NCHANS = 192 (4.4 MHz for LWA352) NBEAMS = 32 (16 x 2-pols) NTIMES = 480 NTIMES_SUM = 24 (1ms)

Move IB verbs receiving class to a dedicated C file. When using the hashpipe_ibverbs library within packet_capture.hpp directly something in that file messes up the compatibility of the ibverbs structs (their sizes are different) to those interpretted by hashpipe. Odd, but working around for now.

Evidently, this is the trigger to make `like_bmon` work its magic

Increase RX packet depth of IB verbs interface to 32k (this seems to be the maximum). Make packet handler use AVX stream store instructions. 1. The receiver is currently hard coded for 64 pols per packet. It would be trivial to parameterize this, but it may have some small performance implication. 2. Code loads 64-bit values into a 256-bit AVX register before writing to memory. If the IBV interface can be tweaked to enforce alignment (talk to DM about this) the first stage won't be necessary. 3. 64 pols per packet = 512 bits per memory write (1 freq channel of data). Newer machines supporting AVX512 could probably run faster than the current code by using _mm512_stream_si512 in place of _mm256_stream_si256

The behaviour of the traceback library has changed in py3, so remove the now nonexistent call. Tweak error handling to properly pass an exception to the cleanup print Fix missing decode()

Time metrics for processing / waiting for input/output data are helpful for figuringout the bottlenecks in the pipeline, but aren't particularly intuitive (IMO) measures of whether things are "fast enough"

This is a gross thing to hardcode, so FIXME. But, having new sequences periodically means that the header timestamps can be used as actual timestamps, rather than just counting bytes in some infinite data stream (which doesn't seem like a good idea when the input stream is from a network, and could conceivably behave strangely). Having timestamps derived from actual packet headers periodically seems sensible(?)

Lwa352 ibv

Allow an option to beamform and integrate in one hit by passing ntime_blocks>0 when initializing the library. Otherwise don't transpose or integrate the data. This change allows multiple downstream processes to use raw beamformer data for their own, different purposes -- (eg) one generating integrated dynamic spectra, and one generating VLBI voltage beams

…o lwa352-ibv

Reaches 27Gbps on LWA352 pipeline

Replace JH's libhashpipeibverbs IBV capture code with JD's dedicated bifrost source. Remove the philosophical quirk of having bifrost depend on hashpipe.

Sequence only changes if out-of-order packets indicate the upstream transmitters have reset

codecov-commenter · 2023-06-01T20:47:34Z

Codecov Report

Attention: Patch coverage is 33.33333% with 2 lines in your changes are missing coverage. Please review.

Project coverage is 67.77%. Comparing base (96bc19a) to head (e32a32b).
Report is 1 commits behind head on ibverb-support.

Files	Patch %	Lines
python/bifrost/packet_capture.py	33.33%	2 Missing ⚠️

Additional details and impacted files

@@                Coverage Diff                 @@
##           ibverb-support     #206      +/-   ##
==================================================
- Coverage           67.79%   67.77%   -0.02%     
==================================================
  Files                  66       66              
  Lines                5744     5747       +3     
==================================================
+ Hits                 3894     3895       +1     
- Misses               1850     1852       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…nerated vs received.

Fix off-by-one error in the power beam header

jack-h and others added 30 commits June 11, 2020 03:32

Merge remote-tracking branch 'jayce/disk-readers' into py3-disk-reader

4db60f2

Add LWA352 packet format

357283e

Add xGPU bindings

9fb5a09

This allows the DP4A library to be used, which is way faster

remove print

1cf8cdb

Add Kernel-only xgpu functions

6d088a2

I.e., call xGPU passing pointers to already transferred data on the device. This gives up xGPU's pipelining abilities, but makes it easier to use the xGPU kernel alongside other consumers also using the same GPU input buffer.

add xgpuSubSelect; fix xgpuKernel

9310406

Add some checking for proper pointer spaces. More checking required

gitignore ctags files

056d98c

Update GPU arch

55b2d6f

Add embryonic ibverbs packet RX support

0812937

Add flag to use ibverbs

324bb2c

Change function names to match bfFunctionName convention

30217af

Change IB verbs packet capture method name to "udp_capture"

faca6c0

Evidently, this is the trigger to make `like_bmon` work its magic

Py2->3 decode; tweak exception raising

42c696a

The behaviour of the traceback library has changed in py3, so remove the now nonexistent call. Tweak error handling to properly pass an exception to the cleanup print Fix missing decode()

Add gbps throughput to like_top

2ec319c

Time metrics for processing / waiting for input/output data are helpful for figuringout the bottlenecks in the pipeline, but aren't particularly intuitive (IMO) measures of whether things are "fast enough"

Merge pull request #1 from realtimeradio/lwa352-ibv

dfdfef8

Lwa352 ibv

Merge branch 'lwa352' of https://github.com/realtimeradio/bifrost int…

ce28180

…o lwa352-ibv

Add missing ifdefs for IBV code

9f077f2

Merge remote-tracking branch 'jayce/ibverb-support' into lwa352-ibv

6192bc6

Remove remnants of JH's hashpipe IBV code

d0a907a

Remove remnants of JH's hashpipe IBV code

76da1eb

Default to buffer 32k packets

951b741

Reaches 27Gbps on LWA352 pipeline

Merge branch 'lwa352-ibv' into lwa352

5448539

Replace JH's libhashpipeibverbs IBV capture code with JD's dedicated bifrost source. Remove the philosophical quirk of having bifrost depend on hashpipe.

Remove some unused code

d863296

Change how bifrost sequences are defined

35e819d

Sequence only changes if out-of-order packets indicate the upstream transmitters have reset

Merge commit '35e819d' into HEAD

6d42e90

jaycedowell added 13 commits June 1, 2023 09:22

Need to actually save files...

8a953cc

Revert to the ibverb-support version.

9088671

Focus on packet formats for now.

aa064b1

Focus on packet formats for now.

4e1d318

Move verbs buffer control into configure.

4972230

lwa352_vbeam_* -> vbeam_*

d07b153

Remove some debugging.

b37bf9b

Catch here as well.

9d67edb

Attempt to add a non-AVX version of the snap2 packet processor.

bc6add7

Now in formats/base.hpp.

7432067

Revert to the ibverb-support version.

d9736b4

Ugh.

70bf291

This block seems to be causing problems in CI.

21210ba

jaycedowell changed the title ~~Lwa352~~ Packet formats from the LWA352 OVRO bifrost branch Jun 1, 2023

jaycedowell and others added 14 commits July 20, 2023 15:09

Merge branch 'ibverb-support' into lwa352

f352ab7

Clean up header filler.

02e2e94

Re-enable missing source blanking.

e783a24

Ugh.

ae61dd8

Fix an off-by-one error from a header mis-match in how packets are ge…

75c74f6

…nerated vs received.

Merge pull request #3 from jaycedowell/caltech-bifrost-dsp

fae28f7

Fix off-by-one error in the power beam header

Merge remote-tracking branch 'upstream/ibverb-support' into lwa352

84cde94

I thought this was in configure.ac already.

b0254e8

Fix a bad merge.

b16dc15

Give up on packet pacing for now.

749f0ba

Merge remote-tracking branch 'upstream/ibverb-support' into lwa352

53e4189

Merge remote-tracking branch 'upstream/ibverb-support' into lwa352

a2aea31

Merge remote-tracking branch 'upstream/ibverb-support' into lwa352

4562c9d

Nice.

e32a32b

jaycedowell merged commit 9d18f89 into ledatelescope:ibverb-support Apr 19, 2024
12 of 14 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Packet formats from the LWA352 OVRO bifrost branch #206

Packet formats from the LWA352 OVRO bifrost branch #206

jack-h commented Jun 1, 2023

codecov-commenter commented Jun 1, 2023 •

edited by codecov bot

Loading

Packet formats from the LWA352 OVRO bifrost branch #206

Packet formats from the LWA352 OVRO bifrost branch #206

Conversation

jack-h commented Jun 1, 2023

codecov-commenter commented Jun 1, 2023 • edited by codecov bot Loading

Codecov Report

codecov-commenter commented Jun 1, 2023 •

edited by codecov bot

Loading