exception vectors go to .data #129

agrobman · 2019-09-06T17:05:24Z

seems that exception code goes to .data section:

.word 0x3a7db1e6, 0xd412cc8d, 0x735721c1, 0x881570db, 0x2f9b3971, 0x825922ad, 0xb5f05d35, 0xc1ae9a38
.align 4;
.align 5
_user_stack_start:
.rept 4999
.4byte 0x0
.endr
_user_stack_end:
.4byte 0x0
_kernel_start: .align 12
smode_program: slt t2, t0, t1
c.beqz a4, smode_program_stack_p
smode_program_stack_p:addi sp, sp, -52
sll s3, s2, s7
sw ra, 4(sp)
sw t0, 8(sp)
sw s9, 12(sp)
mulhsu s5, a2, ra
sw a2, 16(sp)
sw t3, 20(sp)
sw t1, 24(sp)
c.andi a3, 16

diag.exe: file format elf32-littleriscv

Sections:
Idx Name Size VMA LMA File off Algn
0 .text 00011a38 00000000 00000000 00001000 21
CONTENTS, ALLOC, LOAD, READONLY, CODE
1 .text.init 000000da 00011a38 00011a38 00012a38 20
CONTENTS, ALLOC, LOAD, READONLY, CODE
2 .data 00039e22 00012000 00012000 00013000 212
CONTENTS, ALLOC, LOAD, DATA
3 .tohost 00000048 0004be40 0004be40 0004ce40 26
CONTENTS, ALLOC, LOAD, DATA

It should be nice to get things in correct sections to be able to move them around with linker:

.text, .data. .stack .init, .handlers

Also what is the mechanism to define multiple data sections and their addresses? ( we have several memory regions with different characteristics - internal data, instructions, I/O, external normal memories and I/O regions,)

We would like to have several text and data sections be placed in these "RTL/TB defined" addresses.

taoliug · 2019-09-06T18:41:57Z

This issue should be gone for the bare program mode.
For the data section address, the generated program is intended to be used for core level verification rater than SOC level, so it doesn't have any notion of memory space partition. I think this can be added by implement custom load/store instruction stream. For example, you can add label for each memory region, and use load/store to access these regions based on the region label + region address offset.

agrobman · 2019-09-06T18:58:16Z

We do core level verification , but the core includes internal memories, memory mapped interrupt controller we need to verify. L/S unit needs stimuli with mixed targets ...

agrobman · 2019-09-06T19:05:38Z

we also have a minimum MMU/MPU logic which defines memory characteristics based on address -
the LS transaction types differ depending it's I/O or just regular memory and we need to test these characteristics. The CPU RTL has various parameters, which define where these internal memories reside, so we have to tune code/data locations accordingly.

taoliug · 2019-09-06T21:26:51Z

Does the core support virtual address translation?

agrobman · 2019-09-06T21:31:20Z

It doesn’t, but we have plans for future versions. We’re also planning multithreaded cores with amo instructions for synchronization. From: taoliug <notifications@github.com> Sent: Friday, September 6, 2019 4:27 PM To: google/riscv-dv <riscv-dv@noreply.github.com> Cc: Alexander Grobman <Alexander.Grobman@wdc.com>; Author <author@noreply.github.com> Subject: Re: [google/riscv-dv] exception vectors go to .data (#129) CAUTION: This email originated from outside of Western Digital. Do not click on links or open attachments unless you recognize the sender and know that the content is safe. Does the core support virtual address translation? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#129>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AK5G544DXNOGHPODWFYEEC3QILDJ5ANCNFSM4IULECIA>.

taoliug · 2019-09-06T21:35:43Z

Right now the the program generate many data pages with label : data_page_{idx}. The default setting is 40 4KB data pages. One easy way to achieve your goal is to link these pages to the various regions in your memory map, so that existing load/store instruction can access different regions naturally. Another way to do it to name the data pages in a more meaningful way. Like
{
region_0_name : 4KB
region_1_name: 16KB
region_2_name: 512B
....
}
In your link script, you can link each region to corresponding address. This will complicate the page table setup though.

agrobman · 2019-09-06T21:44:45Z

Don’t they all belong to the same .data section? As far as I know linker operates with sections, not labels. If you prepend your data pages with “.section dataN”, these pages can be relocated by linker to any address independently – Interesting useful feature could be to use the generator to select physical addresses for these pages … ( I think, there is special .section format to define start address, like .org XXXX command) From: taoliug <notifications@github.com> Sent: Friday, September 6, 2019 4:36 PM To: google/riscv-dv <riscv-dv@noreply.github.com> Cc: Alexander Grobman <Alexander.Grobman@wdc.com>; Author <author@noreply.github.com> Subject: Re: [google/riscv-dv] exception vectors go to .data (#129) CAUTION: This email originated from outside of Western Digital. Do not click on links or open attachments unless you recognize the sender and know that the content is safe. Right now the the program generate many data pages with label : data_page_{idx}. The default setting is 40 4KB data pages. One easy way to achieve your goal is to link these pages to the various regions in your memory map, so that existing load/store instruction can access different regions naturally. Another way to do it to name the data pages in a more meaningful way. Like { region_0_name : 4KB region_1_name: 16KB region_2_name: 512B .... } In your link script, you can link each region to corresponding address. This will complicate the page table setup though. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#129>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AK5G547T2IJVFOYEFFAVLO3QILELFANCNFSM4IULECIA>.

taoliug · 2019-09-06T21:53:04Z

Let me give a try for the custom region setting. Use .org might be a better idea as it reduce the complexity of the link script.
{
data_section_0_name ,0x0001_0000, 4KB, instruction
data_section_1_name ,0x0002_0000, 16KB, data
data_section_2_name ,0x0003_F000, 16KB, data
....
}

agrobman · 2019-09-06T22:10:40Z

Link script can use something like {data* } => .data to put all dataN sections to one output .data section to emulate current behavior ( I don’t remember exact syntax). From: taoliug <notifications@github.com> Sent: Friday, September 6, 2019 4:53 PM To: google/riscv-dv <riscv-dv@noreply.github.com> Cc: Alexander Grobman <Alexander.Grobman@wdc.com>; Author <author@noreply.github.com> Subject: Re: [google/riscv-dv] exception vectors go to .data (#129) CAUTION: This email originated from outside of Western Digital. Do not click on links or open attachments unless you recognize the sender and know that the content is safe. Let me give a try for the custom region setting. Use .org might be a better idea as it reduce the complexity of the link script. { data_section_0_name ,0x0001_0000, 4KB, instruction data_section_1_name ,0x0002_0000, 16KB, data data_section_2_name ,0x0003_F000, 16KB, data .... } — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#129>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AK5G544YSIHTMWCDZ64IKEDQILGMFANCNFSM4IULECIA>.

taoliug · 2019-09-06T22:27:51Z

The main complexity comes from setting up page tables for these scattered data pages. Especially if we want to inject error in the page table entry and do error recovery. I need to think about a clean way to do it. Right now the assumption is that all pages are equal size and continuously allocated in memory space. I think it makes sense to allow user customize the page location/size/access_type.

agrobman · 2019-09-06T22:45:47Z

I agree it’s not simple .. we have 4 data pointers in our tests – one for internal data memory, another for external “normal” , one – for I/O type external and one for our internal I/O. Our test gen mixes L/S instructions with all these pointers in the instruction streams . Limitation is that we had to reserve 4 GPRs for these pointers and accesses can go to 4KB windows only. To check address aliasing effects it may be not sufficient, especially if the CPU includes data cache . ( we don’t have DC, though) Other L/Ss with random addresses can be created with a sequence, setting up data pointer, but then it’s too deterministic, one more possibility is to get data pointers from memory tables – to check L/L or L/S dependencies From: taoliug <notifications@github.com> Sent: Friday, September 6, 2019 5:28 PM To: google/riscv-dv <riscv-dv@noreply.github.com> Cc: Alexander Grobman <Alexander.Grobman@wdc.com>; Author <author@noreply.github.com> Subject: Re: [google/riscv-dv] exception vectors go to .data (#129) CAUTION: This email originated from outside of Western Digital. Do not click on links or open attachments unless you recognize the sender and know that the content is safe. The main complexity comes from setting up page tables for these scattered data pages. Especially if we want to inject error in the page table entry and do error recovery. I need to think about a clean way to do it. Right now the assumption is that all pages are equal size and continuously allocated in memory space. I think it makes sense to allow user customize the page location/size/access_type. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#129>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AK5G54YLD2IKQMIZ5ZNV5DTQILKOTANCNFSM4IULECIA>.

agrobman · 2019-09-06T23:18:14Z

to my original complain - when you place code(exception handlers in this case) to data sections it confuses some tools
objdump for ex - it won't disassemble data sections. You can always place it in the code, by prepending it with .text - assembler collects all .text code from source to one .text section in obj file

you can have sections in the source file as layered cake:
.text
some code
.data
some data
.text
another code
.data
another data

at output all .text will be collected together and and all .data together too, forming one .text and one .data

taoliug · 2019-09-06T23:22:15Z

yes, this issue should have been fixed by this change.
68243cc#diff-f4e7ea3a932515dd6de98542c6c9754bR177

.text
user program
.data
user data
.text << this is missing
kernel program
.data
kernel data

agrobman · 2019-09-06T23:27:00Z

why it is section text.init and not just .text?

.macro init
.endm
.section .text.init <<<< ????
.globl _start
_start:

taoliug · 2019-09-06T23:30:56Z

I think this is inherited from riscv-tests, it's used by link script to link this section to beginning of the program. Maybe for your use case, you want it to be .text so that you can have your own init section?

agrobman · 2019-09-06T23:39:31Z

Yes, I had to change manually this one to .text .. From: taoliug <notifications@github.com> Sent: Friday, September 6, 2019 6:31 PM To: google/riscv-dv <riscv-dv@noreply.github.com> Cc: Alexander Grobman <Alexander.Grobman@wdc.com>; Author <author@noreply.github.com> Subject: Re: [google/riscv-dv] exception vectors go to .data (#129) CAUTION: This email originated from outside of Western Digital. Do not click on links or open attachments unless you recognize the sender and know that the content is safe. I think this is inherited from riscv-tests, it's used by link script to link this section to beginning of the program. Maybe for your use case, you want it to be .text so that you can have your known init section? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#129>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AK5G54YSH32SBLVR7K2AQLDQILR3FANCNFSM4IULECIA>.

taoliug · 2019-09-08T19:47:09Z

With above changes, now each memory region as it's own section, You can link them to match the memory map. Memory region can be configured in the config class:
https://github.com/google/riscv-dv/blob/master/src/riscv_instr_gen_config.sv#L79
For the external memory, I'd suggest to specify a small range for simulation purpose, otherwise it might take a long time to randomly init the memory region.

[Nr] Name Type Address Offset
Size EntSize Flags Link Info Align
[ 0] NULL 0000000000000000 00000000
0000000000000000 0000000000000000 0 0 0
[ 1] .text PROGBITS 0000000080000000 00001000
000000000000a2ac 0000000000000000 AX 0 0 4096
[ 2] .tohost PROGBITS 000000008000b000 0000c000
0000000000000048 0000000000000000 WA 0 0 64
[ 3] .region_0 PROGBITS 000000008000c000 0000d000
0000000000001000 0000000000000000 WA 0 0 1
[ 4] .region_1 PROGBITS 000000008000d000 0000e000
0000000000004000 0000000000000000 WA 0 0 1
[ 5] .region_2 PROGBITS 0000000080011000 00012000
0000000000002000 0000000000000000 WA 0 0 1
[ 6] .region_3 PROGBITS 0000000080013000 00014000
0000000000000200 0000000000000000 WA 0 0 1
[ 7] .region_4 PROGBITS 0000000080013200 00014200
0000000000001000 0000000000000000 WA 0 0 1
[ 8] .user_stack PROGBITS 0000000080014200 00015200
0000000000009c40 0000000000000000 WA 0 0 64
[ 9] .kernel_stack PROGBITS 000000008001de40 0001ee40

agrobman · 2019-09-09T16:30:39Z

any idea, why am I getting region sections with zero data?
.pushsection .region_0,"aw",@progbits;
region_0:
.word 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000
.word 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000
.word 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000, 0x00000000

taoliug · 2019-09-09T16:47:34Z

Because there are three data patterns for the data section
https://github.com/google/riscv-dv/blob/master/src/riscv_instr_pkg.sv#L616

typedef enum bit [1:0] {
RAND_DATA = 0,
ALL_ZERO,
INCR_VAL
} data_pattern_t;

agrobman · 2019-09-09T16:49:28Z

Is the data pattern randomly selected?

agrobman · 2019-09-09T16:52:13Z

One more thing

can you add _finish: label to the _exit, something like this:

_exit:
_finish:
j write_tohost

agrobman · 2019-09-09T16:56:52Z

one more idea:
print a sequence name before and after it is injected as a comment into the generated source

eroom1966 · 2019-09-09T17:20:27Z

Hi All
I was following this thread, and wondering if the best method is the tohost/fromhost idea.
I cannot recall whom, but I am pretty sure there has been talk regarding dropping the whole tohost/fromhost interface, in which case the ecall would seem the best approach.
If we were to simply follow the linux syscall style, we could test the syscall number, eg exit()
it would be trivial then to write a traphandler which interrogates the syscall number and does the appropriate action
@agrobman can I ask what is your execution environment which need to have the specific labels in the elf file to detect an END-OF-TEST scenario ?
Thx
Lee

taoliug · 2019-09-09T18:43:55Z

One more thing

can you add _finish: label to the _exit, something like this:

_exit:
_finish:
j write_tohost

Can you extend riscv_asm_program_gen::gen_program_end to add this? I don't want to add TB specific logic in the upstream code.

taoliug · 2019-09-09T21:17:10Z

I am closing this issue for now as I believe the original issue is solved. Please file a different issue for other feature requests. Thanks.

taoliug · 2019-09-09T21:54:00Z

one more idea:
print a sequence name before and after it is injected as a comment into the generated source

This is already implemented:

                  slt        a4, a6, a3
                  lbu        a6, 33(ra) #end riscv_hazard_instr_stream_5
                  la         s9, region_4+319 #start riscv_hazard_instr_stream_11
                  lh         t2, 59(s9)

agrobman · 2019-09-09T23:10:11Z

@eroom1966 , _finish label is used by our RTL verification environment .
we don't like ecall/syscall as end of test because we need to verify this instruction in randoms by itself .
Also some benchmarks use syscall to interact with host - we utilize this interface when run these benchmarks on the RTL model ..

taoliug mentioned this issue Sep 6, 2019

Add a bare program mode #130

Merged

This was referenced Sep 7, 2019

Re-organize text and data section #134

Merged

Re-organize data page generation #135

Merged

taoliug mentioned this issue Sep 8, 2019

Update README #137

Merged

taoliug closed this as completed Sep 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exception vectors go to .data #129

exception vectors go to .data #129

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019 •

edited

Loading

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 8, 2019

agrobman commented Sep 9, 2019

taoliug commented Sep 9, 2019

agrobman commented Sep 9, 2019

agrobman commented Sep 9, 2019

agrobman commented Sep 9, 2019

eroom1966 commented Sep 9, 2019

taoliug commented Sep 9, 2019 •

edited

Loading

taoliug commented Sep 9, 2019

taoliug commented Sep 9, 2019

agrobman commented Sep 9, 2019

exception vectors go to .data #129

exception vectors go to .data #129

Comments

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019 via email

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019

agrobman commented Sep 6, 2019

taoliug commented Sep 6, 2019 • edited Loading

agrobman commented Sep 6, 2019 via email

taoliug commented Sep 8, 2019

agrobman commented Sep 9, 2019

taoliug commented Sep 9, 2019

agrobman commented Sep 9, 2019

agrobman commented Sep 9, 2019

agrobman commented Sep 9, 2019

eroom1966 commented Sep 9, 2019

taoliug commented Sep 9, 2019 • edited Loading

taoliug commented Sep 9, 2019

taoliug commented Sep 9, 2019

agrobman commented Sep 9, 2019

taoliug commented Sep 6, 2019 •

edited

Loading

taoliug commented Sep 9, 2019 •

edited

Loading