Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add RFC for structured VCD output #74

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
243 changes: 243 additions & 0 deletions text/0074-structured-vcd.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,243 @@
- Start Date: (fill in with date at which the RFC is merged, YYYY-MM-DD)
- RFC PR: [amaranth-lang/rfcs#0074](https://github.com/amaranth-lang/rfcs/pull/0074)
- Amaranth Issue: [amaranth-lang/amaranth#0000](https://github.com/amaranth-lang/amaranth/issues/0000)

# Structured VCD Output

## Summary
[summary]: #summary

When generating VCD files, use the `scope` keyword to distinguish aggregate signals.

## Motivation
[motivation]: #motivation

These changes are intended to make it easier for waveform viewers to recover information about aggregate signals from VCD files.
This is partially motivated by an issue with VCD parsing that occurs in Surfer (see [ekiwi/wellen#36](https://github.com/ekiwi/wellen/issues/36)).
Ideally, this leads to a better user experience when inspecting simulated Amaranth designs with a waveform viewer.

## Guide-level explanation
[guide-level-explanation]: #guide-level-explanation

This RFC describes changes to the way Amaranth generates a VCD file.
When reading a VCD file, waveform viewers like [Surfer](https://surfer-project.org/) and [GTKWave](https://gtkwave.sourceforge.net/) support use of the `scope` keyword for organizing variables into groups.
Currently, when writing a VCD file, Amaranth uses the `module` scope to distinguish between signals belonging to different modules.
For instance:

```
$scope module top $end # top
$var wire 1 <id> clk $end # top.clk
$var wire 1 <id> rst $end # top.rst
$var wire 32 <id> in $end # top.in
$var wire 32 <id> out $end # top.out
$scope module submodule $end # top.submodule
$var wire 1 <id> clk $end # top.submodule.clk
$var wire 1 <id> rst $end # top.submodule.rst
$var wire 5 <id> in $end # top.submodule.in
$var wire 5 <id> out $end # top.submodule.out
$upscope $end
$upscope $end
```

However, it does *not* attempt to use scopes for organizing the members of aggregate signals with array-like or struct-like datatypes.
Instead, when creating VCD variables for aggregate signals, members are distinguished only by appending to the name of the parent signal.
For instance, a signal `top.submodule.my_array` with the type `ArrayLayout(unsigned(32), 4)` is currently represented as:

```
$scope module top $end # top
...
$scope module submodule $end # top.submodule
...
$var wire 128 <id> my_array $end # top.submodule.my_array
$var wire 32 <id> my_array[0] $end # top.submodule.my_array[0]
$var wire 32 <id> my_array[1] $end # top.submodule.my_array[1]
$var wire 32 <id> my_array[2] $end # top.submodule.my_array[2]
$var wire 32 <id> my_array[3] $end # top.submodule.my_array[3]
$upscope $end
$upscope $end
```

This is not sufficient for explicitly conveying the relationship between aggregate signals and their members to a waveform viewer.
In this case, `my_array` does not explicitly *contain* its members, and waveform viewers may only *infer* this relationship by attempting to recover it from the names.

This RFC proposes the use of `scope vhdl_array` and `scope vhdl_record` to pass information about these relationships to a waveform viewer.
After this change, the VCD output above would become:

```
$scope module top $end # top
...
$scope module submodule $end # top.submodule
...
$comment Flattened representation of 'my_array' $end
$var wire 128 <id> my_array $end # top.submodule.my_array

$comment Hierarchical representation of 'my_array' members $end
$scope vhdl_array my_array $end # top.submodule.my_array
$var wire 32 <id> 0 $end # top.submodule.my_array.0
$var wire 32 <id> 1 $end # top.submodule.my_array.1
$var wire 32 <id> 2 $end # top.submodule.my_array.2
$var wire 32 <id> 3 $end # top.submodule.my_array.3
$upscope $end

$upscope $end
$upscope $end
```

These images demonstrate the difference in GTKWave output:

<figure>
<img align="center" src="./0074-structured-vcd/gtkwave_pre_rfc.png" alt="a screenshot of array-like signals in GTKWave (current behavior)">
<figcaption><i>Figure 1. A screenshot of array-like signals in GTKWave (current behavior)</i></figcaption>
</figure>

<figure>
<img align="center" src="./0074-structured-vcd/gtkwave_post_rfc.png" alt="a screenshot of array-like signals in GTKWave (proposed behavior)">
<figcaption><i>Figure 2. A screenshot of array-like signals in GTKWave (proposed behavior)</i></figcaption>
</figure>

... and the difference in Surfer output:

<figure>
<img align="center" src="./0074-structured-vcd/surfer_pre_rfc.png" alt="a screenshot of array-like signals in Surfer (current behavior)">
<figcaption><i>Figure 3. A screenshot of array-like signals in Surfer (current behavior)</i></figcaption>
</figure>

<figure>
<img align="center" src="./0074-structured-vcd/surfer_post_rfc.png" alt="a screenshot of array-like signals in Surfer (proposed behavior)">
<figcaption><i>Figure 4. A screenshot of array-like signals in Surfer (proposed behavior)</i></figcaption>
</figure>

## Reference-level explanation
[reference-level-explanation]: #reference-level-explanation

Currently, Amaranth represents aggregate signals in the VCD by appending the names of elements/members to the name of the signal.
When simulator output is written to a VCD file, we intend that signals with aggregate datatypes are explicitly given their own scope and split into multiple VCD variables.
We propose the following conventions:

- The `module` scope defines a group of signals belonging to the same `Module`
- The `vhdl_array` scope defines a group of signals belonging to an array (such as a signal with `ArrayLayout`)
- The `vhdl_record` scope defines a structured group of signals (such as a signal with `StructLayout`)

> **NOTE**: (Implementation details pending)
> ...


### Expected Output: Array-like

```python
from amaranth import *
from amaranth.lib.data import *

foo = Signal(ArrayLayout(unsigned(32), 4))
```

For `ArrayLayout`, each element in the array should become a VCD variable whose name is integer array index.
In this case, the signal `foo` is split into `foo.0`, `foo.1`, `foo.2`, and `foo.3`:

```
$scope vhdl_array foo $end
$var wire 32 <id> 0 $end
$var wire 32 <id> 1 $end
$var wire 32 <id> 2 $end
$var wire 32 <id> 3 $end
$upscope $end
```

### Expected Output: Struct-like

```python
from amaranth import *
from amaranth.lib.data import *

foo = Signal(StructLayout({
"a": unsigned(1),
"b": unsigned(4),
"c": unsigned(32),
}))
```

For `StructLayout`, each member should become a VCD variable with the member name.
In this case, the signal `bar` is split into `bar.a`, `bar.b`, and `bar.c`:

```
$scope vhdl_record bar $end
$var wire 1 id a $end
$var wire 4 id b $end
$var wire 32 id c $end
$upscope $end
```

### Expected Output: Nested Aggregates

```python
from amaranth import *
from amaranth.lib.data import *

class MyStruct(Struct):
a: unsigned(1)
b: unsigned(4)
c: ArrayLayout(unsigned(32), 4),

# A struct-like signal
foo = Signal(MyStruct)

```

For signals with nested aggregate types, the scopes are nested in the same way that `scope module` is already used for nested modules.
In this case, the signal `foo` is split like this:

```
$scope vhdl_record bar $end
$var wire 1 id a $end
$var wire 4 id b $end
$scope vhdl_array c $end
$var wire 32 id 0 $end
$var wire 32 id 1 $end
$var wire 32 id 2 $end
$var wire 32 id 3 $end
$upscope $end
$upscope $end
```

## Drawbacks
[drawbacks]: #drawbacks

- VCD is not a particularly efficient format, and this adds even more bytes to generated VCD files.

- This breaks any existing compatibility with waveform viewers that do not handle the `vhdl_record` and `vhdl_array` scope types.
Since these are not part of the VCD specification, some waveform viewers may fail to handle this.

## Rationale and alternatives
[rationale-and-alternatives]: #rationale-and-alternatives

- The choice of `scope vhdl_array` and `scope vhdl_record` are suggested due to known compatibility with Surfer and GTKWave.
However, note that these are not part of the VCD specification (which is somewhat old and under-defined).

### Alternatives

- Expose an environment variable (or a parameter in `write_vcd()`) allowing users to opt-in/opt-out of structured VCD output
- Include support for a different waveform format that has better-defined support for variables with composite datatypes.

## Prior art
[prior-art]: #prior-art

- As far as I can tell, use of the `vhdl_record` and `vhdl_array` scope types comes from the [ghdl/ghdl](https://github.com/ghdl/ghdl) simulator.
- [verilator/verilator](https://github.com/verilator/verilator) makes no attempt to use these scope types
- [steveicarus/iverilog](https://github.com/steveicarus/iverilog) makes no attempt to use these scopes types

## Unresolved questions
[unresolved-questions]: #unresolved-questions

- Should we continue to include the "flattened" [pure bit-vector] representation of aggregate signals in the VCD?
- Does this feature need to be gated/opt-in by default?

- The simulator currently depends on [westerndigitalcorporation/pyvcd](https://github.com/westerndigitalcorporation/pyvcd) when writing VCD files.
However, `pyvcd` only emits scope types defined by the VCD specification (and does not include the `vhdl_record`/`vhdl_array` scopes).
When implementing this RFC, should we continue relying on `pyvcd`, or does this warrant the addition of our own code in the simulator for writing VCD files?
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One reason to use pyvcd (VCD itself is not hard to emit) is its support for .gtkw files. I really don't want to have more GTKWave specific code in Amaranth...


## Future possibilities
[future-possibilities]: #future-possibilities

Since this adds more overhead to VCD output, it's worth mentioning that VCD may be unsuitable for testing very large designs.
This RFC can also serve as a stepping stone for supporting alternative waveform formats in the future.

Binary file added text/0074-structured-vcd/gtkwave_post_rfc.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added text/0074-structured-vcd/gtkwave_pre_rfc.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added text/0074-structured-vcd/surfer_post_rfc.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added text/0074-structured-vcd/surfer_pre_rfc.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.