Project: FM Demodulator

1) Introduction

The project develops a mono FM receiver that can be used to listen to the radio through your Pynq board. We use the RTL2832 software defined as to sample RF signals.

2) Project Goal

Your overall goal is to perform real-time demodulation. This means that the demodulation of 1 second of sampled data takes less than one second including the I/O time to transfer data to/from the demodulator on the programmable logic.

The first goal is to use your knowledge from previous projects to implement an FM demodulator IP core. The second goal is to develop an interface between the hardware-accelerated FM demodulator and the Jupyter notebook. The end goal is to perform real-time FM demodulation which requires careful optimization in both the hardware implementation of the demodulator and the interfacing between the processing system (PS) and programmable logic (PL).

The project is divided into two parts.

In the first part, you develop a hardware-accelerated version of the scikit-dsp-comm mono_FM demodulator function using HLS. This FM demodulator consists of a linear filter, downsampler, and a discriminator.
The second part integrates the demodulator IP core onto the Pynq Board. You should be able to listen to local FM radio channel using your M implementation in programmable logic.

3) Installation Instructions

For this project you will need to install rtlsdr and scikit-dsp-comm onto your Pynq. If you have the Pynq connected to the internet:

Go to Juptyer Notebook and on the New dropdown (on the right) select Terminal.
In the new terminal window, run the following:

pip install pyrtlsdr

This will install the libsdr driver on the Pynq needed to connect the RTLSDR2832.

Once installation completes run

pip3 install scikit-dsp-comm

This library is needed for demodulation.

If the Pynq is not connected to the internet:

Go to pyrtlsdr and download the source code as a pyrtlsdr-master.zip file.
Open Jupyter on the Pynq and upload this .zip. Open a Terminal using the above instrucvtions and run the following:

unzip pyrtlsdr-master.zip
cd pyrtlsdr-master
python setup.py install

Setup should now have installed rtlsdr. Repeat the same for scikit-dsp-comm downloading the .zip, and installing to the Pynq.
Since this is a workaround, make sure that the import paths are properly specified.

4) Materials

The starter files for the project can be found at in fm-demodulation.

This contains two Python notebooks:

mono-fm-demodulation-prerecorded-file.ipynb explains how the Mono FM Demodulator. It uses a prerecorded set of samples from the RTL SDR and can be run on any computer.
rtl2832.ipynb provides a more comphrehensive discussion of the project. It includes code that can interface directly with the RTL SDR to sample live data and change the radio settings. It also provides some hints and discussion on how to develop the FM demodulator and interfacing it with the notebook.

We will not provide skeleton, code, testbenches, or any other scaffolding as you have gotten in the previous projects. A major learning outcome for this project is for you to completely develop a hardware accelerated system given only functionally correct Python. You must develop testbenches, correct functionally correct HLS code, decide on the I/O strategy, and optimize the system to achieve real-time performance.

5) Design Instructions

The Python code provides a functionally correct implementation of the FM demodulator written in Python. The mono_fm function taken directly from the ``scikit-dsp-comm’’ library is:

The FM Demodulator has three primary functions: downsample, a linear filter lfilter, and a frequency discriminator discrim.

downsample

downsample is a very straightforward operation. The function is given a factor N, which indicates that the downsampler should pass on every Nth sample. The implementation of downsample can be found here.

linear filter

lfilter is implemented as a direct II transposed structure.

This means that it can compute filters in the form:

\[a[0]*y[n] = b[0]*x[n] + b[1]*x[n-1] + ... + b[M]*x[n-M] - a[1]*y[n-1] - ... - a[N]*y[n-N]\]

lfilter takes three arguments, the a array, b array, and the input data to filter. a and b are fixed in this implementation. The hardware design can assume these fixed values, i.e., must only implement the exact required low pass filters and not the entire functionality of the lfilter function. More information about the linear filter implementation can be found here.

discriminator

To demodulate FM we require a discriminator circuit, which gives an output which is proportional to the input frequency deviation.

def discrim(x):
 """
 function disdata = discrim(x)
 where x is an angle modulated signal in complex baseband form.

 Mark Wickert
 """
 X=np.real(x)        # X is the real part of the received signal
 Y=np.imag(x)        # Y is the imaginary part of the received signal
 b=np.array([1, -1]) # filter coefficients for discrete derivative
 a=np.array([1, 0])  # filter coefficients for discrete derivative
 derY=signal.lfilter(b,a,Y)  # derivative of Y,
 derX=signal.lfilter(b,a,X)  #    "          X,
 disdata=(X*derY-Y*derX)/(X**2+Y**2)
 return disdata

The above code is the scikit-dsp-comm implementation of an FM baseband discriminator.

A frequency discriminator computes the derivative of the modulated phase.

The phase for a given complex sample S is calculated as

\[\phi(t) = \tan^{-1}\frac{S_I(t)}{S_R(t)}\]

where \(S\) is a sample that is a complex data type consisting of a real \(S_R(t)\) and imaginary \(S_I(t)\) values (or equivalently I and Q).

The time derivative of \(\tan^{-1}[x(t)]\) is the trigonometric identity

\[\frac{\partial \tan^{-1} x(t)}{\partial t} = \frac{1}{1+x^2(t)} \frac{\partial x(t)}{\partial t}\]

Here \(x(t) = \frac{S_I(t)}{S_R(t)}\). Substituting that into the equation gives

\[\frac{\partial\tan^{-1}\frac{S_I(t)}{S_R(t)}}{\partial t} = \frac{1}{1+(\frac{S_I(t)}{S_R(t)})^2} \frac{\partial \frac{S_I(t)}{S_R(t)}}{\partial t}\]

\(\frac{\partial \frac{S_I(t)}{S_R(t)}}{\partial t}\) can be rewritten using the calculus identity for the derivative of a ratio as

\[\frac{\partial\frac{S_I(t)}{S_R(t)}}{\partial t} = \frac{S_R(t)\frac{\partial S_I(t)}{\partial t}-S_I(t)\frac{\partial S_R(t)}{\partial t}}{S_R(t)^2}\]

Substituting that into \(\frac{\partial\tan^{-1}\frac{S_I(t)}{S_R(t)}}{\partial t}\) gives

\[\frac{\partial\tan^{-1}\frac{S_I(t)}{S_R(t)}}{\partial t} = \frac{1}{1+(\frac{S_I(t)}{S_R(t)})^2} \frac{S_R(t)\frac{\partial S_I(t)}{\partial t}-S_I(t)\frac{\partial S_R(t)}{\partial t}}{S_R(t)^2}\]

Multiplying the numerator and denominator of the first ratio by \(S_R(t)^2\)

\[\phi'(t) = \frac{S_R(t)S_I'(t)-S_R'(t)S_I(t)}{S_R^2(t)+S_I^2(t)}\]

mono_fm uses a simple approximation for the derivative which simply calculates the difference between the current and previous samples.

\[S_R'(t) = S_R(t) - S_R(t-1) S_I'(t) = S_I(t) - S_I(t-1)\]

This is done for both the real and imaginary parts. This computation is done using the lfilter function in the Python code. The b array is used to calculate the differences between consecutive elements.

Substituting these equations yields the final computation that is done for the discriminator.

\[\phi'(t) = \frac{S_R(t)(S_I(t) - S_I(t-1))-(S_R(t) - S_R(t-1))S_I(t)}{S_R^2(t)+S_I^2(t)}\]

Despite the somewhat involved derivation, the end result is a fairly simple calculation that can be performed efficiently in hardware.

6) PYNQ Demo

You should integrate your HLS code into a Jupyter notebook and demonstrate real-time performance, i.e., the total processing time less is less than the sampling time.

The provided Jupyter notebooks provide methods to sample real-data using the RTL SDR. There is also a prerecorded file. Your system should be able to take data in this format, process it in real-time, output it in the expected format, and play correctly using the provided functions. This must be done in real-time. For example, your system must be able to process 1 second of data in under one second.

You can use any method of interfacing as long as it performs the processing in real-time. This will require that the I/O between the PS and PL is efficient. This can very easily become a bottleneck and limit your ability to perform real-time processing. A real-time HLS IP core is useless if your cannot use it to the best of its performance. You are highly encouraged to modify the code to achieve a better performance and observe the throughput by changing the way you transmit data between PS and PL. Make use of the “RTL 2832” USB tuner in-order to receive the input RF Samples.

7) Optimization Guidelines

Hardware / Software Interface: An important consideration in your overall design will be the way that you exchange data between the mono_fm IP core on the programmable logic (PL) and the Jupyter notebook running on the processing system (PS). This interface will define the interface to your IP core, so think carefully about the entire system design before diving into the HLS design process.
Testing: You must develop a strategy for testing your HLS code. Perform incremental testing. Design and test the subfunctions before integrating them together.
Real-time Performance Constraints: Your ultimate goal is to perform real-time demodulation. The SDR sampling rate is set at 2.4 million samples per second (MSPS). That requires a real-time mono_fm to be have an initiation interval less than 417 ns. Using a clock period of 10 ns, that allows for approximately 40 cycles per sample. Given the fairly simplistic nature of the computations, this should be feasible. Do not forget about the potential effects of the HW/SW interface on the overall design.
System Design: You are responsible for understanding the system constraints and delivering a system that meets those constraints. There are many ways to build a system. The best systems are the ones that work reliably. Always work first on developing a functional baseline implementation. Then optimize it to achieve real-time performance.

8) Submission Procedure

You have to submit your code (and only your code, not other files, not HLS project files). Your code should have everything in it so that we can synthesize it directly. This means that you should use pragmas in your code, and not use the GUI to insert optimization directives. We must be able to only import your source file and directly synthesize it.

You must follow the file structure below. We use automated scripts to pull your data, so DOUBLE CHECK your file/folder names to make sure it corresponds to the instructions.

Your repo must contain a folder named “mono_fm” at the top-level. This folder must be organized as follows (similar to previous projects):

Contents:

Report.pdf
Folder fm-demodulator
- Source code (*.cpp, *.h, *.tcl) and reports (.rpt and .xml).
Folder Demo
- .bit and .hwh files
- FM.ipynb host file

Report: For this project, you must submit a report that describes the implementation of your design. You may add figures, diagrams, tables, or charts to describe your architectures with a short paragraph explaining them. There are no specific questions to answer. Just explain your design. We will check if (1) your final FM Demodulation functions are functionally correct (they pass their test benches) and (2) achieves target latency. The report will help us to understand your design. You also can use this report to explain your work for bonus part (check the grading section).

9) Grading Rubric

30 points: Functionally correct HLS design. You will get full credit if we are able to build and test your HLS code without any effort. You must provide testbenches that demonstrate the correct functionality of your code. You need to report the throughput of the final design.

30 points: Basic Pynq Demo. Demonstrate the ability to demodulate the prerecorded file through your HLS code. You will get full credit for clear audio output and an RMSE less than 1e-6.

10 points: Real-time Pynq Demo. Demonstrate the ability to perform real-time demodulation. For example, show that your system can demodulate the 1 second prerecorded file in under 1 second.

10 points: Interface with RTL SDR. Demonstrate the ability to directly acquire real-time samples using the RTL SDR.

20 points: Report. Detail the system design decisions, HLS code, testbenches, HW/SW interfaces, and other aspects of the system. Talk about challenges. Report should be well formatted, use figures and diagrams to aid explanations, and free of spelling and grammar errors.

Bonus:

30 Points: Output audio on Pynq board. Modify your design to listen to the audio using the Pynq audio. The Pynq base overlay has the ability to use of audio instance on the Pynq Board. One approach would be to add your system into the base overlay. Another approach would extract the important parts of the base overall into your designs. More information about the audio module on Pynq Z2 can be found here and here. Detailed description of the BaseOverlay can be found here.