
A 6bpp run length encoded image or video is read from a W25Q128JV or similar QSPI flash, and output to 640x480 VGA.
This is perfect for displaying the Bad Apple music video.

The encoding uses 16-bit words. Most words are a run length in the top 10 bits, and a colour in the bottom 6 bits. A run must come to the end at the end of each row.
A run must be at least 2 pixels, and any group of 3 consecutive runs within a row must be at least 12 pixels, otherwise the data buffer will empty.
8-bit mono audio data can be interleaved into the video stream.  The PWM output value is updated by the value 0xC000 + sample, these must be at the end of a row, but do not have to be present on every row.  With a 24MHz project clock the row clock is exactly 30kHz.
To compress the audio slightly, sample deltas can also be used, packing 2, 3 or 4 samples into one command. These add a signed offset to the current sample value at the end of the next 2, 3 or 4 rows:
0xD000 + (offset1 << 6) + offset2 with 2 6-bit signed offsets0xE000 + (offset1 << 8) + (offset2 << 4) + offset3 with 3 4-bit signed offsets0xF000 + (offset1 << 9) + (offset2 << 6) + (offset3 << 3) + offset4 with 4 3-bit signed offsetsThis means that quieter audio takes less space!
Note that row and frame repeat, which were supported on the TT07 and TT IHP 0p2 versions are not supported here because audio data is interleaved into the video data.
The data is read starting at address 0.  The special word 0xBFC0 causes the player to stop and restart from address 0 at the beginning of the next frame, restarting the video.  This could also be used to display a still image.
Create a RLE binary file (docs/scripts to do this TBD) and load onto the flash. The pinout matches the QSPI Pmod. This should be plugged into the audio Pmod, and then the audio Pmod plugged into the bidir pins. Note the flash must support the h6B Fast Read Quad Output command, with 8 dummy cycles between address and data.
Connect the Tiny VGA PMOD to the output pins.
Inputs 2-0 set the read latency for the SPI in half clock cycles, it's likely that will need to be set to 2 (set input 1 high and inputs 0 and 2 low). This latency depends on the total round trip time through the mux and out to the flash and back. Valid values are 1 to 4.
Run with a 24MHz clock.
The 16MB flash is only enough for the first minute of Bad Apple.  But because the flash read is just one very long read it would be straightforward to supply the data stream from the RP2040 or other external source.  To make it easier to do this from the demo board RP2040, the QSPI pin configuration can be modified by setting in3 high so that the 4 data pins are contiguous.
| # | Input | Output | Bidirectional | 
|---|---|---|---|
| 0 | SPI latency[0] | R[1] | CS | 
| 1 | SPI latency[1] | G[1] | SD0 / SCK | 
| 2 | SPI latency[2] | B[1] | SD1 / SD0 | 
| 3 | Select QSPI pinout | vsync | SCK / SD1 | 
| 4 | R[0] | SD2 | |
| 5 | G[0] | SD3 | |
| 6 | B[0] | Unused CS | |
| 7 | hsync | PWM audio |