Soyogi

Floating-point/Fixed-point Ogg/Vorbis decoder for embedded platforms

Overview

Soyogi(そよぎ, "tremor" in Japanese) is a new implementation of Ogg/Vorbis decoder, focused on looser memory requirement rather than speed. Soyogi is around two times slower than other decoders such as Tremor, while it can empirically decode most of the common Vorbis files with just 48KiB of RAM.

Its salient features include:

50% space saving by calculating MDCT via "in-place" DCT-IV
Totally static memory use, in a sense that neither heap area nor dynamic stack is required
Variable-length CAR (Compressed Array Representation) for Huffman tree, which saves around 25% of codebook RAM
Most of the stereo Ogg/Vorbis files with blocksizes 256 and 2048 could be decoded with only 24KiB of codec RAM on average
Fixed point implementation for environments without FPU is also available as a build option

Inverse MDCT via DCT-IV

Instead of extracting 2N time-domain samples from the vector, Soyogi performs DCT-IV on the frequency-domain audio, and saves the N-sized output until "Overlap-Add" process. TDAC is executed at the same time as two vectors are overlapped, which could be regarded as additional butterfly.

Fully static in terms of memory use

All of the setup components will be allocated on a dedicated stack, which could be of any configurable capacity up to 64KiB. 36KiB would be enough to decode a stereo Ogg/Vorbis file with blocksizes N and 4096. Note that a PCM buffer to store the decoder output frame by frame is still required, and when the output is 100% buffered, 48KiB or so should be the average RAM consumption.

Hardware failsafe assumed

Soyogi basically does not check for errors during packet decode. Instead, the MCU hardware is expected to protect from illegal access with MPU (Memory Protection Unit) functionality. Single dedicated stack for codec setup makes it easier to confine the location of memory errors. Errors are accumulated and detected once at the end of packet.

Fixed-point version

Fixed-point version of Soyogi would be preferable if your CPU supports 32x32=64 multiplication.

Limitations

The maximum number of (sparse) codebook entries is 32767.
- Practical bound is at around 7000.
The maximum number of codebooks is 255.
- Practical bound is at 50.
The maximum size of setup stack is 65536 bytes.
- Practical bound for blocksizes N/4096 is at 49152.
Blocksizes N/8192 is not supported.
Floor 0 is not supported.
Submaps are not supported.

Software SRC capability

FIR-based sampling rate converter, from 44.1kHz to 48kHz, is implemented to play both 11.025/22.05/44.1kHz and 12/24/48kHz audio with only one I2S master clock source.

Development status

As of December 2015, the decoder is apparently functional (decoder functionality is frozen), whereas there should be potential bugs when an irregular stream is given. Several Vorbis files were successfully decoded to clear PCM samples. There is currently no plan to support Floor 0 and Submaps.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
.gitignore		.gitignore
FDCT.c		FDCT.c
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SRC.c		SRC.c
codebook.c		codebook.c
codebook.h		codebook.h
decoder.h		decoder.h
floor1.c		floor1.c
floor1.h		floor1.h
main.c		main.c
mapping.c		mapping.c
mapping.h		mapping.h
misc.c		misc.c
misc.h		misc.h
mode.c		mode.c
mode.h		mode.h
ogg.c		ogg.c
ogg.h		ogg.h
residue.c		residue.c
residue.h		residue.h
setup.c		setup.c
setup.h		setup.h
vector.c		vector.c
vector.h		vector.h
vorbis.c		vorbis.c
vorbis_helper.h		vorbis_helper.h
vorbis_table.c		vorbis_table.c
vorbis_table.h		vorbis_table.h

License

ihr486/Soyogi

Folders and files

Latest commit

History

Repository files navigation

Soyogi

Overview

Inverse MDCT via DCT-IV

Fully static in terms of memory use

Hardware failsafe assumed

Fixed-point version

Limitations

Software SRC capability

Development status

About

Resources

License

Stars

Watchers

Forks

Languages