Add libsamplerate for resampling decoder output

author: jacqueline <me@jacqueline.id.au> 2023-08-01 14:00:31 +1000
committer: jacqueline <me@jacqueline.id.au> 2023-08-01 14:00:31 +1000
commit: fbebc525117f18d5751e6951bc4ffcc51f70dcc4 (patch)
tree: 5725146701b816060fdd1f0979b2ff83fc4f7e24 /lib/libsamplerate/libsamplerate-0.2.2/docs/faq.md
parent: 55429fa6231cb576a79bbc7d6b0bf0732f5ea7a4 (diff)
download: tangara-fw-fbebc525117f18d5751e6951bc4ffcc51f70dcc4.tar.gz
1 files changed, 193 insertions, 0 deletions
diff --git a/lib/libsamplerate/libsamplerate-0.2.2/docs/faq.md b/lib/libsamplerate/libsamplerate-0.2.2/docs/faq.md
new file mode 100755
index 00000000..da3d87a9
--- /dev/null
+++ b/lib/libsamplerate/libsamplerate-0.2.2/docs/faq.md
@@ -0,0 +1,193 @@
+---
+layout: default
+---
+
+# Frequently Asked Questions
+
+1. [Q1 : Is it normal for the output of libsamplerate to be louder than its input?](#Q001)
+2. [Q2 : On Unix/Linux/MacOSX, what is the best way of detecting the presence and location of libsamplerate and its header file using autoconf?](#Q002)
+3. [Q3 : If I upsample and downsample to the original rate, for example 44.1-\>96-\>44.1, do I get an identical signal as the one before the up/down resampling?](#Q003)
+4. [Q4 : If I ran src\_simple (libsamplerate) on small chunks (160 frames) would that sound bad?](#Q004)
+5. [Q5 : I\'m using libsamplerate but the high quality settings sound worse than the SRC\_LINEAR converter. Why?](#Q005)
+6. [Q6 : I\'m use the SRC\_SINC\_\* converters and up-sampling by a ratio of 2. I reset the converter and put in 1000 samples and I expect to get 2000 samples out, but I\'m getting less than that. Why?](#Q006)
+7. [Q7 : I have input and output sample rates that are integer values, but the API wants me to divide one by the other and put the result in a floating point number. Won\'t this case problems for long running conversions?](#Q007)
+
+## Q1 : Is it normal for the output of libsamplerate to be louder than its input? {#Q001}
+
+The output of libsamplerate will be roughly the same volume as the input.
+However, even if the input is strictly in the range (-1.0, 1.0), it is still
+possible for the output to contain peak values outside this range.
+
+Consider four consecutive samples of [0.5 0.999 0.999 0.5]. If we are up
+sampling by a factor of two we need to insert samples between each of the
+existing samples. Its pretty obvious then, that the sample between the two 0.999
+values should and will be bigger than 0.999.
+
+This means that anyone using libsamplerate should normalize its output before
+doing things like saving the audio to a 16 bit WAV file.
+
+## Q2 : On Unix/Linux/MacOSX, what is the best way of detecting the presence and location of libsamplerate and its header file using autoconf? {#Q002}
+
+libsamplerate uses the pkg-config (man pkg-config) method of registering itself
+with the host system. The best way of detecting its presence is using something
+like this in configure.ac (or configure.in):
+
+    PKG_CHECK_MODULES(SAMPLERATE, samplerate >= 0.1.3,
+            ac_cv_samplerate=1, ac_cv_samplerate=0)
+
+    AC_DEFINE_UNQUOTED([HAVE_SAMPLERATE],${ac_cv_samplerate},
+            [Set to 1 if you have libsamplerate.])
+
+    AC_SUBST(SAMPLERATE_CFLAGS)
+    AC_SUBST(SAMPLERATE_LIBS)
+
+This will automatically set the **SAMPLERATE_CFLAGS** and **SAMPLERATE_LIBS**
+variables which can be used in Makefile.am or Makefile.in like this:
+
+    SAMPLERATE_CFLAGS = @SAMPLERATE_CFLAGS@
+    SAMPLERATE_LIBS = @SAMPLERATE_LIBS@
+
+If you install libsamplerate from source, you will probably need to set the
+**PKG_CONFIG_PATH** environment variable's suggested at the end of the
+libsamplerate configure process. For instance on my system I get this:
+
+    -=-=-=-=-=-=-=-=-=-= Configuration Complete =-=-=-=-=-=-=-=-=-=-=-
+
+        Configuration summary :
+
+        Version : ..................... 0.1.3
+        Enable debugging : ............ no
+
+        Tools :
+
+        Compiler is GCC : ............. yes
+        GCC major version : ........... 3
+
+        Extra tools required for testing and examples :
+
+        Have FFTW : ................... yes
+        Have libsndfile : ............. yes
+        Have libefence : .............. no
+
+        Installation directories :
+
+        Library directory : ........... /usr/local/lib
+        Program directory : ........... /usr/local/bin
+        Pkgconfig directory : ......... /usr/local/lib/pkgconfig
+
+## Q3 : If I upsample and downsample to the original rate, for example 44.1->96->44.1, do I get an identical signal as the one before the up/down resampling? {#Q003}
+
+The short answer is that for the general case, no, you don't. The long answer is
+that for some signals, with some converters, you will get very, very close.
+
+In order to resample correctly (ie using the **SRC_SINC_*** converters),
+filtering needs to be applied, regardless of whether its upsampling or
+downsampling. This filter needs to attenuate all frequencies above 0.5 times the
+minimum of the source and destination sample rate (call this fshmin). Since the
+filter needed to achieve full attenuation at this point, it has to start rolling
+off a some frequency below this point. It is this rolloff of the very highest
+frequencies which causes some of the loss.
+
+The other factor is that the filter itself can introduce transient artifacts
+which causes the output to be different to the input.
+
+## Q4 : If I ran src_simple on small chunks (say 160 frames) would that sound bad? {#Q004}
+
+Well if you are after odd sound effects, it might sound OK. If you are after
+high quality sample rate conversion you will be disappointed.
+
+The src_simple() was designed to provide a simple to use interface for people
+who wanted to do sample rate conversion on say, a whole file all at once.
+
+## Q5 : I'm using libsamplerate but the high quality settings sound worse than the SRC_LINEAR converter. Why? {#Q005}
+
+There are two possible problems. Firstly, if you are using the src_simple()
+function on successive blocks of a stream of samples, you will get bad results.
+The src_simple() function is designed for use on a whole sound file, all at
+once, not on contiguous segments of the same sound file. To fix the problem, you
+need to move to the src_process() API or the callback based API.
+
+If you are already using the src_process() API or the callback based API and the
+high quality settings sound worse than SRC_LINEAR, then you have other problems.
+Read on for more debugging hints.
+
+All of the higher quality converters need to keep state while doing conversions
+on segments of a large chunk of audio. This state information is kept inside the
+private data pointed to by the SRC_STATE pointer returned by the src_new()
+function. This means, that when you want to start doing sample rate conversion
+on a stream of data, you should call src_new() to get a new SRC_STATE pointer
+(or alternatively, call src_reset() on an existing SRC_STATE pointer). You
+should then pass this SRC_STATE pointer to the src_process() function with each
+new block of audio data. When you have completed the conversion, you can then
+call src_delete() on the SRC_STATE pointer.
+
+If you are doing all of the above correctly, you need to examine your usage of
+the values passed to src\_process() in the [SRC_DATA](api_misc.md#src_data)
+struct. Specifically:
+
+- Check that input_frames and output_frames fields are being set in terms of
+  frames (number of sample values times channels) instead of just the number of
+  samples.
+- Check that you are using the return values input_frames_used and
+  output_frames_gen to update your source and destination pointers correctly.
+- Check that you are updating the data_in and data_out pointers correctly for
+  each successive call.
+
+While doing the above, it is probably useful to compare what you are doing to
+what is done in the example programs in the examples/ directory of the source
+code tarball.
+
+If you have done all of the above and are still having problems then its
+probably time to email the author with the smallest chunk of code that
+adequately demonstrates your problem. This chunk should not need to be any more
+than 100 lines of code.
+
+## Q6 : I'm use the SRC_SINC_* converters and up-sampling by a ratio of 2. I reset the converter and put in 1000 samples and I expect to get 2000 samples out, but I'm getting less than that. Why? {#Q006}
+
+The short answer is that there is a transport delay inside the converter itself.
+Long answer follows.
+
+By way of example, the first time you call src_process() you might only get 1900
+samples out. However, after that first call all subsequent calls will probably
+get you about 2000 samples out for every 1000 samples you put in.
+
+The main problems people have with this transport delay is that they need to
+read out an exact number of samples and the transport delay scews this up. The
+best way to overcome this problem is to always supply more samples on the input
+than is actually needed to create the required number of output samples. With
+reference to the example above, if you always supply 1500 samples at the input,
+you will always get 2000 samples at the output. You will always need to keep
+track of the number of input frames used on each call to src_process() and deal
+with these values appropriately.
+
+## Q7 : I have input and output sample rates that are integer values, but the API wants me to divide one by the other and put the result in a floating point number. Won't this case problems for long running conversions? {#Q007}
+
+The short answer is no, the precision of the ratio is many orders of magnitude
+more than is really needed.
+
+For the long answer, lets do come calculations. Firstly, the `src_ratio` field
+is double precision floating point number which has [53 bits of precision](http://en.wikipedia.org/wiki/Double_precision).
+
+That means that the maximum error in your ratio converted to a double is one bit
+in 2^53 which means the double float value would be wrong by one sample
+after 9007199254740992 samples have passed or wrong by more than half a sample
+wrong after half that many (4503599627370496 samples) have passed.
+
+Now if for example our output sample rate is 96kHz then
+
+    4503599627370496 samples at 96kHz is 46912496118 seconds
+    46912496118 seconds is 781874935 minutes
+    781874935 minutes is 13031248 hours
+    13031248 hours is 542968 days
+    542968 days is 1486 years
+
+So, after 1486 years, the input will be wrong by more than half of one sampling
+period.
+
+All this assumes that the crystal oscillators uses to sample the audio stream is
+perfect. This is not the case. According to [this web site](http://www.ieee-uffc.org/freqcontrol/quartz/vig/vigcomp.htm),
+the accuracy of standard crystal oscillators (XO, TCXO, OCXO) is at best 1 in
+100 million. The `src_ratio` is therefore 45035996 times more accurate than the
+crystal clock source used to sample the original audio signal and any potential
+problem with the `src_ratio` being a floating point number will be completely
+swamped by sampling inaccuracies.
author	jacqueline <me@jacqueline.id.au>	2023-08-01 14:00:31 +1000
committer	jacqueline <me@jacqueline.id.au>	2023-08-01 14:00:31 +1000
commit	fbebc525117f18d5751e6951bc4ffcc51f70dcc4 (patch)
tree	5725146701b816060fdd1f0979b2ff83fc4f7e24 /lib/libsamplerate/libsamplerate-0.2.2/docs/faq.md
parent	55429fa6231cb576a79bbc7d6b0bf0732f5ea7a4 (diff)
download	tangara-fw-fbebc525117f18d5751e6951bc4ffcc51f70dcc4.tar.gz