Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information
The code in this repo aims to help reproduce the results in the work:
Rong Gong, Jordi Pons, and Xavier Serra. 2017. Audio to Score Matching by Combining Phonetic and Duration Information. In 18th International Society for Music Information Retrieval Conference. Suzhou, China
The objective of this research task to find the corresponding score for its singing query audio. By pre-segmenting both the singing audios and the music scores into the phrase units, we restrict this research to the “matching” scope. The matched scores could facilitate several lower-level MIR tasks, such as the score-informed automatic syllable or phoneme segmentation for singing voice.
The related code only situated in phoneticSimilarity folder. Other folders are used to test other matching methods.
dataset_path
variable in general/filePath.py
to locate the above datasetCythonModule
, in terminal type python setup.py build_ext --inplace
requirements.txt
class_name
in general/filePath.py
to 'danAll'
or 'laosheng'
to experiment on either dan or laosheng role-typeam
in general/parameters.py
to 'gmm'
or 'cnn'
to select acoustic modelpython runHMM.py
to produce the experiment results for HMM and post-processor duration modelling matchingpython runHSMM.py
to produce the experiment results for HSMM duration modelling matchingpython acousticModelTraining.py
numpy scipy matplotlib essentia vamp scikit-learn pinyin cython keras theano unicodecsv
Affero GNU General Public License version 3