项目作者: nick-monto

项目描述 :
A neural network featuring convolutional layers designed to classify eight different languages. Class activation maps are used to investigate similarities in activation patterns across and within languages.
高级语言: Python
项目地址: git://github.com/nick-monto/LangNet_CNN.git
创建时间: 2017-04-01T23:22:42Z
项目社区:https://github.com/nick-monto/LangNet_CNN

开源协议:MIT License

下载


LangNet_CNN

Hrayr Harutyunyan showed that CNNs are very good at identifying what language is being spoken give multiple languages.

This repository is for my exploration of CNNs in language recognition. I am currently using file from several Shtooka databases spanning eight languages. I am still searching for other freely available spoke word databases that contain recordings across many languages and multiple speakers within each language.

The purpose of this work is to examine the features that the CNN architecture deems as important for distinguishing different languages.

During the preprocessing, there are two other folders. One folder containing the Flac files from Shtooka and one folder containing the converted flac - wav files. I have not uploaded these as they each contain over 6000 files.