Spoken dialogue system open resources

OpenDial – java

https://github.com/plison/opendial

https://pdfs.semanticscholar.org/7981/324bcad5812ccf789d2091414e19138047dc.pdf

DeepPavlov – Python

https://github.com/deepmipt/DeepPavlov

Jindigo – Java

http://www.speech.kth.se/jindigo/

jVoiceXML – JAVA

https://github.com/JVoiceXML/JVoiceXML

CMU RavenClaw – C++/Perl

https://www.cs.cmu.edu/~dbohus/ravenclaw-olympus/index-dan.html

PED – prolog

http://planeffdia.sourceforge.net/main/

OwlSpeak – Java

https://sourceforge.net/projects/owlspeak/

IrisTK – java

http://www.iristk.net/index.html

InproTK – java python

https://bitbucket.org/inpro/inprotk

Rivr – Java – voiceXML

https://github.com/nuecho/rivr/#overview

 

summary ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

https://github.com/EllaVator/EllaVator/wiki/Open-source-dialog-frameworks

Cloud Commercial like the FB wit.ai, Microsoft LUIS, Nuance and google api.ai

Advertisements

Speech Recognition and Speech Synthesis open resources

CMU-Sphinx  C/C++/JAVA

Kaldi

HTK

Julius

RWTH

simon

iATROS-speech

SHoUT

Zanzibar

OpenIVR

MSDN-SAPI:http://msdn.microsoft.com/zh-cn/library/ms723627.aspx

CMU-Sphinx: http://en.wikipedia.org/wiki/CMU_Sphinx

HTK Toolkit:http://htk.eng.cam.ac.uk/

Julius:http://en.wikipedia.org/wiki/Julius_(software)

RWTH ASR:http://en.wikipedia.org/wiki/RWTH_ASR

http://en.wikipedia.org/wiki/List_of_speech_recognition_software

 

http://ibillxia.github.io/blog/2012/11/24/several-plantforms-on-audio-and-speech-signal-processing/

http://zh.wikipedia.org/wiki/语音识别
http://baike.baidu.com/view/549184.htm

 

 

Speaker Recognition and Diarization open resources

Speaker Recognition

ALIZE/LIA_RAL – C++

https://github.com/ALIZE-Speaker-Recognition/LIA_RAL

SIDEKIT  – python
MSR Identity Toolbox – matlab
Kaldi – scripting
Examples
===========================================================================

Discussion

http://habla.dc.uba.ar/gravano/ith-2014/presentaciones/Dehak_et_al_2010.pdf

 

GMM-UBM i-Vector

http://cslt.riit.tsinghua.edu.cn/mediawiki/images/c/cb/131104-ivector-microsoft-wj.pdf

https://people.csail.mit.edu/sshum/talks/ivector_tutorial_interspeech_27Aug2011.pdf

https://speechlab.sjtu.edu.cn/pages/sw121/homepage/2016/05/20/ivector-tutorial/

https://blog.csdn.net/xmu_jupiter/article/details/47209961

https://blog.csdn.net/zhangxueyang1/article/details/66971997

 

Speaker Diarization

LIUM – JAVA

http://www-lium.univ-lemans.fr/diarization/doku.php/welcome

https://github.com/StevenLOL/LIUM

kaldi CALLHOME_diarization – scripting

https://github.com/kaldi-asr/kaldi/tree/master/egs/callhome_diarization

https://github.com/Jamiroquai88/VBDiarization

Pyannote – python

https://github.com/pyannote/pyannote-audio

aalto speech – python for segment

https://github.com/aalto-speech/speaker-diarization