Create synthetic voice for a target language, using festival.
http://transfer.arcadia.co.jp/openupload/www/index.php?action=d&step=2
NET119 introduction: https://www.youtube.com/watch?v=PYb4p2wqP00
Playlist: https://www.youtube.com/playlist?list=PLD78C4Y4xMTfqC411fB4w_GKJDQXW58d8
git checkout RELEASE-4.0 <== Now using this!
git branch
git branch -a
git log --oneline --decorate --graph --all
git remote show origin
mkdir ~/arcadia/smtdevenv; cd ~/arcadia/smtdevenv
git clone https://github.com/ivansetiawantky/smtdevenv.git .
mkdir ~/arcadia/smtdevenv/sharedwks
docker search ivansetiawantky
sudo apt-get install jq
(Command line JSON processor)docker pull ivansetiawantky/smtdevenv:latest ; docker images -a ; docker inspect ivansetiawantky/smtdevenv:latest | jq '.[0].Config.Labels'
docker container ls -a
docker run --rm -ti --name mysmtdev -v $HOME/arcadia/smtdevenv/sharedwks:/home/smtdev/sharedwks ivansetiawantky/smtdevenv:latest byobu new
docker container attach mysmtdev
cd /home/smtdev/localwks/sample-models/
/home/smtdev/localwks/mosesdecoder/bin/moses -f phrase-model/moses.ini < phrase-model/in > out2
diff out*
: same!!!!First, try the Getting Started - Baseline System (fr2en)
This part is the Getting Started - Baseline System, where the example of building machine translation (mosesdecoder) based on fr→en (fr2en) parallel corpus to do SMT is shown.
Minimum s/w requirements are:
Base directory is $HOME in the Baseline system tutorial, but here we use below base directory.
The corpus and moses.ini (machine translation model from training result) must be located outside the container, i.e., to be put in the volume shared by the host and the container (need to confirm). This is to hide the proprietary corpus from the public domain Docker image.
basedir=~/localwks; echo $basedir
modeldir=~/sharedwks/fr2en
Prepare word-alignment tools, GIZA++:
Second, try from below page (en2jp)
Syntax Based Machine Translation (jp2en)
To analyze the text morphology as part-of-speech, i.e., convert text to a part-of-speech.
MeCab: http://taku910.github.io/mecab/
Fonem (音素) bahasa Indonesia berdasarkan IPA (International Phonetic and Alphabet):