Links & FAQ

Q1:How to access TACC (The most powerful super super university server )

A1:

Step 1: go to the following link to register your account

https://portal.tacc.utexas.edu/account-request

Step 2: ask your boss go to the following link to register your account

https://portal.tacc.utexas.edu/account-request

Step 3: ask your boss to put your under his sponsorship, then you can read the direction to begin to use the super super server

Q2: Is there any demo for denoise

A2:

http://www.youtube.com/watch?v=kkWeNsunocg

Q3: ISCA Achive

A3:

www.isca-speech.org/archive

Q4: Where can I find code for HMM

Q4:

MATLAB-based: https://code.google.com/p/pmtk3/

C-based: http://htk.eng.cam.ac.uk/

Q5: How to find some special character for WORD/PowerPoint

A5:

http://www.theworldofstuff.com/characters/

Q6: Do you know any scholarship for speech related PhD?

A6:

http://cerg1.ugc.edu.hk/hkpfs/Leaflet2013-14HKPFS.pdf

http://www.eie.polyu.edu.hk/~mwmak/mypubl.htm

Q7: QR code generator

A7: http://www.the-qrcode-generator.com

Q8: Deadline of Signal processing conference

A8: http://www.signalprocessingsociety.org/conferences/upcoming-conferences/

Q9: Can speaker verification help in this legal case:Evidence of Who Was Yelling: George Zimmerman? Trayvon Martin? Both?

A9:

http://en.wikipedia.org/wiki/Shooting_of_Trayvon_Martin

http://www.youtube.com/watch?v=BpH8M6tATU4

http://www.youtube.com/watch?v=73tPKeKddfE

George Doddington refuted analysis from a state witness who testified that he detected Martin screaming “I’m begging you” on a 911 call.

http://www.theblaze.com/stories/2013/06/09/speech-expert-trayvon-martin-screaming-for-help-in-911-call-is-imaginary-stuff/

Q10: E-book for NLP?

A10:

Natural Language Processing with Python http://nltk.org/

http://nltk.googlecode.com/svn/trunk/doc/book/book.html

Q11: Where can I find the best student paper list for interspeech and ICASSP?

A11:

For interspeech:

http://www.isca-speech.org/iscaweb/index.php/honors/awards

For icassp, it looks like you can only go to individual conference session. Say, for ICASSP2013:

http://www.icassp2013.com/GrantsAwards.asp

Q12: Is there any ASR game?

A12: http://iambrandonn.github.io/FlashCards/

Q13: Toolkit for VAD.

A13:

http://www-lium.univ-lemans.fr/diarization/doku.php/welcome

http://cs.joensuu.fi/pages/tkinnu/VQVAD/VQVAD.zip

Q14: Scholarship or Fellowship

A14:

Google scholarship: (Deadline: Dec)

http://googleforstudents.blogspot.com/2013/09/applications-now-open-for-google.html

IBM student Scholarship: (No link can be found at this time)

IBM Ph.D. fellowship:(Deadline: Oct. )

http://www.research.ibm.com/university/phdfellowship/

Microsoft Research PhD Fellowship (Deadline: Sep. or Oct. )

http://research.microsoft.com/en-us/collaboration/awards/apply-us.aspx

As for timing, refer to the following letter:

I would like announce the 2011 Microsoft Research PhD Fellowship Program. Each fellowship includes a $28,000 stipend, payment of tuition/fees, and conference travel for two years. In addition, we offer each Fellow the opportunity to be a 12-week, salaried intern at Microsoft Research. The Microsoft Research PhD Fellowship Program for 2011 has started accepting applications beginning today and the deadline for submission of applications is Friday, October 8, 2010, at 5:00 P.M. Pacific Time. Applicants must be enrolled in a full-time PhD program in an accredited university in the US or Canada with a research focus in Computer Science, Mathematics, or Electrical Engineering. For additional details of eligibility requirements and the application procedure, please see:http://research.microsoft.com/en-us/collaboration/awards/apply-us.aspx

Microsoft Research is also pleased to begin accepting applications for the 2011 Microsoft Research Graduate Women's Scholarship Program, today, August 17, 2010. Each scholarship includes a $15,000 stipend, payment of tuition/fees, and conference travel for two years. The deadline for submission of applications is Friday, October 8, 2010, at 5:00 P.M. Pacific Time. Applicants must be full-time, female students, enrolled in their first year of graduate program in an accredited university in the US or Canada with a research focus in Computer Science, Mathematics, or Electrical Engineering. For qualifications and application procedures, please seehttp://research.microsoft.com/en-us/collaboration/awards/fellows-women.aspx.

Q15: What is difference between training, test, and development data set?

A15: Data split:

1) Training data set (or called as enrollment data set). Say you may have 3 classes, each class may have 2 samples. For speaker verification, we will say each enrollment speaker has two sessions.

2) Test data set

3) Development data set (or background data set): the goal of this set is to find optimal parameter which can be applied to "Test data set"

A good data split should make sure:

there is no overlap among the above 3 datasets.

Development data set should close match what we encounter in "Test data set", if we can.

Q16: What is the impact factor of major speech conferences

A16:

As for conference impact factors, please find an indication as given by http://citescholar.org as follows (2010 Impact Factor, Conference):

1.16 ICASSP,

1.05 INTERSPEECH

Q17: Where can I find public corpus for speech Enhancement/identification?

A17: Here is one example from MITLL (some researchers also used it for speaker identification)

https://www.ll.mit.edu/mission/cybersec/HLT/corpora/SpeechCorpora.html

  • Lincoln Laboratory Speech Enhancement Corpus (LLSEC) - The LLSEC was collected under DARPA funding to further research in the area of speech enhancement. The LLSEC is a mutli-microphone speech corpus collected under different scenarios to realistically reflect a wide variety of degraded speech conditions.

Q18: Where to find the file lists for NIST SRE?

A18:

http://cls.ru.nl/~saeidi/file_library/I4U.tgz

http://code.google.com/p/prism-set/

Q19: Is there any toolkit for telephone speech enhancement?

A19:

Speech processing, transmission and quality aspects (stq), distributed speech recognition, advanced front-end feature extraction algorithm, compression algorithm,” in ETSI standard document-ETSI ES 202 050 v1.1.1, 2002.

http://www.etsi.org/deliver/etsi_es/202000_202099/202050/01.01.05_60/es_202050v010105p.pdf

http://www.etsi.org/deliver/etsi_es/202000_202099/202050/01.01.05_60/es_202050v010105p0.zip

Q20: Upcoming Signal Processing Conferences

A20:

http://www.signalprocessingsociety.org/conferences/upcoming-conferences/

Q21: How to write a great research paper?

A21: Please refer: How to write a great research paper (by Simon Peyton Jones)

1.Don’t wait: write

2.Identify your key idea

3.Tell a story

4.Nail your contributions

5.Related work: later

6.Put your readers first (examples)

7.Listen to your readers

More: http://research.microsoft.com/~simonpj

Q22: Where to find free accent corpus:

A22:

http://accent.gmu.edu/about.php