-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathREADME
More file actions
25 lines (22 loc) · 744 Bytes
/
README
File metadata and controls
25 lines (22 loc) · 744 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
Sources:
- https://cmloegcmluin.wordpress.com/2012/11/10/relative-frequencies-of-english-phonemes/
- http://www.speech.cs.cmu.edu/cgi-bin/cmudict
- http://www.kilgarriff.co.uk/bnc-readme.html
To do:
. use more than just first pronunciation in cmudict
. <er> is one phoneme etc
. phone or phoneme?
. manual error checking
. transcribe some of the uncorrelateds
. reread cmloegcmluin
x ARPAbet -> IPA
Changelog:
- v0.2.0: Translate ARPAbet to IPA
- v0.1.0: First steps; Frequencies generally and post-/w/
- Processing:
- Use only the first pronunciation in cmudict
- Discard uncorrelateds entirely
- No manual error checking etc
- Results:
- Q1: Frequencies of phonemes generally
- Q2: Frequencies of phonemes post-/w/