-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathREADME
More file actions
29 lines (25 loc) · 850 Bytes
/
README
File metadata and controls
29 lines (25 loc) · 850 Bytes
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Versions of local copies:
- cmudict: 0.7b. Retrieved May 28, 2018.
- kilgarriff: Retrieved May 28, 2018.
Sources:
- https://cmloegcmluin.wordpress.com/2012/11/10/relative-frequencies-of-english-phonemes/
- http://www.speech.cs.cmu.edu/cgi-bin/cmudict
- http://www.kilgarriff.co.uk/bnc-readme.html
To do:
. use more than just first pronunciation in cmudict
. <er> is one phoneme etc
. phone or phoneme?
. manual error checking
. transcribe some of the uncorrelateds
. reread cmloegcmluin
x ARPAbet -> IPA
Changelog:
- v0.2.0: Translate ARPAbet to IPA
- v0.1.0: First steps; Frequencies generally and post-/w/
- Processing:
- Use only the first pronunciation in cmudict
- Discard uncorrelateds entirely
- No manual error checking etc
- Results:
- Q1: Frequencies of phonemes generally
- Q2: Frequencies of phonemes post-/w/