[espeak-ng:master] new issue: manual phoneme input peculiarities #github


espeak-ng@groups.io Integration <espeak-ng@...>
 

[espeak-ng:master] New Issue Created by cmrdt:
#227 manual phoneme input peculiarities

when doing manual phoneme input:

  • [[___]]’s possessive case (e.g. for names) isn't parsed (space-separated ˈɛs) (rather do s|z|ɪz)

  • stress markers',: if don't manually input or via infer from lookup/similar word, lack of (dictionary?) heuristics at times make the word sound worse is there way to specify degree of manual/auto?

  • when changing language e.g. to use other vowel/phoneme sets via SSML tags: e.g. <voice xml:lang="es"> stuff <voice xml:lang="en-us"> it does a grammar/parsing reset therefore pitch and intonation is reset as though it were a new sentence, . [[_^_esstuff_^_en-us]] or [[_^_es]] stuff [[_^_en-us]] bypasses this, as long it is not interrupted by a newline

  • unexpected/confusing [[Q"Ql# _^_ipa GRK]] as ʁɣɬ (͡i͡p͡a) ɣʁɬ deviates from Kirshenbaum, - ok, but should be noted

  • no space before _^_ produces/mangles

    • [[a_^_ipa a]] as 'a(͡g͡n)yS;E a ipa: ˈa(͡g͡n)yɕɛ a
    • [[a_^_abc a]] as 'a(͡m͡t)J^Qu͡o a ipa: ˈa(͡m͡t)ʝɣu͡o a  
  • when invalid _^_ typed, it repeats previous word

    • should it be better documented? (what is base1 base2? havent figured it out yet)
invalid actual
bs hbs hr
en-029 en-wi
en-gb
en-GB-scotland en-sc
en-GB-x-gbclan en-n
en-GB-x-gbcwmd en-wm
en-GB-x-rp en-rp
es-mx es-419 es-la
fa-Latn fa
fonipa ipa
fr-be fr?
hy-arevela hy?
lfn base2
ms id
nb no
pap base2
pt pt_pt !
pt-br pt !
vi-VN-x-central vi-hue
vi-VN-x-south vi-sgn
zh-cmn cmn zh
zh-yue zhy
  rw
  prs
  nso
  nci
  kk
  hi_base
  fr-ca
  consonants
  bo
  base
  base1
  base2
  ak
  wo
  • at least allows combining from multiple non-tonal languages into one synth- word [[_^_ipa ts) _^_fr-ca y _^_ar H _^_es own _^_en-us Iz]]

also question regarding loanwords which often differ from pronunciation norm's should they use  some of other language's phoneme set?   by how much ? the degree it has been normalized, integrated into common-use


espeak-ng@groups.io Integration <espeak-ng@...>
 

[espeak-ng:master] New Issue Created by cmrdt:
#227 manual phoneme input peculiarities

when doing manual phoneme input:

  • [[___]]’s possessive case (e.g. for names) isn't parsed (space-separated ˈɛs) (rather do s|z|ɪz)

  • stress markers',: if don't manually input or via infer from lookup/similar word, lack of (dictionary?) heuristics at times make the word sound worse is there way to specify degree of manual/auto?

  • when changing language e.g. to use other vowel/phoneme sets via SSML tags: e.g. <voice xml:lang="es"> stuff <voice xml:lang="en-us"> it does a grammar/parsing reset therefore pitch and intonation is reset as though it were a new sentence, [[_^_esstuff_^_en-us]] or [[_^_es]] stuff [[_^_en-us]] bypasses this, as long it is not interrupted by a newline

  • ~unexpected/confusing [[Q"Ql# _^_ipa GRK]] as ʁɣɬ (͡i͡p͡a) ɣʁɬ~ , no space before _^_ produces [[a_^_ipa a]] as ˈa(͡g͡n)yɕɛ a

  • when invalid _^_ typed, it repeats previous word

    • should it be better documented? (what is base1 base2? havent figured it out yet)
invalid actual
bs hbs hr
en-029 en-wi
en-gb
en-GB-scotland en-sc
en-GB-x-gbclan en-n
en-GB-x-gbcwmd en-wm
en-GB-x-rp en-rp
es-mx es-419 es-la
fa-Latn fa
fonipa ipa
fr-be fr?
hy-arevela hy?
lfn base2
ms id
nb no
pap base2
pt pt_pt !
pt-br pt !
vi-VN-x-central vi-hue
vi-VN-x-south vi-sgn
zh-cmn cmn zh
zh-yue zhy
  rw
  prs
  nso
  nci
  kk
  hi_base
  fr-ca
  consonants
  bo
  base
  base1
  base2
  ak
  wo
  • at least allows combining from multiple non-tonal languages into one synth- word [[_^_ipa ts) _^_fr-ca y _^_ar H _^_es own _^_en-us Iz]]

also question regarding loanwords which often differ from pronunciation norm's should they use  some of other language's phoneme set?   by how much ? the degree it has been normalized, integrated into common-use


espeak-ng@groups.io Integration <espeak-ng@...>
 

[espeak-ng:master] New Issue Created by cmrdt:
#227 manual phoneme input peculiarities

when doing manual phoneme input:

  • [[___]]’s possessive case (e.g. for names) isn't parsed (space-separated ˈɛs) (rather do s|z|ɪz)

  • stress markers',: if don't manually input or via infer from lookup/similar word, lack of (dictionary?) heuristics at times make the word sound worse is there way to specify degree of manual/auto?

  • when changing language e.g. to use other vowel/phoneme sets via SSML tags: e.g. <voice xml:lang="es"> stuff <voice xml:lang="en-us"> it does a grammar/parsing reset therefore pitch and intonation is reset as though it were a new sentence, [[_^_esstuff_^_en-us]] or [[_^_es]] stuff [[_^_en-us]] bypasses this, as long it is not interrupted by a newline

  • ~unexpected/confusing [[Q"Ql# _^_ipa GRK]] as ʁɣɬ (͡i͡p͡a) ɣʁɬ~ , no space before _^_ produces [[a_^_ipa a]] as ˈa(͡g͡n)yɕɛ a

  • when invalid _^_ typed, it repeats previous word

    • should it be better documented? (what is base1 base2? havent figured it out yet)
invalid actual
bs hbs hr
en-029 en-wi
en-gb
en-GB-scotland en-sc
en-GB-x-gbclan en-n
en-GB-x-gbcwmd en-wm
en-GB-x-rp en-rp
es-mx es-419 es-la
fa-Latn fa
fonipa ipa
fr-be fr?
hy-arevela hy?
lfn base2
ms id
nb no
pap base2
pt pt_pt !
pt-br pt !
vi-VN-x-central vi-hue
vi-VN-x-south vi-sgn
zh-cmn cmn zh
zh-yue zhy
  rw
  prs
  nso
  nci
  kk
  hi_base
  fr-ca
  consonants
  bo
  base
  base1
  base2
  ak
  wo
  • at least allows combining from multiple non-tonal languages into one synth- word [[_^_ipa ts) _^_fr-ca y _^_ar H _^_es own _^_en-us Iz]]

also question regarding loanwords which often differ from pronunciation norm's should they use  some of other language's phoneme set?   by how much ? the degree it has been normalized, integrated into common-use