#227 manual phoneme input peculiarities

when doing manual phoneme input:

  • [[___]]’s possessive case (e.g. for names) isn't parsed (space-separated ˈɛs) (rather do s|z|ɪz)

  • stress markers',: if don't manually input or via infer from lookup/similar word, lack of (dictionary?) heuristics at times make the word sound worse is there way to specify degree of manual/auto?

  • when changing language e.g. to use other vowel/phoneme sets via SSML tags: e.g. <voice xml:lang="es"> stuff <voice xml:lang="en-us"> it does a grammar/parsing reset therefore pitch and intonation is reset as though it were a new sentence, [[_^_esstuff_^_en-us]] or [[_^_es]] stuff [[_^_en-us]] bypasses this, as long it is not interrupted by a newline

  • ~unexpected/confusing [[Q"Ql# _^_ipa GRK]] as ʁɣɬ (͡i͡p͡a) ɣʁɬ~ , typo produces [[a_^_ipa a]] as ˈa(͡g͡n)yɕɛ a

  • ~not all languages/variants seem reachable via _^_~ when invalid _^_ typed, it repeats previous word

    • should it be better documented? (what is base1 base2? havent figured it out yet)
invalid actual
bs hbs hr
en-029 en-wi
en-GB-scotland en-sc
en-GB-x-gbclan en-n
en-GB-x-gbcwmd en-wm
en-GB-x-rp en-rp
es-mx es-419 es-la
fa-Latn fa
fonipa ipa
fr-be fr?
hy-arevela hy?
lfn base2
ms id
nb no
pap base2
pt pt_pt
pt-br pt
vi-VN-x-central vi-hue
vi-VN-x-south vi-sgn
zh-cmn cmn zh
zh-yue zhy
  • at least allows combining from multiple non-tonal languages into one synth- word [[_^_ipa ts) _^_fr-ca y _^_ar H _^_es own _^_en-us Iz]]

also question regarding loanwords which often differ from pronunciation norm's should they use  some of other language's phoneme set?   by how much ? the degree it has been normalized, integrated into common-use

