Topics

[espeak-ng:master] reported: Language analysis improvements #github


espeak-ng@groups.io Integration <espeak-ng@...>
 

[espeak-ng:master] New Comment on Issue #199 Language analysis improvements
By valdisvi:

Yes, because in compiledict.c bytes are compressed into integer with utf8_in function, and then only these 4 bytes are written with Write4Bytes. That produces wrong result, if there are too many "meaningful" bytes in from or to part of replacement. So, universal .replace implementation requires to replace arbitrary number of from bytes to arbitrary number of to bytes. To test it, just add rule e.g. .replace æ are are usi ša ra //etc. with even more bytes in from or to part of replacement