Speech technologies for under-resourced languages of the world

 
PIIS0373658X0011291-9-1
Publication type Article
Status Published
Authors
Affiliation:
St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences (SPIIRAS)
ITMO University
Address: St. Petersburg, 199178, Russia
Affiliation: St. Petersburg Institute for Informatics and Automation, Russian Academy of Sciences
Address: St. Petersburg, 199178, Russia
Journal nameVoprosy Jazykoznanija
EditionIssue 2
Pages117-135
Abstract

Over the past decade, computer speech processing of under-resourced and minority languages has experienced a significant progress. In this paper, we present an analytical review of existing problems and approaches in the field of speech recognition for many spoken languages lacking speech and text resources, including languages of the Russian Federation. The definition and characteristics of under-resourced languages and challenges connected with their automatic processing are presented, as well as projects and investigations dealing with analysis and preservation of under-resourced languages of the world are described.

Keywordsunder-resourced languages, speech technology, speech recognition, language models
Received20.03.2015
Publication date20.03.2015
Number of characters647
Cite   Download pdf To download PDF you should sign in

Price publication: 0

Number of purchasers: 0, views: 624

Readers community rating: votes 0

1. Asinovskij i dr. 2007 — Asinovskij A. S., Radnaeva L. D., Sherstinova T. Yu. Razrabotka fonda zvuchaschej rechi «Yazyki narodov Rossii» // Al'manakh – 2007. Yazyki narodov Rossii. SPb.: Fakul'tet filologii i iskusstv SPbGU, 2007. S. 4—11. http://www.peliken.iphil.ru/Almanakh_2007.pdf [Asinovskii A. S., Radnaeva L. D., Sherstinova T. Yu. Development of the vocal speech fund «Languages of the Nations of Russia». Almanac – 2007. Languages of the Nations of Russia. St. Petersburg: Faculty of Philology and Arts, St. Petersburg State Univ., 2007. Pp. 4—11. Available at: http://www.peliken.iphil.ru/Almanakh_2007.pdf]

2. Asinovskij i dr. 2010 — Asinovskij A. S., Bogdanova N. V., Stepanova S. B., Sherstinova T. Yu., Markasova E. V., Suprunova A. V. Zvukovoj korpus russkogo yazyka «Odin rechevoj den'»: puti popolneniya i pervye rezul'taty issledovaniya // Komp'yuternaya lingvistika i intellektual'nye tekhnologii: Po materialam ezhegodnoj Mezhdunarodnoj konferentsii «Dialog’2010» (Bekasovo, 26—30 maya 2010 g.). Vyp. 9 (16). M.: Izd-vo RGGU, 2010. C. 41—47. [Asinovskii A. S., Bogdanova N. V., Stepanova S. B., Sherstinova T. Yu., Markasova E. V., Suprunova A. V. Sound corpus of Russian “One day of speech”: ways of updating and intellectual technologies. Komp’yuternaya lingvistika i intellektual’nye tekhnologii: Po materialam ezhegodnoi Mezhdunarodnoi konferentsii «Dialog’2010» (Bekasovo, May 26—30, 2010). No. 9 (16). Moscow: Russian State Univ. for the Humanities , 2010. Pp. 41—47.]

3. Getsevich, Lobanov 2010 — Getsevich Yu. S., Lobanov B. M. Sistema sinteza belorusskoj rechi po tekstu // Rechevye tekhnologii. 2010. № 1. S. 92—101. [Getsevich Yu. S., Lobanov B. M. A system of Belorussian text-to-speech synthesis. Rechevye tekhnologii. 2010. No. 1. Pp. 92—101.]

4. Kazakevich, Kibrik 2005 — Kazakevich O. A., Kibrik A. E. Malye yazyki na postsovetskom prostranstve. Malye yazyki i traditsii: suschestvovanie na grani. Vyp. 1. Lingvisticheskie problemy sokhraneniya i dokumentatsii malykh yazykov M.: Novoe izdatel'stvo, 2005. S.13—39. [Kazakevich O. A., Kibrik A. E. Malye yazyki na postsovetskom prostranstve. Malye yazyki i traditsii: sushchestvovanie na grani. Vyp. 1. Lingvisticheskie problemy sokhraneniya i dokumentatsii malykh yazykov [Small languages on the post-Soviet territory. Small languages and traditions: existence on the verge. No. 1. Linguistic problems of preservation documenting of small languages. Moscow: Novoe Izdatel’stvo, 2005. Pp.13—39.]

5. Karpov 2011 — Karpov A. A. Komp'yuternyj analiz i sintez russkogo zhestovogo yazyka // Voprosy yazykoznaniya. 2011. № 6. S. 41—53. [Karpov A. A. Computer analysis and synthesis of Russian sign language. Voprosy jazykoznanija. 2011. No. 6. Pp. 41—53.]

6. Karpov 2015 — Karpov A. A. 4-j Mezhdunarodnyj seminar po rechevym tekhnologiyam dlya maloresursnykh yazykov SLTU-2014 // Voprosy yazykoznaniya. 2015. № 2. S. 150—152. [Karpov A. A. 4th International workshop on spoken language technologies for under-resourced languages. Voprosy jazykoznanija. 2015. № 2. Pp. 150—152.]

7. Kibrik, Podlesskaya 2009 — Kibrik A. A., Podlesskaya V. I. (red.). Rasskazy o snovideniyakh: korpusnoe issledovanie ustnogo russkogo diskursa. M.: Yazyki slavyanskikh kul'tur, 2009. S. 288—308. [Kibrik A. A., Podlesskaya V. I. (eds). Rasskazy o snovideniyakh: korpusnoe issledovanie ustnogo russkogo diskursa [Night Dream Stories: A corpus study of spoken Russian discourse]. Moscow: Yazyki Slavyanskikh Kul’tur, 2009. Pp. 288—308].

8. Kipyatkova, Karpov 2010 — Kipyatkova I. S., Karpov A. A. Analiticheskij obzor sistem raspoznavaniya russkoj rechi s bol'shim slovarem // Trudy SPIIRAN. 2010. Vyp. 12. S. 7—20. http://proceedings. spiiras.nw.ru/ojs/index.php/sp/article/download/1474/1337 [Kipyatkova I. S., Karpov A. A. An analytical survey of large vocabulary Russian speech recognition systems. SPIIRAS Proceedings. 2010. No. 12. Pp. 7—20. http://proceedings.spiiras.nw.ru/ojs/index.php/sp/article/download/1474/1337]

9. Kipyatkova i dr. 2013 — Kipyatkova I. S., Ronzhin A. L., Karpov A. A. Avtomaticheskaya obrabotka razgovornoj russkoj rechi. SPb.: GUAP, 2013. [Kipyatkova I. S., Ronzhin A. L., Karpov A. A. Avtomaticheskaya obrabotka razgovornoi russkoi rechi [Automated processing of the conversational Russian speech]. St. Petersburg: St. Petersburg State University of Airspace Instrumentation, 2013.]

10. Muradova 2008 — Muradova A. R. Kak ischezayut yazyki i kak ikh vozrozhdayut. Yazykovoe raznoobrazie v kiberprostranstve: rossijskij i zarubezhnyj opyt. M.: MTsBS. 2008. S. 70—75. http://www.philology.ru/linguistics1/muradova-08.htm [Muradova A. R. Kak ischezayut yazyki i kak ikh vozrozhdayut. Yazykovoe raznoobrazie v kiberprostranstve: rossiiskii i zarubezhnyi opyt [Disappearance and revival of languages. Linguistic diversity in the cyberspace: Russian and foreign practices]. Moscow: Interregional Centre for Library Cooperation, 2008. Pp. 70—75. Available at: http://www.philology.ru/linguistics1/muradova-08.htm]

11. Plungyan 2010 — Plungyan V. A. Pochemu yazyki takie raznye. M.: AST-PRESS KNIGA, 2010. [Plungian V. A. Pochemu yazyki takie raznye [Why languages are so different]. Moscow: AST-PRESS KNIGA, 2010.]

12. Potapova 2009 — Potapova R. K. Osnovnye tendentsii razvitiya mnogoyazychnoj korpusnoj lingvistiki // Rechevye tekhnologii. 2009. № 2. S. 92—114. [Potapova R. K. The main trends in the development of multilingual corpus-based linguistics. Rechevye tekhnologii. 2009. No 2. Pp. 92—114.]

13. Ronzhin i dr. 2006 — Ronzhin A. L., Karpov A. A., Lobanov B. M., Tsirul'nik L. I., Jokish O. Fonetikomorfologicheskaya razmetka rechevykh korpusov dlya raspoznavaniya i sinteza russkoj rechi // Informatsionno-upravlyayuschie sistemy. 2006. № 6 (25). S. 24—34. [Ronzhin A. L., Karpov A. A., Lobanov B. M., Tsirulnik L. I., Jokisch O. Phonetic-morphological mapping of speech corpora for recognition and synthesis of Russian speech. Informatsionno-upravlyayushchie sistemy. 2006. No. 6 (25). Pp. 24—34.]

14. Starostin 2007 — Starostin S. A. Sravnitel'noe yazykoznanie i ehtimologicheskie bazy dannykh // Starostin S. A. Trudy po yazykoznaniyu. M.: Yazyki slavyanskikh kul'tur, 2007. S. 770—778. [Starostin S. A. Comparative linguistics and etymological databases. Starostin S. A. Trudy po yazykoznaniyu. Moscow: Yazyki Slavyanskikh Kul’tur, 2007. Pp. 770—778.]

15. Khusainov 2014 — Khusainov A. F. Tekhnologiya avtomatizatsii sozdaniya i otsenki kachestva programmnykh sredstv analiza rechi s uchetom osobennostej maloresursnykh yazykov. Dis. … kand. tekhn. nauk. Ufa, 2014. [Khusainov A. F. Tekhnologiya avtomatizatsii sozdaniya i otsenki kachestva programmnykh sredstv analiza rechi s uchetom osobennostei maloresursnykh yazykov. Kand. diss. [The technology of automation of speech analysis tools software development and quality control by reference to specific features of under-resourced languages. Cand. diss.]. Ufa, 2014.]

16. Chelysheva 2009 — Chelysheva I. I. (red.). Minoritarnye yazyki Evrazii. Problemy yazykovykh kontaktov. M.: Tezaurus, 2009. [Chelysheva I. I. (ed.). Minoritarnye yazyki Evrazii. Problemy yazykovykh kontaktov [Minority languages of Eurasia. Problems of language contacts]. Moscow: Thesaurus, 2009.]

17. Arlazarov et al. 2004 — Arlazarov V., Bogdanov D., Krivnova O., Podrabinovich A. Creation of Russian speech databases: Design, processing, development tools. Proc. International conference on speech and computer SPECOM-2004. St. Petersburg, Russia, 2004. Pp. 650—656.

18. Barnard et al. 2009 — Barnard E., Davel M., van Heerden C. ASR corpus design for resource-scarce languages. Proc. International conference Interspeech-2009. Brighton, UK, 2009. Pp. 2847—2850.

19. Barnard et al. 2010 — Barnard E., Davel M., van Huyssteen G. Speech technology for information access: a South African case study. Proc. AAAI spring symposium on artificial intelligence for development AID. Palo Alto, USA, 2010. Pp. 8—13.

20. Berment 2004 — Berment V. Méthodes pour informatiser des langues et des groupes de langues «peu dotées». Thèse de doctorat, Université Joseph Fourier. Grenoble, 2004.

21. Besacier et al. 2006 — Besacier L., Zhou B., Gao Y. Towards speech translation of non written languages. Proc. IEEE/ACL spoken language technology workshop SLT-2006. Aruba, 2006. Pp. 222—225.

22. Besacier et al. 2014 — Besacier L., Barnard E., Karpov A., Schultz T. Automatic speech recognition for under-resourced languages: A survey. Speech communication. 2014. Vol. 56. Pp. 85—100.

23. Bilmes, Kirchhoff 2003 — Bilmes J. A., Kirchhoff K. Factored language models and generalized parallel backoff. Proc. Conference of the North American chapter of the association for computational linguistics on human language technology. Stroudsburg, PA, USA. Vol. 2. 2003. Pp. 4—6.

24. Cai 2008 — Cai J. Transcribing Southern Min speech corpora with a web-based language learning system. Proc. 1st International workshop on spoken language technologies for under-resourced languages SLTU2008. Hanoi, Vietnam, 2008. Pp. 659—664.

25. Chan, Rosenfeld 2012 — Chan H. Y., Rosenfeld R. Discriminative pronunciation learning for speech recognition for resource scarce languages. Proc. 2nd ACM symposium on computing for development ACM DEV. New York, USA, 2012. Article No. 12.

26. Chelba, Jelinek 2000 — Chelba C., Jelinek F. Structured language model. Computer speech and language. 2000. Vol. 14. No. 4. Pp. 283—332.

27. Crystal 2000 — Crystal D. Language death. Cambridge: Cambridge University Press, 2000.

28. Cucu et al. 2012 — Cucu H., Besacier L., Burileanu C., Buzo A. ASR domain adaptation methods for low-resourced languages: application to Romanian language. Proc. European conference on signal processing EUSIPCO-2012. Bucharest, Romania, 2012. Pp. 1648—1652.

29. Cucu et al. 2014 — Cucu H., Buzo A., Besacier L., Burileanu C. SMT-based ASR domain adaptation methods for under-resourced languages: application to Romanian. Speech communication. 2014. Vol. 56. Pp. 195—212.

30. Davel et al. 2011 — Davel M., van Heerden C., Kleynhans N., Barnard E. Efficient harvesting of Internet audio for resource-scarce ASR. Proc. International conference Interspeech-2011. Florence, Italy, 2011. Pp. 3153—3156.

31. de Vries et al. 2014 — de Vries N., Davel M., Badenhorst J., Basson W., de Wet F., Barnard E., de Waal A. A smartphone-based ASR data collection tool for under-resourced languages Speech communication. 2014. Vol. 56. Pp. 119—131.

32. Denoual, Lepage 2006 — Denoual E., Lepage Y. The character as an appropriate unit of processing for non-segmenting languages. Proc. NLP annual meeting. Tokyo, Japan, 2006. Pp. 731—734.

33. Do et al. 2010 — Do T., Besacier L., Castelli E. Unsupervised SMT for a low resourced language pair. Proc. 2nd International workshop on spoken language technologies for under-resourced languages SLTU-2010. Penang, Malaysia, 2010. Pp. 130—135.

34. Ekpenyong et al. 2014 — Ekpenyong M., Urua E.-A., Watts O., King S., Yamagishi J. Statistical parametric speech synthesis for Ibibio. Speech communication. 2014. Vol. 56. Pp. 243—251.

35. Gales et al. 2014 — Gales M., Knill K., Ragni A., Rath S. Speech recognition and keyword spotting for low-resource languages: Babel project research at CUED. Proc. 4th International workshop on spoken language technologies for under-resourced languages SLTU-2014. St. Petersburg, Russia, 2014. Pp. 16—23.

36. Gebreegziabher, Besacier 2012 — Gebreegziabher M., Besacier L. Preliminary experiments on English-Amharic statistical machine translation. Proc. 3rd International workshop on spoken language technologies for under-resourced languages SLTU-2012. Cape Town, South Africa, 2012. Pp. 36—41.

37. Gelas et al. 2011 — Gelas H., Teferra Abate S., Besacier L., Pellegrino F. Quality assessment of crowdsourcing transcriptions for African languages. Proc. International conference Interspeech-2011. Florence, Italy, 2011. Pp. 3065—3068.

38. Gemmeke, van Hamme 2011 — Gemmeke J. F., van Hamme H. An hierarchical exemplar-based sparse model of speech with an application to ASR. Proc. IEEE international workshop ASRU-2011. Hawaii, USA, 2011. Pp. 101—106.

39. Ghoshal et al. 2009 — Ghoshal A., Jansche M., Khudanpur S., Riley M., Ulinski M. Web-derived pronunciations. Proc. IEEE international conference on acoustics, speech, and signal processing ICASSP-2009. Taipei, Taiwan, 2009. Pp. 4289—4292.

40. Hartmann et al. 2014 — Hartmann W., Lamel L., Gauvain J.-L. Cross-word sub-word units for low-resource keyword spotting. Proc. 4th International workshop on spoken language technologies for under-resourced languages SLTU-2014. St. Petersburg, Russia, 2014. Pp. 112—117.

41. Huet et al. 2010 — Huet S., Gravier G., Sebillot P. Morpho-syntactic postprocessing of N-best lists for improved French automatic speech recognition. Computer speech and language. 2010. Vol. 24. No. 4. Pp. 663—684.

42. Hughes et al. 2010 — Hughes T., Nakajima K., Ha L., Moreno P., LeBeau M. Building transcribed speech corpora quickly and cheaply for many languages. Proc. International conference Interspeech-2010. Makuhari, Japan, 2010. Pp. 1914—1917.

43. Jensson et al. 2008 — Jensson A., Iwano K., Furui S. Development of a speech recognition system for Icelandic using machine translated text. Proc. 1st International workshop on spoken language technologies for under-resourced languages SLTU-2008. Hanoi, Vietnam, 2008. Pp. 18—21.

44. Karanasou, Lamel 2010 — Karanasou P., Lamel L. Comparing SMT methods for automatic generation of pronunciation variants. Proc. International workshop IceTAL-2010. Reykjavik, Iceland, 2010. P. 167.

45. Karpov et al. 2011 — Karpov A., Kipyatkova I., Ronzhin A. Very large vocabulary ASR for spoken Russian with syntactic and morphemic analysis. Proc. International conference Interspeech-2011. Florence, Italy, 2011. Pp. 3161—3164.

46. Karpov et al. 2014 — Karpov A., Markov K., Kipyatkova I., Vazhenina D., Ronzhin A. Large vocabulary Russian speech recognition using syntactico-statistical language modeling. Speech communication. 2014. Vol. 56. Pp. 213—228.

47. Kempton, Moore 2014 — Kempton T., Moore R. K. Discovering the phoneme inventory of an unwritten language: A machine-assisted approach. Speech communication. 2014. Vol. 56. Pp. 152—166.

48. Kipyatkova, Karpov 2014 — Kipyatkova I., Karpov A. Study of morphological factors of factored language models for Russian ASR. Proc. 16th International conference on speech and computer SPECOM-2014. Springer, Lecture Notes in Artificial Intelligence. Vol. 8773. 2014. Pp. 451—458.

49. Kipyatkova et al. 2013 — Kipyatkova I., Karpov A., Verkhodanova V., Zelezny M. Modeling of pronunciation, language and nonverbal units at conversational Russian speech recognition. International journal of computer science and applications. 2013. No. 10 (1). Pp. 11—30.

50. Krauwer 2003 — Krauwer S. The basic language resource kit (BLARK) as the first milestone for the language resources roadmap. Proc. International workshop on speech and computer SPECOM-2003. Moscow, Russia, 2003. Pp. 8—15.

51. Kuo et al. 2009 — Kuo H.-K. J., Mangu L., Emami A., Zitouni I., Lee Y.-S. Syntactic features for Arabic speech recognition. Proc. International workshop ASRU-2009. Merano, Italy, 2009. Pp. 327—332.

52. Lamel et al. 2012 — Lamel L., Courcinous S., Gauvain J.-L., Josse Y., Le V. B. Transcription of Russian conversational speech. Proc. 3rd International workshop SLTU-2012. Cape Town, South Africa, 2012. Pp. 156—161.

53. Laurent et al. 2009 — Laurent A., Deleglise P., Meignier S. Grapheme to phoneme conversion using an SMT system. Proc. International conference Interspeech-2009. Brighton, UK, 2009. Pp. 708—711.

54. Le, Besacier 2009 — Le V.-B., Besacier L. Automatic speech recognition for under-resourced languages: application to Vietnamese language. IEEE transactions on audio, speech and language processing. 2009. Vol. 17 (8). Pp. 1471—1482.

55. Mohan et al. 2014 — Mohan A., Rose R., Ghalehjegh S. H., Umesh S. Acoustic modelling for speech recognition in Indian languages in an agricultural commodities task domain. Speech communication. 2014. Vol. 56. Pp. 167—180.

56. Nakamura 2014 — Nakamura S. Towards real-time multilingual multimodal speech-to-speech translation. Proc. 4th International workshop on spoken language technologies for under-resourced languages SLTU2014. St. Petersburg, Russia, 2014. Pp. 13—15.

57. Parent, Eskenazi 2010 — G. Parent, M. Eskenazi. Toward better crowdsourced transcription: transcription of a year of the Let’s Go bus information system data. Proc. IEEE workshop on spoken language technology SLT-2010. Berkeley, USA, 2010. Pp. 312—317.

58. Patel et al. 2009 — Patel N., Chittamuru D., Jain A., Dave P., Parikh T. S. Avaaj Otalo: A field study of an interactive voice forum for small farmers in rural India. Proc. ACM international conference CHI-2009. Boston, USA, 2009. Pp. 733—742.

59. Pellegrini, Lamel 2008 — Pellegrini T., Lamel L. Are audio or textual training data more important for ASR in less-represented languages? Proc. 1st International workshop on spoken language technologies for under-resourced languages SLTU-2008. Hanoi, Vietnam, 2008. Pp. 2—6.

60. Rastrow et al. 2012 — Rastrow A., Dredze M., Khudanpur S. Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining. Proc. 50th Annual meeting of association for computational linguistics ACL-2012. Jeju, Korea, 2012. Pp. 175—183.

61. Potapova 2011 — Potapova R. Multilingual spoken language databases in Russia. Proc. International conference on speech and computer SPECOM-2011. Kazan, Russia, 2011. Pp. 13—17.

62. Ronzhin, Karpov 2007 — Ronzhin A., Karpov A. Russian voice interface. Pattern recognition and image analysis. 2007. Vol. 17 (2). Pp. 321—336.

63. Rotovnik et al. 2007 — Rotovnik T., Maucec M. S., Kacix Z. Large vocabulary continuous speech recognition of an inflected language using stems and endings. Speech communication. 2007. Vol. 49. No. 6. Pp. 437—452.

64. Roux et al. 2000 — Roux J. C., Botha E. C., du Preez J. A. Developing a multilingual telephone based information retrieval system in African languages. Proc. 2nd International conference on language resources and evaluation LREC-2000. Athens, Greece, 2000. Pp. 975—980.

65. Schlippe et al. 2010 — Schlippe T., Ochs S., Schultz T. Wiktionary as a source for automatic pronunciation extraction. Proc. International conference Interspeech-2010. Makuhari, Japan, 2010. Pp. 2290—2293.

66. Schlippe et al. 2012 — Schlippe T., Ochs S., Vu N. T., Schultz T. Automatic error recovery for pronunciation dictionaries. Proc. International conference Interspeech-2012. Portland, USA, 2012. Pp. 2298—2301.

67. Schlippe et al. 2014 — Schlippe T., Ochs S., Schultz T. Web-based tools and methods for rapid pronunciation dictionary creation. Speech communication. 2014. Vol. 56. Pp. 101—118.

68. Schultz, Kirchhoff 2006 — Multilingual speech processing. Schultz T., Kirchhoff K. (eds). Burlington, MA: Elsevier Academic Press, 2006.

69. Schultz et al. 2007 — Schultz T., Black A., Badaskar S., Hornyak M., Kominek J. SPICE: web-based tools for rapid language adaptation in speech processing systems. Proc. International conference Interspeech-2007. Antwerp, Belgium, 2007. Pp. 2125—2128.

70. Schultz et al. 2013 — Schultz T., Vu N. T., Schlippe T. GlobalPhone: a multilingual text & speech database in 20 languages. Proc. International conference ICASSP-2013. Vancouver, Canada, 2013. Pp. 8126—8130.

71. Siniscalchi et al. 2013 — Siniscalchi S. M., Reed J., Svendsen T., Lee C.-H. Universal attribute characterization of spoken languages for automatic spoken language recognition. Computer speech and language. 2013. Vol. 27. No. 1. Pp. 209—227.

72. Smit et al. 2014 — Smit P., Virpioja S., Grönroos S. A., Kurimo M. Morfessor 2.0. Toolkit for statistical morphological segmentation. Proc. 14th Conference of the European chapter of the association for computational linguistics EACL-2014. Gothenburg, Sweden, 2014. Pp. 21—24.

73. Stahlberg et al. 2012 — Stahlberg F., Schlippe T., Vogel S., Schultz T. Word segmentation through cross-lingual word-to-phoneme alignment. Proc. IEEE workshop on spoken language technology SLT-2012. Miami, USA, 2012. Pp. 85—90.

74. Stuker et al. 2003 — Stuker S., Schultz T., Metze F., Waibel A. Multilingual articulatory features. Proc. IEEE international conference on acoustics, speech and signal processing ICASSP-2003. Hong Kong, China, 2003. Vol. I. Pp. 144—147.

75. Stuker et al. 2009 — Stuker S., Besacier L., Waibel A. Human translations guided language discovery for ASR systems. Proc. International conference Interspeech-2009. Brighton, UK, 2009. Pp. 3023—3026.

76. Tachbelie et al. 2014 — Tachbelie M., Abate S. T., Besacier L. Using different acoustic, lexical and language modeling units for ASR of an under-resourced language — Amharic. Speech communication. 2014. Vol. 56. Pp. 181—194.

77. van Heerden et al. 2010 — van Heerden C., Kleynhans N., Barnard E., Davel M. Pooling ASR data for closely related languages. Proc. 2nd International workshop on spoken language technologies for under-resourced languages SLTU-2010. Penang, Malaysia, 2010. Pp. 17—23.

78. van Niekerk, Barnard 2014 — van Niekerk D. R., Barnard E. Predicting utterance pitch targets in Yoruba for tone realisation in speech synthesis. Speech communication. 2014. Vol. 56. Pp. 229—242.

79. Vazhenina et al. 2012 — Vazhenina D., Kipyatkova I., Markov K., Karpov A. State-of-the-art speech recognition technologies for Russian language. Proc. Joint international conference on human-centered computer environments HCCE-2012. Aizu-Wakamatsu, Japan, 2012. Pp. 59—63.

80. Vu et al. 2010 — Vu N. T., Kraus F., Schultz T. Multilingual A-stabil: A new confidence score for multilingual unsupervised training. Proc. IEEE workshop on spoken language technology SLT-2010. Berkeley, USA. Pp. 183—188.

81. Whittaker, Woodland 2003 — Whittaker E. W. D., Woodland P. C. Language modelling for Russian and English using words and classes. Computer speech and language. 2003. Vol. 17. No. 1. Pp. 87—104.

82. Wissing, Barnard 2008 — Wissing D., Barnard E. Vowel variations in Southern Sotho: An acoustical investigation. Southern African linguistics and applied language studies. 2008. Vol. 26 (2). Pp. 255—265.

Система Orphus

Loading...
Up