*Speech Standards: Lessons Learnt DOI: http://dx.doi.org/10.5772/intechopen.93134*

*Human 4.0 - From Biology to Cybernetic*

[Accessed: 26 April 2020]

version 1.0. W3C Note; 05 May 2000. Available from: http://www.w3.org/ TR/2000/NOTE-voicexml-20000505 Available from: https://www.w3.org/ TR/pronunciation-lexicon/ [Accessed:

[26] Auburn RJ. Voice browser call control: CCXML version 1.0. W3C Recommendation; 05 July 2011.

ccxml/ [Accessed: 26 April 2020]

[27] Barnett J, Akolkar R, Auburn RJ, Bodell M, Carter J, McGlashan S, et al. State Chart XML (SCXML): State machine notation for control abstraction. W3C Recommendation; 01 September 2015. Available from: https://www.w3.org/ TR/scxml/ [Accessed: 26 April 2020]

[28] Barnett J. Introduction to SCXML. In: Dahl DA, editor. Multimodal

Interaction with W3C Standards. Cham, Switzerland: Springer International

[29] Larson J. VoiceXML: Introduction to Developing Speech Applications. Upper Saddle River, New Jersey: Prentice Hall;

[30] Dahl D. Practical Spoken Dialog Systems. Berlin, Heidelberg: Springer-

[31] Jokinen K, McTear M. Spoken Dialogue Systems. Princeton, NJ: Morgan & Claypool; 2009

[32] Brown MK, Kellner A, Raggett D. Stochastic language models (N-Gram) specification. W3C Working Draft; 02 January 2001. Available from: http:// www.w3.org/TR/ngram-spec [Accessed:

[33] Standard ECMA-327. ECMAScript 3rd Edition Compact Profile; June 2001. Available from: http://www.ecmainternational.org/publications/files/ ECMA-ST-WITHDRAWN/Ecma-327.

[34] IPA. Handbook of the International Phonetic Association. Cambridge, UK: Cambridge University Press; 1999

pdf [Accessed: 26 April 2020]

Publishing; 2017. pp. 81-107

2003

Verlag; 2005

26 April 2020]

Available from: https://www.w3.org/TR/

26 April 2020]

[19] McGlashan S, Burnett DC, Carter J, Danielsen P, Ferrans J, Hunt A, et al. Voice Extensible Markup Language (VoiceXML) version 2.0. W3C Recommendation; 16 March 2004. Available from: https://www.w3.org/TR/ voicexml20/ [Accessed: 26 April 2020]

[20] Hunt A, McGlashan S. Speech recognition grammar specification version 1.0. W3C Recommendation; 16 March 2004. Available from: https:// www.w3.org/TR/speech-grammar/

[Accessed: 26 April 2020]

2020]

[21] Burnett DC, Walker MR, Hunt A. Speech Synthesis Markup Language (SSML) version 1.0. W3C Recommendation; 16 March 2004. Available from: https://www.w3.org/TR/ speech-synthesis/ [Accessed: 26 April

[22] Oshry M, Auburn RJ, Baggia P, Bodell M, Burke D, Burnett DC, et al. Voice Extensible Markup Language (VoiceXML) 2.1. W3C Recommendation; 19 June 2007.

Available from: https://www.w3.org/TR/ voicexml21/ [Accessed: 26 April 2020]

[24] Burnett DC, Shuang ZW. Speech Synthesis Markup Language (SSML) version 1.1. W3C Recommendation; 07 September 2010. Available from: https://www.w3.org/TR/speechsynthesis11/ [Accessed: 26 April 2020]

[25] Baggia P. Pronunciation Lexicon Specification (PLS) version 1.0. W3C Recommendation; 14 October 2008.

[23] Van Tichelen L, Burke D. Semantic interpretation for speech recognition (SISR) version 1.0. W3C Recommendation; 05 April 2007. Available from: https://www.w3.org/TR/semanticinterpretation/ [Accessed: 26 April 2020]

**62**

[35] Harel D. Statecharts: A visual formalism for complex systems. Journal Science of Computer Programming. 1987;**8**(3):231-274

[36] The Internet Engineering Task Force (IETF). Available from: https://www. ietf.org/ [Accessed: 26 April 2020]

[37] Shanmugham S, Monaco P, Eberman B. A Media Resource Control Protocol (MRCP). RFC 4463; Informational; April 2006. Available from: https://tools.ietf.org/html/rfc4463 [Accessed: 26 April 2020]

[38] Burnett D, Shanmugham S. Media Resource Control Protocol Version 2 (MRCPv2); RFC 6787; Internet Standard; November 2012. Available from: https://tools.ietf.org/html/rfc6787 [Accessed: 26 April 2020]

[39] Burke D. Speech Processing for IP Networks: Media Resource Control Protocol (MRCP). Chichester, UK: Wiley; 2007

[40] Watt SM, Underhill T. Ink Markup Language (InkML). W3C Recommendation; 20 September 2011. Available from: http://www.w3.org/ TR/InkML [Accessed: 26 April 2020]

[41] Johnston M. EMMA: Extensible MultiModal Annotation markup language. W3C Recommendation; 10 February 2009. Available from: http://www.w3.org/TR/emma/ [Accessed: 26 April 2020]

[42] Johnston M. Extensible multimodal annotation for intelligent interactive systems. In: Dahl DA, editor. Multimodal Interaction with W3C Standards. Cham, Switzerland: Springer International Publishing; 2017. pp. 37-64

[43] Dahl DA. Natural language semantics markup language for the speech interface framework. W3C Working Draft; 20 November 2000. Available from: https://www.w3.org/TR/ nl-spec/ [Accessed: 26 April 2020]

[44] Burkhardt F, Schröder M. Emotion Markup Language (EmotionML) 1.0. W3C Recommendation; 22 May 2014. Available from: https://www.w3.org/TR/ emotionml/ [Accessed: 26 April 2020]

[45] Burkhardt F, Pelachaud C, Schuller BW, Zovato E. EmotionML. In: Dahl DA, editor. Multimodal Interaction with W3C Standards. Cham, Switzerland: Springer International Publishing; 2017. pp. 65-80

[46] Barnett J, Bodell M, Dahl D, Kliche I, Larson J, Porter B, et al. Multimodal architecture and interfaces. W3C Recommendation; 25 October 2012. Available from: https://www.w3.org/TR/ mmi-arch/ [Accessed: 26 April 2020]

[47] Dahl DA, editor. Multimodal Interaction with W3C Standards. Cham, Switzerland: Springer International Publishing; 2017

[48] Burnett DC. ALL: Thoughts and thanks as the VBWG comes to a close. W3C Mailing List Archive; 26 September 2015. Available from: https:// lists.w3.org/Archives/Public/wwwvoice/2015JulSep/0029.html [Accessed: 26 April 2020]

[49] McGlashan S, Burnett D, Akolkar R, Auburn RJ, Baggia P, Barnett J, et al. Voice Extensible Markup Language (VoiceXML) 3.0. W3C Working Draft; 16 December 2010. Available from: http://www.w3.org/TR/voicexml30/ [Accessed: 26 April 2020]

[50] Natal A, Shires G, Cáceres M, Jägenstedt P. Web speech API. Draft Community Group Report; 21 January 2021. Available from: https://wicg.github. io/speech-api/ [Accessed: 26 April 2020]

[51] Johnston M, Dahl DA, Denney T, Kharidi N. EMMA: Extensible MultiModal Annotation markup language version 2.0. W3C Working Draft; 08 September 2015. Available from: https://www.w3.org/ TR/2015/WD-emma20-20150908/ [Accessed: 26 April 2020]

**65**

Section 3

Cognitive Processing

Section 3
