**7. Fusion of face and speech traits**

Based on the purpose of the robot, a unimodal or a multimodal recognition system could be selected to be used for human-robot interaction. For example, a military purpose robot should be more accurate than home purpose robot. As mentioned in Section 5.2, the common trait that can be used for human identification by robot in both remote and proximate interaction is voice biometric trait. On the other hand, the face is the most realistic biometric trait in case of proximate interaction.

**References**

DOI: 10.1007/978-0-387-773261

cviu.2007.08.005

TPAMI.2003.1227981

10.5815/ijmecs.2012.11.06

B:MESC.0000005857.89878.a9

10.1109/IJCB.2011.6117584

1.1795335. PACS

[1] Jain AK, Ross AA, Nandakumar K. Introduction to Biometrics. Springer; 2011. 312 p.

Person Identification Using Multimodal Biometrics under Different Challenges

http://dx.doi.org/10.5772/intechopen.71667

97

[2] Takeo K. Picture Processing by Computer Complex and Recognition of Human Faces [Thesis]. Kyoto, Japan: Dept. of Science, Kyoto University; 1974. 143 p . DOI: 10.14989/

[3] Bowyer KW, Hollingsworth K, Flynn PJ. Image understanding for iris biometrics: A survey. Computer Vision and Image Understanding. 2008;**110**(2):281-307. DOI: 10.1016/j.

[4] Zhang D, Kong WK, You J, Wong M. Online palmprint identification. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2003;**25**(9):1041-1050. DOI: 10.1109/

[5] Raut SD, Humbe VT. Biometric palm prints feature matching for person identification. International Journal of Modern Education and Computer Science. 2012;**4**(11):61. DOI:

[6] Kong A, Zhang D, Kamel M. A survey of palmprint recognition. Pattern Recognition.

[7] Berry JS, David A. The history and development of fingerprinting. In: Lee HC, Ramotowski R, Gaensslen RE, editors. Advances in fingerprint Technology. 2nd ed. CRC press;

[8] Cole S, Col A. Suspect Identities: A History of Fingerprinting and Criminal Identification. Cambridge, Mass /London: Harvard University Press; 30-10-2009.38 p. DOI: 10.1023/

[9] Jain AK, Prabhakar S, Hong L, Pankanti S. Filterbank-based fingerprint matching. IEEE

[10] Mina I. Mark N. Sasan M. The effect of time on ear biometrics. In: International Joint Conference on Biometrics (IJCB), 2011; Washington, DC, USA. IEEE; 2011. p. 1-6. DOI:

[11] Pflug A, Busch C. Ear biometrics: A survey of detection, feature extraction and recognition methods. IET Biometrics. 2012;**1**(2):114-129. DOI: 10.1049/iet-bmt.2011.0003

[12] Pruzansky S, Mathews MV. Talker-recognition procedure based on analysis of variance. The Journal of the Acoustical Society of America. 1964;**36**(11):2041-2047. DOI: 10.1121/

[13] Peskin B et al. Topic and speaker identification via large vocabulary continuous speech recognition. In: Proceedings of the workshop on Human Language Technology. Association

Transactions on Image Processing. 2000;**9**(5):846-859. DOI: 10.1109/83.841531

doctor.k1486Available from: http://hdl.handle.net/2433/162079

2009;**42**(7):1408-1418. DOI: 10.1016/j.patcog.2009.01.018

2001:13-52. DOI: 10.1201/9781420041347.ch1

It will be appropriate to fuse face and voice in human-robot interaction, since both of these traits are noncontacted and the user is unaware that recognition is being performed. Many studies proved that the fusion of face and speech is appropriate for many purposes [37–39], where face and speech are the best choices since both of them do not need physical or direct contact with sensors [40, 41]. Another advantage of speech over face is that speech can be recognized even when a human and robot are not found in the same physical place. This is useful for voice recognition purposes by mobile phone or when a user and robot are in two different rooms in the same place. Consequently, a realistic human-robot interaction system is achieved, either HRI is conducted by face-to-face, blind, or invisible interaction.
