Section 3 Data Interaction

*Data Integrity and Quality*

amiajnl-2011-000681

Journal of the American Medical Informatics Association, 2013, vol. 20(1), p. 44-151. doi:10.1136/

Wesley, 2013. ISBN-0133970779

[15] Codd E.F. A Relational Model of Data for Large Shared Data Banks. In: Software Pioneers (Broy M., Denert E. (eds)). Springer Verlag, 2002 https:// doi.org/10.1007/978-3-642-59412-0\_16

[16] Chen, P.P-S. The entity-relationship model—toward a unified view of data. ACM Transactions on Database Systems, 1976, vol. 1(1), p. 9-36. Doi:10.1145/320434.320440

[17] Calderwood, A.H. and Jacobson, B.C. Comprehensive Validation of the Boston Bowel Preparation Scale. Gastrointestinal Endoscopy, 2010 vol. 72(4) p. 686-692. Doi: 10.1016/j.

[18] Dama International. Dama-DMBOOK: Data Management Body of Knowledge. Technics Publications, LLC,

Doi: 10.1145/1629175.1629210

[20] Wieten, E., Schreuders, E.H., Nieuwenburg, S.AV., Hansen, B.E., Lansdorp-Vogelaar, I., Kuipers, E.H., Bruno, M.J. and Spaander, M.C.W. Effects of increasing screening age and fecal hemoglobin cutoff concentrations in a colo-rectal cancer screening

[19] Khatri, V. and Brown, C.V. Designing data governance. Communications of the ACM, 2010, vol. 53, no 1, p. 148-152.

program. Clinical Gastroenterology and Hepatology, 2016, vol. 14, no 12, p. 1771- 1777. Doi:10.1016/j.cgh.2016.08.016

[21] Kreimeyer, K., Foster, M., Pandey, A., Arya, N., Halford, G., Jones, S.F.,

2017 ISBN-1634622340

gie.2010.06.068.

[14] Elmasri, R. and Navathe, S.B. (eds) The relational data model and relational database constraints. In Fundamentals of Database Systems, Pearson AddisonForshee, R., Walderhaug, M. and Botsis, T. Natural language processing systems for capturing and standardizing unstructured clinical information: a systematic review. Journal of biomedical informatics, 2017, vol. 73, p. 14-29. Doi:

10.1016/j.jbi.2017.07.012

[22] Llop, E.S., Cano del Pozo, M., García Montero, J.I., Carrera-Lasfuentes, P. and Lanas A. Colo-rectal cancer screening program in Aragon (Spain): preliminary results Gaceta sanitaria, 2018, vol. 32, no 6, p. 559-562.

doi: 10.1016/j.gaceta.2017.05.014

**40**

**43**

**Chapter 4**

**Abstract**

in organizations.

**1. Introduction**

artificial intelligence (AI)

Analysis

*Sreekantha Desai Karanam,* 

Big Data Integration Solutions in

*Rajani Sudhir Kamath, Raja Vittal Rao Kulkarni* 

*and Bantwal Hebbal Sinakatte Karthik Pai*

Organizations: A Domain-Specific

Big Data Integration (BDI) process integrates the big data arising from many diverse

data sources, data formats presents a unified, valuable, customized, holistic view of data. BDI process is essential to build confidence, facilitate high-quality insights and trends for intelligent decision making in organizations. Integration of big data is a very complex process with many challenges. The data sources for BDI are traditional data warehouses, social networks, Internet of Things (IoT) and online transactions. BDI solutions are deployed on Master Data Management (MDM) systems to support collecting, aggregating and delivering reliable information across the organization. This chapter has conducted an exhaustive review of BDI literature and classified BDI applications based on their domain. The methods, applications, advantages and disadvantage of the research in each paper are tabulated. Taxonomy of concepts, table of acronyms and the organization of the chapter are presented. The number of papers reviewed industry-wise is depicted as a pie chart. A comparative analysis of curated survey papers with specific parameters to discover the research gaps were also tabulated. The research issues, implementation challenges and future trends are highlighted. A case study of BDI solutions implemented in various organizations was also discussed. This chapter concludes with a holistic view of BDI concepts and solutions implemented

**Keywords:** master data management (MDM), internet of things (IoT),

business intelligence (BI), software as a service (SAAS), machine learning (ML),

Accenture company has conducted a survey on the implementation of BDI solutions in organizations. The survey outcome revealed that 92% of managers are happy with the results obtained from BDI solutions and 89% of managers agree that big data integration and analytics is very vital for their business planning to leverage competition. The Internet Trends Report from KPCB's by Mary Meeker discovered the decreasing trends in the cost of hardware technology in the past twenty years, the cost of computing has been reduced by 33%, 38% storage cost reduction
