- Research article
- Open Access
- Open Peer Review
Are big data analytics helpful in caring for multimorbid patients in general practice? - A scoping review
BMC Family Practicevolume 20, Article number: 37 (2019)
The treatment of multimorbid patients is one crucial task in general practice as multimorbidity is highly prevalent in this setting. However, there is little evidence how to treat these patients and consequently there are but a few guidelines that focus primarily on multimorbidity. Big data analytics are defined as a method that obtains results for high volume data with high variety generated at high velocity. Yet, the explanatory power of these results is not completely understood. Nevertheless, addressing multimorbidity as a complex condition might be a promising field for big data analytics.
The aim of this scoping review was to evaluate whether applying big data analytics on patient data does already contribute to the treatment of multimorbid patients in general practice.
In January 2018, a review searching the databases PubMed, The Cochrane Library, and Web of Science, using defined search terms for “big data analytics” and “multimorbidity”, supplemented by a search of grey literature with Google Scholar, was conducted. Studies were not filtered by type of study, publication year or language. Validity of studies was evaluated independently by two researchers.
In total, 2392 records were identified for screening. After title and abstract screening, six articles were included in the full-text analysis. Of those articles, one reported on a model generated with big data techniques to help caring for one group of multimorbid patients. The other five articles dealt with the analysis of multimorbidity clusters. No article defined big data analytics explicitly.
Although the usage of the phrase “Big Data” is growing rapidly, there is nearly no practical use case for big data analysis techniques in the treatment of multimorbidity in general practice yet. Furthermore, in publications addressing big data analytics, the term is rarely defined.
However, possible models and algorithms to address multimorbidity in the future are already published.
Patients with more than one chronic disease are commonly defined as multimorbid . Multimorbidity is a highly prevalent condition. In fact, in the United States of America 48% of patients older than 65 years, suffer from more than three chronic diseases. This population accounts for 89% of the annual Medicare’s budget in the USA . When multimorbidity is defined as suffering from at least three chronic diseases, 62% of German patients in general practice older than 65 years are multimorbid . Therefore, multimorbid patients are prevalent in general practice.
Treating the individual diseases of multimorbid patients in accordance with the specific guidelines for the single disease is the most common way to deliver care . This approach carries the danger of leading to an overall deterioration of the health status of multimorbid patients [5, 6]. These challenges intensify with the number of diseases to treat .
Limitations for treating these patients described back in 2005 such as drug interactions or guideline related recommendations that contradict each other are still relevant today . To date very few guidelines primarily focus on multimorbidity [9,10,11]. Therefore, optimization of care for this population is a high-priority task for health care. Currently, new recommendations for improved treatment of multimorbid patients using ehealth, e.g. decision support systems are being published [6, 12].
The term ehealth describes the general use of electronic devices or systems in medical care. One aspect of ehealth is the application of big data analysis techniques. The term “Big Data” was introduced in 1997 . Commonly big data analytics are defined by the “3Vs”: increasing volume of data, the high velocity of data, and the variety of data [14,15,16]. It is hypothesized that big data analytics have the ability to reveal patterns in patient data that could not be identified with more traditional methods of data analysis .
However, there are no cut off values that clearly determine the point at which data starts being big. Still, big data analytics might have the potential to be a useful addition to the treatment of multimorbid patients.
The aim of this review was to evaluate to what degree the application of big data analytics could assist general practitioners in treating multimorbid patients.
Two of the authors (AW, DW) conducted an organized computerized literature search for studies that utilized big data analytics of patient data in order to treat multimorbid patients. This review followed the guidelines of the PRISMA Extension for Scoping Reviews (PRIMSA-ScR) .
In January 2018, the databases PubMed, The Cochrane Library, and Web of Science were searched. In order to detect all areas of Big Data Analytics, a complex search strategy was developed, using the terms “big data”, “health analytics”, “healthcare informatics”, “electronic health records”, “databases”, “data collection system”, “electronic data capture”, “data management system”, “deep learning”, “electronic medical record”, “machine learning”, “medical data”, “huge data”, “electronic patient record”, “datamining”, “data analysis”, “reinforcement learning”, “decision support system”, “predictive analytics”, “reasoning” and “inference”. In order to identify studies dealing with big data analytics in the context of multimorbidity, search terms were combined with the terms “multimorb*” and “multi-morb*”.
The terms “general practitioner” or “general practice” were deliberately not included in the search terms to ensure that as many articles as possible on the topic of multimorbidity were included. Relevance to general practice was individually assessed in the screening process.
Additionally, a search of grey literature with Google Scholar was conducted. For this search the terms “multimorbidity AND “big data” AND (“general practice” OR “gp”)” were consented within the research group under the advisement of an expert in computer sciences and used to keep the number of results manageable. Patents and citations were excluded. The search in Google Scholar was performed using the “Private-Setting” in order to produce replicable results.
The results of the searches were imported into the web service Covidence (www.covidence.org) which was used in the further review process. The complete search strategy is available in the supplemental material (Additional file 1).
No review protocol was registered for this scoping review.
After the exclusion of duplicates, 2392 article were included in the review process. Two reviewers (AW, DW) independently screened the titles, abstracts, and subsequently the full-text articles. Discrepancies during the screening process were discussed during regular consensus meetings. A third reviewer (JS) was consulted as needed.
The authors are members of the “Center for Open Innovation in Connected Health (COPICOH)” at the University of Lübeck. In this center, computer scientists as well as researchers from a variety of health disciplines are working together. The authors held consensus meetings with this research group in order to define, what articles were to be included. It was decided that studies that used standard statistical methods, e.g. large cohort studies that examined data from electronic health records, were not deemed eligible.
After the title and abstract screening, 29 full-texts were screened. Of those full-texts 23 articles were excluded after discussion within the research team because they either did not conduct big data analytics as defined by the authors, did not focus on multimorbidity or were not relevant for general practice.
Finally, six articles were included in the data extraction. For better traceability, the entire screening process is visualized using the PRISMA Flow Chart. (see Fig. 1).
Studies were not filtered by type of study, publication year or language. Validity of studies was evaluated based on the judgement of two independent researchers (AW, DW). The data extraction from the included studies was done by AW and a scientific researcher in the field of computer sciences and consented with all authors.
Extracted data was the publication year, the country of origin, the aim of the study, the number of examined datasets, the used method of data analyses, and the outcome. The Results of the data extraction are summarized in Table 1.
Of the 220 results in the “grey literature”, none were deemed eligible and consequently were not among the included articles. All of the included articles were published in English. None of these articles used the keyword or term “Big Data”. However, after discussing this within the researcher group, six papers were included. They originated from Greece, Germany, Hungary, the Netherlands, the United States, and Canada [19,20,21,22,23,24]. The oldest article was published 2013 the latest in 2017.
Four of the articles dealt with the analysis of large data sets of multimorbid patients to analyze or find patterns in the combinations of diseases in these samples [20, 21, 23, 24]. Although the main objectives of the articles may sound similar, the specific focus of each of these articles was a different one. One article proposed a framework for the management of treatment for multimorbid patients who suffered from COPD . The sixth article presented a new dynamic modelling approach to predict the gain in Disability Adjusted Life Years obtained by eliminating exposure to a risk factor more precisely than other models .
Another result of this review was that there were no precise definitions for “Big Data” in the screened articles. Furthermore, there were no defined cut-off values to specify at which point the levels of volume, velocity or variety of data are sufficiently high to clearly define them as “Big”.
The aim of this review was to evaluate to what degree big data analytics are already supporting general practitioners in treating multimorbid patients.
Altogether, we identified only one article addressing the approach of improving the treatment of multimorbid patients with COPD by using techniques that are related to big data analytics . However, the approach proposed in this paper has to be further validated including more patients and a broader variety of diseases.
The other five included articles did not present direct recommendations for the treatment of multimorbid patients. However, they utilized methods and techniques to develop models that could, upon further investigation, shed a better light on the understanding of the underlying patterns for multimorbidity. It would be reasonable to verify these models for the analysis of multimorbidity clusters of other datasets to further the understanding of multimorbidity and then integrate these models into the medical decision framework for treating patients. This missing step of integration is also commonly found in reviews addressing big data analytics in health care [27,28,29].
Furthermore, we found a lack of clear definitions for the “3 V’s” or the term “Big Data” in general. In other reviews on big data analytics in health care settings, these terms are also only implicitly described, but their definition is usually not included [27, 28].
Although common expectations are, that big data analytics will have a great variety of applications in the field of multimorbid patient treatment in the future , this review that found only one study that has direct implications for treatment puts this portrayal in perspective.
These findings might suggest an interface problem between different scientific disciplines that do research in the field of big data. For the clinician finding reliable evidence for the benefit of applying big data analytics to improve treatment is crucial. However, they usually do not have the competence in validating algorithms. Computer scientists may be more interested in developing algorithms for a more generic problem than to apply an algorithm in a specific clinical setting. Therefore, there may be the need for an academic discipline that focuses on the implementation of algorithms into practice. One example of a future application for big data analytics in health care could be the implementation of big data algorithms into medical apps for mobile devices. There are already a number of studies that investigate the possible benefits of these apps .
Strengths and limitations
This is to our knowledge the first review that addresses applying big data analytics in the treatment of multimorbid patients in general practice. Journals commonly included in databases used by health care professionals might not be the ones researcher working in the field of big data analytics are publishing their results in, leading to a bias in our findings.
One study was found that presents an approach for treating a group of multimorbid patients using big data techniques. Terms pertaining to big data analytics are not defined in studies applying these methods. Over all, there seems to be a mismatch between the perceived presence and usage of big data in health care and existing literature in databases commonly used by health care professionals. It seems highly relevant to form interdisciplinary research environments in which experts in implementing computer sciences and health care professionals work together to evaluate the benefits of big data analysis techniques for the treatment of patients.
Chronic obstructive pulmonary disease
Center for Open Innovation in Connected Health
Preferred Reporting Items for Systematic reviews and Meta-Analyses
PRISMA Extension for Scoping Reviews
United States of America
van den Akker M, Buntinx F, Metsemakers JFM, et al. Multimorbidity in general practice: prevalence, incidence, and determinants of co-occurring chronic and recurrent diseases. J Clin Epidemiol. 1998;51(5):367–75.
Institute of Medicine. Crossing the Quality Chasm: A New Health System for the 21st Century. Washington DC: The National Academies Press; 2001. https://doi.org/10.17226/10027.
van den Bussche H, Schäfer I, Koller D, et al. Multimorbidity in the German elderly population - part 1: prevalence in ambulatory medical care. ZFA. 2012;88(9):365–71.
Sturmberg JP, Bennett JM, Martin CM, et al. ‘Multimorbidity’ as the manifestation of network disturbances. J Eval Clin Pract. 2017;23(1):199–208.
Field TS, Gurwitz JH, Harold LR, et al. Risk factors for adverse drug events among older adults in the ambulatory setting. J Am Geriatr Soc. 2004;52:1349–54.
Rijken M, Struckmann V, van der Heide I, et al. (on behalf of the ICARE4EU consortium). How to improve care for people with multimorbidity in Europe? Policy Brief 23. European Observatory on Health Policies and Systems. Denmark 2017. http://www.euro.who.int/en/about-us/partners/observatory/publications/policy-briefs-and-summaries/how-to-improve-care-for-people-with-multimorbidity-in-europe. (accessed 27 Jun 2018).
Tinetti ME, Bogardus ST Jr, Agostini JV. Potential pitfalls of disease-specific guidelines for patients with multiple conditions. N Engl J Med. 2004;351:2870–4.
Boyd CM, Darer J, Boult C, et al. Clinical practice guidelines and quality of Care for Older Patients with Multiple Comorbid Diseases. JAMA. 2005;8(10):716–24.
Multimorbidity: clinical assessment and management. NICE. National Institute for Health and Care Excellence. United Kingdom. https://www.nice.org.uk/guidance/ng56 (accessed 13 Jun 2018).
Clinical Practice Guideline “Multimorbidity”. German College of General Practitioners and Family Physicians. Draft. http://www.awmf.org/leitlinien/detail/ll/053-043.html [Association of the Scientific Medical Societies in Germany]. (accessed 13 Jun 2018).
Mühlhäuser U, Goetz K, Weinmayr L-M, et al. DEGAM-guideline “multimorbidity” - a Field test. ZFA. 2018;94(2):64–9.
Martin CM, Vogel C, Grady D, et al. Implementation of complex adaptive chronic care: the patient journey record system (PaJR). J Eval Clin Pract. 2012;18(6):1226–34.
Garapati SL, Garapati S. Application of big data analytics: an innovation in health care. Comput Intell. 2018;14(1):15–27.
McAfee A, Brynjolfsson E. Big data: the management revolution. Harv Bus Rev. 2012;90(10):60–6 68, 128.
Laney D. 3D data management: controlling data volume, velocity and variety. http://blogs.gartner.com/doug-laney/files/2012/01/ad949-3D-Data-Management-Controlling-Data-Volume-Velocity-and-Variety.pdf. (accessed 4 Jun 2018).
Langkafel P. Intro Big Data for Healthcare? In: Langkafel P, editor. Big Data in Medicine und Health Economics. Diagnosis, Therapy, Side effects. Heidelberg: medhochzwei Verlag GmbH; 2014. p. 12.
Roski J, Bo-Linn GW, Andrews TA. Creating value in health care through bog data: opportunities and policy implications. Health Aff (Millwood). 2014;33:1115–22.
Tricco AC, Lillie E, Zarin W, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. 2018;169(7):467–73. https://doi.org/10.7326/M18-0850 Epub 2018 Sep 4.
Andriopoulou FG, Birkos KD, Lymberopoulos DK. Ef-Zin: A hybrid framework for ubiquitous management of comorbidity and multimorbidity in chronic diseases. 13th IEEE International Conference on Bioinformatics and Bioengineering, Chania. 2013. pp. 1–4. https://doi.org/10.1109/BIBE.2013.6701581.
Schäfer I, Kaduszkiewicz H, Wagner HO, et al. Reducing complexity: a visualisation of multimorbidity by combining disease clusters and triads. BMC Public Health. 2014;14:1285.
Marx P, Antal P. Decomposition of shared latent factors using Bayesian multi-morbidity dependency maps. Singapore: Springer Singapore; 2015.
Boshuizen HC, Nusselder WJ, Plasmans MHD, et al. Taking multi-morbidity into account when attributing DALYs to risk factors: comparing dynamic modeling with the GBD2010 calculation method. BMC Public Health. 2017;17:197.
Kalgotra P, Sharda R, Croff JM. Examining health disparities by gender: a multimorbidity network analysis of electronic medical record. Int J Med Inform. 2017;108:22–8.
Nicholson K, Bauer M, Terry AL, et al. The multimorbidity cluster analysis tool: identifying combinations and permutations of multiple chronic diseases using a record-level computational analysis. J Innov Health Inform. 2017;24(4):339–43.
Cichosz SL, Johansen MD, Hejlesen O. Toward big data analytics: review of predictive models in Management of Diabetes and its Complications. J Diabetes Sci Technol. 2015;10(1):27–34. https://doi.org/10.1177/1932296815611680.
Zhang R, Simon G, Yu F. Advancing Alzheimer's research: a review of big data promises. Int J Med Inform. 2017;106:48–56. https://doi.org/10.1016/j.ijmedinf.2017.07.002 Epub 2017 Jul 24.
Mehta N, Pandit A. Concurrence of big data analytics and healthcare: a systematic review. Int J Med Inform. 2018;114:57–65. https://doi.org/10.1016/j.ijmedinf.2018.03.013 Epub 2018 Mar 26.
Kruse CS, Goswamy R, Raval Y, et al. Challenges and opportunities of big data in health care: a systematic review. JMIR Med Inform. 2016;4(4):e38.
Islam MS, Hasan MM, Wang X, et al. A Systematic Review on Healthcare Analytics: Application and Theoretical Perspective of Data Mining. Healthcare (Basel). 2018;6(2). https://doi.org/10.3390/healthcare6020054.
Walther P. How big data can revolutionize patient care. By analyzing health data from a wide range of sources, we’re helping health professionals in Germany diagnose and treat patients based on the latest research and evidence. https://www.elsevier.com/connect/how-big-data-can-revolutionize-patient-care (accessed 17.09.2018).
Albrecht, U.-V. (Hrsg.). Chances and Risks of Mobile Health Apps (CHARISMHA), Medizinische Hochschule Hannover, 2016. urn:nbn:de:gbv:084-16040811153. http://www.digibib.tu-bs.de/?docid=00060000. Accessed 17 Jan 2019.
The authors would like to thank Marcel Gehrke for his professional support, and Alexa Waschkau, M.A. for proofreading the manuscript.
All authors are members of the Center for Open Innovation in Connected Health (COPICOH) at the University of Lübeck, which is co-funded by Cisco Systems. The funding body was not involved in the design of the study, nor the collection, analysis, and interpretation of data, nor the writing of the manuscript. The funding body approved the final manuscript.
Availability of data and materials
The complete search strategy is available in the additional material.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Search Strategy.pdf. Detailed Information on search strategy. Information on search strategy, search results, and exclusion criterias. (PDF 47 kb)