Skip to main content

Are big data analytics helpful in caring for multimorbid patients in general practice? - A scoping review



The treatment of multimorbid patients is one crucial task in general practice as multimorbidity is highly prevalent in this setting. However, there is little evidence how to treat these patients and consequently there are but a few guidelines that focus primarily on multimorbidity. Big data analytics are defined as a method that obtains results for high volume data with high variety generated at high velocity. Yet, the explanatory power of these results is not completely understood. Nevertheless, addressing multimorbidity as a complex condition might be a promising field for big data analytics.

The aim of this scoping review was to evaluate whether applying big data analytics on patient data does already contribute to the treatment of multimorbid patients in general practice.


In January 2018, a review searching the databases PubMed, The Cochrane Library, and Web of Science, using defined search terms for “big data analytics” and “multimorbidity”, supplemented by a search of grey literature with Google Scholar, was conducted. Studies were not filtered by type of study, publication year or language. Validity of studies was evaluated independently by two researchers.


In total, 2392 records were identified for screening. After title and abstract screening, six articles were included in the full-text analysis. Of those articles, one reported on a model generated with big data techniques to help caring for one group of multimorbid patients. The other five articles dealt with the analysis of multimorbidity clusters. No article defined big data analytics explicitly.


Although the usage of the phrase “Big Data” is growing rapidly, there is nearly no practical use case for big data analysis techniques in the treatment of multimorbidity in general practice yet. Furthermore, in publications addressing big data analytics, the term is rarely defined.

However, possible models and algorithms to address multimorbidity in the future are already published.

Peer Review reports


Patients with more than one chronic disease are commonly defined as multimorbid [1]. Multimorbidity is a highly prevalent condition. In fact, in the United States of America 48% of patients older than 65 years, suffer from more than three chronic diseases. This population accounts for 89% of the annual Medicare’s budget in the USA [2]. When multimorbidity is defined as suffering from at least three chronic diseases, 62% of German patients in general practice older than 65 years are multimorbid [3]. Therefore, multimorbid patients are prevalent in general practice.

Treating the individual diseases of multimorbid patients in accordance with the specific guidelines for the single disease is the most common way to deliver care [4]. This approach carries the danger of leading to an overall deterioration of the health status of multimorbid patients [5, 6]. These challenges intensify with the number of diseases to treat [7].

Limitations for treating these patients described back in 2005 such as drug interactions or guideline related recommendations that contradict each other are still relevant today [8]. To date very few guidelines primarily focus on multimorbidity [9,10,11]. Therefore, optimization of care for this population is a high-priority task for health care. Currently, new recommendations for improved treatment of multimorbid patients using ehealth, e.g. decision support systems are being published [6, 12].

The term ehealth describes the general use of electronic devices or systems in medical care. One aspect of ehealth is the application of big data analysis techniques. The term “Big Data” was introduced in 1997 [13]. Commonly big data analytics are defined by the “3Vs”: increasing volume of data, the high velocity of data, and the variety of data [14,15,16]. It is hypothesized that big data analytics have the ability to reveal patterns in patient data that could not be identified with more traditional methods of data analysis [17].

However, there are no cut off values that clearly determine the point at which data starts being big. Still, big data analytics might have the potential to be a useful addition to the treatment of multimorbid patients.

The aim of this review was to evaluate to what degree the application of big data analytics could assist general practitioners in treating multimorbid patients.


Search strategy

Two of the authors (AW, DW) conducted an organized computerized literature search for studies that utilized big data analytics of patient data in order to treat multimorbid patients. This review followed the guidelines of the PRISMA Extension for Scoping Reviews (PRIMSA-ScR) [18].

In January 2018, the databases PubMed, The Cochrane Library, and Web of Science were searched. In order to detect all areas of Big Data Analytics, a complex search strategy was developed, using the terms “big data”, “health analytics”, “healthcare informatics”, “electronic health records”, “databases”, “data collection system”, “electronic data capture”, “data management system”, “deep learning”, “electronic medical record”, “machine learning”, “medical data”, “huge data”, “electronic patient record”, “datamining”, “data analysis”, “reinforcement learning”, “decision support system”, “predictive analytics”, “reasoning” and “inference”. In order to identify studies dealing with big data analytics in the context of multimorbidity, search terms were combined with the terms “multimorb*” and “multi-morb*”.

The terms “general practitioner” or “general practice” were deliberately not included in the search terms to ensure that as many articles as possible on the topic of multimorbidity were included. Relevance to general practice was individually assessed in the screening process.

Additionally, a search of grey literature with Google Scholar was conducted. For this search the terms “multimorbidity AND “big data” AND (“general practice” OR “gp”)” were consented within the research group under the advisement of an expert in computer sciences and used to keep the number of results manageable. Patents and citations were excluded. The search in Google Scholar was performed using the “Private-Setting” in order to produce replicable results.

The results of the searches were imported into the web service Covidence ( which was used in the further review process. The complete search strategy is available in the supplemental material (Additional file 1).

No review protocol was registered for this scoping review.

Study screening

After the exclusion of duplicates, 2392 article were included in the review process. Two reviewers (AW, DW) independently screened the titles, abstracts, and subsequently the full-text articles. Discrepancies during the screening process were discussed during regular consensus meetings. A third reviewer (JS) was consulted as needed.

Eligibility criteria

The authors are members of the “Center for Open Innovation in Connected Health (COPICOH)” at the University of Lübeck. In this center, computer scientists as well as researchers from a variety of health disciplines are working together. The authors held consensus meetings with this research group in order to define, what articles were to be included. It was decided that studies that used standard statistical methods, e.g. large cohort studies that examined data from electronic health records, were not deemed eligible.

After the title and abstract screening, 29 full-texts were screened. Of those full-texts 23 articles were excluded after discussion within the research team because they either did not conduct big data analytics as defined by the authors, did not focus on multimorbidity or were not relevant for general practice.

Finally, six articles were included in the data extraction. For better traceability, the entire screening process is visualized using the PRISMA Flow Chart. (see Fig. 1).

Fig. 1

PRISMA Flow Chart

Data extraction

Studies were not filtered by type of study, publication year or language. Validity of studies was evaluated based on the judgement of two independent researchers (AW, DW). The data extraction from the included studies was done by AW and a scientific researcher in the field of computer sciences and consented with all authors.

Extracted data was the publication year, the country of origin, the aim of the study, the number of examined datasets, the used method of data analyses, and the outcome. The Results of the data extraction are summarized in Table 1.

Table 1 Summarized characteristics of included studies


Of the 220 results in the “grey literature”, none were deemed eligible and consequently were not among the included articles. All of the included articles were published in English. None of these articles used the keyword or term “Big Data”. However, after discussing this within the researcher group, six papers were included. They originated from Greece, Germany, Hungary, the Netherlands, the United States, and Canada [19,20,21,22,23,24]. The oldest article was published 2013 the latest in 2017.

Four of the articles dealt with the analysis of large data sets of multimorbid patients to analyze or find patterns in the combinations of diseases in these samples [20, 21, 23, 24]. Although the main objectives of the articles may sound similar, the specific focus of each of these articles was a different one. One article proposed a framework for the management of treatment for multimorbid patients who suffered from COPD [19]. The sixth article presented a new dynamic modelling approach to predict the gain in Disability Adjusted Life Years obtained by eliminating exposure to a risk factor more precisely than other models [22].

Another result of this review was that there were no precise definitions for “Big Data” in the screened articles. Furthermore, there were no defined cut-off values to specify at which point the levels of volume, velocity or variety of data are sufficiently high to clearly define them as “Big”.


The aim of this review was to evaluate to what degree big data analytics are already supporting general practitioners in treating multimorbid patients.

Altogether, we identified only one article addressing the approach of improving the treatment of multimorbid patients with COPD by using techniques that are related to big data analytics [19]. However, the approach proposed in this paper has to be further validated including more patients and a broader variety of diseases.

These limitations are in line with other reviews that addressed the benefits of big data analytics for Diabetes type 1 and 2 and Alzheimer’s disease [25, 26].

The other five included articles did not present direct recommendations for the treatment of multimorbid patients. However, they utilized methods and techniques to develop models that could, upon further investigation, shed a better light on the understanding of the underlying patterns for multimorbidity. It would be reasonable to verify these models for the analysis of multimorbidity clusters of other datasets to further the understanding of multimorbidity and then integrate these models into the medical decision framework for treating patients. This missing step of integration is also commonly found in reviews addressing big data analytics in health care [27,28,29].

Furthermore, we found a lack of clear definitions for the “3 V’s” or the term “Big Data” in general. In other reviews on big data analytics in health care settings, these terms are also only implicitly described, but their definition is usually not included [27, 28].

Although common expectations are, that big data analytics will have a great variety of applications in the field of multimorbid patient treatment in the future [30], this review that found only one study that has direct implications for treatment puts this portrayal in perspective.

These findings might suggest an interface problem between different scientific disciplines that do research in the field of big data. For the clinician finding reliable evidence for the benefit of applying big data analytics to improve treatment is crucial. However, they usually do not have the competence in validating algorithms. Computer scientists may be more interested in developing algorithms for a more generic problem than to apply an algorithm in a specific clinical setting. Therefore, there may be the need for an academic discipline that focuses on the implementation of algorithms into practice. One example of a future application for big data analytics in health care could be the implementation of big data algorithms into medical apps for mobile devices. There are already a number of studies that investigate the possible benefits of these apps [31].

Strengths and limitations

This is to our knowledge the first review that addresses applying big data analytics in the treatment of multimorbid patients in general practice. Journals commonly included in databases used by health care professionals might not be the ones researcher working in the field of big data analytics are publishing their results in, leading to a bias in our findings.


One study was found that presents an approach for treating a group of multimorbid patients using big data techniques. Terms pertaining to big data analytics are not defined in studies applying these methods. Over all, there seems to be a mismatch between the perceived presence and usage of big data in health care and existing literature in databases commonly used by health care professionals. It seems highly relevant to form interdisciplinary research environments in which experts in implementing computer sciences and health care professionals work together to evaluate the benefits of big data analysis techniques for the treatment of patients.



Chronic obstructive pulmonary disease


Center for Open Innovation in Connected Health


Preferred Reporting Items for Systematic reviews and Meta-Analyses


PRISMA Extension for Scoping Reviews


United States of America


  1. 1.

    van den Akker M, Buntinx F, Metsemakers JFM, et al. Multimorbidity in general practice: prevalence, incidence, and determinants of co-occurring chronic and recurrent diseases. J Clin Epidemiol. 1998;51(5):367–75.

    Article  Google Scholar 

  2. 2.

    Institute of Medicine. Crossing the Quality Chasm: A New Health System for the 21st Century. Washington DC: The National Academies Press; 2001.

  3. 3.

    van den Bussche H, Schäfer I, Koller D, et al. Multimorbidity in the German elderly population - part 1: prevalence in ambulatory medical care. ZFA. 2012;88(9):365–71.

    Google Scholar 

  4. 4.

    Sturmberg JP, Bennett JM, Martin CM, et al. ‘Multimorbidity’ as the manifestation of network disturbances. J Eval Clin Pract. 2017;23(1):199–208.

    Article  Google Scholar 

  5. 5.

    Field TS, Gurwitz JH, Harold LR, et al. Risk factors for adverse drug events among older adults in the ambulatory setting. J Am Geriatr Soc. 2004;52:1349–54.

    Article  Google Scholar 

  6. 6.

    Rijken M, Struckmann V, van der Heide I, et al. (on behalf of the ICARE4EU consortium). How to improve care for people with multimorbidity in Europe? Policy Brief 23. European Observatory on Health Policies and Systems. Denmark 2017. (accessed 27 Jun 2018).

  7. 7.

    Tinetti ME, Bogardus ST Jr, Agostini JV. Potential pitfalls of disease-specific guidelines for patients with multiple conditions. N Engl J Med. 2004;351:2870–4.

    CAS  Article  Google Scholar 

  8. 8.

    Boyd CM, Darer J, Boult C, et al. Clinical practice guidelines and quality of Care for Older Patients with Multiple Comorbid Diseases. JAMA. 2005;8(10):716–24.

    Article  Google Scholar 

  9. 9.

    Multimorbidity: clinical assessment and management. NICE. National Institute for Health and Care Excellence. United Kingdom. (accessed 13 Jun 2018).

  10. 10.

    Clinical Practice Guideline “Multimorbidity”. German College of General Practitioners and Family Physicians. Draft. [Association of the Scientific Medical Societies in Germany]. (accessed 13 Jun 2018).

  11. 11.

    Mühlhäuser U, Goetz K, Weinmayr L-M, et al. DEGAM-guideline “multimorbidity” - a Field test. ZFA. 2018;94(2):64–9.

    Google Scholar 

  12. 12.

    Martin CM, Vogel C, Grady D, et al. Implementation of complex adaptive chronic care: the patient journey record system (PaJR). J Eval Clin Pract. 2012;18(6):1226–34.

    Article  Google Scholar 

  13. 13.

    Garapati SL, Garapati S. Application of big data analytics: an innovation in health care. Comput Intell. 2018;14(1):15–27.

    Google Scholar 

  14. 14.

    McAfee A, Brynjolfsson E. Big data: the management revolution. Harv Bus Rev. 2012;90(10):60–6 68, 128.

    PubMed  Google Scholar 

  15. 15.

    Laney D. 3D data management: controlling data volume, velocity and variety. (accessed 4 Jun 2018).

  16. 16.

    Langkafel P. Intro Big Data for Healthcare? In: Langkafel P, editor. Big Data in Medicine und Health Economics. Diagnosis, Therapy, Side effects. Heidelberg: medhochzwei Verlag GmbH; 2014. p. 12.

  17. 17.

    Roski J, Bo-Linn GW, Andrews TA. Creating value in health care through bog data: opportunities and policy implications. Health Aff (Millwood). 2014;33:1115–22.

    Article  Google Scholar 

  18. 18.

    Tricco AC, Lillie E, Zarin W, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. 2018;169(7):467–73. Epub 2018 Sep 4.

    Article  PubMed  Google Scholar 

  19. 19.

    Andriopoulou FG, Birkos KD, Lymberopoulos DK. Ef-Zin: A hybrid framework for ubiquitous management of comorbidity and multimorbidity in chronic diseases. 13th IEEE International Conference on Bioinformatics and Bioengineering, Chania. 2013. pp. 1–4.

  20. 20.

    Schäfer I, Kaduszkiewicz H, Wagner HO, et al. Reducing complexity: a visualisation of multimorbidity by combining disease clusters and triads. BMC Public Health. 2014;14:1285.

    Article  Google Scholar 

  21. 21.

    Marx P, Antal P. Decomposition of shared latent factors using Bayesian multi-morbidity dependency maps. Singapore: Springer Singapore; 2015.

    Google Scholar 

  22. 22.

    Boshuizen HC, Nusselder WJ, Plasmans MHD, et al. Taking multi-morbidity into account when attributing DALYs to risk factors: comparing dynamic modeling with the GBD2010 calculation method. BMC Public Health. 2017;17:197.

    Article  Google Scholar 

  23. 23.

    Kalgotra P, Sharda R, Croff JM. Examining health disparities by gender: a multimorbidity network analysis of electronic medical record. Int J Med Inform. 2017;108:22–8.

    Article  Google Scholar 

  24. 24.

    Nicholson K, Bauer M, Terry AL, et al. The multimorbidity cluster analysis tool: identifying combinations and permutations of multiple chronic diseases using a record-level computational analysis. J Innov Health Inform. 2017;24(4):339–43.

    Article  Google Scholar 

  25. 25.

    Cichosz SL, Johansen MD, Hejlesen O. Toward big data analytics: review of predictive models in Management of Diabetes and its Complications. J Diabetes Sci Technol. 2015;10(1):27–34.

    Article  PubMed  PubMed Central  Google Scholar 

  26. 26.

    Zhang R, Simon G, Yu F. Advancing Alzheimer's research: a review of big data promises. Int J Med Inform. 2017;106:48–56. Epub 2017 Jul 24.

    Article  PubMed  PubMed Central  Google Scholar 

  27. 27.

    Mehta N, Pandit A. Concurrence of big data analytics and healthcare: a systematic review. Int J Med Inform. 2018;114:57–65. Epub 2018 Mar 26.

    Article  PubMed  Google Scholar 

  28. 28.

    Kruse CS, Goswamy R, Raval Y, et al. Challenges and opportunities of big data in health care: a systematic review. JMIR Med Inform. 2016;4(4):e38.

    Article  Google Scholar 

  29. 29.

    Islam MS, Hasan MM, Wang X, et al. A Systematic Review on Healthcare Analytics: Application and Theoretical Perspective of Data Mining. Healthcare (Basel). 2018;6(2).

  30. 30.

    Walther P. How big data can revolutionize patient care. By analyzing health data from a wide range of sources, we’re helping health professionals in Germany diagnose and treat patients based on the latest research and evidence. (accessed 17.09.2018).

  31. 31.

    Albrecht, U.-V. (Hrsg.). Chances and Risks of Mobile Health Apps (CHARISMHA), Medizinische Hochschule Hannover, 2016. urn:nbn:de:gbv:084-16040811153. Accessed 17 Jan 2019.

Download references


The authors would like to thank Marcel Gehrke for his professional support, and Alexa Waschkau, M.A. for proofreading the manuscript.


All authors are members of the Center for Open Innovation in Connected Health (COPICOH) at the University of Lübeck, which is co-funded by Cisco Systems. The funding body was not involved in the design of the study, nor the collection, analysis, and interpretation of data, nor the writing of the manuscript. The funding body approved the final manuscript.

Availability of data and materials

The complete search strategy is available in the additional material.

Author information




AW conducted the database search, screened the search results, extracted the data, and drafted the manuscript. DW conducted the database search and screened the search results. JS had the idea for the study and coordinated as a senior all steps of the study. All authors revised the manuscript, read, and approved the final manuscript.

Corresponding author

Correspondence to Alexander Waschkau.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Search Strategy.pdf. Detailed Information on search strategy. Information on search strategy, search results, and exclusion criterias. (PDF 47 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Waschkau, A., Wilfling, D. & Steinhäuser, J. Are big data analytics helpful in caring for multimorbid patients in general practice? - A scoping review. BMC Fam Pract 20, 37 (2019).

Download citation


  • Big data analytics
  • General practice
  • Multimorbidity
  • eHealth