Article Text

Download PDFPDF

Original research
Assessing the diagnostic accuracy of postnatal clinical scoring methods and foot length measurement for estimating gestational age and birthweight of newborns in low- and middle-income countries: a systematic review and meta-analysis
  1. Shiyam Sunder Tikmani1,2,
  2. Thomas Mårtensson1,
  3. Sumaira Khalid3,
  4. Muhammad Uzair2,
  5. Qammerulanissa Ali2,
  6. Anum Rahim4,
  7. Andreas Mårtensson1,
  8. Sarah Saleem2,
  9. Nick Brown1
  1. 1Global health and migration unit, Department of Women’s & Children’s Health, Uppsala University, Uppsala, Sweden
  2. 2Population and Reproductive Health Section, Department of Community Health Sciences, Aga Khan University, Karachi, Pakistan
  3. 3Department of Public Health, College of Health Professions Marshall University, Huntington, West Virginia, USA
  4. 4Epidemiology and Biostatistic Section, Department of Community Health Sciences, Aga Khan University, Karachi, Pakistan
  1. Correspondence to Dr Shiyam Sunder Tikmani; shiyam.sunder{at}kbh.uu.se

Abstract

Background This study aimed to update systematic reviews and meta-analyses on the diagnostic accuracy of postnatal clinical scoring (PCS) methods and foot length (FL) measurement for assessing gestational age (GA) and birth weight in low-income and middle-income countries (LMICs). In addition, the quality of reference standards, including antenatal ultrasound (A-US), last menstrual period (LMP), PCS and newborn weighing scales, was also evaluated.

Methods Studies from LMICs published between January 2000 and February 2024 were searched, using databases such as PubMed, Web of Science, Cochrane Library, CINAHL and Scopus. Studies that compared PCS and/or FL with LMP and/or A-US to estimate GA or used calibrated newborn weighing scales for birthweight estimation were included. The risk of bias was assessed using the Quality Assessment of Diagnostic Accuracy Studies-II tool and evaluated the quality of the reference standards. When sufficient data were available, pooled estimates were calculated using random-effects models.

Results A total of 50 studies were included. A-US was a reasonable tool for GA assessment if conducted by physicians using fetal biometry and the Hadlock method for GA estimation. LMP was reasonable when women had regular cycles, knew their LMP, were not using contraceptives and LMP data were collected by healthcare providers. When A-US was used as the reference standard, PCS methods estimated GA with a precision of ±2.8 to ±3.2 weeks. FL measurement <7.5 cm showed a pooled sensitivity of 76.2% and specificity of 36.6% for identifying preterm birth. FL measurement ≤7.6 cm had a pooled sensitivity of 78.6% and specificity of 65.7% for identifying low birth weight (LBW). High heterogeneity across studies was observed.

Conclusion This systematic review and meta-analysis highlights significant variability and methodological inconsistencies in using PCS methods and FL measurement for estimating GA and LBW in LMICs. The observed high heterogeneity across studies suggests a cautious interpretation of the results.

PROSPERO registration number CRD42020209455.

  • Neonatology

Data availability statement

All data relevant to the study are included in the article or uploaded as online supplemental information.

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

WHAT IS ALREADY KNOWN ON THIS TOPIC

  • One in five newborns in low-income and middle-income countries (LMICs) is born prematurely or with low birth weight (LBW), increasing their susceptibility to neonatal mortality. Early detection and intervention for these infants can be life-saving.

  • Postnatal clinical scoring (PCS) methods and foot length (FL) measurements are commonly used to estimate gestational age (GA) and LBW in LMICs.

WHAT THIS STUDY ADDS

  • PCS methods such as Ballard Score and Dubowitz Score tend to overestimate GA while the Eregie scoring model underestimates it due to high variability across the studies.

  • The diagnostic accuracy of FL measurements for prematurity and LBW shows varying sensitivity and specificity due to significant methodological differences and high heterogeneity across studies.

HOW THIS STUDY MIGHT AFFECT RESEARCH, PRACTICE OR POLICY

  • There is an urgent need for standardised GA and birthweight measurement protocols, as well as consensus on reference standards, to improve the reliability and accuracy of PCS and FL assessments in LMICs.

  • Enhancing these diagnostic tools will lead to better clinical decision-making and improved neonatal outcomes, particularly in diverse and resource-limited healthcare settings.

  • Policies should prioritise skill development, quality assurance and supportive supervision for healthcare providers conducting GA and birthweight assessments.

Introduction

Preterm and low birth weight (LBW) pose significant challenges to neonatal health globally, particularly in low-income and middle-income countries (LMICs).1 2 In 2020, an estimated 13.4 million babies were born preterm,1 and 19.8 million were born with a birth weight <2500 g—LBW, globally.2 Approximately 900 000 preterm newborns die before the age of 5, with the majority of deaths occurring within the first week after birth, particularly in south Asia and sub-Saharan Africa.3 LBW increases the risk of neonatal mortality by nearly 20 times compared with normal-weighted infants.4 The causes of death due to preterm birth and LBW are often preventable, emphasising the importance of early detection and prompt management.5

Antenatal ultrasound (A-US) is the gold-standard method for estimating gestational age (GA).6 7 However, its use in LMICs is limited due to factors such as limited availability, inadequate maintenance of US devices, late presentation of pregnant women for antenatal care (ANC) and high cost.8–10 In settings where access to A-US is limited, the last menstrual period (LMP) is often used to estimate GA, but this method is prone to errors due to inaccurate recall or irregular menstrual cycles or women on contraception 3 months prior to conception or breastfeed at the time of conception.11 Postnatal clinical scoring (PCS) methods and foot length (FL) measurements have been established to identify preterm birth newborns and LBW.12 The Ballard and Dubowitz scores (DS) assess GA via physical and neurological newborn examinations,12 13 and the Eregie scoring model (ESM) determines newborn maturation using physical examination and anthropometric measurements.12 Anthropometric measurements such as mid-upper arm circumference, head and chest circumference and FL were tested to identify preterm and LBW. For this review, we selected FL measurement due to its simplicity, which makes it feasible for scaling up. FL measurement can be performed with locally available, low-cost tools such as a rigid transparent ruler, and it can be done with minimal handling of the baby.14

Two high-quality systematic reviews and meta-analyses, published in 201612 on neonatal clinical examination including BS, DS, ESM and other methods of GA assessment and in 202015 on diagnostic accuracy of FL for identification of preterm and LBW, reported that low quality of studies and high heterogeneity were the major limitations for interpretation. Both reviews also recommended studies with high-quality A-US as reference standard. Additionally, the WHO has emphasised the need for additional research to discover simple, reliable and feasible methods for assessing GA and birth weight in LMICs.16

Therefore, the objectives of this study were (1) to update the existing systematic reviews and meta-analyses on the diagnostic accuracy of PCS and FL for GA and birthweight assessment in a single review in the LMIC context and (2) to assess the quality of evidence related to reference standards of (1) A-US, (2) LMP, (3) PCS and (4) newborn weighing scales.

Materials and methods

This systematic review and meta-analysis was based on original studies building on a previous review that examined studies up to June 2022. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses was used and is available as online supplemental material. The study was registered at the International Prospective Register of Systematic Reviews—PROSPERO CRD42020209455.

Supplemental material

Search strategy

Systematic literature searches were conducted using databases including PubMed (Medline), Web of Science, Cochrane Library, Cumulative Index to Nursing and Allied Health Literature (CINAHL) and Scopus. A librarian (KM) from Aga Khan University, Karachi Pakistan, performed the searches and were exported to EndNote (V.X9, Clarivate Analytics). In this review, ESM, DS and BS were denoted as PCS methods. Detailed search terms are available in online supplemental table 1.

Inclusion criteria

Original studies written in the English language from LMICs, published between 1 January 2000 and 29 February 2024, were included. Studies reported live births and assessed the diagnostic accuracy of PCS and/or FL for determining GA and birth weight, as well as identifying prematurity and LBW were included. Studies using the LMP, A-US, PCS and/or a calibrated newborn weighing scale as reference standard were included. Additionally, studies that used PCS as the reference standard for FL for GA were also included. LMICs were selected due to the significant healthcare challenges in these regions, which have the highest rates of preterm births and LBW. By including studies from the year 2000 onwards, the review aimed to capture contemporary practices and diagnostic standards, reflecting the transition from reliance on LMP to more accurate and widely adopted methods such as A-US and calibrated newborn weighing scales.

Studies reported stillbirths as the study population, reported small for GA as the only outcome, involved children with chromosomal abnormalities or assessed GA on or after day 7 of birth were excluded. Additionally, studies that did not use A-US or LMP as reference standards for GA or did not employ calibrated newborn weighing scales as the reference standard for LBW assessment were excluded. Case reports/series, narrative/scoping reviews, editorials and published abstracts were also excluded.

Case definition

According to the WHO, preterm birth is defined as the birth of a baby <37 weeks of gestation17 and LBW is defined as birth weight <2500 g.18

Data review and extraction procedure

After removing duplicate studies from the EndNote library, two independent reviewers (MU and QA) screened titles and abstracts to identify full-text articles meeting eligibility criteria. We then read full-text articles meeting these criteria and extracted data, including study title, journal, publication year, country, study design, setting (hospital vs community), population characteristics, sampling strategy, sample size, methods of assessing GA, reference standards, descriptive data (preterm birth and LBW frequencies), and diagnostic accuracy and agreement estimates (correlation coefficient, mean difference, SD, diagnostic accuracy measures such as sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV) and Bland Altman’s limits of agreement (LOA)). We entered the data into MS Excel.

Quality assessment of eligible studies

The risk of bias in individual studies was assessed using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) tool, which evaluates diagnostic studies in four domains: selection of participants, index test, reference standards and flow and timing. Each domain received a score from 0 to 1, indicating a low to high risk of bias. MU and QA independently evaluated methodological quality, resolving disagreements through mutual discussion. If a consensus was not reached, a third reviewer (SST) reviewed the article for the final decision. In addition to QUADAS-2, we assessed the quality of reference standards, such as A-US and LMP (online supplemental table 2).

Additional calculations

Bland Altman’s LOAs were used to observe any bias in reporting the mean difference between the two compared methods as part of the included studies’ quality and reporting bias assessments. LOA was calculated if studies mentioned either the mean±SD of GA for both test and reference standard methods or the mean difference and SD of the mean difference.19

Embedded Image

The 95% CI was calculated for sensitivity, specificity, PPV, NPV and area under the curves, where applicable.20

Embedded Image

Embedded Image

Standardised effect size: pooled variance

Reported mean differences were transformed into standardised mean differences to facilitate comparison across heterogeneous studies with varying characteristics. Pooled variances and SDs around the pooled estimates were calculated using the formula.21

Embedded Image

Data analysis

Data were summarised and grouped in tables based on methods of GA determination and the reference standard. Data analysis was performed by using STATA V.17 (StataCorp). Meta-analysis was employed when two or more studies had appropriate data for pooled analysis. Individual study-level mean differences between the two GA assessment methods were pooled using the ‘meta esize’ command, providing the pooled mean difference and 95% CI. To account for heterogeneity within the data, a meta-analysis method employing the random effects model (REM) was used, which accommodates variability across studies beyond what would be expected by chance alone. Higgins’s I² was used to quantify the degree of heterogeneity present in the pooled data. Correlation coefficients were pooled if studies reported a Pearson correlation (r) using the ‘metan’ command, providing descriptive summaries as median and range. Sensitivity and specificity were pooled using the ‘metandi’ command and reported all pooled effect sizes alongside their 95% CI. Forest plots for REM meta-analysis models were created using the ‘meta forestplot’ command.

Results

After a comprehensive search across all databases, 667 studies were identified. Following the removal of duplicates, 475 studies underwent screening for eligibility based on titles and abstracts. Subsequently, 101 full-text studies were identified for assessment regarding reporting criteria and reference standards. Ultimately, 50 studies were included in the systematic review (figure 1).

Figure 1

PRISMA flow diagram. GA, gestational age; LMP, last menstrual period; PRISMA, Preferred Reporting Items for Systematic Reviews and Meta-Analyses.

Quality assessment

The QUADAS-2 summary graph indicated a high risk of bias related to patient selection and reference standards. Studies using LMP as a reference standard showed the high risk of bias attributed to recall bias. However, a low risk of bias was observed across other QUADAS-2 domains (online supplemental figure 1).

Characteristics of each study are summarised in online supplemental tables 2–7.

Quality of the reference standards for GA and birth weight

A-US (n=18)

A-US was conducted by trained staff (n=10)10 22–30 within 20 weeks of gestation (n=6)24 26 27 29 31 32 using portable US machines (n=6)10 12 27–29 33 and fetal biometry (n=8)10 12 22 25 27 29 34 with the Hadlock method for GA estimation (n=5).10 12 25 27 29 Quality and reliability were assessed in seven studies10 22 24 26 27 33 35 (table 1).

Table 1

Quality assessment of reference standards. (A) Antenatal ultrasound, (B) Last menstrual period, (C) Postnatal clinical scoring and (D) Newborn weighing scales

LMP (n=11)

All 11 studies included women who were aware of their LMP. Criteria for inclusion were women aware of their LMP (n=11),23 36–44 regular menstrual cycles (n=5),36–38 43 44 no contraception use in the 3 months prior to conception (n=3),37 38 43 no breast feeding after conception (n=2)37 38 and the absence of pregnancy complications (n-2).36 38 LMP data were collected by midwives or nurses (n=2)36 43 in the early trimester (n=2),43 44 with GA assessed through Naegele’s formula (n=4)36 37 39 44 and reliability assessed in two studies23 43 (table 1).

PCS (n=17)

15 studies validated FL against the Ballard Score (BS) as a reference standard, and three studies used the ESM as a reference standard. Of 15 studies, 5 studies described the procedures (partially described: only clinical signs, n=345–47; well described: clinical signs and scoring, n=248 49). BS was performed within 24 hours/1 day after birth (n=7)30 46 49–53 by paediatricians (n=4).48 49 51 54 Three studies described the ESM, which was conducted on day 1 (n=1)55 by paediatricians (n=2)48 55 and midwives (n=1)56 (table 1).

Newborn weighing scales (n=22)

Newborns were measured within 24 hours of birth (n=17),34 40 42 48 49 52 56–65 either naked or with minimal clothing (n=8)34 42 45 46 57 61 62 64 using digital (n=18)28 34 40 42 46 48 49 52 55–57 59–64 66 calibrated weighing scales (n=11),28 34 40 48 49 55 56 60 62 64 65 with weights recorded in grams (n=12).40 43 46 48 49 55 58 60–62 65 66 FL measurements were taken by averaging two or three readings recorded (n=6)28 40 48 49 56 64 (table 1).

Diagnostic accuracy of PCS methods in assessing the GA

Ballard scoring with A-US as a reference standard (n=10)

In seven studies,10 24 26 32 33 35 67 BS resulted in a pooled mean difference of 0.65 weeks (95% CI −0.23 to 1.54, p<0.001) and a pooled SD of 1.6 weeks. Four out of 10 studies reported a correlation coefficient ranging from 0.31 to 0.94.12 26 32 33 Additionally, four studies12 32 33 35 reported a pooled sensitivity and specificity of 67.0% (95% CI 22.0% to 94.0%) and 80.0% (95% CI 73.0% to 85.0%), respectively, for identifying preterm births (table 2 and online supplemental figures 2–4).

Table 2

Validity of postnatal clinical scoring in assessing gestational age keeping antenatal ultrasound and last menstrual period as a reference standard

Ballard scoring with LMP as a reference standard (n=4)

BS resulted in a pooled mean difference of −0.35 weeks (95% CI −0.75 to 0.05, p=0.04) and a pooled SD of 1.5 weeks.38 41 67 Three studies reported a correlation coefficient of 0.94.23 None reported sensitivity and specificity (table 2).

Dubowitz scoring with A-US as a reference standard (n=3)

DS resulted in a pooled mean difference of 0.68 weeks (95% CI 0.52 to 0.84, p=0.35) and a pooled SD of 1.4 weeks.25 67 One out of two studies reported a Pearson correlation coefficient of 0.91. One study reported a sensitivity of 61.0% (95% CI 54.9% to 76.0%) and a specificity of 99.0% (95% CI 97.7% to 100%) for identifying preterm birth neonates.25 (table 2)

Dubowitz scoring with LMP as a reference standard, (n=2)

DS resulted in a pooled mean difference of 0.67 weeks (95% CI 0.45 to 0.89, p=0.52) and a pooled SD of 1.2 weeks.31 38 Two studies reported the Pearson correlation coefficient ranged from 0.81 to 0.94.38 43 None of the studies reported sensitivity and specificity. (table 2)

ESM with A-US as a reference standard (n=2)

ESM resulted in a pooled mean difference of −0.44 weeks (95% CI −0.51 to −0.37, p<0.001) and a pooled SD of 1.4 weeks.24 26 One study reported a sensitivity of 75.0% (95% CI 72.4% to 77.6%) and specificity of 58.0% (95% CI 55.0% to 60.9%) of ESM for identifying preterm birth neonates.24 (table 2)

Eregie scoring with the LMP as a reference standard (n=1)

One study reported Pearson correlation coefficient of 0.93 and a mean difference of 0.26±1.38 weeks.38 (table 2)

Diagnostic accuracy of FL for GA assessment

The FL with A-US as a reference standard (n=7)

Three studies reported positive correlations between FL and GA ranging from 0.37 to 0.89 with a pooled correlation coefficient of 0.72 (95% CI 0.38 to 1.05).30 34 40 44 66 Two studies used an FL cut-off of <7.5 cm to detect preterm birth with FL measurements <7.5 cm (n=2) showed pooled sensitivity of 76.2 (95% CI 70.2 to 81.5) and pooled specificity of 36.6 (95% CI 32.7 to 40.7) for identifying preterm birth compared with A-US.24 32 table 3

Table 3

Comparison of foot length for estimating gestational age with A-US, LMP and postnatal clinical scoring (Ballard and Eregie score model) as reference standards

The FL with the LMP as a reference standard (n=7)

Six studies reported positive correlations between FL and GA ranging from 0.14 to 0.93 with a pooled correlation coefficient of 0.56 (95% CI 0.24 to 0.88).36 37 39 42 One study reported FL cut-off of <7.5 cm to identify preterm birth, with sensitivity of 32.7% and specificity of 83.8%. (table 3)

The FL with PCS as reference standard (n=17)

BS (n=15),39 40 45–47 49–54 59 68 69 ESM (n=2)55 56 and both BS and ESM (n=1)48 were used as reference standards. 11 studies reported correlations ranging from 0.69 to 0.96.30 45–47 50–54 59 68 One study reported FL cut-off of <7.5 cm to identify preterm birth using BS as reference standard, has sensitivity of 81.7% and specificity of 77%48 and another study used ESM as reference standard reported sensitivity of 85.7% and specificity of 90.4% for identification of preterm.56 (table 3)

Diagnostic accuracy of FL in assessing the LBW

Of 22 studies that reported diagnostic accuracy of FL for identification of LBW, 15 studies showed a correlation ranging from 0.21 to 0.97 between FL and birth weight with a pooled correlation coefficient of 0.71 (95% CI 0.60 to 0.82).34 40 45 46 49 52 56 58 61–64 66 70 Pooled sensitivity and specificity for identifying LBW at an FL cut-off of ≤7.4 cm (n=4) were 72.1 (95% CI 68.3 to 75.7) and 84.9 (95% CI 83.2 to 86.5), respectively. At an FL cut-off of ≤7.6 cm (n-4), the pooled sensitivity and specificity were 78.6 (95% CI 73.7 to 83.6) and 65.7 (95% CI 63.3 to 68.1), respectively.28 34 48 61 (table 4)

Table 4

Comparison of diagnostic accuracy of foot length in predicting low birth weight (LBW)

Discussion

Existing reviews on PCS and FL as methods for GA and birthweight assessment within the context of LMICs have been updated. PCS methods such as BS and DS tend to overestimate GA while ESM underestimates it. Additionally, studies investigating the diagnostic accuracy of FL as a proxy for prematurity or LBW showed varying degrees of sensitivity and specificity; however, due to high heterogeneity, one should interpret these results with caution (online supplemental figures 2–4). The significant methodological differences, especially in the standardisation of reference standards like A-US and LMP, largely account for the observed variation and equivocal findings in existing studies on PCS methods and FL measurements.

Several contextual factors contribute to this variability. Many studies relied on secondary data lacking standardised data collection methods, leading to inconsistent findings. Methodological differences included anatomical landmarks and measurement tools for FL, with studies using landmarks such as heel to hallux or longest toe and tools ranging from plastic rulers to callipers to flexible tapes. Various cut-offs for FL as a proxy for prematurity (7.1–7.9 cm) and LBW (<6.9 to <7.9 cm) also led to differences in sensitivity and specificity.28 34 71 This variation in cut-offs may be attributed to the higher frequency of premature newborns in some studies.40 56 Furthermore, most studies evaluating FL and PCS for GA were conducted in hospital settings. Hospital settings, with higher incidences of prematurity, asphyxia, sepsis, growth retardation and maternal complications such as pre-eclampsia/eclampsia, gestational diabetes and anaemia, further added to this variability.72 73 Moreover, differences in healthcare settings, staff training and access to care between urban hospitals and rural areas in LMICs also contributed to this heterogeneity. Hospital-based studies often involve trained medical personnel, whereas rural settings may lack such resources, impacting the accuracy and generalisability of PCS and FL measurements.

South Asia exhibits a high prevalence of premature and LBW neonates, with variations in the diagnostic accuracy and optimal cut-offs for FL measurements when compared with other regions such as sub-Saharan Africa. Studies conducted in Asia show FL cut-offs ranging from <6.842 to <7.75 cm59 for identifying preterm, whereas studies from Africa have cut-offs ranging from <7.140 to <8 cm74 for the same purpose. Similarly, for identifying LBW neonates, Asian studies report FL cut-offs ranging from <7.445 49 to <8.0 cm59 while African studies show cut-offs ranging from <6.940 to <8.0 cm.55 These differences arise due to distinct population characteristics and genetic profiles, necessitating different cut-offs.75 76 This regional variability highlights the inherent complexity of applying a one-size-fits-all approach to neonatal assessments.77 Universal application without adjustments can lead to inaccurate assessments, potentially compromising the quality of care and intervention strategies. Therefore, while these diagnostic tools are valuable, their use must be tailored to regional contexts to achieve precise and reliable outcomes.78

This systematic review and meta-analysis has several limitations. First, relying on binary outcomes. Using categorical outcomes like LBW versus not LBW or preterm versus not preterm in LMICs offers advantages. These endpoints simplify data collection and interpretation, making it more feasible in resource-limited settings. Different cut-offs for continuous variables like FL introduce variability, complicating comparisons. Categorical outcomes provide clear, standardised criteria that facilitate decision-making and policy implementation and second, pooling individual-level data for continuous analysis was challenging due to logistical constraints, variations in data quality and limited access to advanced statistical tools, making categorical outcomes a more straightforward, actionable and accessible approach to addressing public health concerns in LMICs.

A deviation from the PROSPERO protocol in the manuscript regarding the inclusion of quality assessment of reference standards is acknowledged, as these factors contribute to heterogeneity across the studies. However, the overall methodology remains consistent with the PROSPERO protocol.

This study highlights the need for standardised measurement protocols and improved data collection methods. By carefully examining the quality of evidence related to reference standards, we recommend implementing uniform protocols for PCS and FL measurements across LMICs to ensure consistency and reliability. Additionally, it is crucial to invest in robust data collection and management systems to enhance the accuracy and applicability of GA and birthweight assessments. Furthermore, the policies should prioritise skill development, quality assurance and supportive supervision for healthcare providers conducting GA and birthweight assessments. These measures will ultimately lead to better neonatal health outcomes.

Conclusion

In conclusion, this review reveals significant variability and methodological inconsistencies in using PCS methods and FL measurements for estimating GA and LBW in LMICs. The observed high heterogeneity across studies suggests a cautious interpretation of the results and calls for future research to be focused on validating and adapting these tools to better suit the specific contexts of diverse LMIC settings.

Data availability statement

All data relevant to the study are included in the article or uploaded as online supplemental information.

Ethics statements

Patient consent for publication

Acknowledgments

We acknowledge Khawaja Mustafa (KM), the librarian of Aga Khan University, who helped us in developing search strategies. I dedicate this manuscript to my late father Professor Pirbhulal Tikmani and my Late mother Asha Devi (Revti).

References

Footnotes

  • Contributors SST, NB, AM, SS and TM conceptualised the study and developed the methodology. SST drafted the initial version of the paper. NB, AM, SS and TM provided critical feedback on the manuscript. MU, QA and AR screened the studies and reviewed full-text extract data synthesis and interpretation. SK and AH data analysis and interpretation. SST is the guarantor. I used AI for grammar and spelling checks.

  • Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.

  • Competing interests None declared.

  • Patient and public involvement Patients and/or the public were not involved in the design, or conduct, or reporting, or dissemination plans of this research.

  • Provenance and peer review Not commissioned; externally peer reviewed.

  • Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.