Discussion
Various anthropometric measurements can be used to identify LBW and preterm babies. In this systematic review and meta-analysis, we evaluated diagnostic accuracy of four common anthropometric measurements (FL, MUAC, CC and HC) reported by 21 studies conducted in eight African countries.
The pooled measurement agreement between birth weight and CC shows a strong positive correlation. The SROC from this meta-analysis also shows that CC (at 28.8 cm optimal cut-off point) has high diagnostic accuracy for identifying LBW babies, as its curve has the largest AUC. These results are in line with results reported by a previous meta-analysis of similar studies in developing countries.22 The diagnostic accuracy observed in CC is partly due to the fact that chest has a larger cross-section with less chance of systematic or random errors in measurement.28 In addition, while CC has high accuracy with clear nipple line landmarks, its use has lower acceptance due to the hypothermia risk from undressing newborns.8 11 15 However, preterm neonates with hyperinflammation of the chest including meconium aspiration appear to have increased lung volume and heavier weight. Therefore, using their CC could lead misclassification of the actual birth weight.36
The pooled estimate for FL shows that it has moderate to strong positive correlation with gestational age and with birth weight. Another important finding in this meta-analysis was the higher correlation observed between FL and birth weight when the measurement was done by a hard ruler or calliper rather than non-stretchable tape. This meta-analysis also found that FL (at 7.2 cm optimal cut-off point) has high diagnostic performance with AUC of 0.91 for detecting preterm babies. Similarly, a previous systematic review reported that FL (at optimal cut-off point <7.3 and <7.9 cm) had relatively high sensitivity and specificity to classify very LBW and LBW infants.23 Another study in Bangladesh also found that FL measured with a firm ruler has better accuracy than by tape measurement.37 Based on qualitative observations of FL measurement procedures, one of the included studies reported that FL measurement using a ruler was simple to learn and explain to others.24
Different studies also reported that FL has minimum interobserver variability and has comparable diagnostic ability until the fifth day of life12 20 and can be performed with simple and available equipment with minimal exposure to the infant. A study conducted in Uganda reported that CC and FL measurements taken by midwives and community health workers have no significant mean difference.18
A logical conclusion that could be drawn from these findings is that FL measurement could be a candidate for community-level identification of LBW and preterm babies. However, other studies reported that FL measurement requires proper positioning to minimise the measurement bias caused by grasp reflex.12
We also observed the need to standardise landmarks for FL measurement as some of the included studies measured from the heel to the tip of the longest toe (hallux or second toe),10 12–15 17 20 while others measured from the heel to tip of the big toe6 11 20 21 24 26; and from the centre of the heel pad to the middle of the tip of the big toe.16 In addition, the correlation between FL and birth weight measurements has significant differences based on the subgroup analysis of the foot landmarks used in the studies. To this end, we suggest that measurements taken from the heel to the tip of the longest toe will provide the maximum distance and can reduce the chance of systematic or random errors as observed in the CC measurement.28
Furthermore, soft tissues of subcutaneous fat are decreased in infants small for gestational age. As FL is based on the measurement of the soft tissue and the bone size, it might give inaccurate estimation when used to measure GA in this group of newborns.20 38
There is a strong positive correlation between MUAC and birth weight measurements in this analysis. The pooled diagnostic accuracy of MUAC, measured by AUC, was also high for identifying preterm and LBW babies. Similarly strong correlation was observed between birth weight and MUAC by a previous meta-analysis study.22 However, the study conducted in Uganda reported that MUAC measurements taken by midwives and community health workers differ significantly.18 Despite the fact that locating the appropriate landmark for MUAC measurement might contribute to intraobserver and interobserver variability, the measurement is familiar for many community health workers due to its routine application in growth monitoring and nutritional assessments.11 The study conducted in Ethiopia reported that MUAC has comparable sensitivity and specificity to identify LBW babies until fifth day of life, however, it is important to note that for clinical interventions to be most beneficial to these neonates, identification is required as soon as possible after birth.16
There was also a wide CI for the pooled diagnostic accuracy estimate of MUAC to identify preterm babies. Despite the fact that gestational age is best assessed by early ultrasound examination, most of the included studies have used other methods including last menstrual period, fundal height or New Ballard Score.39 Hence, the variation of GA estimation method used across these studies and their level of uncertainty could be the reason for the wide confidence intervals of the poled estimates.
The meta-analysis also shows that HC has a significant positive correlation with birth weight. We also found that it has comparable diagnostic accuracy to detect LBW babies when compared with other anthropometric surrogates. However, in Uganda, it was observed that the measurements differ significantly when performed by midwives and community health workers.18 Furthermore, HC measurement has more challenges with subjective landmarks and high liability of head moulding, especially during prolonged and obstructed labour.6 7 21 25 27
Strengths and limitations
In this study, two reference tests (ie, birth weight and gestational age) were considered. The review included the four main anthropometric measurements and evaluated their diagnostic accuracy for detection of both LBW and preterm babies. In addition, multiple thresholds of the index anthropometric indicators were included in calculating the pooled estimates for the diagnostic accuracy. We also calculated pooled estimates of the correlation between the reference and index tests irrespective of the different cut-off points used in the included studies. This was used to evaluate the performance of the anthropometric measurements in predicting birth weight and gestational age without being influenced by the multiple thresholds used across the studies. However, our study is limited because while birth weight is affected by sex, subgroup analysis on sex differences was not performed as the studies did not report the required information. We were also unable to evaluate the diagnostic performance of HC in detecting preterm babies because only two studies reported the required information. Although efforts were made to identify possible sources of heterogeneity, there is considerable uncertainty in most of the pooled estimates in this meta-analysis. In addition, different anthropometric measurement modalities were used across studies. However, only few studies reported specific FL measurement techniques and landmarks to enable deep comparison across subsets.