We developed 79 QIs for paediatric and perinatal care using a RAND-modified Delphi method based on CPGs and existing QIs. These QIs are designed to be defined based on the Japanese administrative database. They have the advantages of being a cost-effective source of information, reducing the risk of selection bias and being easily adopted by participating hospitals. The use of QIs has the potential to raise the standards of quality in paediatric and perinatal care in Japan. Our single-site practice test showed that most of the proposed QIs were measurable in real-world clinical practice using administrative databases and that there was a wide variation in their performance.
QIs and their analyses are beneficial for identifying gaps and areas for improvement, informing the creation of future best practices, tracking progress in quality improvement and providing insights into the management of each condition. Poor adherence may affect patient outcomes or lead to a substantial worsening of disease, death and increased healthcare costs. Adherence gaps and practice variation persist despite decades of development and endorsement of CPGs, and efforts have been made to close this gap.29 Efforts to enhance the quality of guidelines, measure (monitor) and analyse QIs and improve adherence to guidelines are essential for achieving better health service provision. QIs could provide useful information regarding institutional-level and regional-level disparities in quality of care, which could contribute to updating health policy and improving the health provision system. While most QIs are localised considering the difference in healthcare systems and cultures, it would be helpful to consolidate QIs and related information worldwide considering generalisability; comparison of measurable QIs between countries/regions would also help quality improvement.
Our practice test was based on one institution but included all indicators developed in this study. Three proposed QIs were not calculated, partly because the data were not recorded in healthcare insurance regulations. Some QIs had a smaller number of patients than expected, which is partly because of coding practices (eg, ‘Acute focal bacterial nephritis’ tended to be coded as ‘Acute pyelonephritis’). Results also showed that the Japanese Administrative Database is one of the appropriate sources of information for QIs. Further benchmarking of QIs would be attractive in the Japanese setting, while it seemed hard to compare QIs across countries using administrative databases without a common data structure.30
This study has several implications for future research. First, the feasibility of using these QIs in other hospitals needs to be evaluated. Second, the use of the proposed QIs needs to be evaluated; future studies should assess if and how these QIs contribute to quality improvement, including changes in the behaviour of physicians and the frequency of unwarranted events. Third, implementing these QIs requires continuous updates, evaluations and adaptation.21
Strength and limitations
One of the main strengths of this project is the use of routinely collected administrative data, which enables the monitoring of quality issues throughout the healthcare system. Second, administrative data are relatively inexpensive to collect compared with primary data collection.31 Although QIs may be biased by intrahospital heterogeneity,32 we do not suffer from sampling bias, which avoids non-response and recall bias.33 The final strength of the project is the inclusion of a data collection and calculation phase to assess the feasibility of indicator measurement.
Despite the advantages, the study acknowledges certain limitations. The composition of panels and subcommittees may influence consensus development, potentially leading to a biased selection of QIs. Although a two-step, two-round assessment was implemented to reduce bias, the representativeness of panel members could still impact the validity of the consensus method. Notably, patient perspectives were not adequately included due to the absence of patient representation in the panels. Indeed, patient participation in QI development has been limited.34 35 This may be a general limitation of QI development based on guidelines; however, it needs to be resolved for further studies, such as updating these QIs.
Furthermore, the study focused on QIs applicable to the administrative database, excluding aspects of quality not relevant to this specific data source. The accuracy of the proposed QIs may have some limitations due to the coding process, even though efforts were made to ensure measurability by medical experts familiar with the database. Although the DPC (administrative) database was used for epidemiological studies including QIs9–12 27 28 with efforts on its validation studies,36 differences in coding practices between hospitals could affect QI results, and the limited generalisation of indicator performance due to a single-site practice test also needs to be considered. Further practice tests with more hospitals were also required to strengthen its feasibility and assess its reliability and adaptability. In addition, the proposed QIs did not include timely administration of medications/procedures due to the unavailability of hours-level data.
It would be preferable to a set of QIs where the link between process and outcome is established. However, we have developed QIs without consideration of the linkage between process measures and outcome measures. This is partly because it is not easy to define appropriate outcome measures related to process measures using the administrative database. For example, the validation regarding adverse-related ICD-10 codes (T79-T88) is not established, and the timing of diagnosis is lacking in a Japanese setting. Further efforts focusing on the linkage would be an attractive approach to facilitating quality improvement.
Moreover, evidence-based QIs are only as robust as the underlying evidence they are based on. Existing problems with CPGs, such as redundancy, lack of currency and concerns about the quality of evidence, may systematically under-represent or overrepresent certain aspects in specific clinical areas.
Finally, it is essential to recognise that evidence-based QIs can only be as reliable as the underlying evidence they are based on. CPGs have been extensively studied, revealing issues such as redundancy, outdated information, inconsistent structure and content and overly lengthy documents. Moreover, concerns persist regarding the quality of evidence supporting CPGs, leading to potential under-representation or over-representation of certain aspects in specific clinical areas.
In conclusion, this study offers valuable insights into the development and application of QIs for paediatric and perinatal care in Japan. While the use of administrative data is advantageous, there are notable limitations related to consensus development, patient perspectives, database relevance, coding accuracy, coding practices and evidence-based QIs. Awareness of these limitations helps to ensure the appropriate interpretation and utilisation of the proposed QIs in enhancing healthcare quality.