- Review
- Open access
- Published:
Accuracy of artificial intelligence in caries detection: a systematic review and meta-analysis
Head & Face Medicine volume 21, Article number: 24 (2025)
Abstract
Introduction
Artificial intelligence (AI) has significantly transformed the diagnosis and treatment of dental caries, a prevalent issue in oral health care. Traditional diagnostic procedures such as eye inspection and radiography have limitations in detecting early-stage degradation. Artificial intelligence (AI) provides a viable alternative to improve diagnostic precision and effectiveness. This systematic review examines the diagnostic precision of artificial intelligence systems in identifying dental caries using X-ray images.
Methodology
The literature search utilized electronic web resources such as PubMed, Scopus, Web of Science, IEEE Explore, Google Scholar, Embase, and Cochrane. We conducted the search using specific MeSH key phrases and collected data up to January 2024. The QUADAS-2 assessment method was used to assess the risk of bias using a graph and a heat map. We conducted the statistical analysis using R v 4.3.1 software, which included the “meta,” “metafor,” “metaviz,” and “ggplot2” packages. We displayed the results using odds ratios (OR) and forest plots with a 95% confidence interval (CI).
Results
We used a comprehensive search approach in accordance with the PRISMA guidelines to find appropriate studies. The meta-analysis incorporates fourteen of the 21 articles included in this review. The research mostly uses convolutional neural networks (CNNs) for analyzing images, showing outstanding accuracy, sensitivity, and specificity in detecting caries. Significant variability in study results highlights the need for additional research to comprehend the components affecting AI effectiveness.
Conclusion
Despite challenges in implementation and data availability, this systematic review provides essential information about AI and shows great potential caries detection, improve diagnostic consistency, and ultimately enhance patient care in dentistry.
Introduction
Dentistry is not an exception to the new paradigm in diagnosis and treatment brought about by the introduction of Artificial Intelligence (AI) [1]. The development of AI technology has substantially aided in the diagnosis and treatment of dental caries, a frequent but difficult issue in oral health care [2]. Dental caries is still a common condition throughout the world, affecting a large percentage of people in all age categories [3]. While visual inspection and radiographic interpretation are excellent approaches for detecting caries, they are not always reliable in capturing the early stages of decay, especially when it is hidden or below the cavosurface [1, 3]. These constraints should be overcome by the introduction of AI and machine learning in dentistry, which will provide a new standard for diagnostic precision and effectiveness [1].
In the past, dental professionals' knowledge has been crucial in detecting dental caries through the use of instruments like dental explorers, visual exams, and traditional radiography [4]. Despite their widespread use, these techniques have certain inherent drawbacks. Due to the high degree of subjectivity in the manual procedure, practitioners' diagnosis accuracy varies greatly depending on their background and level of skill [5]. Additionally, even though they are essential, traditional radiographs can occasionally miss early-stage caries, particularly if they are occlusal or interproximal, where visual access is restricted [2, 5].
The invention of laser fluorescence equipment, which may identify changes in tooth structure suggestive of caries, and the introduction of digital radiography, which provides improved imaging capabilities, have both marked key turning points in the evolution of caries detection [6]. Even though these developments strengthen diagnostic capacities, they still need to be interpreted, and artificial intelligence can help improve them even more [5, 6].
AI has played a leading role in revolutionizing caries detection, particularly in the areas of machine learning and deep learning models like convolutional neural networks (CNNs) [7]. With the use of enormous datasets of dental imaging, these systems may learn to recognize patterns and abnormalities that might point to the existence of caries. AI's strength is its ability to process and interpret data at a speed and scale significantly faster than that of humans, which lessens the subjectivity involved in conventional diagnostic techniques [1, 7].
Annotated datasets, which classify dental photos for the presence or absence of caries, are used to train AI models. These models are trained to recognize subtle characteristics of dental caries on radiographs, such as alterations in tooth structure and density that might not be immediately noticeable to the human eye. This skill is especially helpful for managing dental caries early on because it enables treatments that can stop or reverse the spread of decay, protecting tooth structure and enhancing oral health [8].
Recent research has shown that artificial intelligence (AI) is highly sensitive, specific, and accurate at identifying dental caries [8, 9]. These studies evaluate how well AI algorithms perform in comparison to the diagnostic judgments of skilled dental experts. The results frequently show that AI is just as good as or better than humans at identifying dental caries. These findings have significant ramifications since they imply that artificial intelligence (AI) could be a useful supplementary instrument in dental diagnostics, offering a second opinion that improves caries detection accuracy and lowers the possibility of oversight [10, 11].
Furthermore, AI's capacity to reliably interpret radiographs and other diagnostic pictures may standardize the identification of dental caries, lowering practitioner variability and possibly producing more consistent treatment results [12]. Dentists may be able to streamline workflows and concentrate more on patient care and less on diagnostic uncertainty by using AI in dental practices.
AI's precision in detecting dental cavities is more than simply a technical marvel; it also directly improves patient care and clinical results. Early and precise caries detection can result in prompt intervention, stopping the spread of decay, protecting the natural structure of the tooth, and eventually lowering the need for more involved and expensive treatments [13]. AI-enhanced diagnostics can also promote a proactive approach to dental care by enhancing patient education and engagement by enabling patients to view and comprehend their oral health state [1].
There are obstacles to the broad use of AI-assisted caries diagnosis, despite the encouraging developments in this field [14]. These include protecting patient privacy and security, integrating AI tools into current dentistry office operations, and requiring large datasets for the purpose of training AI models [15]. Further research is required to confirm AI's effectiveness in a variety of contexts and demographics, as well as to investigate how it might be used in tandem with other cutting-edge technologies [14, 15].
As time goes on, artificial intelligence has more applications in dentistry than only detecting cavities. AI has the potential to completely transform a number of dental care processes, including individualized treatment planning and predictive analytics for identifying risk factors for oral illnesses [1, 2, 8]. The application of artificial intelligence (AI) in dentistry is a convergence of technology and healthcare that has the potential to improve patient care globally in terms of accuracy, effectiveness, and quality [14, 15]. The objective of this meta-analysis and systematic review is to examine how well deep learning algorithms and convolutional neural networks (CNNs), two cutting-edge AI techniques, can identify dental caries. Through a systematic examination of data from many sources, such as bitewing, panoramic, and periapical radiographs, this systematic review aims to offer a thorough evaluation of AI's diagnostic precision, sensitivity, and specificity in caries discovery. In addition, it looks into the consequences of integrating AI into clinical practice, covering both the potential benefits and potential drawbacks. AI has the ability to completely transform dental diagnoses and treatment planning as it develops, ushering in a new age in dental healthcare.
Methodology
Protocol of the study
The present systematic review followed the PRISMA guidelines presented for systematic review and quantitative analysis. This systematic review is registered in PROSPERO with the registration number: CRD42023482739.
Research questions
Studies regarding the diagnostic accuracy of artificial intelligence for dental X-ray in caries detection were chosen based on the “PICOS” (PRISMA-P 2016) technique with following research question:
-
1.
What is the overall diagnostic accuracy of artificial intelligence systems for dental X-ray images?
-
2.
How does the accuracy of artificial intelligence systems vary depending on the type of dental structure or lesion being assessed?
-
3.
How does the accuracy of artificial intelligence systems vary depending on the type of dental radiograph being used?
-
4.
What are the factors that influence the accuracy of artificial intelligence systems for dental X-ray images?
-
5.
What is the impact of using artificial intelligence systems on the clinical workflow of dental professionals?
PICOS
P (population): Patients undergoing dental X-ray imaging.
I (intervention): AI systems for the detection and segmentation of dental structures and lesions on X-ray images. These systems typically use machine learning algorithms to analyze dental X-ray images and identify dental structures and lesions. Convolutional neural networks (CNNs), Deep neural networks (DNNs), Support vector machines (SVMs), Decision trees, Random forests.
C (comparison): Human dentists or traditional methods of dental X-ray image interpretation.
O (outcome): The accuracy, sensitivity, and specificity of AI in detecting dental caries.
S (study design): Prospective and retrospective cohort studies and case–control studies etc.
Search strategy
We conducted an electronic database search to find papers that discuss the use of artificial intelligence in caries detection, identification, and protocol creation. Up until January 2024, the search encompassed all pertinent research, with no limitations on the year of publication. We searched PubMed, Scopus, Web of Science, and Embase among other databases. We obtained articles using MeSH terms and keywords. We also investigated supplementary databases such as Cochrane, IEEE Xplore, and Google Scholar in addition to the search.
The search strategy employed MeSH terms and relevant keywords combined with Boolean operators “OR” and “AND” to ensure comprehensive coverage. The search used keywords such as “artificial intelligence,” “machine learning,” “deep learning,” “caries,” “dental caries,” “panoramic radiography,” “peripical radiography,” “bitewing radiography,” “diagnosis,” “diagnosis time,” “treatment planning,” “bias,” and “algorithm bias,” among others. These terms were combined using appropriate boolean operators to refine the search, as detailed in Table 1.
Study selection
We imported the search results into EndNote X8 software, where we identified and removed duplicate records. We determined the eligibility of the articles through a two-step screening process, first reviewing the abstracts and then examining the full-text articles. This systematic approach ensured that only studies meeting the predefined inclusion criteria were considered for the review.
Eligibility criteria
Three examiners utilised the PICOS approach to review the entire text of papers and eliminate animal experiments and research published in languages other than English. Figure 1 illustrates the criteria for inclusion and exclusion set by the examiners.
Data extraction
The authors, A.M.L. and N.N.F.R., undertook a thorough data extraction process following an electronic literature search completed on May 19, 2023. The final search included research up to January 2024. Specific criteria guided the selection of the articles, ensuring the inclusion of only relevant studies on the application of artificial intelligence in caries detection.
We deemed the involvement of a third reviewer necessary to address any inconsistencies or misunderstandings that may arise during the selection process. The third reviewer was instrumental in reconciling any discrepancies among the primary reviewers, guaranteeing the inclusion of only studies that fulfilled all eligibility requirements in the final analysis.
Data extracted from the selected studies included key characteristics such as the following:
-
Authors: Names of the study authors.
-
Year: Year of publication.
-
Study Design: Type of study design used (e.g., retrospective, cross-sectional, randomized controlled trial).
-
AI Algorithm: The specific AI algorithm or model employed (e.g., CNNs, DNNs, SVMs etc.).
-
Number of Samples: Total number of samples or participants included in the study.
-
X-ray Type: Type of radiographic images used (e.g., panoramic, periapical, bitewing).
-
Comparator: The reference standard or comparator used in the study (e.g., trained dental professionals, other AI models).
-
Evaluation Metrics: Metrics used to evaluate AI performance (e.g., accuracy, sensitivity, specificity, AUC).
-
Outcomes: The main findings or outcomes of the study.
This systematic and inclusive approach guaranteed the precise and comprehensive retrieval of data, establishing a strong basis for the further examination and integration of the findings.
Risk of bias and quality assessment of research articles
The authors (A.M.L. and N.N.F.R) evaluated the total number of included studies based on the revised version of the earlier published risk of bias assessment tool [16]. This quality assessment was done by the QUADAS-2 Assessment Bar Graph and heat map. This graph provides a visual representation of the risk levels across different assessment domains for each study. The domains evaluated include patient selection, index tests, reference standards, flow and timing, and applicability concerns. The color gradient in the bar graph ranges from green to red, indicating a risk level from low (1) to high (3). This visual format allows for a quick assessment of which areas might be problematic or which studies generally exhibit higher risks of bias.
Statistical analysis
The statistical study was performed using R v 4.3.1 software along with the "meta", 'metafor", "metaviz", and " ggplot2" packages. We presented the findings using odds ratios (OR) and the percentage of forest plots within a 95% confidence interval (CI).
Results
Study selection outcomes
MeSH keywords were utilized to retrieve articles from various databases. The PRISMA flowchart (Fig. 2) illustrates the study selection procedure. Initially, the database search yielded 783 documents. We excluded 562 records before screening because they were duplicates or irrelevant, leaving 221 records available for screening. After conducting a thorough evaluation, we eliminated 124 records that did not meet the inclusion criteria, based on a review of their titles and abstracts. After evaluating 97 full-text papers for suitability, we removed 76 due to factors such as missing data, incorrect study design, or outcomes that did not align with the review's objectives. The qualitative synthesis included 21 studies. Of these, 14 papers met the criteria for inclusion in the quantitative meta-analysis.
Study features
Tables 2 and 3 provide a comprehensive overview of the basic features of the 21 studies included in the systematic review. The studies span from 2017 to 2023 and involve a diverse range of populations with a notable number of studies conducted in Asian and European contexts. Studies conducted across various populations including China, South Korea, Brazil, Turkey, France, USA, UK, Taiwan, Germany, and India. There are 21 studies listed, the majority of which were published between 2021 and 2023.
Journals varied across fields like Bioengineering, Neural Computing, and Clinical Oral Investigations.
The majority of the studies are either retrospective or analyze historical data to assess the efficacy of AI models. The studies are either retrospective or cross-sectional, with the aim of evaluating the current accuracy of AI systems. This implies that the field prefers observational research designs. These study types are conducive to analyzing existing datasets and are commonly used in medical research, where prospective trials may be impractical or unnecessary.
Convolutional Neural Networks (CNNs) are the predominant algorithm used across the studies [18,19,20,21, 24,25,26,27,28,29,30,31,32, 34, 35]. This reflects CNN's strengths in handling image data, making them ideal for radiographic image analysis in dental settings. Studies by Chen et al. [17] and Srivastava et al. [37] talked about similar AI tools, such as the EfficientNet and ResNet models, which showed a preference for strong neural network architectures when dealing with large amounts of complex image data.
In all studies, accuracy is a commonly reported metric, with several studies highlighting AI models achieving accuracy rates of over 90%. High Accuracy Levels: Several studies, such as Chen et al. [17] with an accuracy of 95.44% and Zhu et al. [18] with 93.64%, demonstrate that CNNs consistently achieve high accuracy. Bayraktar et al. [19] and Huang et al. [20] reported similar high accuracy levels, reinforcing the reliability of CNNs in dental diagnostics. This high level of accuracy suggests that AI could play a crucial role in improving diagnostic precision in dental radiography. The reported sensitivity rates vary, but they are generally high, with many studies noting rates above 80%. This implies that AI models excel at accurately identifying patients with the condition under examination. High specificity rates indicate a strong ability of AI models to correctly identify those patients who do not have the condition, reducing the risk of false positives. The sensitivity and specificity metrics were particularly notable in studies such as Mao et al. [21], who achieved a sensitivity of around 94% and a specificity close to 95%. The findings of Lee et al. [24] and Bayraktar et al. [19] closely mirrored these rates, demonstrating consistent detection capabilities of AI across different studies.
Overall prevalence of accuracy of the caries detection
Figure 3 displays a forest plot showing the prevalence of accuracy in AI for caries detection, including data from fourteen studies conducted between 2017 and 2023 (Table 4). The studies had sample sizes ranging from 15 to 11,000 people, with stated accuracy varying from 73.3% to 98.8%. Each study's outcome contains the odds ratio (OR) and the p-value, indicating the statistical significance of the data in comparison to a standard value.
Most of the research has high accuracy rates, with six studies achieving 95% accuracy. Lian et al. [28] and De Araujo Faria et al. [22] both achieved accuracies of 99%. Moran et al. [27] showed the lowest accuracy at 73.3%, with a broad confidence interval (CI) ranging from 65.11% to 81.49%, suggesting a less exact estimate likely due to a smaller sample size.
The odds ratios (ORs) differ significantly among the researchers, suggesting variations in the impact of artificial intelligence (AI) on caries detection relative to a control group or expected standard. Lian et al. [28] found an OR of 1.469, showing a large positive effect, whereas Moran et al. [22] reported an OR of 0.052, suggesting a much smaller effect compared to other studies. The paper presents an odds ratio of around 2.72, with a 95% confidence interval of 14.65 to 14.65, indicating the significant effectiveness of AI in detecting caries in several investigations.
The I2 value of 88.0% indicates significant heterogeneity among the study outcomes, likely stemming from variations in study designs, AI technologies employed, sample sizes, or other study-specific factors. Most research provides statistically significant results with p-values significantly lower than 0.05, except for Lee et al. [24], who reported a p-value of 0.08, showing results that are not statistically significant at the customary 5% level.
Accuracy of caries detection by different Xray technique
Forest plot (Fig. 4) appears to be some variation in the accuracy of AI in caries detection depending on the type of x-ray used. Overall, the studies included in the forest plot suggest that AI can be accurate in caries detection with different x-ray types, but some methods may be more accurate than others. The accuracy for digital bitewing x-rays appears to be high (possibly around 80% based on one study), with a narrow confidence interval, which suggests a precise estimate (Fig. 4a). The accuracy for digital panoramic x-rays appears to be lower than for bitewing x-rays, with one study showing an accuracy around 70% and a wide confidence interval, indicating less precision in the estimate. (Fig. 4b) Periapical x-rays show an accuracy around 76%, but the confidence interval is wide, so the estimate is not very precise. More research is needed to draw conclusions about periapical x-rays (Fig. 4c).
Overall sensitivity of AI accuracy in caries detection
We conducted a forest plot and meta-analysis to evaluate the sensitivity of AI in caries detection across multiple studies (Fig. 5/Table 5). The meta-analysis encompassed nine studies conducted between 2020 and 2023, with a total of 17,190 participants. Sensitivity ranged from 71% to 98.85%, with corresponding odds ratios (OR) ranging from 2.448 to 85.957 across the studies. The pooled analysis yielded an overall OR of 1.258 (95% CI: 0.493—2.540), indicating no significant association between the examined factor and the outcome. However, substantial heterogeneity was observed among the studies (I2 = 86.03%, p < 0.05), suggesting variability in effect sizes beyond chance. These findings underscore the importance of considering factors contributing to heterogeneity in future research and clinical practice.
Figure 6 and Table 6 displays the forest plot, which shows the specificity percentages, odds ratios (OR), and 95% confidence intervals (CI) from seven studies that looked at how well artificial intelligence (AI) systems could find cavities in teeth. The specificity values vary widely, from 0.82% to 98.19%, among different researchers. The weighted mean specificity across studies is around 87.90%, showing the great overall performance of AI in accurately recognising non-caries instances. The heterogeneity analysis shows a high Q statistic of 144,926.65 and an I2 statistic of 96.03%, showing substantial variability among trials beyond chance. Other variables, besides random sampling error, may be influencing the discrepancies in specificity estimates. The p-value of less than 0.001 confirms the existence of statistically significant heterogeneity. It is important to carefully examine study design, AI model properties, and other causes of variability when evaluating and applying results from AI studies to dental caries diagnosis.
Risk of bias
QUADAS-2 assessment bar graph observations (Fig. 7)
-
Patient Selection: Most studies show moderate to high risk, with more studies appearing in the orange and red zones.
-
Index Test: This domain also displays a range of risk, with several studies showing higher risk levels.
-
Reference Standard: Here, the risk seems varied, with some studies showing low risk (green) and others high risk (red).
-
Flow and Timing: Most studies are in the low-risk category for this domain, as indicated by the green color.
-
Applicability Concerns: The risks are generally low in this domain across the studies, with most bars colored green.
QUADAS-2 assessment heatmap (Fig. 8)
The heatmap offers a detailed view of the risk assessment for each study across the same domains. Each cell in the heatmap is colored based on the risk level:
-
1 (Low Risk): Green; 2 (Moderate Risk): Yellow; 3 (High Risk): Red
Observations
-
Patient Selection: Some studies demonstrate a high risk (red cells) due to concerns over patient selection and representativeness [24, 25, 33].
-
Index Test: Some studies [23, 35] have identified a high risk, suggesting that the index test may not have been completed according to a defined methodology or interpreted without knowledge of the reference standard.
-
Reference Standard: Multiple studies [28, 33] indicate a high danger, implying that the reference standard may not have been properly implemented.
-
Flow and Timing: While most studies in this field demonstrate low hazards, some exceptions [21, 22] exhibit moderate risks.
-
Applicability Concerns: Few studies show high concern for applicability [25], suggesting that the results might not apply to the intended patient population.
Discussion
Artificial intelligence (AI) has been used in dental diagnostics, revolutionizing caries detection by enhancing accuracy and efficiency. This systematic review and meta-analysis are intended to assess the sensitivity, specificity, and overall diagnosis accuracy of AI systems on various types of dental X-rays. Our research shows that AI routinely achieves high levels of diagnostic accuracy, frequently outperforming conventional methods and providing substantial promise for improving clinical results.
Advancements in AI and caries detection
AI technologies, namely machine learning models like CNNs, have demonstrated exceptional ability to recognize patterns in intricate datasets that surpass human vision [38]. Even experienced dentists find it challenging to identify minor changes in dental X-rays that could signal the initial phases of dental caries using conventional technologies [39]. Our results correspond with the umbrella review of Dashti et al. (2024), which indicated accuracy rates between 73.3% and 98.6% across several datasets. Both studies show that convolutional neural networks (CNNs) can help with dental diagnoses. However, the umbrella review stresses how unpredictable results can be because of different datasets and different methods used. This variability aligns with our findings of significant heterogeneity (I.2 = 88.0%) among the included studies [40]. The sensitivity and specificity statistics obtained from the multiple trials included in our meta-analysis demonstrate the strong performance of AI. Specificity rates of up to 98.19% and sensitivity rates of up to 98.85% have been found in studies [19, 20]. This shows that AI can cut down on both false negatives and false positives, making dental caries diagnostics more reliable [18].
Clinical implications of AI in dentistry
When it comes to clinical applications, the incorporation of AI into dental practices has significant implications. By enhancing the diagnosis process, artificial intelligence has the potential to lessen the mental burden placed on dental practitioners, enabling them to provide more focused patient engagement and care [41]. Furthermore, the high accuracy rates offered by cost-effective AI systems could result in early detection of dental caries, potentially enabling interventions to halt or even reverse the progression of decay [42]. Alternative technologies such as optical coherence tomography (OCT), laser fluorescence, and transillumination systems have been introduced as alternatives to traditional radiography [43]. These modalities can provide different types of information about caries lesions, such as subsurface structure and demineralization, which radiographs might miss. The application of AI in radiograph interpretation could complement and enhance these technologies, creating a more comprehensive diagnostic toolkit. By integrating AI with these approaches, dental practitioners can benefit from more accurate and early detection of caries, improving overall patient outcomes [42, 43]. This not only helps to maintain the natural structure of the teeth, but it also lessens the likelihood that more, more comprehensive, and more expensive treatments may be required in the future [1, 7, 37, 42]. According to Lee et al. (2021), the capability of artificial intelligence to deliver diagnostic outputs that are consistent and trustworthy can also help to standardize the quality of treatment that patients receive, thereby minimizing the variability that is caused by human variables such as fatigue or subjective determination [24].
Enhancing patient outcomes and trust
One of the most important advantages of artificial intelligence in dental diagnostics is the possibility that it may improve patient outcomes [44]. It is possible that more effective treatments and an overall improvement in oral health could result from the accurate and early identification of caries [11, 15, 17]. On top of that, the use of artificial intelligence has the potential to instill a higher level of faith in diagnostic procedures among patients. This is due to AI's potential to serve as an unbiased second opinion, thereby enhancing patients' trust in their dentists' suggested treatment plans [7, 8, 44].
Sources of heterogeneity
In this systematic review, we found substantial variation among the studies that were included, as evidenced by high I2 values in the different meta-analyses. This heterogeneity indicates that the differences in research outcomes are not only random, but rather, are impacted by multiple significant factors that require additional examination.
-
a
One of the primary sources of heterogeneity is the diversity of A) models employed across the studies. The studies included in this review utilized different machine learning algorithms, such as CNNs, DNNs, and SVMs, each with its own architecture, training dataset, and performance characteristics. The performance of these models can vary significantly depending on factors such as the size and quality of the training data, the specific algorithmic parameters, and the type of image preprocessing used. As a result, studies using different AI models may report varying levels of accuracy, sensitivity, and specificity, contributing to the observed heterogeneity.
-
b
Another significant factor contributing to variation arises from the diverse categories of radiographic images examined in the research included. The studies analyzed in this study utilized periapical, bitewing, and panoramic radiographs, each of which provides distinct difficulties and advantages in detecting caries. Another systematic review also reported that, Bitewing radiographs are highly efficient in identifying interproximal caries, whereas panoramic radiographs offer a wider perspective of the dental arch but with reduced precision. The diverse diagnostic accuracy of AI models in different imaging modalities is likely a factor in the observed variability in the study results [45].
-
c
The studies included in the analysis exhibited variations in their design and the populations they investigated. Differences in study design, such as retrospective versus prospective cohort studies, along with variations in sample numbers, demographic features, and clinical settings, can influence the generalizability and applicability of the findings. Research conducted in specific groups with different levels of dental caries or in different geographical areas may show varying levels of accuracy in diagnosis, which adds to the overall diversity.
-
d
Furthermore, variations in the evaluation and documentation of results among different research may also contribute to the presence of heterogeneity. The main goal of most of the studies was to test how well AI models could diagnose problems. However, there were differences in the exact metrics used (like accuracy, sensitivity, specificity, and area under the curve [AUC]) and the levels set for finding cavities. Variability in the results can arise from inconsistent reporting of outcomes and potential biases in the selection of cases or controls, making it difficult to directly compare research.
Challenges
Despite the above advantages, a number of obstacles hinder the widespread implementation of artificial intelligence in dental diagnostics. Protection of personal information is of the utmost importance, particularly with regard to the management of sensitive patient data [46]. Additionally, the incorporation of AI technologies into pre-existing clinical workflows presents a number of important hurdles, all of which require an investment of both time and money [47]. AI needs to be incorporated with regard for demographic risk, social determinants, health care service, and economic variables to value dentistry practice. Therefore, the development of AI systems with great specificity should be given top priority independent of the frequency context. High specificity helps to reduce overtreatment and directs focus on lesions that truly call for attention, therefore assuring more efficient use of resources [48].
Limitations
We should acknowledge several limitations, even though our systematic review and meta-analysis offer valuable insights into the diagnostic accuracy of artificial intelligence (AI) systems in detecting dental caries.
First, the heterogeneity among the included studies is significant, stemming from variations in AI models, radiographic techniques, and study designs. This variability may affect the generalizability of our findings across different clinical settings.
Second, the studies included in our analysis predominantly focused on specific types of radiographic images (such as bitewing or panoramic radiographs), which may limit the applicability of AI models to other imaging modalities or newer technologies that were not covered.
Third, the potential for publication bias exists, as studies with positive outcomes are more likely to be published, which could skew the overall results of our meta-analysis. Additionally, while we employed the QUADAS-2 tool for assessing the risk of bias in individual studies, we did not utilize QUADAS-C, which might be more suitable for comparative analyses.
Finally, particularly in populations with low disease frequency, inadequate specificity artificial intelligence systems could cause unwarranted interventions. Future studies should thus give top priority to the creation of highly specific artificial intelligence models capable of precisely differentiating between circumstances needing intervention and those not so demanding. Furthermore, taken into account in the integration of artificial intelligence into clinical practice should be demographic risk, socioeconomic determinants of health, and financial limitations to guarantee fair and efficient application [48].
Future directions
As we look to the future, it is vital to continuously create and validate AI models in order to handle these difficulties. The development of standardized protocols for artificial intelligence training is necessary. These protocols should include a diversity of training datasets in order to improve the robustness and usability of AI systems across a variety of clinical situations and individual populations. Additionally, future research should concentrate on the incorporation of artificial intelligence tools that can supplement conventional diagnostic procedures. This would lead to the development of a comprehensive diagnostic framework that capitalizes on the strengths of both human expertise and AI capabilities.
Furthermore, regulatory frameworks need to be adapted in order to keep up with the rapid advancements in technology. This is necessary in order to guarantee that artificial intelligence products are both safe and effective for clinical usage. When it comes to developing standards that govern the ethical use of artificial intelligence in healthcare, collaborations between researchers, doctors, and policymakers are absolutely necessary. These guidelines will ensure that patients benefit from these technologies without having their privacy or autonomy compromised.
Conclusions
AI has the ability to greatly improve the diagnosis process in dentistry, especially in detecting dental caries. This review emphasizes the high sensitivity and specificity rates of AI, showcasing its potential to enhance diagnostic accuracy, ultimately resulting in improved patient outcomes and streamlined clinical workflows. Dental professionals must address difficulties such as maintaining data privacy, incorporating AI into clinical practices, and improving AI models through comprehensive research to fully utilize AI in dentistry. It is crucial for the dental profession to adopt and apply innovations in a conscientious and ethical manner to improve patient care as we progress. This review anticipates increased collaboration between human skills and artificial intelligence in dentistry in the future, leading to remarkable enhancements in the quality and effectiveness of dental treatment.
Data availability
No datasets were generated or analysed during the current study.
References
Patcas R, Bornstein MM, Schätzle MA, Timofte R. Artificial intelligence in medico-dental diagnostics of the face: a narrative review of opportunities and challenges. Clin Oral Invest. 2022;26(12):6871–9.
Agrawal P, Nikhade P, Nikhade PP. Artificial intelligence in dentistry: past, present, and future. Cureus. 2022;14(7):e27405. https://doiorg.publicaciones.saludcastillayleon.es/10.7759/cureus.27405.
Frencken JE, Sharma P, Stenhouse L, Green D, Laverty D, Dietrich T. Global epidemiology of dental caries and severe periodontitis–a comprehensive review. J Clin Periodontol. 2017;44:S94–105.
Kumari M, Rafia AK, Shree R. Changing concepts in the diagnosis of dental caries: a review. Sci Arch Dent Sci. 2022;5(1):29–35.
Wenzel A. Radiographic modalities for diagnosis of caries in a historical perspective: from film to machine-intelligence supported systems. Dentomaxillofacial Radiology. 2021;50(5): 20210010.
Dayo AF, Wolff MS, Syed AZ, Mupparapu M. Radiology of dental caries. Dental Clinics. 2021;65(3):427–45.
Patil S, Albogami S, Hosmani J, Mujoo S, Kamil MA, Mansour MA, et al. Artificial intelligence in the diagnosis of oral diseases: applications and pitfalls. Diagnostics. 2022;12(5):1029.
Mohammad-Rahimi H, Motamedian SR, Rohban MH, Krois J, Uribe SE, Mahmoudinia E, Schwendicke F. Deep learning for caries detection: a systematic review. J Dent. 2022;122:104115.
Schwendicke F, Elhennawy K, Paris S, Friebertshäuser P, Krois J. Deep learning for caries lesion detection in near-infrared light transillumination images: a pilot study. J Dent. 2020;92:103260.
Hassani H, Amiri Andi P, Ghodsi A, Norouzi K, Komendantova N, Unger S. Shaping the future of smart dentistry: from Artificial Intelligence (AI) to Intelligence Augmentation (IA). IoT. 2021;2(3):510–23.
Slosiarova N, Mesarcik M, Jurkacek P, Podrouzek J. Trustworthy AI in dental care beyond Artificial Intelligence Act. Proc. of the 2nds International Workshop on Imagining the AI Landscape After the AI Act – HHAI 23, Munich, Germany, June 26-27, 2023, CEUR-WS.org, online https://ceur-ws.org/Vol-3456/short1-2.pdf.
Anil S, Sudeep K, Saratchandran S, Sweety VK. Revolutionizing dental caries diagnosis through artificial intelligence. 2023. https://doiorg.publicaciones.saludcastillayleon.es/10.5772/intechopen.112979.
Ing ME. Dental caries. In: Dental science for the medical professional: an evidence-based approach. Cham: Springer International Publishing. 2023. pp. 69–87. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/978-3-031-38567-4_7.
Fatima A, Shafi I, Afzal H, Díez IDLT, Lourdes DRSM, Breñosa J, Ashraf I. Advancements in dentistry with artificial intelligence: current clinical applications and future perspectives. Healthcare. 2022;10(11): 2188.
Dixit S, Kumar A, Srinivasan K. A current review of machine learning and deep learning models in oral cancer diagnosis: recent technologies, open challenges, and future research directions. Diagnostics. 2023;13(7):1353.
Karobari MI, Batul R, Khan M, et al. Micro computed tomography (Micro-CT) characterization of root and root canal morphology of mandibular first premolars: a systematic review and meta-analysis. BMC Oral Health. 2024;24:(1).
Chen IDS, Yang CM, Chen MJ, Chen MC, Weng RM, Yeh CH. Deep learning-based recognition of periodontitis and dental caries in dental x-ray images. Bioengineering. 2023;10(8): 911.
Zhu H, Cao Z, Lian L, Ye G, Gao H, Wu J. CariesNet: a deep learning approach for segmentation of multi-stage caries lesion from oral panoramic X-ray image. Neural Comput & Applic. 2023;35;16051–9.
Bayraktar Y, Ayan E. Diagnosis of interproximal caries lesions with deep convolutional neural network in digital bitewing radiographs. Clin Oral Invest. 2022;26(1):623–32.
Huang Y, Lee S. Deep Learning for Caries Detection using Optical Coherence Tomography. medRxiv. 2021. Preprint. https://doiorg.publicaciones.saludcastillayleon.es/10.1101/2021.05.04.21256502.
Mao Y-C, Chen T-Y, Chou H-S, Lin S-Y, Liu S-Y, Chen Y-A, Liu Y-L, Chen C-A, Huang Y-C, Chen S-L, et al. Caries and restoration detection using bitewing film based on transfer learning with CNNs. Sensors. 2021;21:4613.
De Araujo Faria V, Azimbagirad M, Viani Arruda G, Fernandes Pavoni J, Cezar Felipe J, Dos Santos EMCMF, Murta Junior LO. Prediction of radiation-related dental caries through PyRadiomics features and artificial neural network on panoramic radiography. J Digit Imaging. 2021;34:1237–48.
Hur S-H, Lee E-Y, Kim M-K, Kim S, Kang J-Y, Lim JS. Machine learning to predict distal caries in mandibular second molars associated with impacted third molars. Sci Rep. 2021;11:15447.
Lee S, Oh S-I, Jo J, Kang S, Shin Y, Park J-W. Deep learning for early dental caries detection in bitewing radiographs. Sci Rep. 2021;11:16807.
Vinayahalingam S, Kempers S, Limon L, Deibel D, Maal T, Hanisch M, Bergé S, Xi T. Classification of caries in third molars on panoramic radiographs using deep learning. Sci Rep. 2021;11:12609.
Mertens S, Krois J, Cantu AG, Arsiwala LT, Schwendicke F. Artificial intelligence for caries detection: randomized trial. J Dent. 2021;115: 103849.
Moran M, Faria M, Giraldi G, Bastos L, Oliveira L, Conci A. Classification of approximal caries in bitewing radiographs using convolutional neural networks. Sensors. 2021;21: 5192.
Lian L, Zhu T, Zhu F, Zhu H. Deep learning for caries detection and classification. Diagnostics. 2021;11: 1672.
Zheng L, Wang H, Mei L, Chen Q, Zhang Y, Zhang H. Artificial intelligence in digital cariology: a new tool for the diagnosis of deep caries and pulpitis using convolutional neural networks. Ann Transl Med. 2021;9:763.
Bayrakdar IS, Orhan K, Akarsu S, Çelik Ö, Atasoy S, Pekince A, Odabaş A. Deep-learning approach for caries detection and segmentation on dental bitewing radiographs. Oral Radiol. 2021;1:1–12.
Devlin H, Williams T, Graham J, Ashley M. The ADEPT Study: a comparative study of dentists’ ability to detect enamel-only proximal caries in bitewing radiographs with and without the use of assist dent artificial intelligence software. Br Dent J. 2021;231:481–5.
Chen H, Li H, Zhao Y, Zhao J, Wang Y. Dental disease detection on periapical radiographs based on deep convolutional neural networks. Int J Comput Assist Radiol Surg. 2021;16:649–61.
Geetha V, Aprameya KS, Hinduja DM. Dental caries diagnosis in digital radiographs using back-propagation neural network. Health Inf Sci Syst. 2020;8:8.
Cantu AG, Gehrung S, Krois J, Chaurasia A, Rossi JG, Gaudin R, Elhennawy K, Schwendicke F. Detecting caries lesions of different radiographic extension on bitewings using deep learning. J Dent. 2020;100: 103425.
Choi J, Eun H, Kim C. Boosting proximal dental caries detection via combination of variational methods and convolutional neural network. J Signal Process Syst. 2018;90:87–97.
Lee JH, Kim D-H, Jeong S-N, Choi S-H. Detection and diagnosis of dental caries using a deep learning-based convolutional neural network algorithm. J Dent. 2018;77:106–11.
Srivastava MM, Kumar P, Pradhan L, Varadarajan S. Detection of tooth caries in bitewing radiographs using deep learning. arXiv preprint arXiv:1711.07312. 2017.
Alzubaidi L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, Al-Shamma O, et al. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. J Big Data. 2021;8:1–74.
Akhter MN, Hussain SS, Riaz N, Zulfiqar R. Using technological diagnostic tools to find early caries: a systematic review. Dinkum Journal of Medical Innovations. 2023;2(07):271–83.
Dashti M, Londono J, Ghasemi S, Zare N, Samman M, Ashi H, Amirzade-Iranaq MH, Khosraviani F, Sabeti M, Khurshid Z. Comparative analysis of deep learning algorithms for dental caries detection and prediction from radiographic images: a comprehensive umbrella review. PeerJ Computer Science. 2024;10: e2371.
Mahdi SS, Battineni G, Khawaja M, Allana R, Siddiqui MK, Agha D. How does artificial intelligence impact digital healthcare initiatives? A review of AI applications in dental healthcare. International Journal of Information Management Data Insights. 2023;3(1): 100144.
Schwendicke F, Rossi JG, Göstemeyer G, Elhennawy K, Cantu AG, Gaudin R, Chaurasia A, Gehrung S, Krois J. Cost-effectiveness of artificial intelligence for proximal caries detection. J Dent Res. 2021;100(4):369–76. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/0022034520972335.
Abogazalah N, Ando M. Alternative methods to visual and radiographic examinations for approximal caries detection. J Ora Sci. 2017;59(3):315–22.
Khanagar SB, Al-Ehaideb A, Maganur PC, Vishwanathaiah S, Patil S, Baeshen HA, Bhandi S. Developments, application, and performance of artificial intelligence in dentistry–A systematic review. Journal of dental sciences. 2021;16(1):508–22.
Ammar N, Kühnisch J. Diagnostic performance of artificial intelligence-aided caries detection on bitewing radiographs: a systematic review and meta-analysis. Jpn Dent Sci Rev. 2024;60:128–36. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jdsr.2024.02.001.
Klumpp M, Hintze M, Immonen M, Ródenas-Rigla F, Pilati F, Aparicio-Martínez F, Delgado-Gonzalo R. Artificial intelligence for hospital health care: application cases and answers to challenges in European hospitals. In Healthcare. 2021;9(8):961.
Najjar R. Redefining radiology: a review of artificial intelligence integration in medical imaging. Diagnostics. 2023;13(17): 2760.
La Rosa GRM. Artificial intelligence in demineralized lesion detection: evaluating clinical benefits and economic disadvantages of artificial intelligence-based models. J Am Dent Assoc. 2024;S0002–8177(24):00591–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.adaj.2024.10.007.
Acknowledgements
None.
Funding
The current research did not receive any funding. The author(s) would like to express their gratitude to Ajman University and The Deanship of Research and Graduate Studies for their generous support in covering the APC fees for this publication, which has greatly facilitated the dissemination of this research.
Author information
Authors and Affiliations
Contributions
A.M.L was responsible for conceptualization, resources, supervision, validation, and writing the original draft. Both A.M.L and N.N.F.R contributed to data curation, formal analysis, methodology, supervision, and writing review and editing. Additionally, N.N.F.R handled the visualization tasks.
Corresponding author
Ethics declarations
Ethics approval and consent to participate:
Not applicable.
Consent for publications
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Luke, A.M., Rezallah, N.N.F. Accuracy of artificial intelligence in caries detection: a systematic review and meta-analysis. Head Face Med 21, 24 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13005-025-00496-8
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13005-025-00496-8