Skip to main content

Validation of a digital, partly automated three-dimensional cast analysis for evaluation of orthodontic treatment assessment

Summary

Background

Plaster models have been considered the gold standard in traditional orthodontic model analysis. Modern imaging techniques and ever-advancing technologies have expanded the scope of digital diagnostic tools. These innovations allow the use of devices specifically designed for the diagnosis of 3D structures. The aim of this method comparison study was to determine the accuracy and efficiency of digital measurements compared to conventional manual measurements on plaster models.

Materials and methods

The present cohort constitutes the evaluation of pretherapeutic situation models of 247 orthodontically treated patients (129 females and 118 males, average age 16.76 +- 9.49 years) with mixed or permanent dentition who were treated at the University Hospital RWTH Aachen between January 2018 and December 2020. Plaster models were digitised using a model scanner, and an experienced examiner performed various measurements on blinded plaster models using a calliper and on digital models using the specially developed ‘Tooth width analysis Aachen’ patch in the OnyxCeph3TM-3D software. The intra-rater and inter-rater reliability were determined by a second, blinded assessor. Measurements included tooth width, crown height, arch width, arch length and arch circumference, as well as overjet and overbite. Differences between analogue and digital methods were calculated.

Results

Differences of up to 0.3 mm were observed between manual and partially automated digital measurements for sagittal, transversal and vertical parameters. Teeth with close proximal contact to adjacent teeth and teeth in jaws with a negative space analysis result showed an increased difference between manual and partially automated digital measurements, although this was not clinically relevant. The time required to perform digital measurements was statistically significantly reduced.

Conclusions

Partially automated digital impression analysis offers an accurate, highly efficient and time-saving alternative to traditional manual impression analysis.

Peer Review reports

Introduction

Three-dimensional model analyses, based on plaster casts of the upper and lower dentition of a patient, constitute an essential component of orthodontic diagnostics [1]. Conventionally, model analysis is performed on plaster models, which succeeded for many years in orthodontic practice [2]. Nevertheless, due to the inherent limitations of conventional plaster dental casts, including increased storage space requirements and the potential for damage due to handling, digital scanners for obtaining virtual models and several software programs for digital model analyses have been developed and disseminated in orthodontics. There are a number of methods for obtaining digital models, including scanning of the patient’s dentition using an intraoral scanner at the office, scanning of an alginate or silicone impression, scanning of a plaster model with a laboratory scanner, and scanning using Cone Beam Computed Tomography (CBCTs) following 3D imaging reconstruction [3,4,5]. The advent of digital technology has made it possible for both soft and hard tissues to be visually represented electronically in a modern orthodontic practice. As a consequence, digital models are emerging as the new standard for orthodontic medical documentation, potentially superseding plaster models [3, 4]. However, its validity compared to traditional model analysis using callipers remains to be assessed, especially as a variety of hardware and software solutions are available to the user. The current research situation is supported by several comprehensive systematic reviews that consistently confirm the reliability of digital workflows in clinical applications. This broad evidence base, supplemented by current validation studies, emphasises the advanced state of development of digital workflow solutions in a clinical context [3, 4, 6]. In recent times, technological advances have enabled the creation of digital three-dimensional dental models, which have gained considerable popularity [7]. This may be attributed to the numerous advantages offered by digital analysis, as outlined below. Whether it is a tooth width measurement or a Bolton analysis, the literature shows that digital model evaluation can save a significant amount of time - seconds or even minutes [7,8,9,10]. It is a common occurrence for orthodontists to encounter difficulties when attempting to store their patients’ records due to the limited space available. This is particularly problematic for those who have been in practice for several years and are required to adhere to the legal retention periods for patient records. Therefore, is the elimination of the storage problem of plaster models another advantage, which require physical storage space. In contrast, digital models can be stored efficiently on hard drives or portable storage devices. This facilitates data access, which in turn opens up new opportunities for interdisciplinary communication and collaboration [6, 11,12,13]. Digital methods are becoming increasingly pertinent for orthodontic cast analysis. However, many digital methods are still time-consuming due to the involvement of numerous manual steps. Consequently, the development of partially automated and self-measuring tools has become of great interest.

Therefore, as part of a retrospective cross-sectional study of orthodontically treated patient cases to validate the specially developed ‘Tooth width analysis Aachen’ patch in the OnyxCeph3TM-3D software, we wanted to compare traditional manual model analysis of plaster models and semi-automated digital model analysis to develop another gold standard for digital orthodontic model analysis to be used in everyday orthodontic practice. The existing plaster models were digitized for this purpose, while intraoral scans were not routinely available. This digital model analysis should efficiently generate precise, objective and reproducible data that provides important information on the orthodontic treatment situation; for pre-therapeutic orthodontic treatment planning, for re-evaluation as part of an interim diagnosis or post-therapeutically for comparison with the pre-therapeutic initial situation and assessment of the orthodontic treatment quality.

The primary objective of this retrospective study was to evaluate the degree of agreement and accuracy between digital and analogue model analysis. As secondary outcomes, risk factors that increase measurement deviation and the required measurement time were evaluated. To this end, a sample of 247 initial models, all in the mixed dentition phase, were analysed in both analogue and digital models.

Material and method

Study design

The present study employs a method-comparison approach to assess the degree of agreement between digital and analogue model analysis, as well as the required time of each method. Ethical approval was obtained from the ethics committee of Medical Faculty of RWTH Aachen with the ethical registration number EK 232/20.

Data acquisition

The power calculation was performed using G Power 3.1 [14]. For the power analysis we used the smallest tooth, the premolar 15 and the smallest measurement method, the tooth width analysis, as the SD has the greatest effect/disadvantage here. A total of 247 pseudonymised cases were included from patients being treated during the period from January 2018 to December 2020. The case records were obtained from the archives of the University Hospital Aachen, RWTH, Germany and consist of consist of pre- treatment plaster models with inclusion criteria described in the following: (i) Good condition of dental casts (not fractured or degraded), (ii) mixed or permanent dentition (without further differentiation with regard to e.g. tooth agenesis or tooth eruption) and (iii) Irrespective of crowding, malocclusion, and previous orthodontic treatment. Plaster models of insufficient quality or damaged models with incomplete teeth that could not be evaluated were not digitised and therefore not included in the digital database.

The plaster casts were also scanned with orthoX®scan (Dentaurum GmbH & Co. KG, Ispringen, Germany) in accordance with the manufacturer’s instructions. Three-dimensional virtual models were subjected to visual inspection for accuracy and, where necessary, scanning was conducted until high-quality models were acquired. Subsequently, the virtual models were imported into the software OnyxCeph3TM (Image Instruments GmbH, Chemnitz, Germany), which was used by one experienced investigator, familiarised with the software, for digital model analyses. Based on the studies of Coenen et al. [15, 16], the ‘Tooth Width Analysis Aachen’ patch for orthodontic model analysis was developed in collaboration with OnyxCeph3TM-3D for the further development and continuation of the digitalisation of orthodontic diagnostics, which is not yet freely accessible. The same plaster casts were subjected to manual analysis with a calliper (Dentaurum GmbH & Co. KG, reference number 042-751-00). A patient case consists of one upper and one lower jaw. 247 pseudonymised cases result in a total of 494 individual jaws (247 upper and 247 lower jaw), which were measured both digitally and by hand as part of the orthodontic model analysis. According to the analogue evaluation method, only the permanent teeth were measured and evaluated, but not the deciduous teeth.

Dental cast analysis

In accordance with the definitions listed in the Table 1, the measurements and the corresponding computations represented are commonly used parameters and were used both for an analogue and digital manner. In the context of arch dimensions, the terms “anterior arch width,” “arch length,” and “arch perimeter” are used. The measurements were taken to the nearest 0.1 mm, except for overjet, which was recorded to the nearest 0.5 mm. As far as the digital measurement is concerned, the software automatically provides suggestions for reference points for all required measurements, except for the overbite and the lower posterior arch width. Nevertheless, it is necessary to review the reference points and adjust them if required (partly automated). The time required for the measurement of 50 digital (excluded the time for digitizing the plaster model) and 50 analogue cast analyses is documented.

Table 1 Detailed explanation of the measurements carried out with the necessary reference points for standardising the digital and analogue measurement. The measurements were only carried out on permanent teeth

Reliability

To assess the intra- and inter-rater reliability of digital and analogue methods, 50 randomly selected pretreatment casts were subjected to repeated digital measurements, while another 50 plaster models were remeasured analogously by the same examiner after an interval of two months. Also, a second blinded examiner was briefed with the definitions of required measurements and performed 50 digital cast analyses. The results of the repeated measurements were compared between different methods and examiners. Intra- and interclass correlation coefficient was calculated with the software Python (Version 3.9.12 and pingouin library).

Statistical analysis

The plaster models were considered as the gold standard. The digital and analogue model analyses were compared directly based on all measurement parameters. In addition, the differences were calculated by subtracting the analogue value from the digital value. For further analysis the statistics software GraphPad PRISM Version 10.0.1/2023 (GraphPad Software Inc., San Diego, California, USA) was utilised. In order to examine the digital and analogue values for significant differences, all groups were first tested for normal distribution using the Shapiro-Wilk test. The direct comparison of the digital and analogue values (Table 2) was carried out using the Wilcoxon matched-pairs signed rank test and the bland-altmann plot (Supp.-Fig. 1a-k). The measurement differences between the digital and analogue values with regard to the different tooth groups (Fig. 2A), measurement criteria (Fig. 2B) and the amount of space in dental arch (Fig. 3B) were determined using the Kruskal-Wallis test. The accessibility of the measuring points (Fig. 3A) was statistically tested using the Mann-Whitney U-test. The comparison of the time required (Fig. 3C) was carried out using the paired T-test. P < 0.05 was considered statistically significant.

Table 2 Comparison of the results of the digital and analogue measurement methods. The tooth widths of the maxilla and mandible of 16–26, 36–46, the tooth crown height of 12–22, 32–42, the overjet and overjet, the midline shift in the maxilla and mandible and in relation to each other, the arch dimensions with anterior and posterior dental arch width for the maxilla and mandible, dental arch length and circumference were measured. The results show statistical significance using the Wilcoxon matched-pairs signed rank test, but are not clinically relevant due to their negligible size (see difference digital vs. analogue)
Fig. 1
figure 1

Methodological diagram of the three-dimensional digital model measurement method. A: Occlusal representation of the segmentation of a maxillary model to identify the areas of a closed scan surface belonging to a dental crown. B: Reference points are assigned to the three-dimensional dental crowns, which form an individual coordinate system and are used in the following diagnostic modules. The reference points are checked in all 3 planes. C: The digital model analysis is then carried out with determination of tooth widths, crown height, overjet/overbite, midline shifts, arch dimensions (anterior and posterior arch width, arch length, arch circumference)

Fig. 2
figure 2

The measurement differences between the analogue and digital methods were examined in detail regarding the different tooth groups and different measurement criteria. A: Differences between digital and analogue measurements [mm] for the different groups of teeth. The premolars show the greatest deviation and the canines the smallest; molars IQR: 0.10, median: -0.10; premolar IQR: 0.00, median:-0.10; canines IQR: 0.00, median: -0.10; incisors IQR: -0.10, median: 0.00 B: A comparison of the observed absolute difference between the digital and analogue methods for tooth width (IQR: -0.10, median: 0.00), crown height (IQR: 0.40, median: 0.20), overjet (IQR: 0.00, median: 0.00) and overbite (IQR: 0.50, median: 0.00) and arch dimensions (IQR: 0.30, median: 0.10). The largest measurement difference can be seen in the arch dimensions, while the smallest measurement difference is in the tooth widths

Fig. 3
figure 3

Differences between the digital and analogue measuring methods regarding the accessibility of the measuring points and the time required. A, B: Comparison of the deviation between digital and analogue tooth width measurement for teeth with easy access to the reference points (IQR: 0.10, median: 0.00) and close proximal contact to the adjacent teeth (IQR: 0.10, median: 0.00). The teeth with freely accessible measuring points show a smaller measurement difference between the digital and analogue measuring methods compared to the teeth with difficult-to-access reference points; upper crowding IQR: 0.143, median: -0.067; upper spacing IQR: 0.10, median: -0.042; lower crowding IQR: 0.125, median: -0.058; lower spacing IQR: -0.066, median: 0.042 C: Comparison of the time required [sec] between the digital and analogue measurement methods for the tooth width measurement (manual IQR: 157.50, median: 499.00; digital IQR: 39.70, median: 308.00) and the complete model analysis (manual IQR: 236.50, median: 889.00; digital IQR: 64.00, median: 513.55)

Results

247 pseudonymised patient cases from the study center Clinic for Orthodontics of the University Hospital RWTH Aachen, 129 female and 118 male patients with an average age of 16.67 +- 9.49 years with mixed or permanent dentition, were included in this retrospective cohort study and examined using a digital, semi-automated orthodontic model analysis with the OnyxCeph software. The digital measurements were compared with the gold standard of analog model analysis.

The different methods agree to within 0.3 mm for direct measurements and to within 0.66 for computed space parameters in model analysis. The results of the digital measurement method could be recorded in a shorter time.

Power

Power analyses with α (one-tailed) = 0.05, effect size = 0.23 and power (1 - β) = 0.95 resulted in a minimum sample size of N = 206. Therefore, we included a total of 247 pseudonymised cases from patients being treated at the University Hospital Aachen, RWTH.

Reliability

For digital measurements, an intraclass correlation coefficient (ICC) between 0.996 and 0.9995 was calculated for the different measurements. For repeated measurements performed on plaster models, the results varied between 0.991 and 0.9997. The agreement of the digital measurements by the second examiner was rated from 0.946 to 0.997.

Accuracy of measuring tooth size

A total of 5497 digitally (partly automatically) and analogously measured tooth widths from 247 cases were compared. Table 2 provides an overview of the digital and analogue measurement results for the tooth widths in the upper (teeth 16–26) and lower jaw (teeth 36–46), the Bolton ratio AR and OR, the crown height of teeth 12–22, 32–42, the overjet and overbite, the midline shift in the upper and lower jaw and the deviation from each other, the arch dimensions with anterior and posterior arch width, the arch length and circumference in the upper and lower jaw. There was an absolute average difference of 0.053 mm (+-0.025 mm) between the digital and analogue tooth width measurements. Thus, the digital method seems to provide larger values of about 0.023 mm on average. A Wilcoxon matched-pairs signed rank test was conducted, which revealed statistically significant differences between the values obtained using the two methods. The mean absolute and percentage differences for teeth 16–26 in the maxilla and 36–46 in the mandible are shown in Fig. 4A, B. Furthermore, the differences that occurred were analysed and specified further by tooth type regarding molars, premolars, canines and incisors (Fig. 2A) and based on the different measurement criteria (facewidth, crown height, overjet/overbite, arch dimensions) (Fig. 2B). It was found that premolars show a higher deviation between the measurement on the digital model and the plaster model than molars, canines or incisors (Fig. 2A). The largest measurement difference can be seen in the arch dimensions, while the smallest measurement difference is in the tooth widths (Fig. 2B).

Fig. 4
figure 4

Differences between the digital and analogue measuring methods regarding the tooth groups, the accessibility of the measuring points and the time required. A: The average absolute (left y-axis) and percentage (right y-axis) measurement difference between the digital and analogue measurement methods in relation to the upper jaw with teeth 16–27 and B: in relation to the lower jaw with teeth 36–46, with the premolars showing the greatest deviation in comparison

Bolton ratio

Based on our inclusion criteria for model selection and mixed dentition models or tooth agenesis, Bolton’s Analysis was performed using each method, and the values were examined to see if the values differed (Table 2). AR could be determined for 211 patients and OR for 167 of the 247 patients. The mean absolute difference was relative 0.57 for AR and 0.54 for OR. When the calculated Bolton ratios of all patients were compared between the digital and analogue methods, significant differences were found for AR. However, there was no significant difference for OR.

Comparison of measuring techniques

Bland-Altman plots were constructed for the values obtained by both methods (Supp.-Fig. 1a-k). These plots contain ‘limits of agreement’ that define a range that covers 95% of the differences between the values obtained by different methods or repeated measurements. The mean differences and limits of agreement resulting from the comparison of the two measurement methods (caliper and software) are shown in Supp.-Fig. 1a-k. The mean difference between all individual tooth measurements (tooth width, crown height) is + 0.05 mm for tooth width with 95% limits of agreement of + 0.31 and − 0.22 mm and − 0.12 mm for crown height with 95% limits of agreement of + 0.61 and − 0.84 mm. The measurements of inter- and intramaxillary individual distances (overjet, overbite, midline deviation, arch width, arch length, arch girth) showed the smallest mean difference of -0.03 mm with 95% limits of agreement of + 0.77 mm and − 0.83 mm for the midline deviation and the largest mean difference of -0.25 mm with 95% limits of agreement of + 0.41 mm and − 0.90 mm for the arch girth. The cumulative measurements showed a mean difference of -0.22 mm for the anterior Bolton ratio with greater 95% limits of agreement than the single tooth measurements and individual distances of + 1.63 mm and − 2.06 mm. The total Bolton ratio showed a mean difference of -0.25 mm with 95% limits of agreement of + 1.64 mm and − 1.63 mm.

Risk factors increasing the deviation of measurements

When comparing digital and analogue measurements, risk factors that can lead to increased measurement differences must be considered. As shown in Fig. 4A, crowding in an arch appeared to significantly increase the difference between digital and analogue measurements of tooth width in the maxilla and mandible. Crowding was defined as having a negative result when performing the spacing analysis. All other cases with a positive result were included in the spacing group. The group with present crowding in the maxilla or mandible shows an increased mean difference of 0.076 mm, compared to 0.044 mm in the maxilla and 0.057 mm in the mandible with spacing. In addition, an individual evaluation of each tooth was also documented. The observed differences between the two measurement methods were compared between two groups according to access to the reference point and the results are shown in Fig. 4B. Teeth with tight approximal contact showed a significantly higher deviation between the two measurement methods, averaging approximately 0.059 mm, than teeth with freely accessible reference points, averaging approximately 0.027 mm.

Time saving

A comparison was conducted between the required time for the digital and analogue methods, separately for tooth width measurement and complete model analysis. The results are provided in Fig. 4C. The mean time required for the measurement of tooth widths 16–26 and 36–46 on plaster models was 8:36 min. A significant reduction in the measurement time was observed in comparison with digital determination, with a reduction to 5:09 min. Consequently, a time saving of 3:27 min was achieved. The required time for the complete analysis of the plaster models was also analysed. This included the measurement of tooth widths and heights (12–22 and 32–42), as well as anterior and posterior arch widths, arch lengths, perimeter, overjet and overbite were considered. The digital model analysis was completed in a mean time of 8:33 min, whereas the manual analysis of plaster models required a mean time of 14:48 min. Consequently, a reduction in the required time of 6:15 min was demonstrated. The observed differences in required time were statistically significant.

Discussion

In this study, we investigated the accuracy, reliability and efficiency of digital model analysis based on virtual models digitised with the model scanner orthoXscan (Dentaurum GmbH & Co. KG, Ispringen, Germany) and analysed with the software OnyxCeph3TM (Image Instruments GmbH, Chemnitz, Germany), in comparison to conventional manual model analysis on plaster models and a digital calliper. Our findings indicate that partly automated digital cast analysis is a timesaving and therefore highly efficient process and represents an accurate and reliable alternative to traditional manual cast analysis, which are of particular interest to orthodontists.

A comparison of the measurement of tooth width between digital and plaster models revealed that the partially automated digital method delivered by an amount of 0.023 mm resulted in increased values according to Kardach et al. [17]. Studies have shown varying results in measuring tooth width, with some finding no statistical difference but a slight decrease in digital values up to 0.023mm [18]. Regardless of the statistical significance observed in the discrepancy between analogue and digital methods in measuring tooth width, the results obtained with the Bolton analysis demonstrated no significant differences in terms of OR [18]. In contrast to our results, Lo Giudice et al. found no significant differences in OR [19]. Conversely, the differences in AR were not significant, which is not consistent with our results and those of Kim et al. [20]. The lack of discrepancies in the overall Bolton analysis can be attributed to the assumption of a consistent bias in tooth width. After accounting for this assumption, the proportional value of OR effectively compensates for the differences in single tooth width [11, 17, 18]. In some cases, especially in the measurement of crown height, the digital measurement process proved to be challenging due to the occurrence of boundary blurring during the reconstruction of attached gingiva in the segmentation process. This resulted in difficulties in accurately defining the cervical reference point. Consequently, this can result in a higher absolute mean difference of 0.241 mm regarding to the crown height compared to Keating et al.´s (0.1 mm) and Camardella et al.´s (0.17 mm) findings [21, 22]. In this case, other measuring tool as the “calliper measurement’’ could be more accurate to measure the crown height of these teeth. Furthermore, is also important to note that the measurement ranges for tooth width and height are similar, but that for crown height, there is a nearly four times higher deviation. Although the discrepancy is below the 0.3 mm threshold [5], it is not clinically relevant. In our study the arch dimensions showed measurements with a significantly decreased difference of 0.297 mm. In disagreement to the literature, increased values for arch widths and arch perimeter were found [23]. This result can be attributed to potential misinterpretations during the selection of reference points (centre of the cusp), particularly in cases where teeth exhibit attrition on the reference cusp. The estimation of the arch perimeter in the context of space analysis represents a key aspect of the diagnostic and planning processes employed in orthodontics. Arch perimeter estimation, such as intermolar and intercanine widths, displayed significant differences between plaster and digitised models. The clinical relevance of the detected results was questioned by the authors [23]. This type of measurement revealed in our study the greatest absolute difference between analogue and digital measurements. It is important to note that the results should be interpreted in the context of the large distances that were analysed. Therefore the percentage difference is the smallest of all the measurements that were conducted, and it is evident that the results are not clinically significant when compared to the literature [23]. In contrast to other studies, the measurement of overjet was found to be statistically different in our study [11, 18]. Similarly, the measurement of the overbite differed with statistical significance in other studies, amounting to 0.3 mm or 0.49 mm, respectively [11, 18]. In contrast, our determined mean difference was much smaller, with 0.236 mm for the overbite. In the aforementioned studies [11, 18], the discrepancy was deemed to be of no clinical significance despite statistical significance. This may be indicative of the similarly insignificant discrepancy observed in our study.

In conclusion, this prompts a fundamental debate about the clinical significance of statistical significance. A perusal of Table 2 reveals that all the observed differences in measurements are statistically significant. However, it is not immediately evident whether these differences are automatically clinically relevant for orthodontists. To address this fundamental question, a review of the relevant literature was conducted. It was recommended by Bishara et al. that a maximum difference of 0.2 mm should be allowed for repeated measurements carried out by a single examiner. The discrepancy in measurements for tooth width and overjet between the digital and plaster methods in our study was below the threshold levels required for repeated measurements with the same method, as previously stipulated [24, 25]. Another author has stated that for a digital model in orthodontics, 0.3 mm is set as the threshold of required accuracy [5]. Deviation for non-calculated measurements of up to 0.5 mm is considered to be clinically irrelevant in the literature [11, 26]. All the differences in the measurements presented in Table 2 are below this value, thus satisfying the requirements. When analysing the measurement results, it can be assumed that the statistically significant but at the same time clinically irrelevant measured values should not significantly influence the treatment decision. It may be necessary to make an exception for the crown height. As previously stated, there are discrepancies between the findings of different studies, which raises the question of what factors are responsible for the observed discrepancies in study results.

When analysing the Bland-Altman plot, it should be noted that both limits of agreement increase when measuring individual distances (with the exception of arch width), and even more so when cumulative measurements are made. Whether this variability is clinically acceptable cannot be easily answered. The tolerance limit of ± 0.5 mm postulated in the literature [11] suggests that the non calculated measurements of tooth and arch widths are clinically acceptable. At the same time, it should be noted that more complex measurements (arch length or girth) and cumulative measurements were not included in the above study and therefore the tolerance limit of ± 0.5 mm may not be applicable. Our results suggest a complexity-dependent proportionality. The more complex and extensive the measured structure (from individual teeth to dental arches to commutative groups of teeth), the greater the percentage deviations. This suggests a proportional relationship in which measurement uncertainty increases with the size and complexity of the measured structure. In general, it can be said that the mean of difference/distortion is low and the relatively small range of the limits of agreement in relation to the complexity of the measurements indicates a low variability of the differences.

As described above, there are inconsistencies between different investigations, which leads to the question what factors are causing differences between study results.

One of the factors that can affect the accuracy of a measurement is the level of experience of the observer. In the context of our study, the rater was experienced, familiar with the software being used, and completed all measurements in the allotted time of two months. The research group has many years of experience in digital measurement with the OnyxCeph software, due to regular internal training by the developers of the software program itself, previous digital studies and prior research results, the daily use of digital analysis of patient cases in clinical orthodontic practice and to support, continuously optimise and further develop the software tool. It can be observed that observers with more experience tend to produce more consistent readings when undertaking repeated measurements, although this may not always be the case [27]. However, the results of repeated measurements can also vary, even if performed by the same observer. This is particularly true when the interval between repeated measurements is extended [28]. Furthermore, it is important to acknowledge that the process of marking reference points in a three-dimensional model on a two-dimensional computer screen may present certain challenges [29]. This is due to the fact that points may be evaluated differently depending on the adjusted point of view, which can be particularly problematic when working with complex models. The challenge lies in identifying an identical reference point in both analogue and digital forms [6, 30].

In the present study, teeth exhibiting close approximate contact with neighbour teeth and the presence of crowding demonstrated an elevated discrepancy. Regarding the issue of point positioning, callipers are unable to attain the maximum mesiodistal diameter of teeth, especially in the presence of crowding. Moreover, impression materials are unable to accurately reproduce the space between crowded teeth in plaster casts [31]. In contrast, software solutions for digital model analysis offer a range of functions, including zooming and view rotation particularly at proximal contacts, even in the presence of crowding. Conversely, one source for bias in this context can be the poor resolution of the approximate area between teeth in the plaster model, in which the software is making a proposal for the missing information delineating the merging teeth surfaces of adjacent teeth. This approach may diverge from that of the examiner when the calliper is applied. A review of the literature reveals that there are discrepancies between analog and digital measurements, which are attributed to varying degrees of crowding [19, 32, 33]. The authors also propose that these discrepancies have clinical implications [33]. Furthermore, it would be beneficial to discuss whether the selection of traditional plaster models can be considered the gold standard and using an intraoral scanner by a trained user may avoid this bias.

In the analysis of the time required to perform the measurements, a significant reduction was observed using the digital method, which is consistent with other studies [8,9,10]. Looking at the significant results of timekeeping that have been studied for more than 10 years, the technology has improved considerably. Today, the software places the reference points automatically and accurately, and only needs to be checked and adjusted by the user, which can explain another time saving in our study. The first step in the process chain, which includes the traditional impression or direct digital intraoral scanning, was not studied here. Recent studies on chairside time shows a differentiated picture of the time efficiency of digital workflows, which is determined by complex interactions between technology maturity, user experience, and clinical application context, which reveal situation-dependent delay potentials [34,35,36]. Again, the literature is clear on the benefits and efficiency of digital intraoral scanning, which should be kept in mind [37,38,39].

The comparison of measurement methods on plaster models represents a limitation in terms of bias, as the use of intraoral scans only would allow a more accurate analysis. For users with non-specialised digital equipment, the results may be of limited value.

Conclusion

In conclusion, the present study of orthodontic model analysis using plaster casts and a calliper and virtual models obtained with orthoX®scan and analysed with OnyxCeph3TM shows that partially automated digital cast analysis is an accurate, highly efficient and time-saving alternative to traditional manual cast analysis. The methods agree to within 0.3 mm for direct measurements and to within 0.66 for calculated space parameters in model analysis. Caution should be exercised with some calculated values due to bias associated with the assumption of cumulative error. With the exception of crown height, the measurement differences are not clinically relevant. Improvements in measurement accuracy and automatic reference point detection are expected as the technology develops. It is therefore necessary and appropriate for clinicians to gain further experience with digital systems. The further transition to automated and digital methods will be of increasing interest in future clinical orthodontics.

Data availability

The data underlying this article will be shared on reasonable request to the corresponding author.

References

  1. Ehrenfeld M, Sander FG, Schwenzer N, Janson I, Sander C, editors. Kieferorthopädie. 2., neu erstellte u. erw. Aufl. Stuttgart New York: Thieme; 2011.

  2. Lippold C, Kirschneck C, Schreiber K, Abukiress S, Tahvildari A, Moiseenko T, et al. Methodological accuracy of digital and manual model analysis in orthodontics - A retrospective clinical study. Comput Biol Med. 2015;62:103–9.

    Article  PubMed  Google Scholar 

  3. Rossini G, Parrini S, Castroflorio T, Deregibus A, Debernardi CL. Diagnostic accuracy and measurement sensitivity of digital models for orthodontic purposes: A systematic review. Am J Orthod Dentofac Orthop. 2016;149:161–70.

    Article  Google Scholar 

  4. Fleming PS, Marinho V, Johal A. Orthodontic measurements on digital study models compared with plaster models: a systematic review. Orthod Craniofac Res. 2011;14:1–16.

    Article  PubMed  CAS  Google Scholar 

  5. Hirogaki Y, Sohmura T, Satoh H, Takahashi J, Takada K. Complete 3-D reconstruction of dental cast shape using perceptual grouping. IEEE Trans Med Imaging. 2001;20:1093–101.

    Article  PubMed  CAS  Google Scholar 

  6. Koretsi V, Tingelhoff L, Proff P, Kirschneck C. Intra-observer reliability and agreement of manual and digital orthodontic model analysis. Eur J Orthod. 2018;40:52–7.

    Article  PubMed  Google Scholar 

  7. Tomita Y, Uechi J, Konno M, Sasamoto S, Iijima M, Mizoguchi I. Accuracy of digital models generated by conventional impression/plaster-model methods and intraoral scanning. Dent Mater J. 2018;37:628–33.

    Article  PubMed  CAS  Google Scholar 

  8. Radeke J, von der Wense C, Lapatki BG. Comparison of orthodontic measurements on dental plaster casts and 3D scans. J Orofac Orthop. 2014;75:264–74.

    Article  PubMed  Google Scholar 

  9. Tomassetti JJ, Taloumis LJ, Denny JM, Fischer JR. A comparison of 3 computerized Bolton tooth-size analyses with a commonly used method. Angle Orthod. 2001;71:351–7.

    PubMed  CAS  Google Scholar 

  10. Mullen SR, Martin CA, Ngan P, Gladwin M. Accuracy of space analysis with emodels and plaster models. Am J Orthod Dentofac Orthop. 2007;132:346–52.

    Article  Google Scholar 

  11. Santoro M, Galkin S, Teredesai M, Nicolay OF, Cangialosi TJ. Comparison of measurements made on digital and plaster models. Am J Orthod Dentofac Orthop. 2003;124:101–5.

    Article  Google Scholar 

  12. Jabri MA, Wu S, Pan Y, Wang L. An overview on the veracity of intraoral digital scanning system and utilization of iTero scanner for analyzing orthodontic study models both In-Vivo and Ex-Vivo. Niger J Clin Pract. 2021;24:1–7.

    Article  PubMed  CAS  Google Scholar 

  13. Peluso MJ, Josell SD, Levine SW, Lorei BJ. Digital models: an introduction. Semin Orthod. 2004;10:226–38.

    Article  Google Scholar 

  14. Faul F, Erdfelder E, Lang A-G, Buchner A. G*Power 3.1.7: A flexible statistical power analysis program for the social, behavioral and biomedical sciences. Beh Res Meth S. 2013;39:175–91.

    Article  Google Scholar 

  15. Coenen FA, Bartz JR, Niederau C, Craveiro RB, Knaup I, Wolf M. Orthodontic treatment quality evaluated by partially automated digital IOTN and PAR index determination: a retrospective multicentre study. Eur J Orthod. 2023;45:308–16.

    Article  PubMed  Google Scholar 

  16. Vorloeper J, Coenen FA, Lang NA, Niederau C, Knaup I, Craveiro RB, et al. Digital analyses of Bolton tooth size ratios and their association to gender, angle class, and other occlusal traits: a study using a partially automated digital 3D model analysis. Eur J Orthod. 2024;46:cjae046.

    Article  PubMed  Google Scholar 

  17. Kardach H, Szponar-Żurowska A, Biedziak B. A comparison of teeth measurements on plaster and digital models. J Clin Med. 2023;12:943.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Stevens DR, Flores-Mir C, Nebbe B, Raboud DW, Heo G, Major PW. Validity, reliability, and reproducibility of plaster vs digital study models: comparison of peer assessment rating and Bolton analysis and their constituent measurements. Am J Orthod Dentofac Orthop. 2006;129:794–803.

    Article  Google Scholar 

  19. Lo Giudice A, Muraglie S, Martina S, Ronsivalle V, Leonardi R. Accuracy and Reliability of Space Analysis Measurements in Digital Models with Different Degrees of Crowding. [cited 2025 Mar 1]; Available from: https://opendentistryjournal.com/VOLUME/13/PAGE/505/FULLTEXT/

  20. Kim J, Lagravére MO. Accuracy of Bolton analysis measured in laser scanned digital models compared with plaster models (gold standard) and cone-beam computer tomography images. Korean J Orthod. 2016;46:13–9.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Keating AP, Knox J, Bibb R, Zhurov AI. A comparison of plaster, digital and reconstructed study model accuracy. J Orthod. 2008;35:191–201. discussion 175.

    Article  PubMed  Google Scholar 

  22. Camardella LT, Ongkosuwito EM, Penning EW, Kuijpers-Jagtman AM, Vilella OV, Breuning KH. Accuracy and reliability of measurements performed using two different software programs on digital models generated using laser and computed tomography plaster model scanners. Korean J Orthod. 2020;50:13–25.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Quimby ML, Vig KWL, Rashid RG, Firestone AR. The accuracy and reliability of measurements made on computer-based digital models. Angle Orthod. 2004;74:298–303.

    PubMed  Google Scholar 

  24. Bishara SE, Fernandez Garcia A, Jakobsen JR, Fahl JA. Mesiodistal crown dimensions in Mexico and the united States. Angle Orthod. 1986;56:315–23.

    PubMed  CAS  Google Scholar 

  25. Bishara SE, Jakobsen JR, Abdallah EM, Fernandez Garcia A. Comparisons of mesiodistal and buccolingual crown dimensions of the permanent teeth in three populations from Egypt, Mexico, and the united States. Am J Orthod Dentofac Orthop. 1989;96:416–22.

    Article  CAS  Google Scholar 

  26. Casko JS, Vaden JL, Kokich VG, Damone J, James RD, Cangialosi TJ, et al. Objective grading system for dental casts and panoramic radiographs. American board of orthodontics. Am J Orthod Dentofac Orthop. 1998;114:589–99.

    Article  CAS  Google Scholar 

  27. Gesch D, Schröder W, Kocher T, Kirbschus A. Examiner differences in the assessment of different malocclusions. J Orofac Orthop. 2006;67:404–13.

    Article  PubMed  Google Scholar 

  28. Kieser JA, Groeneveld HT, McKee J, Cameron N. Measurement error in human dental mensuration. Ann Hum Biol. 1990;17:523–8.

    Article  PubMed  CAS  Google Scholar 

  29. Zilberman O, Huggare JAV, Parikakis KA. Evaluation of the validity of tooth size and arch width measurements using conventional and three-dimensional virtual orthodontic models. Angle Orthod. 2003;73:301–6.

    PubMed  Google Scholar 

  30. Costalos PA, Sarraf K, Cangialosi TJ, Efstratiadis S. Evaluation of the accuracy of digital model analysis for the American board of orthodontics objective grading system for dental casts. Am J Orthod Dentofac Orthop. 2005;128:624–9.

    Article  Google Scholar 

  31. Harris EF, Smith RN. Accounting for measurement error: a critical but often overlooked process. Arch Oral Biol. 2009;54(Suppl 1):S107–117.

    Article  PubMed  Google Scholar 

  32. Liang Y-M, Rutchakitprakarn L, Kuang S-H, Wu T-Y. Comparing the reliability and accuracy of clinical measurements using plaster model and the digital model system based on crowding severity. J Chin Med Assoc. 2018;81:842–7.

    Article  PubMed  Google Scholar 

  33. Yoon JH, Yu H-S, Choi Y, Choi T-H, Choi S-H, Cha J-Y. Model analysis of digital models in moderate to severe crowding: in vivo validation and clinical application. Biomed Res Int. 2018;2018:8414605.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Glisic O, Hoejbjerre L, Sonnesen L. A comparison of patient experience, chair-side time, accuracy of dental arch measurements and costs of acquisition of dental models. Angle Orthod. 2019;89:868–75.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Burhardt L, Livas C, Kerdijk W, van der Meer WJ, Ren Y. Treatment comfort, time perception, and preference for conventional and digital impression techniques: A comparative study in young patients. Am J Orthod Dentofac Orthop. 2016;150:261–7.

    Article  Google Scholar 

  36. Lee SJ, Jamjoom FZ, Le T, Radics A, Gallucci GO. A clinical study comparing digital scanning and conventional impression making for implant-supported prostheses: A crossover clinical trial. J Prosthet Dent. 2022;128:42–8.

    Article  PubMed  Google Scholar 

  37. Mangano A, Beretta M, Luongo G, Mangano C, Mangano F. Conventional vs digital impressions: acceptability, treatment comfort and stress among young orthodontic patients. Open Dent J. 2018;12:118–24.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Gjelvold B, Chrcanovic BR, Korduner E-K, Collin-Bagewitz I, Kisch J. Intraoral digital impression technique compared to conventional impression technique. A randomized clinical trial. J Prosthodont. 2016;25:282–7.

    Article  PubMed  Google Scholar 

  39. Jánosi KM, Cerghizan D, Mártha KI, Elekes É, Szakács B, Elekes Z, et al. Evaluation of intraoral Full-Arch scan versus conventional preliminary impression. J Clin Med. 2023;12:5508.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

The authors thank the medical faculty of the RWTH Aachen University for providing the required data and supporting this study. The authors thank OnyxCeph3TM for supplying the measurement software.

Funding

Open Access funding enabled and organized by Projekt DEAL.

No funding was received for this study.

Author information

Authors and Affiliations

Authors

Contributions

F.A.L. Conceptualization, Data curation, Formal analysis, Writing—original draft, Writing—review & editing, N.A.L. Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Visualization, Writing— review & editing, J.V. Data curation, Writing— review & editing, C.N. Investigation, Methodology, Visualization, Writing— review & editing, R.B.C. Investigation, Methodology, Project administration, Supervision, Visualization, Writing—review & editing, I.K. Investigation, Methodology, Supervision, Writing— review & editing, and Michael Wolf Conceptualization, Methodology, Project administration, Supervision, Writing—review & editing.

Corresponding author

Correspondence to Franziska A. Lang.

Ethics declarations

Ethical approval

The study was approved by the Ethics Committee of The Medical Faculty of the RWTH Aachen (EK 232/20).

Consent to participate

Due to the retrospective study design and anonymization, informed consent of the patients was not necessary.

Informed consent

For this type of study, formal consent is not required.

Competing interests

The authors declare no competing interests.

Clinical trial number

Not applicable.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lang, F.A., Lang, N.A., Vorloeper, J. et al. Validation of a digital, partly automated three-dimensional cast analysis for evaluation of orthodontic treatment assessment. Head Face Med 21, 36 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13005-025-00515-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13005-025-00515-8

Keywords