Material and Methods: 102 cases were selected from intraoperative consultation procedures performed at our institution between 2012 and 2020. The algorithm was tested by five observers. The observers examined the smears and frozen sections without the algorithm, and then with the algorithm.
Results: The percentage change in the rate of correct diagnoses made by the four observers (O) increased after using the algorithm (O2: 8%, O3: 5%, O4: 8% and O5: 13%), but decreased for only one observer (O1) (5%). The most common error made by the four observers was `grading of glial tumors` (O1: 40%; O2: 23%; O4: 40% and O5: 27.5%), and this group of errors was mostly corrected by using the algorithm (O1: 33%; O2: 3.8%; O4: 23% and O5: 10%). For two observers (O2 and O5), a statistically significant change in diagnostic levels was observed after using the algorithm (p=0.024 and p=0.040; respectively). In addition, thanks to the use of the algorithm, a high degree of agreement was found between the observers` diagnoses (77.7%, p<0.001).
Conclusion: In the intraoperative consultation of central nervous system lesions, algorithms can help to increase the accuracy of the diagnosis and reduce interobserver variability. This study demonstrates that an algorithmic approach is an effective method for pathologists in intraoperative consultation procedures.
IC of central nervous system (CNS) lesions requires correlation of clinical, radiologic, and histologic data. Lesions that occur in patients at a specific site, with a specific radiologic pattern, and in a specific age group significantly narrow the list of differential diagnoses. This increases the possibility of making a precise diagnosis with the data obtained in the pre-analysis.
In order to make the best possible contribution to the surgical procedure, the necessary pre-analytical phase (clear, precise and effective communication with the neurosurgeon, obtaining information about the patient`s clinical, radiological and intraoperative findings and submitting the correct and sufficient sample to the laboratory under appropriate conditions) and the analytical phase (assessment of the adequacy of the tissue, correct preparation of the smear – frozen sections and accurate microscopic examination) must have been carried out. All information about the importance of each of these steps and how they should be performed can be found in the previous study[1].
The best way to make the correct diagnosis during the IC procedure is to first determine the tumor type. For this purpose, it is important to look at the tumor background, especially in smear slides, and try to identify one of the four main tumor types (glial, mesenchymal, neuronal and epithelial) based on certain characteristics[1]. To perform the correct microscopic examination in the analytical phase, some previous studies have shown that smear preparations are important for determining cytologic features and frozen sections are important for determining structural features and that these two methods should be used together[1-5].
The accurate examination of smears and frozen sections prepared in the analytical phase and reaching the correct differential diagnosis may be possible by using an algorithm that helps the pathologist to combine all relevant information. To this end, in a previous study, we proposed an algorithm that the pathologist interprets by systematically evaluating the microscopic features with a series of standard questions (Figure)[1].
Figure 1: The algorithm of intraoperative consultation specimens for CNS tumors[1].
In the post-analytic phase, the crucial role of the surgical pathologist is to report the IC diagnosis accurately and promptly.
The aim of this study is to test the `algorithm` developed to contribute to an accurate microscopic examination in the analytical phase. It also aims to determine whether it is useful in increasing both the rate of correct diagnoses and consistency between observers among pathologists working independently and in different environments.
In the steps of the algorithm, the levels were defined as follows: Level 0: Only if the lesion can be defined as `neoplastic`; Level 1: If the tumor type can be determined; Level 2-3: If the tumor grade can be determined based on cytological + structural features, and Level 4: If a differential diagnosis list can be established (specific diagnosis with clinical and radiological findings)[1].
To test the effectiveness of the algorithm, first without the algorithm, then with the algorithm: a) the rates of correct diagnoses, b) the rates of incorrect diagnoses, c) diagnostic levels were determined. In addition, the agreement between the observers in the diagnoses obtained without and with the help of the algorithm was evaluated.
Statistical Analysis
The statistical analyses were performed with the SPSS 23.0
package program. In the applications with and without
the algorithm, the descriptive statistics are presented with
numbers and percentages. The agreements between the observers
are indicated with the intraclass correlation coefficient
and confidence intervals. The McNemar-Bowker test
was used to analyze the differences between the ratios of the
categorical variables in the addicted groups. P-values of less
than 0.05 were considered statistically significant.
The diagnostic level, misdiagnosis rate and accurate diagnosis rate of each observer in the smears and frozen sections of the 102 cases that they examined first without the algorithm and then with the algorithm are shown in (Table I).
The correct diagnosis rates of the observers before and after using the algorithm are as follows: O1: from 82% to 77%, O2: from 75% to 83%, O3: from 73% to 78%, O4: from 69% to % to 77 and O5: from 58% to 71%. (Table II)
Table II: The correct diagnosis rates of the observers before and after using the algorithm.
The rate of `correct diagnosis` increased for four observers, with a maximum of 13% (rates of increase: O2: 8%, O3: 5%, O4: 8% and O5: 13%). The rate of `correct diagnosis` of only one observer (O1) decreased with the use of the algorithm (decrease of 5% from 82% to 77%).
For only one observer (O3), the rate of `defer to permanent sections` decreased from 4% to 2% with the algorithm. There were no changes for the others.
Grading of glial tumors was the most frequent error in four observers (errors before the algorithm: O1: 40%; O2: 23%; O4: 40% and O5: 27.5%); the rate of correct diagnosis with the algorithm was also highest in this group (grading of glial tumors) (errors after the algorithm: O1: 33%; O2: 3.8%; O4: 23% and O5: 10%). O3, who mainly misdiagnosed metastatic tumors without using the algorithm, had the highest rate of correct diagnoses in this group with the help of the algorithm (error: from 33% to 12.5%).
Other tumors that were misdiagnosed were metastatic tumors, ependymomas or meningiomas.
Six cases were misdiagnosed by all observers: 2 were carcinoma metastases, 1 glioblastoma, 1 ependymoma with anaplastic areas, 1 neuroglial tissue (non-tumoral) and 1 lymphoma.
The diagnostic levels that the observers were able to achieve during the intraoperative consultations are shown in (Table III, IV, V, VI, VII). For cases such as metastases, lymphomas, melanomas, etc., diagnostic level 2-3 were not applicable. In these cases, observers were expected to make a diagnosis at level 0, level 1 or level 4. According to the algorithm, four observers showed a decrease in level 0 and 1 diagnoses (decrease: O2: 14.2%, O3: 14.2%, O4: 1.9% and O5: 8.8%), while there was an increase in level 2-3 and 4 diagnoses (increase: O2: 14.2%, O3: 15.6%, O4: 1.9% and O5: 8.8%). In contrast, only one observer (O1) showed an increase in diagnoses at levels 0 and 1 after using the algorithm, while diagnoses at levels 2-3 and 4 decreased. A statistically significant change was observed in O2 and O5 after the algorithm (p=0.024 and p=0.040, respectively).
Table IV: Diagnostic levels that could be reached by Observer-2 during intraoperative consultations.
Table V: Diagnostic levels that could be reached by Observer-3 during intraoperative consultations.
Table VI: Diagnostic levels that could be reached by Observer-4 during intraoperative consultations.
It was observed that the interobserver agreement increased when the slides were examined using the algorithm (from 68.2% to 77.7%). This is because for all observers except O1, both the rate of correct diagnoses and the diagnostic levels increased after the algorithm. However, interobserver consistency remained at the same level. Because the observers made some errors despite using the algorithm. Some of these were errors that had already been identified during the initial examination and some were other errors. It was found that the algorithm increased the diagnostic accuracy rates in different tumor groups in different observers. This suggests that the algorithm contributed differently for observers with different levels of knowledge. The interobserver consistency was statistically significant both in the examination with and without the algorithm (Table VIII).
Modi et al. reported in their study that most discrepancies between IC and permanent diagnoses occurred in astrocytomas, ependymomas, metastatic tumors, meningiomas and rhabdoid tumors[13,14].
In the study by Plesec and Prayson, <3% mismatch was found in 2156 cases over a period of 8 years[7]. According to this study, approximately 80% of the inconsistent cases were spindle cell lesions, astrocytoma versus oligodendroglioma, lymphoma, reactive processes versus neoplastic processes, and tumor grading errors[7].
Geramizadeh et al. found that the most frequent inconsistency occurred in CNS tumors (14 cases). Ten inconsistent cases (71.3%) in the CNS were `spindle cell tumors`, in which a distinction must be made between meningiomas and schwannomas[15]. Another common discordance in this study was between low-grade gliomas and reactive gliosis (3 cases; 21.3%).
We also had similar problems in our study. Four of the five observers most frequently erred in the `grading of glial tumors` and second most often in metastatic tumors. Metastatic tumors were most frequently confused with highgrade glial tumors, followed by meningiomas. Meanwhile, ependymomas were most commonly confused with other glial tumors, followed by mesenchymal tumors. In addition, meningiomas were most commonly confused with schwannomas.
However, thanks to the algorithm, these four observers showed a significant improvement in the grading of glial tumors, the most common error they made (O1: from 40% to 33%; O2: from 23% to 3.8%; O4: from 40% to 23%; O5: from 27.5% to 10%). The other observer also achieved the best improvement in her most common error (metastatic tumors) (error: from 33% to 12.5%).
When investigating the six cases that were misdiagnosed by all observers, it was found that one of the two carcinoma metastases and one ependymoma case with anaplastic areas were also misdiagnosed by the reference author who diagnosed the cases and signed the original reports.
Some of the diagnostic errors were due to lack of experience, while a significant part was due to technical reasons. These were sampling features (small tissue, partial representation of lesion characteristics) and freezing artifacts that occurred during the IC process. These technical difficulties also made it difficult for observers to recognize the lesion. For this reason, in our previous article[1] we emphasized what needs to be done in the pre-analytical phase. Improvement in this phase is one of the most important steps to make the correct diagnosis. This is because even if the algorithm is perfect, errors in tissue preparation could prevent the algorithm from being used.
The algorithm used during the IC procedure provided a standard checklist to follow and helped pathologists consider options that they may not have initially considered during the differential diagnosis. Thanks to the interpretation of the background in the smears, it was possible to start with a narrower list of differential diagnoses. Additionally, the algorithm aided in better recognition of both low- and high-grade features within the same tumor type. In this way, a descriptive diagnosis/category was obtained, which is a more appropriate diagnosis to be made during the surgical procedure.
The algorithm enabled the observers to increase their diagnostic rates at the advanced levels (levels 2-3 and 4) and consequently empowered them, so that they could provide the neurosurgeon with more specific treatment recommendations.
The situation of O1differs from the others in every respect. The rate of correct diagnoses of O1 decreased with the usage of the algorithm. It turned out that this observer could not benefit from the algorithm for any error. In addition, there was an increase in level 0 and 1 diagnoses for this observer after the algorithm, while a decrease was observed for level 2-3 and 4 diagnoses. O1, who was not helped by the algorithm, was the observer with the highest rate of correct diagnoses without using the algorithm. In contrast, the observer who had the lowest rate of correct diagnoses without the algorithm (O5) had the highest rate of correct diagnoses after using the algorithm. That is, observers with less basic knowledge showed the best improvement, while observers with better basic knowledge benefited less from the algorithm. These results made us think that the effectiveness of this algorithm might vary among pathologists with different levels of knowledge. At the same time, these observers who lack neuropathological practice, even if they are better at first sight, must have the basic knowledge that we use in IC and that we try to explain step by step in the algorithm (how to recognize the background of tumors, cytological and structural features of each tumor type -glial, mesenchymal, neuronal and epithelial). Otherwise, it will not be possible to use the algorithm effectively. For this reason, we plan to create a guide that contains this basic information as well as the algorithm.
The most limiting factor of the study is the imbalance representation of tumor diversity. Since the majority of the material sent for IC consisted of glial tumors, the study results were noticeably affected, with the highest degree of error found in grading of glial tumors. However, these disease rates are still encountered in everyday working life and reflect our daily practice. In addition, another noteworthy limitation is the lack of opportunity for observers to contact the doctor and check the radiological findings. Providing this information in daily practice would increase accuracy rates.
In summary, the algorithm has the potential to assist surgical pathologists to perform IC of CNS tumors. Accordingly, it will be possible to reduce the rate of misdiagnosis that occurs during IC of CNS tumors. Therefore, the best contribution could be made to the patient`s prognosis by offering appropriate guidance on whether to proceed with or cease the operation.
Acknowledgement
This research received no specific grant from any funding agency in
the public, commercial, or not-for-profit sectors.
Ethics Committee Approval
This study was approved by the Karadeniz Technical University
Ethics Committee with protocol number 2021/282.
Conflict of Interest
The authors declare that they have no conflicts of interest.
Authorship Contributions
Concept: EC, Design: EC, IS, AL, Supervision: IS, Data collection
and/or processing: EC, GT, MMU, Analysis and/or interpretation:
EC, IS, Literature search: EC, ZSY, ACM, Writing: EC, Approval: EC,
IS.
1) Çakir E, Oran G, Yüksek GE, Ding C, Tihan T. Intraoperative
Consultations of Central Nervous System Tumors: A Review for
Practicing Pathologists and Testing of an Algorithmic Approach.
Turk Patoloji Derg. 2019;35(3):173-84. PMID: 31107540. DOI:
10.5146/tjpath.2018.01460.
2) Lee HS, Tihan T. The basics of intraoperative diagnosis in neuropathology.
Surg Pathol Clin. 2015;8(1):27-47. PMID: 25783820.
DOI: 10.1016/j.path.2014.10.006
3) Burger PC. Smears and İntraoperative consultations in Surgical
Neuropathology: A manual 1st ed. Baltimore/Maryland, USA; Pb
Medical Publishing; 2009.
4) Savargaonkar P, Farmer PM. Utility of intra-operative consultations
for the diagnosis of central nervous system lesions. Ann
Clin Lab Sci. 2001;31(2):133-9. PMID: 11337901.
5) Kleinschmidt-DeMasters BK, Prayson RA. An algorithmic
approach to the brain biopsy--part I. Arch Pathol
Lab Med. 2006;130(11):1630-8. PMID: 17076524. DOI:
10.5858/2006-130-1630-AAATTB.
6) Al-Ajmi R, Al-Kindi H, George M, Thomas K. Correlation of
Intraoperative İntraoperative consultation Report and Histopathological
Diagnosis of Central Nervous System Tumors - A
Six-Year Retrospective Study. Oman Med J. 2016;31(6):414-
20. PMID: 27974956; PMCID: PMC5099397. DOI: 10.5001/
omj.2016.84.
7) Plesec TP, Prayson RA. Intraoperative consultation discrepancy
in the evaluation of central nervous system tumors. Arch
Pathol Lab Med. 2007;131(10):1532-40. PMID: 17922589. DOI:
10.5858/2007-131-1532-ICDITE.
8) Chaturvedi S, Pant I, Dua R, Gupta S. Analyzing agreement
patterns of intraoperative central nervous system lesion reporting
according to type and grade. J Cytol. 2013;30(3):179-84.
PMID: 24130410; PMCID: PMC3793355. DOI: 10.4103/0970-
9371.117677.
9) Perry A, Scheithauer BW, Stafford SL, Lohse CM, Wollan
PC. `Malignancy` in meningiomas: a clinicopathologic
study of 116 patients, with grading implications. Cancer.
1999;85(9):2046-56. PMID: 10223247. DOI: 10.1002/(sici)1097-
0142(19990501)85:9<2046:aid-cncr23>3.0.co;2-m.
10) Roessler K, Dietrich W, Kitz K. High diagnostic accuracy of cytologic
smears of central nervous system tumors. A 15-year experience
based on 4,172 patients. Acta Cytol. 2002;46(4):667-74.
PMID: 12146028. DOI: 10.1159/000326973.
11) Taratuto AL, Sevlever G, Piccardo P. Clues and pitfalls in stereotactic
biopsy of the central nervous system. Arch Pathol Lab Med.
1991;115(6):596-602. PMID: 2039344.
12) Powell SZ. Intraoperative consultation, cytologic preparations,
and intraoperative consultation in the central nervous system.
Arch Pathol Lab Med. 2005;129(12):1635-52. PMID: 16329736.
DOI: 10.5858/2005-129-1635-ICCPAF.
13) Modi M, Nilkanthe R, Trivedi P, Shah A, Shah J, Chaudhari J.
Importance of intraoperative consultation for the diagnosis of
central nervous system lesions and evaluation of its diagnostic accuracy.
J Cancer Res Ther. 2018;14(6):1176-9. PMID: 30488825.
DOI: 10.4103/0973-1482.191034.