TY - JOUR
T1 - Interobserver variability and diagnostic performance in US assessment of thyroid nodule according to size
AU - Park, S. J.
AU - Park, S. H.
AU - Choi, Y. J.
AU - Kim, D. W.
AU - Son, E. J.
AU - Lee, H. S.
AU - Yoon, J. H.
AU - Kim, E. K.
AU - Moon, H. J.
AU - Kwak, J. Y.
PY - 2012
Y1 - 2012
N2 - Purpose: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. Materials and Methods: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size <5mm), group 2 (5mm nodule size <10mm), group 3 (10mm nodule size <20mm), and group 4 (nodule size 20mm). Three experienced (7-10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. Results: Composition represented substantial or good agreement (k=0.719-0.89). Margin showed the lowest agreement (k=0.322-0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k=0.674), group 2 (k=0.596), group 3 (k=0.674), and group 4 (k=0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value <0.05) and lowest in group 1.NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p<0.05). Conclusion: Interobserver agreements were relatively good (k=0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5mm in maximum diameter.
AB - Purpose: To evaluate the interobserver variability for US assessments of thyroid nodules and analyze the diagnostic performances of US assessments in thyroid nodules according to nodule size. Materials and Methods: This was an IRB-approved retrospective study with waiver of informed consent. A total of 400 surgically-confirmed thyroid nodules were included. Nodules were divided into 4 groups by size; group 1 (nodule size <5mm), group 2 (5mm nodule size <10mm), group 3 (10mm nodule size <20mm), and group 4 (nodule size 20mm). Three experienced (7-10 years) radiologists retrospectively reviewed the US images. Agreement of each US descriptor and final US assessment, and diagnostic performances were calculated in each group and compared. Results: Composition represented substantial or good agreement (k=0.719-0.89). Margin showed the lowest agreement (k=0.322-0.365). Individual kappa values for final assessment according to nodule size were as follows: group 1 (k=0.674), group 2 (k=0.596), group 3 (k=0.674), and group 4 (k=0.673). Specificity, PPV, and accuracy were significantly different among the groups with different size (p value <0.05) and lowest in group 1.NPV, specificity, PPV and accuracy except PPV of observer 3 increased with nodule size (p<0.05). Conclusion: Interobserver agreements were relatively good (k=0.637) in final US assessment regardless of nodule size in experienced radiologists. High false-positive rate was observed in US assessment in nodules less than 5mm in maximum diameter.
UR - http://www.scopus.com/inward/record.url?scp=84871643783&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84871643783&partnerID=8YFLogxK
U2 - 10.1055/s-0032-1325404
DO - 10.1055/s-0032-1325404
M3 - Article
C2 - 23108925
AN - SCOPUS:84871643783
SN - 0172-4614
VL - 33
SP - E186-E190
JO - Ultraschall in der Medizin
JF - Ultraschall in der Medizin
IS - 7
ER -