Annika Gutsche, Martin Metz, Melba Munoz, Kit Wong, Ted Omachi, Rui Zhao, Marcus Maurer, Vasiliki Zampeli, Markus Magerl
{"title":"Assessing the reliability of the FricTest® 4.0 for diagnosing symptomatic dermographism","authors":"Annika Gutsche, Martin Metz, Melba Munoz, Kit Wong, Ted Omachi, Rui Zhao, Marcus Maurer, Vasiliki Zampeli, Markus Magerl","doi":"10.1002/clt2.70005","DOIUrl":null,"url":null,"abstract":"<p>To the Editor,</p><p>Symptomatic dermographism (SD), a common subtype of chronic inducible urticaria (CIndU), involves transient, strip-shaped wheals that itch and burn when the skin is stroked or scratched.<span><sup>1</sup></span> SD affects ≥0.5% of the population,<span><sup>2</sup></span> yet despite its high frequency and marked impact on quality of life, diagnostic tools and treatment options are limited.<span><sup>1, 3</sup></span></p><p>Diagnosis is based on the patient's medical history and provocation testing.<span><sup>2</sup></span> Historically, provocation testing used a smooth, blunt object to stroke the skin, but variations in individuals' disease presentation highlighted the need for validated, reproducible tools.<span><sup>3</sup></span> The FricTest®4.0<span><sup>4</sup></span> is a hand-held, flat plastic comb-like tool with four smooth pins (3.0–4.5 mm long) firmly stroked along the skin. The resulting wheals determine the critical friction threshold (CFT), the shortest pin length/minimum pressure that elicits a positive wheal response.<span><sup>5</sup></span></p><p>This study aimed to assess the reliability (reproducibility and repeatability) of FricTest inter-rater agreement (results from two different raters on the same patient at the same visit) and intra-rater agreement (results from the same rater on the same patient 7–14 days apart). Reliable results are important for monitoring treatment effects, helping patients understand triggers, and improving management. We assume that each patient's left and right forearms are the same, that visits are the same, and that previous provocation did not affect the reaction of subsequent tests.</p><p>This single-center study was conducted at the Urticaria Center of Reference and Excellence<span><sup>6</sup></span> at the Charité Hospital, Berlin, Germany. Adults had SD for >6 weeks, had active SD at enrollment, and gave written informed consent. The study followed the Declaration of Helsinki principles, and the Berlin Charité Ethics Committee approved the protocol.</p><p>The primary endpoint, inter-rater agreement, was the intraclass correlation coefficient (ICC) between the CFT assessments of two raters within the same patient. The CFT scale is 0–4 (0 = no response, 4 = maximum response). ICCs plus upper and lower 95% confidence intervals (CI) were calculated using a mixed-effects linear model methodology.<span><sup>7</sup></span> Rater A and B agreement was quantified using weighted kappa across four categories: left forearm, right forearm, Visit 1, and Visit 2. An ICC<0.4 indicated poor reliability, and 0.6–0.8 indicated substantial reliability.</p><p>Two raters randomized to the order of assessments (Forearm 1 [right], Forearm 2 [left]) administered and recorded all FricTest results. At Visit 1, Rater A applied the FricTest to Forearm 1, covered the arm, and left the room. Rater B then applied the FricTest to Forearm 2, covered the arm, and left the room. Ten minutes after application one, Rater A returned, uncovered Forearm 1 and documented the reaction before re-covering the arm. Rater B repeated this process. Both raters repeated these procedures at Visit 2 after 7–14 days, using the opposite arm to Visit 1.</p><p>Among 16 participants (62.5% female), we observed substantial inter-rater agreement when Rater A and B assessed the same patient during the same visit, with a weighted kappa of 0.86 for the left and 0.77 for the right forearm (Table 1<b>,</b> Figure 1). When the two raters assessed the same patient at different visits, the agreement was moderate at Visit 1 (weighted kappa of 0.56) but improved to substantial at Visit 2 (0.79; Table 1).</p><p>When the same rater assessed one patient at different visits, we found substantial intra-rater agreement. Raters A and B reported weighted kappas of 0.72 and 0.73, indicating consistent and substantial agreement between Visits 1 and 2.</p><p>Overall, there was high consistency in ICC measurements, with 0.89 (95% CI 0.71, 0.94) for inter-rater reliability and 0.81 (95% CI 0.60, 0.92) for intra-rater reliability. At Visits 1 and 2, the average mean differences between Rater A and B were 0.06 and 0.13, respectively. Similarly, the average mean differences for the left and right forearms were 0.06 and 0.25, respectively, indicating high agreement essential for reliable measurements.</p><p>Our study validates the FricTest, demonstrating its reliability as a diagnostic and monitoring tool in SD. Measurement noise from repeated tests by the same or different raters is a well-recognized source of error in medical assessments.<span><sup>8</sup></span> Our results showed almost perfect inter-rater and substantial intra-rater agreement when inducing wheals in SD patients. Accurately determining a patient's CFT could enhance SD diagnosis and ultimately lead to better management of the patient's condition.</p><p>We observed changes in one patient that primarily drove variability between visits. Despite these fluctuations, the consistency with which each rater detected changes in disease activity confirms the robustness of our assessment tool in identifying improvements and deteriorations in SD.</p><p>Our study has limitations, mostly relating to its small size, duration of follow-up, and applicability to real-world conditions. Itch was documented at baseline to ensure the correct diagnosis; it could not be measured as a result because the intensity of itch cannot be differentiated in such tightly condensed strips. Further real-world evidence needs to confirm the FricTest's reproducibility over longer periods and clinical validity in busy settings, which differ from the controlled conditions within this study. While we strove to ensure comparable physical environments at both visits, we cannot rule out minor differences in the experimental environment nor exclude disease-activity modifying factors between the two visits, which may have caused response fluctuations.</p><p>The simple, compact and affordable FricTest should be utilized in clinical studies, especially those investigating potential CIndU treatments, and in primary care settings, where continuity of care can be challenging.</p><p><b>Annika Gutsche</b>: Methodology; writing—review & editing; writing—original draft; data curation. <b>Martin Metz</b>: writing—original draft; methodology; writing—review & editing; data curation. <b>Melba Munoz</b>: Writing—original draft; methodology; writing—review & editing; data curation. <b>Kit Wong</b>: Methodology; writing—review & editing; data curation. <b>Ted Omachi</b>: Data curation; methodology; writing—review & editing. <b>Rui Zhao</b>: Methodology; data curation; writing—review & editing. <b>Marcus Maurer</b>: Methodology; writing—original draft; writing—review & editing; data curation. <b>Vasiliki Zampeli</b>: Study protocol; study investigator; methodology; writing—review & editing; writing—original draft; data curation. <b>Markus Magerl</b>: Study protocol; study investigator; writing—original draft; methodology; writing—review & editing; data curation.</p><p><b>A. Gutsche</b> declares no conflict of interest<b>. M. Metz</b> is or recently was a speaker and/or consultant for Amgen, AstraZeneca, Argenx, Celldex, Celltrion, Escient, Jasper Therapeutics, Novartis, Pharvaris, Regeneron, Sanofi, and Third Harmonic Bio. <b>M. Munoz</b> is or recently was a speaker, advisor and/or received research funding from Jasper Therapeutics, Celldex Therapeutics, Takeda, GA<sup>2</sup>LEN, UNEV, Astra Zeneca and Roche outside the submitted work. <b>K. Wong</b> is a former employee of Genentech Inc. <b>T. Omachi</b> is an employee of Genentech Inc. <b>R. Zhao</b> is a former employee of Genentech/Roche. <b>M. Maurer</b> was a speaker and/or advisor for and/or has received research funds from Allakos, Amgen, Aralez, ArgenX, AstraZeneca, Celldex, Centogene, CSL, Behring, FAES, Genentech, GI Innovation, Innate Pharma, Kyowa Kirin, Leo Pharma Lilly, Menarini, Moxie, Novartis, Roche, Sanofi/Regeneron, Third Harmonic Bio, UCB, and Urich. <b>V. Zampeli</b> was a speaker and/or advisor for and/or has received research funding from Pharming, Takeda, and CSL Behring. <b>M. Magerl</b> is or recently was a speaker and/or advisor for and/or has received research funding from Biocryst, Pharming, Takeda, CSL Behring, Ionis Pharmaceuticals, KalVista, and Pharvaris.</p>","PeriodicalId":10334,"journal":{"name":"Clinical and Translational Allergy","volume":"14 11","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560339/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Translational Allergy","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/clt2.70005","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ALLERGY","Score":null,"Total":0}
引用次数: 0
Abstract
To the Editor,
Symptomatic dermographism (SD), a common subtype of chronic inducible urticaria (CIndU), involves transient, strip-shaped wheals that itch and burn when the skin is stroked or scratched.1 SD affects ≥0.5% of the population,2 yet despite its high frequency and marked impact on quality of life, diagnostic tools and treatment options are limited.1, 3
Diagnosis is based on the patient's medical history and provocation testing.2 Historically, provocation testing used a smooth, blunt object to stroke the skin, but variations in individuals' disease presentation highlighted the need for validated, reproducible tools.3 The FricTest®4.04 is a hand-held, flat plastic comb-like tool with four smooth pins (3.0–4.5 mm long) firmly stroked along the skin. The resulting wheals determine the critical friction threshold (CFT), the shortest pin length/minimum pressure that elicits a positive wheal response.5
This study aimed to assess the reliability (reproducibility and repeatability) of FricTest inter-rater agreement (results from two different raters on the same patient at the same visit) and intra-rater agreement (results from the same rater on the same patient 7–14 days apart). Reliable results are important for monitoring treatment effects, helping patients understand triggers, and improving management. We assume that each patient's left and right forearms are the same, that visits are the same, and that previous provocation did not affect the reaction of subsequent tests.
This single-center study was conducted at the Urticaria Center of Reference and Excellence6 at the Charité Hospital, Berlin, Germany. Adults had SD for >6 weeks, had active SD at enrollment, and gave written informed consent. The study followed the Declaration of Helsinki principles, and the Berlin Charité Ethics Committee approved the protocol.
The primary endpoint, inter-rater agreement, was the intraclass correlation coefficient (ICC) between the CFT assessments of two raters within the same patient. The CFT scale is 0–4 (0 = no response, 4 = maximum response). ICCs plus upper and lower 95% confidence intervals (CI) were calculated using a mixed-effects linear model methodology.7 Rater A and B agreement was quantified using weighted kappa across four categories: left forearm, right forearm, Visit 1, and Visit 2. An ICC<0.4 indicated poor reliability, and 0.6–0.8 indicated substantial reliability.
Two raters randomized to the order of assessments (Forearm 1 [right], Forearm 2 [left]) administered and recorded all FricTest results. At Visit 1, Rater A applied the FricTest to Forearm 1, covered the arm, and left the room. Rater B then applied the FricTest to Forearm 2, covered the arm, and left the room. Ten minutes after application one, Rater A returned, uncovered Forearm 1 and documented the reaction before re-covering the arm. Rater B repeated this process. Both raters repeated these procedures at Visit 2 after 7–14 days, using the opposite arm to Visit 1.
Among 16 participants (62.5% female), we observed substantial inter-rater agreement when Rater A and B assessed the same patient during the same visit, with a weighted kappa of 0.86 for the left and 0.77 for the right forearm (Table 1, Figure 1). When the two raters assessed the same patient at different visits, the agreement was moderate at Visit 1 (weighted kappa of 0.56) but improved to substantial at Visit 2 (0.79; Table 1).
When the same rater assessed one patient at different visits, we found substantial intra-rater agreement. Raters A and B reported weighted kappas of 0.72 and 0.73, indicating consistent and substantial agreement between Visits 1 and 2.
Overall, there was high consistency in ICC measurements, with 0.89 (95% CI 0.71, 0.94) for inter-rater reliability and 0.81 (95% CI 0.60, 0.92) for intra-rater reliability. At Visits 1 and 2, the average mean differences between Rater A and B were 0.06 and 0.13, respectively. Similarly, the average mean differences for the left and right forearms were 0.06 and 0.25, respectively, indicating high agreement essential for reliable measurements.
Our study validates the FricTest, demonstrating its reliability as a diagnostic and monitoring tool in SD. Measurement noise from repeated tests by the same or different raters is a well-recognized source of error in medical assessments.8 Our results showed almost perfect inter-rater and substantial intra-rater agreement when inducing wheals in SD patients. Accurately determining a patient's CFT could enhance SD diagnosis and ultimately lead to better management of the patient's condition.
We observed changes in one patient that primarily drove variability between visits. Despite these fluctuations, the consistency with which each rater detected changes in disease activity confirms the robustness of our assessment tool in identifying improvements and deteriorations in SD.
Our study has limitations, mostly relating to its small size, duration of follow-up, and applicability to real-world conditions. Itch was documented at baseline to ensure the correct diagnosis; it could not be measured as a result because the intensity of itch cannot be differentiated in such tightly condensed strips. Further real-world evidence needs to confirm the FricTest's reproducibility over longer periods and clinical validity in busy settings, which differ from the controlled conditions within this study. While we strove to ensure comparable physical environments at both visits, we cannot rule out minor differences in the experimental environment nor exclude disease-activity modifying factors between the two visits, which may have caused response fluctuations.
The simple, compact and affordable FricTest should be utilized in clinical studies, especially those investigating potential CIndU treatments, and in primary care settings, where continuity of care can be challenging.
Annika Gutsche: Methodology; writing—review & editing; writing—original draft; data curation. Martin Metz: writing—original draft; methodology; writing—review & editing; data curation. Melba Munoz: Writing—original draft; methodology; writing—review & editing; data curation. Kit Wong: Methodology; writing—review & editing; data curation. Ted Omachi: Data curation; methodology; writing—review & editing. Rui Zhao: Methodology; data curation; writing—review & editing. Marcus Maurer: Methodology; writing—original draft; writing—review & editing; data curation. Vasiliki Zampeli: Study protocol; study investigator; methodology; writing—review & editing; writing—original draft; data curation. Markus Magerl: Study protocol; study investigator; writing—original draft; methodology; writing—review & editing; data curation.
A. Gutsche declares no conflict of interest. M. Metz is or recently was a speaker and/or consultant for Amgen, AstraZeneca, Argenx, Celldex, Celltrion, Escient, Jasper Therapeutics, Novartis, Pharvaris, Regeneron, Sanofi, and Third Harmonic Bio. M. Munoz is or recently was a speaker, advisor and/or received research funding from Jasper Therapeutics, Celldex Therapeutics, Takeda, GA2LEN, UNEV, Astra Zeneca and Roche outside the submitted work. K. Wong is a former employee of Genentech Inc. T. Omachi is an employee of Genentech Inc. R. Zhao is a former employee of Genentech/Roche. M. Maurer was a speaker and/or advisor for and/or has received research funds from Allakos, Amgen, Aralez, ArgenX, AstraZeneca, Celldex, Centogene, CSL, Behring, FAES, Genentech, GI Innovation, Innate Pharma, Kyowa Kirin, Leo Pharma Lilly, Menarini, Moxie, Novartis, Roche, Sanofi/Regeneron, Third Harmonic Bio, UCB, and Urich. V. Zampeli was a speaker and/or advisor for and/or has received research funding from Pharming, Takeda, and CSL Behring. M. Magerl is or recently was a speaker and/or advisor for and/or has received research funding from Biocryst, Pharming, Takeda, CSL Behring, Ionis Pharmaceuticals, KalVista, and Pharvaris.
期刊介绍:
Clinical and Translational Allergy, one of several journals in the portfolio of the European Academy of Allergy and Clinical Immunology, provides a platform for the dissemination of allergy research and reviews, as well as EAACI position papers, task force reports and guidelines, amongst an international scientific audience.
Clinical and Translational Allergy accepts clinical and translational research in the following areas and other related topics: asthma, rhinitis, rhinosinusitis, drug hypersensitivity, allergic conjunctivitis, allergic skin diseases, atopic eczema, urticaria, angioedema, venom hypersensitivity, anaphylaxis, food allergy, immunotherapy, immune modulators and biologics, animal models of allergic disease, immune mechanisms, or any other topic related to allergic disease.