UOM at the NTCIR-18 RadNLP Task

Wuraola Oyewusi; Eliana Vasquez Osorio; Gareth Price; Goran Nenadic

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

UOM at the NTCIR-18 RadNLP Task

https://doi.org/10.20736/0002002072

名前 / ファイル	ライセンス	アクション
12-NTCIR18-RADNLP-OyewusiW.pdf (897.4 KB)

アイテムタイプ

デフォルトアイテムタイプ（フル）(1)

公開日

2025-06-06

タイトル

UOM at the NTCIR-18 RadNLP Task

言語

作成者

Wuraola Oyewusi
Eliana Vasquez Osorio
Gareth Price
Goran Nenadic

内容記述

内容記述タイプ

Abstract

内容記述

The RadNLP 2024 (Natural Language Processing for Radiology) shared task at the international conference NTCIR-18 (English track) focuses on document classification for lung cancer staging, aiming to automatically determine the stage (i.e., the degree of progression) of lung cancer from radiology reports. Our approach involved data preprocessing, stratified data augmentation, and fine-tuning RadBERT—a transformer model pre-trained on radiology-specific text. We employed back-translation for data augmentation and 5-fold cross-validation to improve model robustness and address class imbalance. The results demonstrated that data augmentation significantly improved validation performance, with T accuracy increasing from 39.39% to 94.05% during K-fold validation and reaching 100% on the task validation set. However, a substantial performance gap was observed on the task test set, with joint accuracy dropping from 96.3% on the task validation set to 12.35%. This highlights challenges in model generalization due to limited dataset diversity and domain-specific language variability. This report details our methodology, results, and discusses the challenges encountered, highlighting the need for further research to improve the robustness and generalizability of automated lung cancer staging from limited radiology reports.

言語

出版者

NII Institutional Repository

言語

日付

2025-06-06

日付タイプ

Issued

言語

eng

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_5794

資源タイプ

conference paper

ID登録

10.20736/0002002072

ID登録タイプ

JaLC

Versions

Ver.1

2025-06-04 08:02:08.610806

Show All versions

Cite as

Other

エクスポート

OAI-PMH

JPCOAR 2.0
JPCOAR 1.0
DublinCore
DDI

Other Formats

インデックスリンク

インデックスツリー

アイテム

UOM at the NTCIR-18 RadNLP Task

× Wuraola Oyewusi

× Eliana Vasquez Osorio

× Gareth Price

× Goran Nenadic

Versions

Share

Cite as

Other

エクスポート

コミュニティ

メニューを最小化

インデックスリンク

インデックスツリー

アイテム

UOM at the NTCIR-18 RadNLP Task

× Wuraola Oyewusi

× Eliana Vasquez Osorio

× Gareth Price

× Goran Nenadic

Versions

Share

Cite as

Other

エクスポート

コミュニティ