| アイテムタイプ |
デフォルトアイテムタイプ(フル)(1) |
| 公開日 |
2025-06-06 |
| タイトル |
|
|
タイトル |
UOM at the NTCIR-18 RadNLP Task |
|
言語 |
en |
| 作成者 |
Wuraola Oyewusi
Eliana Vasquez Osorio
Gareth Price
Goran Nenadic
|
| 内容記述 |
|
|
内容記述タイプ |
Abstract |
|
内容記述 |
The RadNLP 2024 (Natural Language Processing for Radiology) shared task at the international conference NTCIR-18 (English track) focuses on document classification for lung cancer staging, aiming to automatically determine the stage (i.e., the degree of progression) of lung cancer from radiology reports. Our approach involved data preprocessing, stratified data augmentation, and fine-tuning RadBERT—a transformer model pre-trained on radiology-specific text. We employed back-translation for data augmentation and 5-fold cross-validation to improve model robustness and address class imbalance. The results demonstrated that data augmentation significantly improved validation performance, with T accuracy increasing from 39.39% to 94.05% during K-fold validation and reaching 100% on the task validation set. However, a substantial performance gap was observed on the task test set, with joint accuracy dropping from 96.3% on the task validation set to 12.35%. This highlights challenges in model generalization due to limited dataset diversity and domain-specific language variability. This report details our methodology, results, and discusses the challenges encountered, highlighting the need for further research to improve the robustness and generalizability of automated lung cancer staging from limited radiology reports. |
|
言語 |
en |
| 出版者 |
|
|
出版者 |
NII Institutional Repository |
|
言語 |
en |
| 日付 |
|
|
日付 |
2025-06-06 |
|
日付タイプ |
Issued |
| 言語 |
|
|
言語 |
eng |
| 資源タイプ |
|
|
資源タイプ識別子 |
http://purl.org/coar/resource_type/c_5794 |
|
資源タイプ |
conference paper |
| ID登録 |
|
|
ID登録 |
10.20736/0002002072 |
|
ID登録タイプ |
JaLC |
| 関連情報 |
|
|
関連タイプ |
isReferencedBy |
|
|
識別子タイプ |
URI |
|
|
関連識別子 |
https://research.nii.ac.jp/ntcir/ntcir-18/index.html |
|
|
言語 |
en |
|
|
関連名称 |
NTCIR-18 Conference |
| 開始ページ |
|
|
開始ページ |
none |
| 会議記述 |
|
|
|
会議名 |
NTCIR-18 Conference |
|
|
言語 |
en |
|
回次 |
18 |
|
|
主催機関 |
National Institute of Informatics |
|
|
言語 |
en |
|
|
開始年 |
2025 |
|
|
開始月 |
6 |
|
|
開始日 |
10 |
|
|
終了年 |
2025 |
|
|
終了月 |
6 |
|
|
終了日 |
13 |
|
|
開催期間 |
June 10-13, 2025 |
|
|
言語 |
en |
|
|
開催会場 |
National Institute of Informatics |
|
|
言語 |
en |
|
開催国 |
JPN |