Abstract
High-quality labeled textual data are vital for automatic mining and analysis of massive textual data produced by software systems. Several tools have been designed to facilitate manual labeling of textual data on different levels of granularity. However, these tools neither aim to provide statistics and analysis of labeled textual data, nor support collaboration among the coders to reduce the time cost in manual labeling and enhance the quality of labeling results. In this paper, we developed a Web-based labeling tool named CoolTeD (available at http://williamsriver.cn) for collaborative labeling of the textual datasets. Specifically, CoolTeD can be used: (1) to label textual data from the perspective of requirements types based on ISO 25010, (2) to review the labeling results with different confidence levels and contradictory labels, (3) to automatically calculate Cohen's Kappa coefficient of multiple coders, and (4) to visualize the labeling results. The tool demo is available at https://youtu.be/xVkrB_Cs1J8
| Original language | English |
|---|---|
| Title of host publication | 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) |
| Publisher | IEEE |
| Pages | 613-617 |
| Number of pages | 5 |
| ISBN (Electronic) | 978-1-6654-3786-8 |
| ISBN (Print) | 978-1-6654-3787-5 |
| DOIs | |
| Publication status | Published - 21 Jul 2022 |
| Event | 29th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2022 - Virtual Duration: 15 Mar 2022 → 18 Mar 2022 Conference number: 29 |
Publication series
| Name | Proceedings - 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2022 |
|---|
Conference
| Conference | 29th IEEE International Conference on Software Analysis, Evolution and Reengineering, SANER 2022 |
|---|---|
| Abbreviated title | SANER |
| Period | 15/03/22 → 18/03/22 |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
-
SDG 9 Industry, Innovation, and Infrastructure
-
SDG 12 Responsible Consumption and Production
Keywords
- Collaborative Labeling
- Data Labeling
- Data Visualization
- Requirements Engineering
- Textual Data
- 2024 OA procedure
Fingerprint
Dive into the research topics of 'CoolTeD: A Web-based Collaborative Labeling Tool for the Textual Dataset'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver