Indonesian Online News Comment: Abusive Text Identification (doi:10.34820/FK2/DQEVRR)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Entire Codebook

(external link)

Document Description

Citation

Title:

Indonesian Online News Comment: Abusive Text Identification

Identification Number:

doi:10.34820/FK2/DQEVRR

Distributor:

Root

Date of Distribution:

2022-03-05

Version:

1

Bibliographic Citation:

Romadhony, Ade, 2022, "Indonesian Online News Comment: Abusive Text Identification", https://doi.org/10.34820/FK2/DQEVRR, Root, V1, UNF:6:FL7AmAWefBkzMld2oMk8RA== [fileUNF]

Study Description

Citation

Title:

Indonesian Online News Comment: Abusive Text Identification

Identification Number:

doi:10.34820/FK2/DQEVRR

Authoring Entity:

Romadhony, Ade (Telkom University)

Distributor:

Root

Access Authority:

Romadhony, Ade

Depositor:

Romadhony, Ade

Date of Deposit:

2022-03-04

Study Scope

Keywords:

Computer and Information Science, Indonesian online news comment, abusive text identification

Abstract:

This dataset consists of comments that are in some of the top news stories in 2019. comments obtained from the kompas, kaskus, and detik. The labeling process is carried out by 10 people and each comment was labeled by 3 annotators. Each comment is labeled with: 1: 'not abusive' 2: 'abusive but not offensive' 3: 'abusive and offensive'

Methodology and Processing

Sources Statement

Data Access

Notes:

CC0 Waiver

Other Study Description Materials

Related Publications

Citation

Identification Number:

10.1109/ISRITI48646.2019.9034620

Bibliographic Citation:

Abusive language detection on Indonesian online news comments. Desrul, Dhamir Raniah Kiasati and Romadhony, Ade. 2019 International Seminar on Research of Information Technology and Intelligent Systems (ISRITI). 2019. IEEE

File Description--f4192

File: Abusive Language Detection on Indonesian Online News Comments Dataset .tab

  • Number of cases: 3184

  • No. of variables per record: 2

  • Type of File: text/tab-separated-values

Notes:

UNF:6:FL7AmAWefBkzMld2oMk8RA==

Variable Description

List of Variables:

Variables

comment

f4192 Location:

Variable Format: character

Notes: UNF:6:jrqa8zKMJ2P8/6w6qcfe7Q==

label

f4192 Location:

Summary Statistics: Mean 1.2135678391959805; Max. 3.0; Valid 3184.0; StDev 0.5891398196490077; Min. 1.0;

Variable Format: numeric

Notes: UNF:6:9K/TH+rbS8XymiOIA6qvfw==