TY - JOUR
T1 - Identifying and Translating Subjective Content Descriptions among Texts
AU - Bender, M.
AU - Braun, Tanya
AU - Gehrke, M.
AU - Kuhr, Felix
AU - Möller, R.
AU - Schiff, S.
PY - 2021
Y1 - 2021
N2 - An agent pursuing a task may work with a corpus of documents as a reference library. Subjective content descriptions (SCDs) provide additional data that add value in the context of the agent's task. In the pursuit of documents to add to the corpus, an agent may come across new documents where content text and SCDs from another agent are interleaved and no distinction can be made unless the agent knows the content from somewhere else. Therefore, this paper presents a hidden Markov model-based approach to identify SCDs in a new document where SCDs occur inline among content text. Additionally, we present a dictionary selection approach to identify suitable translations for content text and SCDs based on n-grams. We end with a case study evaluating both approaches based on simulated and real-world data.
AB - An agent pursuing a task may work with a corpus of documents as a reference library. Subjective content descriptions (SCDs) provide additional data that add value in the context of the agent's task. In the pursuit of documents to add to the corpus, an agent may come across new documents where content text and SCDs from another agent are interleaved and no distinction can be made unless the agent knows the content from somewhere else. Therefore, this paper presents a hidden Markov model-based approach to identify SCDs in a new document where SCDs occur inline among content text. Additionally, we present a dictionary selection approach to identify suitable translations for content text and SCDs based on n-grams. We end with a case study evaluating both approaches based on simulated and real-world data.
KW - "Case Research; Randomization Test; Single-Case Studies"
KW - "Pervasive Child Development Disorders; Autistic Disorder; Child"
KW - Dictionary selection
KW - Text mining
KW - Inline subjective content descriptions
KW - "Case Research; Randomization Test; Single-Case Studies"
KW - "Pervasive Child Development Disorders; Autistic Disorder; Child"
KW - Dictionary selection
KW - Text mining
KW - Inline subjective content descriptions
U2 - 10.1142/s1793351x21400122
DO - 10.1142/s1793351x21400122
M3 - Journal article
SN - 1793-351X
VL - 15
SP - 461
EP - 485
JO - International Journal of Semantic Computing
JF - International Journal of Semantic Computing
IS - 4
ER -