Healthc Inform Res.  2014 Oct;20(4):272-279. 10.4258/hir.2014.20.4.272.

Korean Anaphora Recognition System to Develop Healthcare Dialogue-Type Agent

Affiliations
  • 1Department of IT Convergence Engineering, Gachon University, Seongnam, Korea.
  • 2IT Department, Gachon University, Seongnam, Korea. lyh@gachon.ac.kr

Abstract


OBJECTIVES
Anaphora recognition is a process to identify exactly which noun has been used previously and relates to a pronoun that is included in a specific sentence later. Therefore, anaphora recognition is an essential element of a dialogue agent system. In the current study, all the merits of rule-based, machine learning-based, semantic-based anaphora recognition systems were combined to design and realize a new hybrid-type anaphora recognition system with an optimum capacity.
METHODS
Anaphora recognition rules were encoded on the basis of the internal traits of referred expressions and adjacent contexts to realize a rule-based system and to serve as a baseline. A semantic database, related to predicate instances of sentences including referred expressions, was constructed to identify semantic co-relationships between the referent candidates (to which semantic tags were attached) and the semantic information of predicates. This approach would upgrade the anaphora recognition system by reducing the number of referent candidates. Additionally, to realize a machine learning-based system, an anaphora recognition model was developed on the basis of training data, which indicated referred expressions and referents. The three methods were further combined to develop a new single hybrid-based anaphora recognition system.
RESULTS
The precision rate of the rule-based systems was 54.9%. However, the precision rate of the hybrid-based system was 63.7%, proving it to be the most efficient method.
CONCLUSIONS
The hybrid-based method, developed by the combination of rule-based and machine learning-based methods, represents a new system with enhanced functional capabilities as compared to other pre-existing individual methods.

Keyword

Anaphora Resolution; Anaphora Recognition; Reference Resolution; Dialogue Analysis; Natural Language Processing

MeSH Terms

Delivery of Health Care*
Natural Language Processing
Semantics

Figure

  • Figure 1 Anaphora resolution system architecture based on rule and semantic anaphora resolution strategies.

  • Figure 2 Structure of a machine leaning-based anaphora recognition system.

  • Figure 3 Structure of the hybrid anaphora recognition system. NLP: Natural Language Processing, WEKA: Waikato Environment for Knowledge Analysis.


Reference

1. Peral J, Rodriguez AF. Translation of pronominal anaphora between English and Spanish: Discrepancies and evaluation. J Artif Intell Res. 2003; 18:117–147.
Article
2. Zhou X, Han H, Chankai I, Prestrud A, Brooks A. Approaches to text mining for clinical medical records. In : Proceedings of the 2006 ACM Symposium on Applied Computing; 2006 Apr 23-27; Dijon, France. p. 235–239.
3. Song MH, Kim SH, Park DK, Lee YH. A multi-classifier based guideline sentence classification system. Healthc Inform Res. 2011; 17(4):224–231.
Article
4. Chakrabarti S, Dom B, Agrawal R, Raghavan P. Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies. VLDB J. 1998; 7(3):163–178.
Article
5. Chang JT, Schütze H, Altman RB. Creating an online dictionary of abbreviations from MEDLINE. J Am Med Inform Assoc. 2002; 9(6):612–620.
Article
6. Hsu WH. Activities of the KSU Bioinformatics and Medical Informatics (BMI) Working Group, 2001-2002. Manhattan (KS): Department of Computing and Information Science, Kansas State University;2001.
7. Wycoff DA, Wagner JR. Distributed laboratory computing: integration of a laboratory computer into a hospital information system. Am J Clin Pathol. 1978; 70(3):390–399.
Article
8. Burke MD. Clinical laboratory consultation: appropriateness to laboratory medicine. Clin Chim Acta. 2003; 333(2):125–129.
Article
9. Kratz A, Soderberg BL, Szczepiorkowski ZM, Dighe AS, Versalovic J, Laposata M. The generation of narrative interpretations in laboratory medicine: a description of service-specific sign-out rounds. Am J Clin Pathol. 2001; 116:Suppl. S133–S140.
10. Lim EM, Sikaris KA, Gill J, Calleja J, Hickman PE, Beilby J, et al. Quality assessment of interpretative commenting in clinical chemistry. Clin Chem. 2004; 50(3):632–637.
Article
11. Kratz A, Laposata M. Enhanced clinical consulting: moving toward the core competencies of laboratory professionals. Clin Chim Acta. 2002; 319(2):117–125.
Article
12. Hripcsak G, Friedman C, Alderson PO, DuMouchel W, Johnson SB, Clayton PD. Unlocking clinical data from narrative reports: a study of natural language processing. Ann Intern Med. 1995; 122(9):681–688.
Article
13. Lee SK, Kang BY, Kim HG, Son YJ. Predictors of medication adherence in elderly patients with chronic diseases using support vector machine models. Healthc Inform Res. 2013; 19(1):33–41.
Article
14. Kim S, Kim W, Park RW. A comparison of intensive care unit mortality prediction models through the use of data mining techniques. Healthc Inform Res. 2011; 17(4):232–243.
Article
15. Wang S, Zhu W, Liang ZP. Shape deformation: SVM regression and application to medical image segmentation. In : Proceedings of the 8th IEEE International Conference on Computer Vision; 2001 Jul 7-14; Vancouver, Canada. p. 209–216.
16. Do TN, Poulet F. Incremental SVM and visualization tools for bio-medical data mining. In : Proceedings of the European Workshop on Data Mining and Text Mining for Bioinformatics; 2003 Sep 23; Dubrovnik, Croatia. p. 14–19.
17. Li S, Fevens T, Krzyzak A. A SVM-based framework for autonomous volumetric medical image segmentation using hierarchical and coupled level sets. Int Congr Ser. 2004; 1268:207–212.
Article
18. Amaral IF, Coelho F, da Costa JF, Cardoso JS. Hierarchical medical image annotation using SVM-based approaches. In : Proceedings of the 10th IEEE International Conference on Information Technology and Applications in Biomedicine; 2010 Nov 3-5; Corfu, Greece. p. 1–5.
19. Evans R. Applying machine learning toward an automatic classification of it. Lit Linguist Comput. 2001; 16(1):45–57.
Article
20. Mojsilovic A, Gomes J. Semantic based categorization, browsing and retrieval in medical image databases. In : Proceedings of the 2002 International Conference on Image Processing; 2002 Sep 22-25; Rochester, NY. p. 145–148.
21. Lee Y, Patel C, Chun S, Geller J. Compositional knowledge management for medical services on semantic web. In : Proceedings of the 13th international World Wide Web Conference on Alternate Track Papers & Posters; 2004 May 17-20; New York, NY. p. 498–499.
22. Cimino JJ. Auditing the Unified Medical Language System with semantic methods. J Am Med Inform Assoc. 1998; 5(1):41–51.
Article
23. Yun JH, Ahn SJ, Kim Y. Development of clinical contents model markup language for electronic health records. Healthc Inform Res. 2012; 18(3):171–177.
Article
24. Hwang KH, Lee H, Choi D. Medical image retrieval: past and present. Healthc Inform Res. 2012; 18(1):3–9.
Article
25. Arch-int N, Arch-int S. Semantic ontology mapping for interoperability of learning resource systems using a rule-based reasoning approach. Expert Syst Appl. 2003; 40(18):7428–7443.
Article
26. Choi S, Choi J, Yoo S, Kim H, Lee Y. Semantic concept-enriched dependence model for medical information retrieval. J Biomed Inform. 2014; 47:18–27.
Article
Full Text Links
  • HIR
Actions
Cited
CITED
export Copy
Close
Share
  • Twitter
  • Facebook
Similar articles
Copyright © 2024 by Korean Association of Medical Journal Editors. All rights reserved.     E-mail: koreamed@kamje.or.kr