A machine learning approach for Biomedical Named Entity Recognition

Authors

  • Kanimozhi U Research Scholar, Department of Computer Science and Engineering, College of Engineering Guindy, Anna University,Chennai, Tamil Nadu 600025
  • Manjula D Professor, Department of Computer Science and Engineering, College of Engineering Guindy, Anna University, Chennai, Tamil Nadu 600025

DOI:

https://doi.org/10.7439/ijbr.v7i12.3746

Abstract

This paper aims to develop a Named Entity Recognition (NER) that creates new tags that facilitate fast query processing, information retrieval and data preprocessing of Biomedical Domain. We have used a machine learning approach that uses domain specific knowledge to train the data and label the entities with appropriate tags. Conditional Random Fields (CRF) is implemented and used to train the input domain specific data that yields good performance. Experimental and evaluation result shows that the learned model yields a Biomedical domain specific NER recall of 82%, precision of 85%, accuracy of 82% and Fmeasure of 83%. Thus learned CRF model builds a domain specific NER for Biomedical Domain and tags the domain keywords with appropriate tags.

Downloads

Download data is not yet available.

References

Curran, J. R. Clark, S. Language independent NER using a maximum entropy tagger. In Proceedings of the 7thCoNLL. 2003: 164

Derczynski, L., Maynard, D., Rizzo, G., Van Erp, M., Gorrell, G., Troncy, R., Bontcheva, K. Analysis of named entity recognition and linking for tweets. Information Processing and Management.2015;51: 32

Fog Computing World. 2014. Internet of Things to Generate Zettabytes of Data by 2018.

John, D. L. Andrew, M. Fernando, C. N. P. Conditional Random Fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning, 2001; ICML

Kazama, J. Makino, T. Ohta, Y. Tsujii, J. Tuning Support Vector Machines for Biomedical Named Entity Recognition. Proceedings of the Workshop on Natural Language Processing in the Biomedical Domain, 2002; Philadelphia,pp. 1-8. Association for Computational Linguistics.

Konkol, M. Brychc

Leama, R. Wei CH, Lu, Z. tmChem: a high performance approach for chemical named entity recognition and normalization. J Cheminform.2014; 7(Suppl 1):S3.

Lee, K.J. Hwang, Y. S. and Rim, H.C. Two-Phase Biomedical NE Recognition based on SVMs. Proceedings of the ACL 2003 Workshop on NLP in Biomedicine, 2003; 33-40.http://www.aclweb.org/anthology/W03-1305.pdf

Li K, Ai W, Tang, Z. Hadoop recognition of biomedical named entity using conditional random fields. In: IEEE transaction parallel distribution system. 2015; pp 1

Paul, M. James, M. Entity extraction without language-specific resources. In proceedings of the 6th Conference on Natural Language Learning2002; 20, COLING-02: 1-4, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics. DOI=10.3115/1118853.1118873.

Payai, B. Aditi, S. Ashish, K. AGNER: Entity tagger in Agriculture domain, 2nd International Conference on Computing for Sustainable Global Development2015.

Robert, M. Christopher, D. M. Accurate unsupervised joint named-entity extraction unaligned parallel text. In Proceedings of the 4th Named Entity Workshop, NEWS

Saha, S.K., Sarkar, S., Mitra, P., Feature Selection Techniques for Maximum entropy based biomedical named entity recognition. Journal of Biomedical Informatics2009;42(5): 905-911.

Srihari, R. Niu, C. Li, W. A Hybrid Approach for Named Entity and Sub-Type Tagging. Proceedings of the sixth conference on Applied Natural Language Processing, ACM, 2000: 247-254. http://dl.acm.org/citation.cfm?id=974181

Wenhui, L. Sriharsha, V. A Simple Semi-supervised Algorithm for Named Entity Recognition, Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing, 2009: 58

Xavier, C. Lluis, M. Lluis, P. Named entity extraction using Adaboost. In proceedings of the 6th Conference on Natural Language Learning

Yang, M. Zhou, L. Yu, Z. Gao, S. and Guo, J. Lao Named Entity Recognition based on Conditional Random Fields with Simple Heuristic Information. 12th IEEE International conference on Fuzzy Systems and Knowledge Discovery2015.

Downloads

Published

2016-12-30

Issue

Section

Short Communication

How to Cite

1.
A machine learning approach for Biomedical Named Entity Recognition. Int Jour of Biomed Res [Internet]. 2016 Dec. 30 [cited 2024 Oct. 18];7(12):876-82. Available from: https://ssjournals.co.in/index.php/ijbr/article/view/3746

Most read articles by the same author(s)

1 2 3 4 5 6 7 8 9 10 > >>