A machine learning approach for Biomedical Named Entity Recognition
DOI:
https://doi.org/10.7439/ijbr.v7i12.3746Abstract
This paper aims to develop a Named Entity Recognition (NER) that creates new tags that facilitate fast query processing, information retrieval and data preprocessing of Biomedical Domain. We have used a machine learning approach that uses domain specific knowledge to train the data and label the entities with appropriate tags. Conditional Random Fields (CRF) is implemented and used to train the input domain specific data that yields good performance. Experimental and evaluation result shows that the learned model yields a Biomedical domain specific NER recall of 82%, precision of 85%, accuracy of 82% and Fmeasure of 83%. Thus learned CRF model builds a domain specific NER for Biomedical Domain and tags the domain keywords with appropriate tags.Downloads
References
Curran, J. R. Clark, S. Language independent NER using a maximum entropy tagger. In Proceedings of the 7thCoNLL. 2003: 164
Derczynski, L., Maynard, D., Rizzo, G., Van Erp, M., Gorrell, G., Troncy, R., Bontcheva, K. Analysis of named entity recognition and linking for tweets. Information Processing and Management.2015;51: 32
Fog Computing World. 2014. Internet of Things to Generate Zettabytes of Data by 2018.
John, D. L. Andrew, M. Fernando, C. N. P. Conditional Random Fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning, 2001; ICML
Kazama, J. Makino, T. Ohta, Y. Tsujii, J. Tuning Support Vector Machines for Biomedical Named Entity Recognition. Proceedings of the Workshop on Natural Language Processing in the Biomedical Domain, 2002; Philadelphia,pp. 1-8. Association for Computational Linguistics.
Konkol, M. Brychc
Leama, R. Wei CH, Lu, Z. tmChem: a high performance approach for chemical named entity recognition and normalization. J Cheminform.2014; 7(Suppl 1):S3.
Lee, K.J. Hwang, Y. S. and Rim, H.C. Two-Phase Biomedical NE Recognition based on SVMs. Proceedings of the ACL 2003 Workshop on NLP in Biomedicine, 2003; 33-40.http://www.aclweb.org/anthology/W03-1305.pdf
Li K, Ai W, Tang, Z. Hadoop recognition of biomedical named entity using conditional random fields. In: IEEE transaction parallel distribution system. 2015; pp 1
Paul, M. James, M. Entity extraction without language-specific resources. In proceedings of the 6th Conference on Natural Language Learning2002; 20, COLING-02: 1-4, Stroudsburg, PA, USA, 2002. Association for Computational Linguistics. DOI=10.3115/1118853.1118873.
Payai, B. Aditi, S. Ashish, K. AGNER: Entity tagger in Agriculture domain, 2nd International Conference on Computing for Sustainable Global Development2015.
Robert, M. Christopher, D. M. Accurate unsupervised joint named-entity extraction unaligned parallel text. In Proceedings of the 4th Named Entity Workshop, NEWS
Saha, S.K., Sarkar, S., Mitra, P., Feature Selection Techniques for Maximum entropy based biomedical named entity recognition. Journal of Biomedical Informatics2009;42(5): 905-911.
Srihari, R. Niu, C. Li, W. A Hybrid Approach for Named Entity and Sub-Type Tagging. Proceedings of the sixth conference on Applied Natural Language Processing, ACM, 2000: 247-254. http://dl.acm.org/citation.cfm?id=974181
Wenhui, L. Sriharsha, V. A Simple Semi-supervised Algorithm for Named Entity Recognition, Proceedings of the NAACL HLT Workshop on Semi-supervised Learning for Natural Language Processing, 2009: 58
Xavier, C. Lluis, M. Lluis, P. Named entity extraction using Adaboost. In proceedings of the 6th Conference on Natural Language Learning
Yang, M. Zhou, L. Yu, Z. Gao, S. and Guo, J. Lao Named Entity Recognition based on Conditional Random Fields with Simple Heuristic Information. 12th IEEE International conference on Fuzzy Systems and Knowledge Discovery2015.
Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
- An author must submit Copyright form After acceptance of the article.