TY - JOUR N1 - cited By 4; Conference of 12th International Workshop on Knowledge Management and Acquisition for Intelligent Systems, PKAW 2012, held in conjunction with 12th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2012 ; Conference Date: 5 September 2012 Through 6 September 2012; Conference Code:194509 SP - 106 TI - A lazy manâ??s way to part-of-speech tagging AV - none EP - 117 PB - Springer Verlag SN - 03029743 N2 - A statistical-based approach to word alignment involving automatically projecting part-of-speech (POS) tags is presented. The approach is referred to as the â??lazy manâ??s wayâ?? because it improves POS assignment for a resource-poor language by exploiting its similarity to a resource-rich one. This unsupervised learning method combines the N-gram and Dice Coefficient similarity functions in order to align English texts with Malay texts thus projecting the POS tags from English to Malay. It is a quick method that does not require the laborious effort needed to annotate the Malay dataset. A case study, an experiment done on 25 terrorism news articles written in Malay, has shown that leveraging pre-existing resources from a resource-rich language, i.e. English, to supplement a resource-poor language, i.e. Malay, is feasible and avoids building new text-processing tools from scratch. The system was tested on the Malay corpus, consisting of 5413 word tokens. The results reached values of 86.87 for precision, 72.56 for recall and 79.07 for F1-Score. This shows that the â??lazy manâ??s wayâ??, where a resource-poor language just exploits the rich linguistic information available in English, increases bitext projection accuracy significantly. © Springer-Verlag Berlin Heidelberg 2012. ID - scholars3124 KW - Intelligent systems; Knowledge management; Mergers and acquisitions; Natural resources; Text processing; Unsupervised learning KW - Dice coefficient; Linguistic information; Part of speech tagging; Part-of-speech tags; Resource-Rich; Similarity functions; Unsupervised learning method; Word alignment KW - Computational linguistics JF - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) A1 - Zamin, N. A1 - Oxley, A. A1 - Bakar, Z.A. A1 - Farhan, S.A. UR - https://www.scopus.com/inward/record.uri?eid=2-s2.0-84893005114&doi=10.1007%2f978-3-642-32541-0_9&partnerID=40&md5=6398989ce94cbfadf1367af131ac7120 VL - 7457 L Y1 - 2012/// ER -