Construction of Word Dictionary for Bangla Vowel Ended Roots and Its Verbal Inflexions in UNL Based Machine Translation Scheme
AbstractThis paper focuses on the development of word dictionary of Bangla vowel ended roots and their verbal inflexions to incorporating them into an interlingua representation called Universal Networking Language (UNL) processors. A considerable amount of work has been done on the development of Bangla morphological analysis on verbs, nouns, prefixes and suffixes for machine translation. As far as the researchers are aware, no attempts, however, have been made to integrate the previous developments on Bangla vowel ended roots and inflexions for a concrete computational output. This paper attempts to bridge the gap on Bangla vowel ended roots and inflexions in the framework of UNL system aiming to produce a Bangla word dictionary for UNL. The paper analyzes the Bangla vowel ended roots and verbal inflexions and develops their formats in the UNL structure. Dictionary entries of all vowel ended roots and their inflexions are developed in order to generate associated verbs for sentences. These verbs can be used to convert Bangla native language sentences into UNL expressions using semantic rules, which are then converted into required native languages using the language specific generation rules. Conversion of a Bangla language sentence into UNL expression has also been shown in this paper.
(1) Uchida, H., Zhu, M., and Senta, T. C. D., Universal Networking Language, UNDL Foundation, International environment house, 2005/6, Geneva, Switzerland.
(2) Choudhury, M. E. H., Ali, M. N.Y., Sarkar, M.Z.H., and Ahsan, R., Bridging Bangla to Universal Networking Language- a Human Language Neutral Meta- Language, International Conference on Computer and Information Technology (ICCIT), Dhaka, 2005, pp. 104-109.
(3) Ali, M.N.Y., Das, J.K., Mamun, S.M. A. A., and Nurannabi, A.M., Morphological Analysis of Bangla Words for Universal Networking Language, Third International Conference on Digital Information Management (ICDIM 2008), London, England. pp. 532-537.
(4) Ali, M. N. Y., Das, J. K., Al Mamun, S. M. A., and Choudhury, M. E.H., Specific Features of a Converter of Web Documents from Bengali to Universal Networking Language, International Conference on Computer and Communication Engineering 2008 (ICCCE’08), Kuala Lumpur, Malaysia, pp. 726-731.
(5) Serraset, G. and Boitet, C., UNL French Deconversion as Transfer and Generation from an Interlinguia with Quality Enhancement through Off-line Human Interaction, Machine translation Summit, Singapore, 1999.
(6) EnConverter Specification, version 3.3, UNL Center/UNDL Foundation, Tokyo 150-8304, Japan 2002.
(7) UNDL Foundation: The Universal Networking Language
(UNL) specifications, version 3.2., Japan 2003.
(8) DeConverter Specification, version 2.7, UNL Center, UNDL Foundation, Tokyo 150-8304, Japan 2002.
(9) Ali, M., and Ali, M. M., Development of machine translation dictionaries for Bangla Language, Proceedings of 7th International Conference on Computer and Information Technology (ICCIT), pp. 272-276.
(10) Saha, G. K., The E2B machine translation: A new approach to HLT. Ubiquity archive, Association of Computing Machineries (ACM) 6(32), New York, 2005.
(11) Uddin, M. G., Ashraf, H., Kamal. A. H. M, and Ali, M. M., New parameters for Bangla to English statistical machine translation. Proceedings of 3rd International Conference on Electrical & Computer Engineering, 2004, pp. 545-548.
(12) Shahidullah, D. M., Bangala Vyakaran, Maola Brothers Prokashoni, Dhaka, August 2003, pp.110-130.
(13) Shniti Kumar, D. C., Vasha-Prokash Bangla Vyakaran, Rupa and Company Prokashoni, Calcutta, July 1999, pp.170-175.
(14) Rameswar, D. S., Shadharan Vasha Biggan and Bangla Vasha, Pustok Biponi Prokashoni, November 1996, pp.358-377.
(15) Azad, H., Bakkotottoy, Second edition, Dhaka-1994.
(16) Karim, M. A., Kaykobad, M., and M. Murshed, Technical
Challenges and Design Issues in Bangla Language Processing, IGI Global, Disseminator of Knowledge, 2013, pp. 35-78.
(17) Bondopoddaye, H., Bongio Shobdokosh, Shahitto Okademy, Calcutta-2001.
(18) Asaduzzaman, M. M., and Ali, M. M., Morphological analysis of Bangla Words for Automatic Machine Translation, International Conference on Computer and Information Technology, Dhaka, Bangladesh, 2013, pp. 271-276.
(19) Khairunnahar, K., Morphological Analysis of Bangla Prefix, The Dhaka University Journal of Linguistic, 1(2), 2008, pp. 157-168.
(20) Ali, M. N. Y., Sorwar, G., and Shamsujjoha, M., Formation of Word Dictionary of Bangla Vowel Ended Roots for First Person for Universal Networking Language, the 2015 International Conference on Information and Knowledge Engineering, Las Vegas, USA.
(21) Ali, M. N. Y., Sorwar, G., Toru, A. and Islam, M. A., and Shamsujjoha, M., Morphological Rules of Bangla Repetitive Words for UNL Based Machine Translation, 6th International Conference on Swarm Intelligence, ICSI, Beijing, China, June 25-28, 2015, pp-401-408.
(22) Parikh, J., Khot, J., Dave, S., and Bhattacharyya, P., Predicate Preserving Parsing, Department of Computer Science and Engineering, Indian Institute of Technology, Bombay, India.
(23) Earley, J., An Efficient Context Free Parsing Algorithm, Communications of the ACM, 1970.
(24) Ali, M. N. Y., Ali, M. A., Nurannabi, M. A., and Das, J. K., Algorithm for Conversion of Bangla Sentence to Universal Networking Language, International Conference on Asian Language Processing, IALP, , IEEE Computer Society, Harbin, China, 2010, pp. 118-121.
(25) UNDL, available at http://www.undl.org/, Last access: 02-11-2016.