relation: https://khub.utp.edu.my/scholars/10215/ title: HALA: Hive advanced lexical analyzer creator: Amirthalingam, T. creator: Rais, H.M. description: Big Data Analytics has become an integral part of technology in the past decade. It has been the driving force behind some of the major advancements made, including machine learning, artificial intelligence, human behavior prediction and even genomic studies. Furthermore, the popularity of Hadoop and its ability to provide a highly scalable and available distributed computing on commodity machines has attracted various efforts into optimizing the framework itself. This research focuses on bridging the gap between the user experience seen in a relational database management system and the Hadoop framework. It is added as an external component to Apache Hive's architecture and introduces a novel tool, Hive Advanced Lexical Analyzer, or HALA, for 3 added functionalities: (1) Lexical Analysis, (2) Syntactic Analysis and (3) Multi-query processing. This paper also presents the results of a thorough analysis of the applicability for each of these components. 5 queries from the TPC-H series were selected and used as the base query model for multiple variations for information extraction. In addition to extracting query information, the authors have also performed detailed analysis on the syntax verifying component as well. Multiple runs of different query variations have returned a non-existent false positives and false negatives. Additionally, the modularity of this tool allows the usage of either or all of the components to realize the desired outcome. © 2017 IEEE. publisher: Institute of Electrical and Electronics Engineers Inc. date: 2018 type: Conference or Workshop Item type: PeerReviewed identifier: Amirthalingam, T. and Rais, H.M. (2018) HALA: Hive advanced lexical analyzer. In: UNSPECIFIED. relation: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050222571&doi=10.1109%2fUIC-ATC.2017.8397641&partnerID=40&md5=b44704fb2d9c248890be3e3f8d16f2d9 relation: 10.1109/UIC-ATC.2017.8397641 identifier: 10.1109/UIC-ATC.2017.8397641