TY - JOUR SN - 18761100 JF - Lecture Notes in Electrical Engineering TI - Speech Recognizing Comparisons Between Web Speech API and FPT.AI API N1 - cited By 0; Conference of 12th National Technical Seminar on Unmanned System Technology, NUSYS 2020 ; Conference Date: 24 November 2020 Through 25 November 2020; Conference Code:266059 ID - scholars17841 N2 - Nowadays, people use speech recognition services for many purposes in their daily lives, such as learning foreign languages, communicating, etc. Therefore, they need to decide which ones to use. High accuracy and short processing time speech recognition service will help improve the work effectively as the time to re-check output results and the delay time between recognition tasks. For Vietnamese speech recognition, Web Speech API and FPT.AI API are popular. Web Speech API supports multiple languages, while FPT.AI API focuses on Vietnamese as FPT.AIâ??s products are developed exclusively for the Vietnamese market. In order to assist people in choosing a suitable Vietnamese speech recognition service, in this paper, the speech recognizing accuracy and processing time between Web Speech API and FPT.AI API has been compared. 307 audio files containing Vietnamese speeches which are obtained from FPT Open Speech Dataset were chosen to test the accuracy and the processing time of both APIs. For the accuracy test, FPT.AI API was 0.57 more precise than Web Speech API. However, in the processing time test, Web Speech API was 50.99 faster than FPT.AI API. For Web Speech API, it was mostly accurate to process 12â??14-second-long audio files, while FPT.AI API did best when process 2â??4-second-long audio files. The audio files with duration values between 2 and 8 seconds are optimal for both APIs to proceed with STT conversions. © 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. EP - 865 UR - https://www.scopus.com/inward/record.uri?eid=2-s2.0-85116480522&doi=10.1007%2f978-981-16-2406-3_64&partnerID=40&md5=a1ffad457cfbb1bb01be650287375c0d SP - 853 PB - Springer Science and Business Media Deutschland GmbH A1 - Tran, D.C. A1 - Nguyen, D.L. A1 - Ha, H.S. A1 - Hassan, M.F. KW - Application programming interfaces (API); Character recognition; Machine learning; Speech; Statistical tests KW - Applications programming interfaces; Audio files; Daily lives; Foreign language; FPT.; Processing time; Speech-to-text; Vietnamese; Vietnamese speech; Web speech KW - Speech recognition Y1 - 2022/// AV - none VL - 770 ER -