End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application

Chung, T.D. and Drieberg, M. and Bin Hassan, M.F. and Khalyasmaa, A. (2020) End-to-end Conversion Speed Analysis of an FPT.AI-based Text-to-Speech Application. In: UNSPECIFIED.

Full text not available from this repository.
Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

In this paper, an FPT.AI-based text-to-speech (TTS) application is developed that converts Vietnamese text into spoken words. The application is developed based on Django for Python and in the form of an interactive web page which is connected to an FPT.AI server through its application programming interface (API). The application supports conversion of text to seven different Vietnamese speeches. Four out of seven voices can be used to convert up to 500 characters in a single transaction while the others support that of 400 characters. Based on the results obtained, the first conversion time takes up to 10 s to convert 400-character text into speech while the subsequent times, given same text, it takes under 1.8 s for the conversion. This is applicable to all voices. © 2020 IEEE.

Item Type: Conference or Workshop Item (UNSPECIFIED)
Additional Information: cited By 11; Conference of 2nd IEEE Global Conference on Life Sciences and Technologies, LifeTech 2020 ; Conference Date: 10 March 2020 Through 12 March 2020; Conference Code:159607
Uncontrolled Keywords: Websites, Conversion speed; Conversion time; End to end; ITS applications; Spoken words; Text to speech; Vietnamese; Vietnamese speech, Application programming interfaces (API)
Depositing User: Mr Ahmad Suhairi UTP
Date Deposited: 10 Nov 2023 03:27
Last Modified: 10 Nov 2023 03:27
URI: https://khub.utp.edu.my/scholars/id/eprint/13388

Actions (login required)

View Item
View Item