About Me
Hello, Thank you for visiting my page. I am a third-year Ph.D. student at the City University of Hong Kong. The main area of focus is on music and soundscape.
I worked as a software developer level II at Vocera Communication. Previous to my current position, I graduated with an MS (by research) degree from Department of Computer Science and Engineering, at Indian Institute of Technology Kharagpur , India. The topic of my master's thesis dissertation was Prosody Modeling for Storytelling Style Speech Synthesis and the research work was carried out under the supervision of Dr. Krothapalli Sreenivas Rao. I received my B.Tech in Information Technology from Department of Information Technology at North Eastern Hill University, India.
My area of research interest includes speech synthesis, speech recognition, natural language processing and machine learning.
Professional Experience
● Speech Recognition Developer Level 2 | Vocera, Bangalore | R&D Products Engineering Group (Oct '16 - Dec '20)
● Senior Scientific Officer | SRIC-IIT Kharagpur | Department of Information Technology, Govt. of India (Sep '12 - Sep '16)
Thesis
Topic: Prosody Modeling for Storytelling Style Speech Synthesis
Abstract | Demos
Major contributions
Publications
● Kumud Tripathi, Parakrant Sarkar, and K. Sreenivasa Rao "Sentence Based Discourse Classification for Hindi Story Text-to-Speech (TTS) System", in Thirteenth International Conference on Natural Language Processing (ICON-2016), IIT (BHU), Varanasi, India, 19-20 December, 2016. pdf
● Parakrant Sarkar, and K. Sreenivasa Rao, "Development of Story Text-to-Speech System based on Story Genres", in Workshop on Machine Learning in Speech and Language Processing (MLSLP 2016), Google San Fransisco, USA, 13 September, 2016. pdf
● Parakrant Sarkar, and K. Sreenivasa Rao, "Analysis and Modeling Pauses for Synthesis of Storytelling Speech based on Discourse modes", in Proceedings of the IEEE International Conference on Contemporary Computing (IC3 2015), JIIT Noida, India, 11-13 August, 2015. pdf
● Parakrant Sarkar, and K. Sreenivasa Rao, "Modeling Pauses for Synthesis of Storytelling Style Speech Using Unsupervised Word Features", Second International Symposium on Computer Vision and the Internet (VisionNet'15), Procedia Computer Science, Volume 58, Pages 42-49, Kochi, India, August, 2015. pdf
● Parakrant Sarkar, and K. Sreenivasa Rao, "Data-Driven Pause Prediction for Synthesis of Storytelling Style Speech based on Discourse Modes", in Proceedings of the IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT 2015), IIIT Bangalore, India, 10-11 July, 2015. pdf
● Parakrant Sarkar, and K. Sreenivasa Rao,"Data-Driven Pause Prediction for Speech Synthesis in Storytelling Style Speech" , in Proceedings of the IEEE 21st National Conference on Communication (NCC-2015), IIT Bombay, India, 27 February to 1 March, 2015. pdf
● Rashmi Verma, Parakrant Sarkar, and K. Sreenivasa Rao, "Conversion of Neutral speech to Storytelling Style speech", in Proceedings of the eighth IEEE International Conference on Advances in Pattern Recognition (ICAPR 2015), ISI Kolkata, India, 04-07 January, 2015. pdf
● Parakrant Sarkar Arijul Haque, Arup Kumar Dutta, Gurunath Reddy M., Harikrishna D. M., Prasenjit Dhara, Rashmi Verma, N. P. Narendra, Sunil Kr. S.B., Jainath Yadav, K. Sreenivasa Rao, "Designing prosody rule-set for converting neutral TTS speech to storytelling style speech for Indian languages Bengali, Hindi and Telugu", in Proceedings of the IEEE International Conference on Contemporary Computing (IC3 2014), JIIT Noida, India, 20-24 August, 2014. pdf