Web Scraping for Ovarian Cancer Detection: Utilizing Open-Source Whisper AI for Identifying Relevant Terminology and Improving Early Diagnosis

Authors

  • Vijayshri Khedkar Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
  • Pooja Bagane Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
  • Sonali Kothari Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
  • Anubha Gupta Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
  • Utkarsh Singh Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
  • Sahil Gupta Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
  • Tanya Agrawal Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India
  • M. Karthikeyan CSIR-NCL, Pune, India

Keywords:

Automatic Speech Recognition, chemical named entity recognition, Ovarian Cancer, Natural language processing, Web Scraping

Abstract

This research paper investigates the effectiveness of automatic speech recognition (ASR) using OpenAI Whisper module in detecting chemical word entities related to ovarian cancer from human speech. Ovarian cancer is a deadly disease that requires early detection for successful treatment. The proposed ASR system is based on deep learning models capable of recognizing complex speech patterns and distinguishing between different chemical terms related to ovarian cancer. Moreover, the detected chemical entities are used for web content search and retrieval, which can help in discovering useful information related to ovarian cancer. This study highlights the potential of using ASR technology for early detection and accurate identification of ovarian cancer-related chemical entities and utilizing them for retrieving relevant information from the web and opens new avenues for developing intelligent systems for disease diagnosis and treatment.

Downloads

Download data is not yet available.

References

Nataliya Shakhovska, Oleh Basystiuk, Khrystyna Shakhovska: Development of the Speech-to-Text Chatbot Interface Based on Google API(2022), Researchgate

Dr. Sonali Kothari Tidke, Adhiraj Dev Goswami, Prof. Vijayshri Khedkar, Muskaan Agrawal, Anvita Gupta, Kajal Jaggi: Identification of chemical entities from prescribed drugs for ovarian cancer by text mining of medical records (2022), IEEE

Andrés A. Ramírez-Duque, Mary Ellen Foster: A Whisper ROS Wrapper to Enable Automatic Speech Recognition in Embedded Systems(2023), HRCI 2023

Suman K. Saksamudre, P.P. Shrishrima, R.R. Deshmukh: A Review on Different Approaches for Speech Recognition System(2015), International Journal of Computer Applications

Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever: Robust Speech Recognition via Large-Scale Weak Supervision (2019), IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Pages: 5961-5965, DOI: 10.1109/ICASSP.2019.8682574

Vinnarasu A., Deepa V. Jose: Speech to text conversion and summarization for effective understanding and documentation(2019), International Journal of Electrical and Computer Engineering (IJECE)

Virendra Chauhan, Shobhana Dwivedi, Pooja Karale, Prof. S.M. Potdar: Speech to Text Converter Using Gaussian Mixture Model(2016), International Research Journal of Engineering and Technology(IRJET)

Irina Gurtueva , Olga Nagoeva , and Inna Pshenokova: Speech recognition algorithm for natural language management systems under variety of accents(2020), E3S Web of Conferences

M. Benzeghiba, R. De Mori, F. Dufour, and P. J. Godfrey, “Automatic speech recognition and speech variability: A review,” Speech Communication, vol. 57, pp. 109-129, 2014.

Santosh K. Gaikwad, Bharti W. Gawali, Pravin Yannawar: A review on Speech Recognition Technique

S. Ajami, "Use of speech-to-text technology for documentation by healthcare providers," Int. J. Healthc. Technol. Manag., vol. 15, no. 1, pp. 23-32, 2016.

Wiqas Ghai, Navdeep Singh: “Literature Review on Automatic Speech Recognition”, International Journal of Computer Applications (0975 – 8887), Vol. 41, No. 8, March 2012.

Michelle Cutajar, Edward Gatt, Ivan Grech, Owen Casha, Joseph Micallef: “Comparative Study of Automatic Speech Recognition techniques”, IET Signal Processing, January 2013.

Ayushi Trivedi, Navya Pant, Pinal Shah, Simran Sonik, Supriya Agrawal: “Speech to text and text to speech recognition system – A Review”, IOSR Journal of Computer Engineering, Vol. 20, Issue 2, Ver. 1 (March – April 2018), PP 36-43.

Ms. Sweta Minj. (2012). Design and Analysis of Class-E Power Amplifier for Wired & Wireless Systems. International Journal of New Practices in Management and Engineering, 1(04), 07 - 13. Retrieved from http://ijnpme.org/index.php/IJNPME/article/view/9

Faris, W. F. . (2020). Cataract Eye Detection Using Deep Learning Based Feature Extraction with Classification. Research Journal of Computer Systems and Engineering, 1(2), 20:25. Retrieved from https://technicaljournals.org/RJCSE/index.php/journal/article/view/7

Sherje, N.P., Agrawal, S.A., Umbarkar, A.M., Kharche, P.P., Dhabliya, D. Machinability study and optimization of CNC drilling process parameters for HSLA steel with coated and uncoated drill bit (2021) Materials Today: Proceedings,

Downloads

Published

21.09.2023

How to Cite

Khedkar, V. ., Bagane, P. ., Kothari, S. ., Gupta, A. ., Singh, U. ., Gupta, S. ., Agrawal, T. ., & Karthikeyan, M. . (2023). Web Scraping for Ovarian Cancer Detection: Utilizing Open-Source Whisper AI for Identifying Relevant Terminology and Improving Early Diagnosis. International Journal of Intelligent Systems and Applications in Engineering, 11(4), 652–658. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3600

Issue

Section

Research Article

Most read articles by the same author(s)

1 2 > >>