Deep Reinforcement Learning: Bridging the Gap with Neural Networks


  • P. Venkateswara Rao Associate Professor, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Andhra Pradesh, India
  • Vybhavi B. Assistant Professor, Department of Computer Science, New Horizon College, Kasturinagar, Bangalore, Karnataka, India
  • Manjeet Associate Professor, Department of Electrical & Electronics Engineering, Krishna Vidyapeeth of Management & Technology, Siwani, Bhiwani, Haryana, India
  • Arhath Kumar Assistant Professor, Department of MCA, NMAM Institute of Technology, Nitte (Deemed to be University), Udupi, Karnataka, India
  • Manisha Mittal Associate Professor, Department of Electronics and Communication Engineering, Guru Tegh Bahadur Institute of Technology, New Delhi, India
  • Amit Verma University Centre for Research and Development, Chandigarh University, Gharuan Mohali, Punjab, India
  • Dharmesh Dhabliya Professor, Department of Information Technology, Vishwakarma Institute of Information Technology, Pune, Maharashtra, India


Deep learning, Reinforcement learning, Artificial Intelligence, Neural Network


Deep Reinforcement Learning (DRL) represents a paradigm shift in artificial intelligence, combining the strengths of neural networks with the decision-making process of reinforcement learning. This paper explores the symbiotic relationship between DRL and neural networks, elucidating their collaborative potential in addressing complex problems. Neural networks serve as powerful function approximators, enabling DRL systems to efficiently handle high-dimensional state spaces and intricate relationships within raw input data. The key components of DRL, including value and policy networks, leverage neural networks to enhance learning efficiency. Applications span diverse domains, from robotics and game playing to autonomous systems. However, challenges such as sample inefficiency and interpretability persist. This abstract encapsulates the transformative synergy between DRL and neural networks, showcasing their potential to bridge gaps in traditional problem-solving paradigms and propel the field toward novel applications and advancements. As ongoing research addresses challenges, this collaboration promises to unlock new frontiers in intelligent decision-making and problem-solving. Deep Reinforcement Learning (DRL) represents a groundbreaking paradigm in artificial intelligence, seamlessly marrying the power of neural networks with reinforcement learning principles. This paper explores the synthesis of these two domains, elucidating the synergies that arise when deep neural networks serve as function approximators for value and policy functions in reinforcement learning tasks. The essence of DRL lies in its capacity to handle high-dimensional input spaces, learn hierarchical representations, and autonomously discover complex strategies through interaction with the environment. The paper also surveys key advancements and breakthroughs, highlighting the evolution of DRL from early successes to recent state-of-the-art methodologies. Moreover, this paper investigates the applicability of DRL across diverse domains, including robotics, gaming, and decision-making problems.


Download data is not yet available.


A. Mishra and S. Gangwar, “Lung Cancer Detection and Classification using Machine Learning Algorithms,” Int. J. Recent Innov. Trends Comput. Commun., vol. 11, no. 6 S, pp. 277–282, 2023, doi: 10.17762/ijritcc.v11i6s.6920.

M. Sangwan, S. Gambhir, and S. Gupta, “Lung cancer detection using deep learning techniques,” Appl. AI-Based IoT Syst. to Simulation-Based Inf. Retr., pp. 143–168, 2023, doi: 10.4018/978-1-6684-5255-4.ch009.

M. Dirik, “Machine learning-based lung cancer diagnosis,” Turkish J. Eng., vol. 7, no. 4, pp. 322–330, 2023, doi: 10.31127/tuje.1180931.

R. Yamashita, M. Nishio, R. K. G. Do, and K. Togashi, “Convolutional neural networks: an overview and application in radiology,” Insights into Imaging, vol. 9, no. 4. Springer Science and Business Media LLC, pp. 611–629, Jun. 22, 2018. doi: 10.1007/s13244-018-0639-9.

A. Saha and R. K. Yadav, “Study on segmentation and prediction of lung cancer based on machine learning approaches,” Int. J. Exp. Res. Rev., vol. 30, pp. 1–14, 2023, doi: 10.52756/ijerr.2023.v30.001.

J. Yanase and E. Triantaphyllou, “The seven key challenges for the future of computer-aided diagnosis in medicine,” International Journal of Medical Informatics, vol. 129. Elsevier BV, pp. 413–422, Sep. 2019. doi: 10.1016/j.ijmedinf.2019.06.017.

Y. Yuan and Y.-C. Lo, “Improving Dermoscopic Image Segmentation With Enhanced Convolutional-Deconvolutional Networks,” IEEE Journal of Biomedical and Health Informatics, vol. 23, no. 2. Institute of Electrical and Electronics Engineers (IEEE), pp. 519–526, Mar. 2019. doi: 10.1109/jbhi.2017.2787487.

Y. N. Fu’adah, N. C. Pratiwi, M. A. Pramudito, and N. Ibrahim, “Convolutional Neural Network (CNN) for Automatic Skin Cancer Classification System,” IOP Conference Series: Materials Science and Engineering, vol. 982, no. 1. IOP Publishing, p. 012005, Dec. 01, 2020. doi: 10.1088/1757-899x/982/1/012005.

M. Aharonu and R. L. Kumar, “Systematic Review of Deep Learning Techniques for Lung Cancer Detection,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 3, pp. 725–736, 2023, doi: 10.14569/IJACSA.2023.0140384.

V. Rajasekar, M. P. Vaishnnave, S. Premkumar, V. Sarveshwaran, and V. Rangaraaj, “Lung cancer disease prediction with CT scan and histopathological images feature analysis using deep learning techniques,” Results Eng., vol. 18, no. August 2022, p. 101111, 2023, doi: 10.1016/j.rineng.2023.101111.

S. Wankhade and V. S., “A novel hybrid deep learning method for early detection of lung cancer using neural networks,” Healthc. Anal., vol. 3, no. January, p. 100195, 2023, doi: 10.1016/

A. Gupta, D. Kaushik, M. Garg and A. Verma, "Machine Learning model for Breast Cancer Prediction," 2020 Fourth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC), Palladam, India, 2020, pp. 472-477, doi: 10.1109/I-SMAC49090.2020.9243323

E. Benhamou, D. Saltiel, S. Ungari, and A. Mukhopadhyay, “Bridging the gap between Markowitz planning and deep reinforcement learning.” arXiv, 2020. doi: 10.48550/ARXIV.2010.09108.

V. Veeraiah, K. R. Kumar, P. Lalitha Kumari, S. Ahamad, R. Bansal and A. Gupta, "Application of Biometric System to Enhance the Security in Virtual World," 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2022, pp. 719-723, doi: 10.1109/ICACITE53722.2022.9823850.

H. Tercan, A. Guajardo, J. Heinisch, T. Thiele, C. Hopmann, and T. Meisen, “Transfer-Learning: Bridging the Gap between Real and Simulation Data for Machine Learning in Injection Molding,” Procedia CIRP, vol. 72. Elsevier BV, pp. 185–190, 2018. doi: 10.1016/j.procir.2018.03.087.

V. Veeraiah, G. P, S. Ahamad, S. B. Talukdar, A. Gupta and V. Talukdar, "Enhancement of Meta Verse Capabilities by IoT Integration," 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2022, pp. 1493-1498, doi: 10.1109/ICACITE53722.2022.9823766.

M. Hausknecht and P. Stone, “Deep Reinforcement Learning in Parameterized Action Space.” arXiv, 2015. doi: 10.48550/ARXIV.1511.04143.

Kaushik Dushyant; Garg Muskan; Annu; Ankur Gupta; Sabyasachi Pramanik, "Utilizing Machine Learning and Deep Learning in Cybesecurity: An Innovative Approach," in Cyber Security and Digital Forensics: Challenges and Future Trends , Wiley, 2022, pp.271-293, doi: 10.1002/9781119795667.ch12.

Z. Ding and H. Dong, “Challenges of Reinforcement Learning,” Deep Reinforcement Learning. Springer Singapore, pp. 249–272, 2020. doi: 10.1007/978-981-15-4095-0_7.

V. Jain, S. M. Beram, V. Talukdar, T. Patil, D. Dhabliya and A. Gupta, "Accuracy Enhancement in Machine Learning During Blockchain Based Transaction Classification," 2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC), Solan, Himachal Pradesh, India, 2022, pp. 536-540, doi: 10.1109/PDGC56933.2022.10053213.

D. Pujol-Perich et al., "IGNNITION: Bridging the Gap between Graph Neural Networks and Networking Systems," in IEEE Network, vol. 35, no. 6, pp. 171-177, November/December 2021, doi: 10.1109/MNET.001.2100266.

V. Talukdar, D. Dhabliya, B. Kumar, S. B. Talukdar, S. Ahamad and A. Gupta, "Suspicious Activity Detection and Classification in IoT Environment Using Machine Learning Approach," 2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC), Solan, Himachal Pradesh, India, 2022, pp. 531-535, doi: 10.1109/PDGC56933.2022.10053312.

A. H. Qureshi, Y. Miao, A. Simeonov and M. C. Yip, "Motion Planning Networks: Bridging the Gap Between Learning-Based and Classical Motion Planners," in IEEE Transactions on Robotics, vol. 37, no. 1, pp. 48-66, Feb. 2021, doi: 10.1109/TRO.2020.3006716.

P. R. Kshirsagar, D. H. Reddy, M. Dhingra, D. Dhabliya and A. Gupta, "A Review on Comparative study of 4G, 5G and 6G Networks," 2022 5th International Conference on Contemporary Computing and Informatics (IC3I), Uttar Pradesh, India, 2022, pp. 1830-1833, doi: 10.1109/IC3I56241.2022.10073385.

Y. Duan, J. Schulman, X. Chen, P. L. Bartlett, I. Sutskever, and P. Abbeel, “RL$^2$: Fast Reinforcement Learning via Slow Reinforcement Learning.” arXiv, 2016. doi: 10.48550/ARXIV.1611.02779.

P. R. Kshirsagar, D. H. Reddy, M. Dhingra, D. Dhabliya and A. Gupta, "A Scalable Platform to Collect, Store, Visualize and Analyze Big Data in Real- Time," 2023 3rd International Conference on Innovative Practices in Technology and Management (ICIPTM), Uttar Pradesh, India, 2023, pp. 1-6, doi: 10.1109/ICIPTM57143.2023.10118183.

S. Singh, R. Kumar, S. Payra, and S. K. Singh, “Artificial Intelligence and Machine Learning in Pharmacological Research: Bridging the Gap Between Data and Drug Discovery,” Cureus. Springer Science and Business Media LLC, Aug. 30, 2023. doi: 10.7759/cureus.44359.

P. R. Kshirsagar, D. H. Reddy, M. Dhingra, D. Dhabliya and A. Gupta, "A Review on Application of Deep Learning in Natural Language Processing," 2022 5th International Conference on Contemporary Computing and Informatics (IC3I), Uttar Pradesh, India, 2022, pp. 1834-1840, doi: 10.1109/IC3I56241.2022.10073309.

J. Tobin, R. Fong, A. Ray, J. Schneider, W. Zaremba and P. Abbeel, "Domain randomization for transferring deep neural networks from simulation to the real world," 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 2017, pp. 23-30, doi: 10.1109/IROS.2017.8202133.

K. A. Shukla, S. Ahamad, G. N. Rao, A. J. Al-Asadi, A. Gupta and M. Kumbhkar, "Artificial Intelligence Assisted IoT Data Intrusion Detection," 2021 4th International Conference on Computing and Communications Technologies (ICCCT), Chennai, India, 2021, pp. 330-335, doi: 10.1109/ICCCT53315.2021.9711795.

M. Abdelshiheed, J. W. Hostetter, T. Barnes, and M. Chi, “Bridging Declarative, Procedural, and Conditional Metacognitive Knowledge Gap Using Deep Reinforcement Learning.” arXiv, 2023. doi: 10.48550/ARXIV.2304.11739.

V. Veeraiah, V. Talukdar, S. B. Talukdar, J. Kotti, M. K. Dharani and A. Gupta, "IoT Framework in a Blockchain dependent Cloud Environment," 2023 14th International Conference on Computing Communication and Networking Technologies (ICCCNT), Delhi, India, 2023, pp. 1-6, doi: 10.1109/ICCCNT56998.2023.10308158.

Z. Zhang, "Improved Adam Optimizer for Deep Neural Networks," 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS), Banff, AB, Canada, 2018, pp. 1-2, doi: 10.1109/IWQoS.2018.8624183.

V. Veeraiah, H. Khan, A. Kumar, S. Ahamad, A. Mahajan and A. Gupta, "Integration of PSO and Deep Learning for Trend Analysis of Meta-Verse," 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Greater Noida, India, 2022, pp. 713-718, doi: 10.1109/ICACITE53722.2022.9823883.




How to Cite

Rao, P. V. ., B., V. ., Manjeet, M., Kumar, A. ., Mittal, M. ., Verma, A. ., & Dhabliya, D. . (2024). Deep Reinforcement Learning: Bridging the Gap with Neural Networks. International Journal of Intelligent Systems and Applications in Engineering, 12(15s), 576 –. Retrieved from



Research Article

Most read articles by the same author(s)