Embodied Understanding of Large Language Models using Calibration Enhancement

Anurag  Sinha; Kamatchi  K. S.; A.  Deepak; Harish  S.; Dibyhash  Bordoloi; Meenakshi  Sharma; Anurag  Shrivastava

Authors

Anurag Sinha Department of Computer Science, IGNOU, New Delhi, India
Kamatchi K. S. 2Associate Professor, Department of Computer Science and Engineering, KCG College of Technology, Karapakkam, Chennai, Tamil Nadu, 600097
A. Deepak Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamilnadu
Harish S. Associate Professor, Dept of ECE., R L JALAPPA INSTITUTE OF TECHNOLOGY, DODDABALLAPUR, KARNATAKA
Dibyhash Bordoloi Associate Professor, Department of Computer Science & Engineering, Graphic Era Deemed to be University, Dehradun, Uttarakhand
Meenakshi Sharma Professor, RNB Global University, Bikaner
Anurag Shrivastava Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, Tamilnadu

Keywords:

Fine-tuning, Embodied Agents, Chain-of-Thought Prompting, Simulated Representations, Physics Engine Mathematical Word Problems (MWPs), ChatGPT, SayCan

Abstract

In our research pursuit, we explore the inherent capacity of Large Language Models (LLMs) to develop an innate understanding of the physical realm—an essential prerequisite for empowering embodied agents to adeptly navigate real-world challenges. This paper introduces an extensive dataset encompassing diverse physical scenarios, establishing AuPPLE (Augmented Physical Priors via Learned Enhancement) as a robust benchmark. It serves as a comprehensive evaluative framework for assessing and amplifying the physical intuition of LLMs, including scenarios involving free fall and projectile motion. Within this benchmark, questions are framed in various formats, spanning MultiQA, binary classification, and continuous number prediction, thereby facilitating a comprehensive evaluation of LLMs' proficiency in comprehending physical dynamics. Moreover, we conduct a fine-tuning process on LLMs like Flan-T5-Large and DeBERTa, employing succinct physics-based prompts to instill a nuanced understanding of environmental physics. Our empirical findings underscore a notable improvement in the performance of LLMs fine-tuned on these physics-centric scenarios, particularly when confronted with questions rooted in the intricacies of the physical domain. This substantiates the effectiveness of our approach, indicating that strategic fine-tuning through physics-based prompts, in conjunction with external methodologies, significantly reinforces LLMs' intuitive grasp of the physical environment and enhances their efficacy in addressing tasks with a distinct physical dimension.

Downloads

Download data is not yet available.

References

Jurafsky, D., & Martin, J. H. (2020). "Speech and Language Processing." Pearson.

Manning, C. D., & Schütze, H. (1999). "Foundations of Statistical Natural Language Processing." MIT Press.

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2018). "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding." arXiv preprint arXiv:1810.04805.

Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., ... & Stoyanov, V. (2019). "RoBERTa: A Robustly Optimized BERT Approach." arXiv preprint arXiv:1907.11692.

Kennedy, J., & Eberhart, R. (1995). "Particle swarm optimization." Proceedings of ICNN'95-International Conference on Neural Networks, 4, 1942-1948.

Shi, Y., & Eberhart, R. (1998). "A modified particle swarm optimizer." Proceedings of the IEEE Congress on Evolutionary Computation, 69-73.

Goodfellow, I., Bengio, Y., Courville, A., & Bengio, Y. (2016). "Deep Learning." MIT press Cambridge.

LeCun, Y., Bengio, Y., & Hinton, G. (2015). "Deep learning." Nature, 521(7553), 436-444.

Goldberg, Y. (2016). "A Primer on Neural Network Models for Natural Language Processing." Journal of Artificial Intelligence Research, 57, 345-420.

Young, T., Hazarika, D., Poria, S., & Cambria, E. (2018). "Recent trends in deep learning based natural language processing." IEEE Computational Intelligence Magazine, 13(3), 55-75.

Shrivastava, A., Chakkaravarthy, M., Shah, M.A..A Novel Approach Using Learning Algorithm for Parkinson’s Disease Detection with Handwritten Sketches. In Cybernetics and Systems, 2022

Shrivastava, A., Chakkaravarthy, M., Shah, M.A., A new machine learning method for predicting systolic and diastolic blood pressure using clinical characteristics. In Healthcare Analytics, 2023, 4, 100219

Shrivastava, A., Chakkaravarthy, M., Shah, M.A.,Health Monitoring based Cognitive IoT using Fast Machine Learning Technique. In International Journal of Intelligent Systems and Applications in Engineering, 2023, 11(6s), pp. 720–729

Shrivastava, A., Rajput, N., Rajesh, P., Swarnalatha, S.R., IoT-Based Label Distribution Learning Mechanism for Autism Spectrum Disorder for Healthcare Application. In Practical Artificial Intelligence for Internet of Medical Things: Emerging Trends, Issues, and Challenges, 2023, pp. 305–321

Boina, R., Ganage, D., Chincholkar, Y.D., .Chinthamu, N., Shrivastava, A., Enhancing Intelligence Diagnostic Accuracy Based on Machine Learning Disease Classification. In International Journal of Intelligent Systems and Applications in Engineering, 2023, 11(6s), pp. 765–774

Shrivastava, A., Pundir, S., Sharma, A., ...Kumar, R., Khan, A.K. Control of A Virtual System with Hand Gestures. In Proceedings - 2023 3rd International Conference on Pervasive Computing and Social Networking, ICPCSN 2023, 2023, pp. 1716–1721

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). "Attention is all you need." In Advances in neural information processing systems (pp. 5998-6008).

M. Ramish, A. Sinha, J. Desai, A. Raj, Y. S. Rajawat and P. Punia, "IT Attack Detection and Classification using Users Event Log Feature And Behavior Analytics through Fourier EEG Signal," 2022 IEEE 11th International Conference on Communication Systems and Network Technologies (CSNT), Indore, India, 2022, pp. 577-582, doi: 10.1109/CSNT54456.2022.9787637.

A. Sinha, M. Bhargavi, N. K. Singh, N. Garg, S. Pal and A. Verma, "Comparative Analysis of Machine Learning and Data Mining based Multi-Models for Diabetes Risk Prediction," 2022 IEEE International Conference for Women in Innovation, Technology & Entrepreneurship (ICWITE), Bangalore, India, 2022, pp. 1-7, doi: 10.1109/I M.

Bhargavi, A. Sinha, J. Desai, N. Garg, Y. Bhatnagar and P. Mishra, "Comparative Study of Consumer Purchasing and Decision Pattern Analysis using Pincer Search Based Data Mining Method," 2022 13th International Conference on Computing Communication and Networking Technologies (ICCCNT), Kharagpur, India, 2022, pp. 1-7, doi: 10.1109/ICCCNT54827.2022.9984410.

A. Sinha, V. Kumar, V. Sharma and A. Alkhayyat, "QML-FFSD: A Novel Approach for Early Detection of SCDs through Feature Fusion of Antibiotics Composition and Symptoms Data using Quantum ML," 2023 IEEE IAS Global Conference on Emerging Technologies (GlobConET), London, United Kingdom, 2023, pp. 1-7, doi: 10.1109/GlobConET56651.2023.10150112.

A. Sinha, M. Ramish, S. Kumari, P. Jha and M. K. Tiwari, "ANN-ANT-LION-MLP Ensemble Transfer Learning Based Classifier for Detection and Classification of Oral Disease Severity," 2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 2022, pp. 530-535, doi: 10.1109/Confluence52989.2022.9734176.

Anurag Sinha et al., "MTD-DHJS: Makespan-Optimized Task Scheduling Algorithm for Cloud Computing With Dynamic Computational Time Prediction," in IEEE Access, vol. 11, pp. 105578-105618, 2023, doi: 10.1109/ACCESS.2023.3318553.

Embodied Understanding of Large Language Models using Calibration Enhancement

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Similar Articles

Announcements

Information for Authors

ijisae

Information

trindex