Prediction of Gas Hydrate Formation Condition by Data-Driven Modeling: Different Machine Learning Models with Vector Quantization and Cuckoo Search Algorithm

Document Type : Research Article

Authors

1 University of Bonn, Bonn, GERMANY

2 Gas Research Division, Research Institute of Petroleum Industry, Tehran, I.R. IRAN

3 Technology and Innovation Group, Research Institute of Petroleum Industry, Tehran, I.R. IRAN

Abstract

Greenhouse gases can be defined as air pollutants that cause global climate warming. In order to reduce their harmful effects, these gases like methane and carbon dioxide can be stored in the form of compact gas hydrates. Prediction of gas hydrate formation conditions is very important for gas hydrate production and storage in industries. The goal of this study is to develop machine learning methods based on support vector regression and adaptive boosting models for predicting gas hydrate formation conditions for CO2 and natural gas. In this regard, SVR, AdaBoost.R2, VQ-SVR, VQ-AdaBoost.R2, CS-VQ-SVR, and CS-VQ-AdaBoost.R2 models have been developed and compared to obtain a model with the best performance. The cuckoo search optimization algorithm and vector quantization technique have also been utilized to determine the optimal values of the models’ hyper-parameters, reduce the computation time, and improve the accuracy and robustness of the models. As a result, since the values of the coefficient of determination and root mean square error for the CS-VQ-SVR model are 0.0215 and 0.9995, respectively, and the best agreement between predicted and actual values in this model’s graphs is obtained, it can be concluded that the CS-VQ-SVR model has the best accuracy and robustness among other developed models in predicting gas hydrate formation pressure with time. These results show that machine learning is viable for predicting the conditions of gas hydrate formation and preventing greenhouse gas emissions in industries.

Keywords

Main Subjects


[1] Yu Y.S., Zhang X., Liu J.W., Lee Y., Li  X.S., Natural Gas Hydrate Resources and Hydrate Technologies: A Review and Analysis of the Associated Energy and Global Warming Challenges, Energy Environ. Sci., 14: 5611-5668 (2021).
[2] Sloan E.D., Koh C.A., “Clathrate Hydrates of Natural Gases”, Third ed. Taylor & Francis Group, New York (2008).
[4] Rahimi Mofrad H., Ganji H., Nazari K., Kameli M., Rezaie Rod A., Kakavand M., Rapid Formation of Dry Natural Gas Hydrate with High Capacity and Low Decomposition Rate Using a New Effective Promoter, J. Pet. Sci. Eng., 147: 756–759 (2016).
[5] Wu Y., Shang L., Pan Z., Xuan Y., Baena-Moreno  F.M., Zhang Z., Gas Hydrate Formation in the Presence
of Mixed Surfactants and Alumina Nanoparticles
, J. Nat. Gas Sci. Eng., 94: 104049 (2021).
[6] Zhang Z.J., Duraisamy K., ”Machine Learning Methods for Data-Driven Turbulence Modeling”, 22nd AIAA Computational Fluid Dynamics Conference, Dallas (2015).
[7] Alsina E.F., Chica M., Trawinski K., Regattieri A., On the Use of Machine Learning Methods to Predict Component Reliability from Data-Driven Industrial Case Studies, The Int. J. Adv. Man. Tech., 94: 2419–2433 (2018).
[8] Bodendorf F., Merkl P., Franke J., Intelligent Cost Estimation by Machine Learning in Supply Management: A Structured Literature Review, Comp. & Ind. Eng., 160: 107601 (2021).
[10] Gui G., Pan H., Lin Z., Li Y., Yuan Z., Data-Driven Support Vector Machine with Optimization Techniques for Structural Health Monitoring and Damage Detection, KSCE J. Civil Eng., 21(2): 523-534 (2017).
[11] G Abo-Khalil A., Lee D.C., MPPT Control of Wind Generation Systems Based on Estimated Wind Speed Using SVR, IEEE Transactions on Industrial Electronics, 55(3): 1489-1490 (2008).
[12] Hong W.C., Dong Y., Chen L.Y., Wei S.Y., SVR with Hybrid Chaotic Genetic Algorithms for Tourism Demand Forecasting, Applied Soft Computing, 11(2): 1881-1890 (2011).
[13] Lee K., Cho S., Asfour S., Web-based Algorithm for Cylindricity Evaluation using Support Vector Machine Learning, Comp. & Ind. Eng., 60(2): 228-235 (2011).
[15] Moslem B., Khalil M.O., Diab M., Chkeir A., Marque C., “Combining Multiple Support Vector Machines for Boosting the Classification Accuracy of Uterine EMG Signals”, 18th IEEE International Conference on Electronics, Circuits, and Systems, Beirut, Lebanon, 11-14 December, (2011).
[16] Rustempasic I., Can M., Diagnosis of Parkinson's Disease Using Principal Component Analysis and Boosting Committee Machines, Southeast Europe Journal of Soft Computing, 2(1): 102-109 (2013).
[17] Shrestha D.L., Solomatine D.P., Experiments with AdaBoost.RT, an Improved Boosting Scheme for Regression,  Neural Computation, 18(7): 1678-1710 (2006).
[18] Hu Y.H., Hwang J.N., “Handbook for Neural Network Signal Processing”, CRC Press, 1st Edition, (2001).
[19] Schapire R.E., The Strength of Weak Learnability, Machine Learning, 5: 197-227 (1990).
[20] Kaur R., Schaye C., Thompson K., Yee D., Zilz R., Sreenivas R.S., Sowers R., Machine Learning and Price-Based Load Scheduling for an Optimal IoT Control in the Smart and Frugal Home, Energy and AI, 3: 100042 (2021).
[21] Freund Y., Schapire R., A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting, J. Comp. System Sci., 55(1): 119-139 (1997).
[22] Drucker H., Improving Regressors Using Boosting TechniquesIcml, 97: 107-115 (1997).
[23] Patil S., Patil A., Phalle V., “Life Prediction of Bearing by Using Adaboost Regressor”, An International Conference on Tribology, TRIBOINDIA-2018, VJTI, Mumbai, India, (2018).
[24] Vapnik V.N., “The Nature of Statistical Learning Theory”, Springer, New York, 1st Edition (1995).
[25] Wang J.Y.,  "Application of Support Vector Machines in Bioinformatics", Master Thesis, National Taiwan University (2002).
[26] Rajasekaran S., Gayathri S., Lee T.L., Support Vector Regression Methodology for Storm Surge Predictions, Ocean Engineering, 35(16): 1578-1587 (2008).
[30] Huang Q., Mao J., Liu Y., An Improved Grid Search Algorithm of SVR Parameters Optimization, IEEE 14th International Conference on Communication Technology, Chengdu, China, 9-11 November (2012).
[31] Yang X.S., Deb S., Cuckoo Search via Lévy flights, World Congress on Nature & Biologically Inspired Computing (NaBIC), Coimbatore, India, 9-11 December (2009).
[32] Laha D., N.D. Gupta J., An Improved Cuckoo Search Algorithm for Scheduling Jobs on Identical Parallel Machines, Comp. Ind. Eng., 126: 348-360 (2018).
[33] Valian E., Tavakoli S., Mohanna S., Haghi A., Improved Cuckoo Search for Reliability Optimization Problems, Comp. Ind. Eng., 64(1): 459-468 (2013).
[34] Jiang M., Luo J., Jiang D., Xiong J., Song H., Shen J., A Cuckoo Search-Support Vector Machine Model for Predicting Dynamic Measurement Errors of Sensors,  IEEE Access, 4: 5030 – 5037 (2016).
[36] Buzo A., Gray A., Gray R., Markel J., Speech Coding Based Upon Vector Quantization, IEEE Transactions on Acoustics, Speech, and Signal Processing, 28(5): 562-574 (1980).
[37] Krishnamurthy  A.K., Ahalt S.C., Melton D.E., Chen P., Neural Networks for Vector Quantization of Speech and Images, IEEE Journal on Selected Areas in Communications, 8(4): 1449-1457 (1990).
[38] Tan C., Yu D., Gao X., Song W., A Mechanism based Data-Driven Model for Prediction of Hydrate Formation. Proceedings of the 2nd International Conference on Industrial Control Network and System Engineering Research, pp 84-93 (2020).
[40] Monday C.U., Odutola T.O., “Application of Machine Learning in Gas-Hydrate Formation and Trendline Prediction”, SPE Symposium: Artificial Intelligence - Towards a Resilient and Efficient Energy Industry, OnePetro, (2021).
[41] Sadi M., Fakharian H., Ganji H., Kakavand M., Evolving Artificial Intelligence Techniques to Model the Hydrate-based Desalination Process of Produced Water, J. Water Reuse and Desal., 9(4): 372-384 (2019).
[43] Xu P., Xiao Z., Zhang Z., Huffman M., Wang Q., Prediction of Methane Hydrate Formation Conditions in Salt Water using Machine Learning Algorithms, Comp. Chem. Eng., 151: 107358 (2021).
[45] Kumari A., Madhaw M., Pendyala V.S., Prediction of Formation Conditions of Gas Hydrates using Machine Learning and Genetic Programming, Machine Learning for Societal Improvement, Modernization, and Progress, 200-224 (2022).
 [46] Ghorbani M., Zargar G., Jazayeri-Rad H., Prediction of Asphaltene Precipitation using Support Vector Regression Tuned with Genetic Algorithms, Petroleum, 2(3): 301-306 (2016).
[47] Zheng H., Zhou Y., A Novel Cuckoo Search Optimization Algorithm Base on Gauss Distribution. Journal of Computational Information Systems, 8(10): 4193–4200 (2012).