# A Review on Robust Artificial Neural Network Forecasting Models towards Outliers Problem

## DOI:

https://doi.org/10.14419/ijet.v8i1.7.25985## Published:

2019-01-18## Keywords:

Outliers, Backpropagation, robust estimators, evolutionary algorithms, forecasting## Abstract

Neurocomputing have been adjusted in time arrangement estimating field, however the nearness of exceptions that for the most part happen in information time arrangement might be hurtful to the information organize preparing. This is on the grounds that the capacity to consequently discover any examples without earlier suppositions and loss of all inclusive statement. In principle, the most well-known preparing calculation for backpropagation calculations inclines toward lessening ordinary least squares estimator (OLS) or all the more particularly, the mean squared error (MSE). In any case, this calculation is not completely hearty when exceptions exist in preparing information, and it will prompt false estimate future esteem. In this paper, the effect of time series outliers in backpropagation training is discussed. The comparisons and related issues of autoregressive moving average (ARIMA) to artificial neural network (ANN) are also discussed briefly in this paper. Moreover, the background of the basic backpropagation neural network (BPNN) time series models; nonlinear autoregressive (NAR) and nonlinear autoregressive moving average (NARMA), is also discussed in this paper. The critical part of the paper is the application of metaheuristics algorithms, mainly the Firefly Algorithm (FFA) to improve the backpropagation models. There are also highlights of latest research works on the robustification of backpropagation using modern optimization algorithms.

Â

Â

## References

[1] Z. Zhang, "Parameter estimation techniques: a tutorial with application to conic fitting",Image and Vision Computing, vol. 15, no. 1, pp. 59-76, 1997.

[2] R. Pearson, "Outliers in process modeling and identification", IEEE Transactions on Control Systems Technology, vol. 10, no. 1, pp. 55-63, 2002.

[3] A.Rusiecki, â€œRobust learning algorithm based on LTA estimator,â€Neurocomputing, vol. 120, pp. 624-632, 2013.

[4] M. El-Melegy, M. Essai, A. Ali, "Robust training of artificial feedforward neural networks" in Foundations of Computational Intelligence: Learning and Approximation (Studies in Computational Intelligence), USA, New York:Springer-Verlag, vol. 1, pp. 217-242, 2009.

[5] Rusiecki, M. Kordos, T. KamiÅ„ski, K. GreÅ„, "Training neural networks on noisy data" in Artificial Intelligence and Soft Computing, Berlin, Germany:Springer-Verlag, vol. 8467, pp. 131-142, 2014.

[6] Abraham and B. Nath, "A neuro-fuzzy approach for modelling electricity demand in Victoria", Applied Soft Computing, vol. 1, no. 2, pp. 127-138, 2001.

[7] D. Delen, G. Walker and A. Kadam, "Predicting breast cancer survivability: a comparison of three data mining methods", Artificial Intelligence in Medicine, vol. 34, no. 2, pp. 113-127, 2005.

[8] R. Gareta, L. Romeo and A. Gil, "Forecasting of electricity prices with neural networks", Energy Conversion and Management, vol. 47, no. 13-14, pp. 1770-1778, 2006.

[9] S. Hamid and Z. Iqbal, "Using neural networks for forecasting volatility of S&P 500 Index futures prices", Journal of Business Research, vol. 57, no. 10, pp. 1116-1125, 2004.

[10] S. Srinivasulu and A. Jain, "A comparative analysis of training methods for artificial neural network rainfallâ€“runoff models", Applied Soft Computing, vol. 6, no. 3, pp. 295-306, 2006.

[11] G. Zhang, M. Keil, A. Rai and J. Mann, â€œPredicting information technology project escalation: A neural network approachâ€,European Journal of Operational Research, vol. 146, pp. 115-129, 2003.

[12] L. Ma and K. Khorasani, "New training strategies for constructive neural networks with application to regression problems", Neural Networks, vol. 17, no. 4, pp. 589-609, 2004.

[13] H. Hippert, C. Pedreira and R. Souza, "Neural networks for short-term load forecasting: a review and evaluation",IEEE Transactions on Power Systems, vol. 16, no. 1, pp. 44-55, 2001.

[14] G. Zhang, "Business Forecasting with Artificial Neural Networks: An overview", in: G. P Zhang(Ed.), Neural Networks in Business Forecasting, Idea Group Publishing , Hershey, PA, 2004

[15] H. Allende, C. Moraga, and R. Salas, â€œArtificial neural networks in time Series forecasting: A comparative analysisâ€, Kybernetika, vol. 38, no. 6, pp. 685-707, 2002.

[16] H. Allende, C. Moraga and R. Salas, "Robust Estimator for the Learning Process in Neural Networks Applied in Time Series", Artificial Neural Networks â€” ICANN 2002, pp. 1080-1086, 2002.

[17] H. Jang, E. Topal and Y. Kawamura, "Unplanned dilution and ore loss prediction in longhole stoping mines via multiple regression and artificial neural network analyses", Journal of the Southern African Institute of Mining and Metallurgy, vol. 115, no. 5, pp. 449-456, 2015.

[18] A.Aliahmadi, M. Jafari-Eskandari, A. Mozafari, and H. Nozari, â€œComparing Linear Regression and Artificial Neural Networks To Forecast Total Productivity Growth In Iranâ€, International Journal of Information, Business and Management, vol. 8, no. 1, pp. 93-113, 2016.

[19] E. Maasoumi, A. Khotanzed and A. Abaye, "Artificial neural networks for some macroeconomic series: A first report",Econometric Reviews, vol. 13, no. 1, pp. 105-122, 1994.

[20] N. Kohzadi, M. Boyd, I. Kaastra, B. Kermanshahi, andD. Scuse, â€œNeural networks for forecasting: an introductionâ€, Canadian Journal of Agricultural Economics/Revue canadienne d'agroeconomie, vol. 43, no. 3, pp. 463-474, 1995.

[21] N. Swanson and H. White, "A Model Selection Approach to Real-Time Macroeconomic Forecasting Using Linear Models and Artificial Neural Networks", Review of Economics and Statistics, vol. 79, no. 4, pp. 540-550, 1997.

[22] J. Hansen, J. McDonald and R. Nelson, "Time Series Prediction With Genetic-Algorithm Designed Neural Networks: An Empirical Comparison With Modern Statistical Models", Computational Intelligence, vol. 15, no. 3, pp. 171-184, 1999.

[23] G. Box and D. Pierce, "Distribution of Residual Autocorrelations in Autoregressive-Integrated Moving Average Time Series Models", Journal of the American Statistical Association, vol. 65, no. 332, pp. 1509-1526, 1970.

[24] J. McDonald and Y. Xu, "Some forecasting applications of partially adaptive estimators of ARIMA models", Economics Letters, vol. 45, no. 2, pp. 155-160, 1994.

[25] G. Tkacz, S. Hu, "Forecasting GDP Growth Using Artificial Neural Networks", Working Paper 99-3 Department of Monetary and Financial Analysis Bank of Canada, 1999.

[26] G. Tkacz, "Neural network forecasting of Canadian GDP growth", International Journal of Forecasting, vol. 17, no. 1, pp. 57-69, 2001.

[27] G. Zhang, "Time series forecasting using a hybrid ARIMA and neural network model", Neurocomputing, vol. 50, pp. 159-175, 2003.

[28] M. Z. Mat Junoh, "Predicting GDP Growth in Malaysia Using Knowledge-Based Economy Indicators: A Comparison Between Neural Network And Econometric Approaches", Sunway College Journal, vol. 1, pp. 39-50, 2004.

[29] K. Mohammadi, H. Eslami, and S. Dardashti, â€œComparison of Regression, ARIMA & ANN Models for Reservoir Inflow Forecasting using Snowmelt Equivalent (a case study of Karaj)â€,Journal of Agriculture, Science & Technology, vol. 7, no. 17, 2005.

[30] G. Rutka, "Network Traffic Prediction using ARIMA and Neural Networks Models", Electronics and Electrical Engineeging, vol. 4, pp. 47-52, 2008.

[31] G. Cybenko, "Approximation by superpositions of sigmoidal functions", Math. Control Signals Syst., vol. 2, no. 4, pp. 303-314, 1989.

[32] K. Funahashi, "On the approximate realization of continuous mappings by neural networks", Neural Networks, vol. 2, no. 3, pp. 183-192, 1989.

[33] D. White and D. Sofge, â€œHandbook of Intelligent Control: Neural, Fuzzy, and Adaptative Approachesâ€, Van Nostrand Reinhold Company, 1992.

[34] H. White and M. Stinchcombe,â€œApproximating and learning unknown mappings using multilayer feedward networks with bounded weightsâ€, In Arcial neural networks: approximations and learning theory, H. White ed., Blackwell, Oxford, UK, 1992.

[35] Gallant, H. White, "On learning the derivatives of an unknown mapping with multilayer feedforward networks", Neural Networks, vol. 5, no. 1, pp. 129-138, 1992.

[36] H. White, Artificial neural networks: approximation and learning theory. Blackwell Publishers, 1992.

[37] Baccigalupi, L. Bedini, C. Burigana, G. De Zotti, A. Farusi, D. Maino, M. Maris, F. Perrotta, E. Salerno, L. Toffolatti and A. Tonazzini, "Neural networks and the separation of cosmic microwave background and astrophysical signals in sky maps", Monthly Notices of the Royal Astronomical Society, vol. 318, no. 3, pp. 769-780, 2000.

[38] R. Fierro and F. Lewis, "Control of a nonholonomic mobile robot using neural networks", IEEE Transactions on Neural Networks, vol. 9, no. 4, pp. 589-600, 1998.

[39] R. Lippmann, "An introduction to computing with neural nets", IEEE ASSP Magazine, vol. 4, no. 2, pp. 4-22, 1987.

[40] M. Khashei and M. Bijari, "An artificial neural network (p,d,q) model for timeseries forecasting", Expert Systems with Applications, vol. 37, no. 1, pp. 479-489, 2010.

[41] M. Khashei and M. Bijari, "A novel hybridization of artificial neural networks and ARIMA models for time series forecasting", Applied Soft Computing, vol. 11, no. 2, pp. 2664-2675, 2011.

[42] Specht, "A general regression neural network", IEEE Transactions on Neural Networks, vol. 2, no. 6, pp. 568-576, 1991.

[43] Alon, M. Qi, R. J. Sadowsik, "Forecasting aggregate retail sales: A comparison of artificial neural networks and traditional methods", J. Retailing Consumer Services, vol. 8, no. 3, pp. 147-156, 2001.

[44] M. Thompson and M. Kramer, â€œModeling chemical processes using prior knowledge and neural networksâ€, AIChE Journal, vol. 40, no. 8, pp. 1328-1340, 1994.

[45] KTam and M. Kiang, â€œManagerial applications of neural networks: the case of bank failure predictionsâ€, Management science, vol. 38, no. 7, pp. 926-947, 1992.

[46] G. Spellman, â€œAn application of artificial neural networks to the prediction of surface ozone concentrations in the United Kingdomâ€, Applied Geography, vol. 19, no. 2, pp. 123-136, 1999.

[47] Chau, â€œParticle swarm optimization training algorithm for ANNs in stage prediction of Shing Mun Riverâ€, Journal of hydrology, vol. 329, no. 3, pp. 363-367, 2006.

[48] Chau, â€œApplication of a PSO-based neural network in analysis of outcomes of construction claimsâ€, Automation in Construction, vol. 16, no.5, pp. 642-646, 2007.

[49] Storrie-Lombardi, O. Lahav, L. Sodre and L. Storrie-Lombardi, â€œMorphological classification of galaxies by artificial neural networksâ€, Monthly Notices of the Royal Astronomical Society, vol. 259, no. 1, pp. 8P-12P, 1992.

[50] G. Foody, "Using prior knowledge in artificial neural network classification with a minimal training set", International Journal of Remote Sensing, vol. 16, no. 2, pp. 301-312, 1995.

[51] Guresen, G. Kayakutlu and T. Daim, "Using artificial neural network models in stock market index prediction", Expert Systems with Applications, vol. 38, no. 8, pp. 10389-10397, 2011.

[52] P. Atkinson and A. Tatnall, "Introduction Neural networks in remote sensing",International Journal of Remote Sensing, vol. 18, no. 4, pp. 699-709, 1997.

[53] G. Zhang, M. Y. Hu, B. Eddy Patuwo and D. C. Indro, "Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis", European Journal of Operational Research, vol. 116, no. 1, pp. 16-32, 1998.

[54] K. Fanning and K. Cogger, "A Comparative Analysis of Artificial Neural Networks Using Financial Distress Prediction", Intelligent Systems in Accounting, Finance and Management, vol. 3, no. 4, pp. 241-252, 1994.

[55] O. Nelles, â€œNonlinear system identification: from classical approaches to neural networks and fuzzy modelsâ€. Springer Science & Business Media, 2001.

[56] Zhang, Artificial higher order neural networks for economics and business. Hershey, IGI Global (701 E. Chocolate Avenue, Hershey, Pennsylvania, 17033, USA), 2008.

[57] Boely and R. Botez, "New Approach for the Identification and Validation of a Nonlinear F/A-18 Model by Use of Neural Networks", IEEE Transactions on Neural Networks, vol. 21, no. 11, pp. 1759-1765, 2010.

[58] J. Michalkiewicz, "Modified Kolmogorov's Neural Network in the Identification of Hammerstein and Wiener Systems", IEEE Transactions on Neural Networks and Learning Systems, vol. 23, no. 4, pp. 657-662, 2012.

[59] G. Wen and Y. Liu, "Adaptive fuzzy-neural tracking control for uncertain nonlinear discrete-time systems in the NARMAX form", Nonlinear Dynamics, vol. 66, no. 4, pp. 745-753, 2011.

[60] M. Shafiq and N. Butt, "Utilizing higher-order neural networks in U-model based controllers for stable nonlinear plants", International Journal of Control, Automation and Systems, vol. 9, no. 3, pp. 489-496, 2011.

[61] Z. Chen and Y. Ni, "On-board Identification and Control Performance Verification of an MR Damper Incorporated with Structure", Journal of Intelligent Material Systems and Structures, vol. 22, no. 14, pp. 1551-1565, 2011.

[62] M. Gonzalez-Olvera and Y. Tang, "Black-Box Identification of a Class of Nonlinear Systems by a Recurrent Neurofuzzy Network", IEEE Transactions on Neural Networks, vol. 21, no. 4, pp. 672-679, 2010.

[63] L. YuMin, T. DaZhen, X. Hao and T. Shu, â€œProductivity matching and quantitative prediction of coalbed methane wells based on BP neural networkâ€,Science China Technological Sciences, vol. 54, no. 5, pp. 1281-1286, 2011.

[64] M. Han, J. Fan and J. Wang, "A Dynamic Feedforward Neural Network Based on Gaussian Particle Swarm Optimization and its Application for Predictive Control", IEEE Transactions on Neural Networks, vol. 22, no. 9, pp. 1457-1468, 2011.

[65] X. Zong-yi, Q. Yong, P. Xue-miao, J. Li-min and Z. Yuan, "Modelling of the Automatic Depth Control Electrohydraulic System Using RBF Neural Network and Genetic Algorithm", Mathematical Problems in Engineering, vol. 2010, pp. 1-16, 2010.

[66] GÃ³mez-Gil, J. RamÃrez-Cortes, S. Pomares HernÃ¡ndez and V. AlarcÃ³n-Aquino, "A Neural Network Scheme for Long-Term Forecasting of Chaotic Time Series", Neural Processing Letters, vol. 33, no. 3, pp. 215-233, 2011.

[67] X. Zhi-Bin, â€œParticle-swarm-optimization-based selective neural network ensemble & its application to modeling resonant frequency of microstrip antennaâ€.inMicrostrip Antennas, pp. 69-82, 2011.

[68] Z. Chen and Y. Ni, â€œOn-board Identification & Control Performance Verification of an MR Damper Incorporated with Structureâ€,Journal of Intelligent Material Systems & Structures, vol. 22, pp. 1551-1565, 2011.

[69] B. Cosenza, â€œDevelopment of a neural network for glucose concentration prevision in patients affected by type 1 diabetesâ€,Bioprocess & Biosystems Engineering, pp. 1-9, 2012.

[70] S. Chatterjee, S. Nigam, J. Singh and L. Upadhyaya, "Software fault prediction using Nonlinear Autoregressive with eXogenous Inputs (NARX) network", Applied Intelligence, vol. 37, no. 1, pp. 121-129, 2012.

[71] M. Rashid, M. Frasca, A. Ali, R. Ali, L. Fortuna and M. Xibilia, "Nonlinear model identification for Artemia population motion", Nonlinear Dynamics, vol. 69, no. 4, pp. 2237-2243, 2012.

[72] S. Geman, E. Bienenstock and R. Doursat, "Neural Networks and the Bias/Variance Dilemma", Neural Computation, vol. 4, no. 1, pp. 1-58, 1992.

[73] C. Granger, â€œStrategies for modeling nonlinear time-series relationshipsâ€,The Economic Record, vol. 69, no. 206, pp. 233-238, 1993.

[74] R. Hoptroff, "The principles and practice of time series forecasting and business modelling using neural nets", Neural Computing & Applications, vol. 1, no. 1, pp. 59-66, 1993.

[75] R. Ashley, "Statistically significant forecasting improvements: how much out-of-sample data is likely necessary?", International Journal of Forecasting, vol. 19, no. 2, pp. 229-239, 2003.

[76] Azoff,â€œNeural Network Time Series Forecasting of Financial Marketsâ€. Chichester, New York, Wiley, 1994.

[77] W. Gorr, D. Nagin and J. Szczypula, "Comparative study of artificial neural network and statistical models for predicting student grade point averages", International Journal of Forecasting, vol. 10, no. 1, pp. 17-34, 1994.

[78] Komarek andA. Moore, â€œMaking logistic regression a core data mining tool with tr-irlsâ€,IEEE International Conference on Data Mining (ICDM'05), pp. 4-10, 2005.

[79] Zhang and M. Qi,â€œPredicting consumer retail sales using neural networksâ€, In Smith & Gupta (Eds.), Neural Networks in Business: Techniques and Applications, Hershey, PA: Idea Group Publishing, pp. 26-40, 2002.

[80] Kaastra and M. Boyd, "Designing a neural network for forecasting financial and economic time series", Neurocomputing, vol. 10, no. 3, pp. 215-236, 1996.

[81] Zhang and M. Hu, "Neural network forecasting of the British Pound/US Dollar exchange rate", Omega, vol. 26, no. 4, pp. 495-506, 1998.

[82] Coakley and C. Brown, "Artificial neural networks in accounting and finance: modeling issues", International Journal of Intelligent Systems in Accounting, Finance & Management, vol. 9, no. 2, pp. 119-144, 1999.

[83] W. Remus and M. Oâ€™Connor, â€œNeural networks for time series forecastingâ€,in Armstrong JS (ed) Principles of forecasting: a hand book for researchers & practitioners. Kluwer, Norwell, MA, pp 245â€“256, 2001.

[84] Y. Pan, T. Pohlen andS. Manago, â€œHybrid neural network model in forecasting aggregate US retail salesâ€, inAdvances in business & management forecasting, vol. 9, 153-170, 2013.

[85] S. Anbazhagan and N. Kumarappan, â€œDay-ahead deregulated electricity market price forecasting using recurrent neural networkâ€,Systems Journal, IEEE, vol. 7, no. 4, pp. 866-872, 2013.

[86] Azad, S. Mekhilef and V. Ganapathy, "Long-Term Wind Speed Forecasting and General Pattern Recognition Using Neural Networks", IEEE Transactions on Sustainable Energy, vol. 5, no. 2, pp. 546-553, 2014.

[87] Heydari and F. Keynia, â€œPrediction of Wind Power Generation Through Combining Particle Swarm Optimization & Elman Neural Network (El-PSO)â€,International Energy Journal, vol. 15, no. 2, 2015.

[88] Chatfield,â€œTime-Series Forecastingâ€. Boca Raton, Chapman & Hall/ CRC, FL, 2001.

[89] Lennon, G. Montague, A. Frith, C. Gent and V. Bevan, â€œIndustrial applications of neural networks-An investigationâ€,Journal of Process Control, vol. 11, pp. 497-507, 2001.

[90] G. Zhang, B. Patuwo and M. Hu, â€œA simulation study of artificial neural networks for nonlinear time-series forecastingâ€,Computers & Operations Research, vol. 28, no. 4, pp. 381-396, 2001.

[91] G. Zhang, â€œAn investigation of neural networks for linear time-series forecastingâ€,Computers & Operations Research, vol. 28, pp. 1183-1202, 2001.

[92] S. Arlot andA. Calisse, â€œA survey of cross-validation procedures for model selectionâ€, Statistic Surveys, vol. 4, pp. 40-79, 2010.

[93] Bakirtzis, V. Petridis, S. Kiartzis, M. Alexiadis and A. Maissis, â€œA neural network short term load forecasting model for the Greek power systemâ€, IEEE Transactions on Power Systems, vol. 11, no. 2, pp. 858-863, 1996.

[94] Khotanzad, R. Afkhami-Rohani, T. Lu, A. Abaye, M. Davis and D. Maratukulam, â€œANNSTLF-A neural-network-based electric load forecasting systemâ€, IEEE Transactions on Neural Networks, vol. 8 no. 4, pp. 835-846, 1997.

[95] Bishop, â€œNeural networks for pattern recognitionâ€, Oxford university press, 1995.

[96] S. Abid, F. Fnaiech and M. Najim, â€œA fast feedforward training algorithm using a modified form of the standard backpropagation algorithmâ€, IEEE Transactions on Neural Networks, vol. 12, no. 2, pp. 424-430, 2001.

[97] M. Hagan and M. Menhaj, â€œTraining feedforward networks with the Marquardt algorithmâ€, IEEE Transactions on Neural Networks, vol. 5, no. 6, pp. 989-993, 1994.

[98] T. Kwok and D. Yeung, â€œObjective functions for training new hidden units in constructive neural networksâ€,IEEE Transactions on Neural Networks, vol. 8, no. 5, pp. 1131-1148, 1997.

[99] Y. Yamamoto andP. Nikiforuk, â€œA new supervised learning algorithm for multilayered and interconnected neural networksâ€IEEE Transactions on Neural Networks, Vol. 11, pp. 36â€“46, 2000.

[100] Yam and T. Chow,â€œExtended least squares based algorithm for training feedforward networksâ€, IEEE Transactions on Neural Networks, Vol. 8, pp. 806â€“810, 1997.

[101] Yam and T. Chow, â€œA weight initialization method for improving training speed in feedforward neural networksâ€, Neurocomputing, Vol. 30, pp. 219â€“232, 2000.

[102] Yam and T. Chow, â€œFeedforward networks training speed enhancement by optimal initialization of the synaptic coefficientsâ€,IEEE Transactions on Neural Networks, Vol. 12, pp. 430â€“434, 2001.

[103] X. Yu, M. Efe and O. Kaynak, â€œA general backpropagation algorithm for feedforward neural networks learningâ€, IEEE Transactions on Neural Networks, vol. 13, no. 1, pp. 251-254, 2002.

[104] T. Kathirvalavakumar and P. Thangavel, â€œA Modified Backpropagation Training Algorithm for Feedforward Neural Networks*â€,Neural Processing Letters, vol. 23, no. 2, pp. 111-119, 2006.

[105] T. Williams, â€œPredicting changes in construction cost indexes using neural networksâ€, Journal of Construction Engineering and Management, vol. 120, no. 2, pp. 306â€“320, 1994.

[106] V. Asari, â€œTraining of a feedforward multiple-valued neural networks by error backpropagation with a multilevel threshold functionâ€,IEEE Transactions on Neural Networks, vol. 12, pp. 1519â€“1521, 2001.

[107] Chowdhury, Y. Singh and R. Chansarkar, â€œDynamic tunneling technique for efficient training of multilayer perceptronsâ€,IEEE. Transactions on Neural Networks, vol. 10, pp. 48â€“55, 1999.

[108] G. Hinton, â€œConnectioniest Learning Procedure in Machine Learning: Paradigms & Methodsâ€, In: J. G. Carbonell, (ed.), pp. 185â€“234, Cambridge MA: MIT press, 1989.

[109] T. Kathirvalavakumar andP. Thangavel, â€œA new learning algorithm using simultaneous perturbation with weight initializationâ€,Neural Processing Letters, vol. 17, pp. 55â€“68, 2003.

[110] Krogh and J. Hertz, â€œGeneralization in a linear perceptron in the presence of noiseâ€,Journal of Physics A â€“ Mathematical & General, vol. 25, pp. 1135â€“1147, 1992.

[111] G. Lera andM. Pinzolas, â€œNeighborhood based Levenbergâ€“Marquardt algorithm for neural network trainingâ€IEEE Transactions on Neural Networks, vol. 13, pp. 1200â€“1203, 2002.

[112] Parisi, E. Di Claudio, G. Orlandi and B. Rao, "A generalized learning paradigm exploiting the structure of feedforward neural networks", IEEE Transactions on Neural Networks, vol. 7, no. 6, pp. 1450-1460, 1996.

[113] Reed andR. Marks, â€œNeural Smithing Supervised Learning in Feedforward Artificial Neural Networksâ€, Cambridge: MIT, 1999.

[114] P. Thangavel and T. Kathirvalavakumar, â€œA new learning algorithm using simultaneous perturbation with weight initializationâ€, Neural processing letters, 17(1), 55-68, 2003.

[115] P. Thangavel and T. Kathirvalavakumar, â€œTraining feedforward networks using simultaneous perturbation with dynamic tunnelingâ€, Neurocomputing, 48(1), 691-704, 2002.

[116] Bryson, â€œApplied optimal control: optimization, estimation & controlâ€,CRC Press, 1975.

[117] E. Vamsidhar, K. Varma, P. Rao and R. Satapati, â€œPrediction of rainfall using backpropagation neural network modelâ€,International Journal on Computer Science & Engineering, vol. 2, no. 4, pp. 1119-1121,2010.

[118] Duy, Y. Sato and Y. Inoguchi, â€œImproving accuracy of host load predictions on computational grids by artificial neural networksâ€, International Journal of Parallel, Emergent and Distributed Systems, vol. 26, no. 4, pp. 275-290, 2011.

[119] S. Ubaidillah, R. Sallehuddin and N. Ali, â€œCancer Detection Using Aritifical Neural Network & Support Vector Machine: A Comparative Studyâ€,JurnalTeknologi, vol. 65, no. 1, 2013.

[120] Elmi, S. Ibrahim andR. Sallehuddin, â€œDetecting sim box fraud using neural networkâ€IT Convergence and Security 2012, pp. 575-582, 2013.

[121] R. Alwee, S. Hj Shamsuddin and R. Sallehuddin, â€œHybrid support vector regression and autoregressive integrated moving average models improved by particle swarm optimization for property crime rates forecasting with economic indicatorsâ€,The Scientific World Journal, 2013.

[122] H. Maier and G. Dandy, â€œNeural network models for forecasting univariate time seriesâ€, Neural Network World, vol. 5, pp. 747-772, 1996.

[123] O. Valenzuela, I. Rojas, F. Rojas, H. Pomares, L. Herrera, A. GuillÃ©n and M. Pasadas, â€œHybridization of intelligent techniques and ARIMA models for time series predictionâ€,Fuzzy Sets and Systems, vol. 159, no. 7, pp. 821-845, 2008.

[124] Khashei, M. Bijari and G. Raissi Ardali,â€œImprovement of auto-regressive integrated moving average models using fuzzy logic & artificial neural networks (ANNs)â€,Neurocomputing, vol. 72, no. 4, pp. 956-967, 2009.

[125] Weruaga, â€œMultilayer Neural Networksâ€, Beamer presentation, 2006.

[126] Bylander, â€œUniversal Approximationâ€, 2003,http://www.cs.utsa.edu/~bylander/cs4793 /univ-approx.pdf

[127] Bonnell, â€œ Implementation of a New Sigmoid Function in Backpropagation Neural Networksâ€ (Doctoral dissertation, East Tennessee State University), 2011.

[128] T. Windeatt and C. Zor, â€œMinimising added classification error using Walsh coefficientsâ€,IEEE Transactions on Neural Networks, vol. 22, no. 8, pp. 1334-1339,2011.

[129] Panklib, C. Prakasvudhisarn and D. Khummongkol, â€œElectricity Consumption Forecasting in Thail & Using an Artificial Neural Network & Multiple Linear Regressionâ€,Energy Sources, Part B: Economics, Planning, & Policy, vol. 10, no. 4, pp. 427-434, 2015.

[130] S. Khan, T. Ayub andS. Rafeeqi, â€œPrediction of compressive strength of plain concrete confined with ferrocement using artificial neural network (ANN) & comparison with existing mathematical modelsâ€, American Journal of Civil Engineering & Architecture, vol. 1, no. 1, pp. 7-14, 2013.

[131] T. Ayub, M. Nurrudin, S. Khan and F. Memon, â€œPrediction of Compressive Strength of Plain Concrete Confined with Ferrocement using Artificial Neural Network (ANN)â€Special issues of the International Journal of Soft Computing & Software Engineering [JSCSE], vol. 3, no. 3, pp. 663-667, 2013.

[132] Maia, F. de Carvalho and T. Ludermir, â€œForecasting models for interval-valued time seriesâ€,Neurocomputing, vol. 71, no. 16, pp. 3344-3352, 2008.

[133] Wang and Y. Mei, â€œModel for forecasting construction cost indices in Taiwanâ€Construction Management & Economics,vol. 16, no. 2, pp. 147-157, 1998.

[134] P. Hartono and S. Hashimoto, â€œLearning from imperfect dataâ€,Applied Soft Computing, vol. 7, pp. 353-363, 2007.

[135] Pham and D. Karaboga, â€œIntelligent optimization techniques: genetic algorithms, tabu search, simulated annealing & neural networksâ€,Springer Science & Business Media, 2012.

[136] G. Zhang, M. Y. Hu, B. E.Patuwo andD. C. Indro, â€œArtificial neural networks in bankruptcy prediction: General framework and cross-validation analysisâ€, European journal of operational research, 116(1), 16-32, 1999.

[137] T. Kim, K. Oh, C. Kim and J. Do, â€œArtificial neural networks for non-stationary time seriesâ€,Neurocomputing, vol. 61, pp. 439-447, 2004.

[138] Cadenas and W.Rivera, â€œWind speed forecasting in three different regions of Mexico, using a hybrid ARIMAâ€“ANN modelâ€,Renewable Energy, vol. 35, no. 12, pp. 2732-2738, 2010.

[139] S. Sfetsos, â€œA novel approach for the forecasting of mean hourly wind speed time seriesâ€,Renewable energy, Elsevier, pp. 163-174, 2002.

[140] C. Wu, K. Chau and C. Fan, "Prediction of rainfall time series using modular artificial neural networks coupled with data-preprocessing techniques", Journal of Hydrology, vol. 389, no. 1-2, pp. 146-167, 2010.

[141] Wang, H. Zou, J. Su, L. Li and S. Chaudhry,â€œAn ARIMAâ€ANN Hybrid Model for Time Series Forecastingâ€,Systems Research & Behavioral Science,vol. 30, no. 3, pp. 244-259, 2013.

[142] Yolcu, C. Aladag, E. Egrioglu and V. Uslu, â€œTime-series forecasting with a novel fuzzy time-series approach: an example for Istanbul stock marketâ€,Journal of Statistical Computation & Simulation, vol. 83, no. 4, pp. 599-612, 2013.

[143] Claveria and S. Torra, â€œForecasting tourism dem & to Catalonia: Neural networks vs. time series modelsâ€, Economic Modelling, vol. 36, pp. 220-228, 2014.

[144] K. Gairaa, A. Khellaf, Y. Messlem and F. Chellali, â€œEstimation of the daily global solar radiation based on Boxâ€“Jenkins & ANN models: A combined approachâ€,Renewable & Sustainable Energy Reviews, vol. 57, pp. 238-249, 2016.

[145] Ardakani and M. Ardehali, â€œLong-term electrical energy consumption forecasting for developing & developed economies based on different optimized models & historical data typesâ€,Energy, vol. 65, pp. 452-461, 2014.

[146] J. Pati, K. K. Shukla, "A comparison of ARIMA neural network and a hybrid technique for Debian bug number prediction", Proc. Comput. Commun. Technol., pp. 47-53, 2014.

[147] R. Sharda and R. Patil, â€œNeural networks as forecasting experts: An empirical testâ€ In: Proceedings of the International Joint Conference on Neural Networks. Washington, D.C., vol. 2, pp. 491-494, 1990.

[148] R. Sharda and R. Patil, â€œConnectionist approach to time series prediction: An empirical testâ€,Journal of Intelligent Manufacturing, vol. 3, pp. 317-323, 1992.

[149] Z. Tang, C. Almeida and P. Fishwick, â€œTime series forecasting using neural networks vs Box-Jenkins methodologyâ€, Simulation, vol. 57, no. 5, pp. 303â€“310, 1991.

[150] Z. Tang and P. Fishwick, â€œFeedforward neural nets as models for time series forecastingâ€,ORSA Journal on Computing, vol.5, no. 4, pp. 374â€“385, 1993.

[151] S. Kang, â€œAn Investigation of the Use of Feedforward Neural Networks for Forecastingâ€, Ph.D. Thesis, Kent State University, 1991.

[152] Kohzadi, M. Boyd, B. Kermanshashi and I. Kaastra, â€œA comparison of artificial neural network & time series models for forecasting commodity pricesâ€,Neurocomputing, vol. 10, pp. 169-181, 1996.

[153] T. Hill, M. Oâ€™ConnorandW. Remus, â€œNeural Network Models for Time Series Forecastsâ€,Management Science, 42, 7, 1082-1092, 1996.

[154] T. Hill, L. Marquez, M. O'Connor and W. Remus, â€œArtificial neural network models for forecasting and decision makingâ€,International Journal of Forecasting, vol. 10, no.1, pp. 5-15, 1994.

[155] T. Hill, M. Oâ€™Connor and W. Remus, â€œNeural network models for time series forecastsâ€Management Sciences, vol. 42, no. 7, pp. 1082-1092, 1996.

[156] Caire, G. Hatabian and C. Muller, â€œProgress in forecasting by neural networksâ€,Proceedings of the International Joint Conference on Neural Networks, vol. 2, pp. 540â€“545, 1992.

[157] T. Hann andE. Steurer, â€œMuch ado about nothing? Exchange rate forecasting: Neural networks vs. linear models using monthly & weekly dataâ€,Neurocomputing,vol. 10, pp. 323-339, 1996.

[158] S. Balkin and J. Ord, â€œAutomatic neural network modeling for univariate time seriesâ€International Journal of Forecasting, vol. 16, no. 4, pp. 509-515, 2000.

[159] Harun, â€œShort term load forecasting using Artificial Neural Network (ANN) with multiple time lags, stationary time series & Principal Component Analysis (PCA)â€,(Masters dissertation, Universiti Teknologi MARA), 2011.

[160] B. Barbosa, L. Aguirre, C. Martinez and Braga, â€œBlack and Gray-Box Identification of a Hydraulic Pumping Systemâ€IEEE Trans. Control Systems Technology, vol. 19, no. 2, pp. 398-406, 2011.

[161] S. Ahmadi and M. Karrari, â€œNonlinear identification of synchronous generators using a local model approachâ€Przeglad Elektrotechniczny (Electrical Review), vol. 8, pp. 166-170, 2011.

[162] L. Teslic, B. Hartmann, O. Nelles and I. Skrjanc, â€œNonlinear system identification by gustafsonâ€“kessel fuzzy clustering and supervised local model network learning for the drug absorption spectra processâ€ IEEE Transactions on Neural Networks, vol. 22, no. 12, pp. 1941-1951, 2011.

[163] C. Safak, V. Topuz and A. Baba, â€œPneumatic motor speed control by trajectory tracking fuzzy logic controllerâ€Sadhana, vol. 35, no. 1, pp. 75-86, 2010.

[164] K. Tanabe, B. LuÄiÄ‡, D. AmiÄ‡, T. Kurita, M. Kaihara, N. Onodera and T. Suzuki, â€œPrediction of carcinogenicity for diverse chemicals based on substructure grouping & SVM modelingâ€,Molecular diversity, vol. 14, no. 4, pp. 789-802, 2010.

[165] K. Tanabe, B. LuÄiÄ‡, D. AmiÄ‡, T. Kurita, M. Kaihara, N. Onodera and T. Suzuki, â€œPrediction of carcinogenicity for diverse chemicals based on substructure grouping & SVM modelingâ€, Molecular diversity, vol. 14, no. 4, pp. 789-802, 2010.

[166] T. Tsai, C. Lee and C. Wei,â€œNeural network based temporal feature models for short-term railway passenger demand forecastingâ€Expert Systems with Applications, vol. 36, no. 2, pp. 3728-3736, 2009.

[167] M. Cogollo and J. Velasquez, â€œMethodological advances in artificial neural networks for time series forecastingâ€,IEEE Latin America Transactions, vol. 12, no. 4, 764-771,2014.

[168] Yassin, A. Zabidi, M. Salleh and N. Khalid, â€œMalaysian tourism interest forecasting using Nonlinear Auto-Regressive (NAR) modelâ€, 2013 IEEE 3rd International Conference on System Engineering & Technology (ICSET), pp. 32-36, 2013.

[169] Pawlus, H. Karimi and K. Robbersmyr, â€œData-based modeling of vehicle collisions by nonlinear autoregressive model & feedforward neural networkâ€,Information Sciences, vol. 235, pp. 65-79, 2013.

[170] H. Chen and X. Liu, â€œMelt Index Prediction Based on Two Compensation by Compound Basis Function Neural Network & Hidden Markov Modelâ€, 2014.

[171] Saberian, H. Hizam, M. Radzi, M. Ab Kadir and M. Mirzaei, â€œModelling and prediction of photovoltaic power output using artificial neural networksâ€International Journal of Photoenergy, pp. 1-11,2014.

[172] S. Kamaruddin, N. Ghani and N. Ramli, â€œModified nonlinear neural network forecasting models on Malaysian s & costs indicesâ€, Proceedings of 2015 IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE), pp. 191-196, 2015.

[173] AcuÃ±a, C. Ramirez and M. Curilem,â€œComparing NARX and NARMAX models using ANN and SVM for cash demand forecasting for ATMâ€The IEEE 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1-6, 2012.

[174] Goyal and D. Kaur, â€œA Novel Vehicle Classification Model for Urban Traffic Surveillance Using the Deep Neural Network Modelâ€, I.J. Education & Management Engineering, vol.1, 18-31,2016.

[175] M. Rahiman, M. Taib and Y. Salleh, â€œSelection of training data for modeling essential oil extraction system using NNARX structureâ€,In 2007 Int. Conf. Control, Automation & Systems (COEX 2007), Seoul, Korea, pp. 1691-1695, 2007.

[176] Chiras, C. Evans and D. Rees, â€œNonlinear gas turbine modeling using NARMAX structuresâ€,IEEE Trans. Instrumentation and Measurement, vol. 50, no. 4, pp. 893-898, 2001.

[177] K. Ahn and H. P. H. Anh, â€œInverse Double NARX Fuzzy Modeling for System Identificationâ€,IEEE/ ASME Transactions on Mechatronics, 15(1), 136-148, 2010.

[178] N. Kadir, N. Md Tahir, M. assin and A. Zabidi, â€œMalaysian tourism interest forecasting using Nonlinear Auto-Regressive Moving Average (NARMA) modelâ€,2014 IEEE Symposium on Wireless Technology & Applications (ISWTA), pp. 193-198, 2014.

[179] Dimogianopoulos, J. Hios and S. Fassois, â€œFDI for Aircraft Systems Using Stochastic Pooled-NARMAX Representations, Design and Assessmentâ€,IEEE Trans. Control Systems Technology, vol. 17, no.6, pp. 1385-1397, 2009.

[180] Hornik, M. Stinchcombe and H. White, â€œMultilayer feedforward networks are universal approximatorsâ€, Neural networks, vol. 2, no.5, pp. 359-366, 1989.

[181] F. Hampel, E. Ronchetti, P. Rousseeuw and W. Stahel, â€œRobust Statistics, The approach based on influence functionsâ€,Wiley, New York, 1986.

[182] C. Chen and G. Yin, â€œComputing the Efficiency & Tuning Constants for M-Estimationâ€, In Proceedings of the 2002 Joint Statistical Meetings, vol. 478, p. 482, 2002.

[183] J. William, â€œ Introduction to Robust & Quasi-Robust Statistical Methodsâ€, Springer, Berlin, Heidelberg, 1983.

[184] Huber, â€œRobust Statisticsâ€, John Wiley & Sons, New York, 1981.

[185] Farcomeni and L. Ventura, â€œAn overview of robust methods in medical researchâ€Statistical Methods in Medical Research,vol. 21, no. 2, pp. 111-133, 2012.

[186] D. Adedia, Comparison of robust regression estimators. Unpublished MPhil thesis, KNUST, Kumasi, 2014.

[187] Kantar and V. Yildirim, â€œRobust Estimation for Parameters of the Extended Burr Type III Distributionâ€Communications in Statistics-Simulation and Computation, vol. 44, no. 7, pp. 1901-1930, 2015.

[188] Kordos and A. Rusiecki, â€œReducing noise impact on MLP trainingâ€, Soft Computing, pp. 1-17, 2015.

[189] Kordos and A. Rusiecki, â€œImproving MLP neural network performance by noise reduction. In Theory & Practice of Natural Computingâ€, Springer Berlin Heidelberg, pp. 133-144, 2013.

[190] Rusiecki, â€œRobust Learning Algorithm Based on Iterative Least Median of Squaresâ€,Neural Process Lett, vol. 36, pp. 145-160, 2012.

[191] S. Kamaruddin, N. Ghani and N. Ramli, â€œEnhancing Backpropagation of ANN-NAR & ANN-NARMA using Robust Estimators with Application on Real Industrial Dataâ€,Proceedings of The 3rd International Conference on Computer Science & Computational Mathematics (ICCSCM 2014), vol. 3, pp. 256-266, 2014.

[192] Bruna, â€œShort term load forecasting using non-linear modelsâ€Master Thesis Report, Measurement & Control Group, Faculty of Electrical Engineering, Eindhoven University of Technology, 1994.

[193] Yang, â€œEngineering Optimization: An Introduction with Metaheuristic Applicationsâ€John Wiley and Sons, USA, 2010.

[194] X. Yang, â€œA New Metaheuristic Bat-inspired algorithmâ€,Nature Inspired Cooperative Strategies for Optimization, 284, pp. 65-74, 2010.

[195] X. Yang, â€œNature-Inspired Metaheuristic Algorithmsâ€ Luniver Press, UK, 2008.

[196] C. Blum, M. Aguilera, A. Roli and M.Sampels, â€œHybrid Metaheuristics, An Emerging Approach to Optimizationâ€, Springer, 2008.

[197] S. Pal, C. Rai and P. Amrit, â€œComparative study of Firefly Algorithm & Particle Swarm Optimization for Noisy Nonlinear Optimization Problemsâ€,International Journal of Intelligent Systems & Applications, vol. 10, pp. 50-57, 2012.

[198] M. Yuanbin, â€œParticle Swarm Optimization for Cylinder Helical GearMulti-objective Design problemsâ€,Applied Mechanics and Materials, vol. 109, pp. 216-221, 2012.

[199] X. Yang, and X. He, â€œFirefly Algorithm: Recent Advances and Applicationsâ€, International Journal Swarm Intelligence, vol. 1, no.1, pp. 36-50, 2013.

[200] Kennedy and R. Eberhart, â€œA discrete binary version of the particle swarm algorithmâ€,Systems, Man, and Cybernetics, vol. 5, pp. 4104-4108, 1997.

[201] X. Yang, â€œFirefly algorithms for multimodal optimizationâ€, InStochastic algorithms: foundations and applications, pp. 169-178, 2009.

[202] X. Yang and S. Deb, â€œCuckoo Search via Levy Flightsâ€,Proceedings of World Congress on Nature & Biologically Inspired Computing, pp. 210-214, 2009.

[203] Bansal and K. Deep, â€œOptimization of directional overcurrent relay times by particle swarm optimizationâ€,Swarm Intelligence Symposium, pp. 1-7, 2008.

[204] C. Floudas and P. Pardalos,â€œState of the art in global optimization: computational methods and applicationsâ€, Springer Science & Business Media, vol. 7, 2013.

[205] Gandomi, X. Yang and A. Alavi, â€œMixed variable structural optimization using firefly algorithmâ€Computers & Structures, vol. 89, no. 23, pp. 2325-2336,2011.

[206] Parpinelli and H. Lopes, â€œNew inspirations in swarm intelligence: a surveyâ€,International Journal of Bio-Inspired Computation, vol. 3, no. 1, pp. 1-16, 2011.

[207] X. Yang, â€œMultiobjective firefly algorithm for continuous optimizationâ€,Engineering with Computers, vol. 29, no.2, pp. 175-184, 2013.

[208] M. Spasos, K. Tsiakmakis, N. Charalampidis and R. Nilavalan, â€œRF-MEMS switch actuation pulse optimization using Taguchiâ€™s methodâ€,Microsystem Technologies, vol. 17, pp. 1351-1359, 2011.

[209] F. Ducatelle, G. Caro and L. Gambardella, â€œPrinciples and applications of swarm intelligence for adaptive routing in telecommunications networksâ€,Swarm Intelligence, vol. 4, pp. 173-198, 2010.

[210] M. Arjona, M.Cisneros-GonzÃ¡lez and C. HernÃ¡ndez, â€œParameter Estimation of a Synchronous Generator Using a Sine Cardinal Perturbation and Mixed Stochasticâ€“Deterministic Algorithmsâ€,IEEE Trans. Industrial Electronics, vol. 58, no. 2, pp. 486-493, 2011.

[211] C. Liao, â€œEnhanced RBF Network for Recognizing Noise-Riding Power Quality Eventsâ€,IEEE Trans. Instrumentation & Measurement, vol. 59, no.6, pp. 1550-1561, 2010.

[212] M. Wan, L. Li, J. Xiao, C. Wang and Y. Yang, â€œData clustering using bacterial foraging optimizationâ€,Journal of Intelligent Information Systems, vol. 38, no.2, pp. 321-341, 2012.

[213] D. Karaboga and B. Basturk, â€œA Powerful and Efficient Algorithm for Numerical Function Optimization: Artificial Bee Colony (ABC) Algorithmâ€,Journal of Global Optimization, vol. 39, no. 3, pp. 459-471, 2007.

[214] Brajevic, and M. Tuba, â€œAn upgraded artificial bee colony (ABC) algorithm for constrained optimization problemsâ€,Journal of Intelligent Manufacturing, vol. 10 pp. 1-12, 2012.

[215] W. Yunna and S. Zhaomin, â€œApplication of Radial Basis Function Neural Network Based on Ant Colony Algorithm in Credit Evaluation of Real Estate Enterprisesâ€,Proc. 2008 Int. Conf. on Management Science and Engineering, pp. 1322-1329, 2008.

[216] Udhayakumar and S. Kumanan, â€œSequencing & scheduling of job & tool in a flexible manufacturing system using ant colony optimization algorithmâ€,International Journal of Advanced Manufacturing Technology, vol. 50, pp. 1075-1084, 2010.

[217] M. Al-Meshah, â€œFirefly algorithms with probabilistic neural network for classification problemsâ€, PhD Thesis. Malaysian National University, 2013.

[218] X. S. Yang and N. I. M. Algorithms, Luniver Press. Beckington, UK, 242-246, 2008.

[219] Zigiaris, â€œBusiness process reengineering BPRâ€,Report produced for the EC funded project, InnoRegio Project, BPR Hellas SA, January, 2000.

[220] M. Basu, â€œA simulated annealing-based goal-attainment method for economic emission load dispatch of fixed head hydrothermal power systemsâ€,International Journal of Electrical Power & Energy Systems, vol. 27, no. 2, pp. 147-153, 2005.

[221] Chen and C. Huang, â€œBiobjective power dispatch using goal-attainment method & adaptive polynomial networksâ€,IEEE Transactions on Energy Conversion, vol. 19, no. 4, pp. 741â€“747, 2004.

[222] Bouktir, R. Labdani and L. Slimani, â€œEconomic power dispatch of power system with pollution control using multi-objective particle swarm optimizationâ€,Journal of Pure & Applied Sciences, vol. 4, no. 2, pp. 57â€“77, 2007.

[223] El-Wahed, A. Mousa, and M. Elsisy, â€œSolving economic emissions load dispatch problem by using hybrid ACO-MSM approachâ€, The Online Journal on Power and Energy Engineering, vol. 1, no. 1, pp.36-53, 2007.

[224] Xie, S. Wang and Z. Wu, â€œStudy on economic, rapid and environmental power dispatch based on fuzzy multi-objective optimizationâ€,Modern Applied Science, vol. 3, no. 6, pp. 38-44, 2009.

[225] Alsumait and J. Sykulski, â€œSolving economic dispatch problem using hybrid GA-PS-SQP methodâ€. Proceedings of the International Conference on Computer as a Tool (EUROCON â€™09), pp. 351â€“ 356, 2009.

[226] Sudhakaran, S. Slochanal, R. Sreeram and N. Chandrasekhar, â€œApplication of refined genetic algorithm to combined economic & emission dispatchâ€, Journal of the Institution of Engineers, vol. 85, no. 2, pp. 115â€“119, 2004.

[227] Goncalves, C. Almeida, J. Kuk, and M. Delgado, â€œSolving economic load dispatch problem by natural computing intelligent systemsâ€, in Proceedings of the 15th International Conference on Intelligent System Applications to Power Systems (ISAP â€™09), vol. 1, pp.257-264, 2009.

[228] E. Osaba, X. Yang, F. Diaz, E. Onieva, A. Masegosa and A. Perallos, â€œA discrete firefly algorithm to solve a rich vehicle routing problem modelling a newspaper distribution system with recycling policyâ€, Soft Computing, pp. 1-14, 2016.

[229] Alb, P. Alotto, C. Magele, W. Renhart, K. Preis and B. Trapp, â€œFirefly Algorithm for Finding Optimal Shapes of Electromagnetic Devicesâ€, IEEE Transactions on Magnetics, vol. 52, no. 3, pp. 1-4, 2016.

[230] GÃ¡lvez and A. Iglesias, â€œNew memetic self-adaptive firefly algorithm for continuous optimizationâ€,International Journal of Bio-Inspired Computation, vol. 8, no.5, pp. 300-317, 2016.

[231] Åukasik and S. Å»ak, â€œFirefly algorithm for continuous constrained optimization tasks. In Computational Collective Intelligenceâ€, Semantic Web, Social Networks & Multiagent Systems. Springer Berlin Heidelberg, pp. 97-106, 2009.

[232] Saadi, M. Nor Azura and M. Norazan, â€œThe Superiority of Evolutionary Algorithms to Robustify Backpropagation Neural Network Learning Algorithmâ€,Journal of Applied Environmental and Biological Sciences, vol. 6, no. 25, pp. 17-31, 2016.

[233] M. M.Basri, N. M. Nawi, M. Mamat and N. A. Hamid, â€œA New Class of Nonlinear Conjugate Gradient for Training Back Propagation Algorithmâ€, In International Conference on Soft Computing and Data Mining, pp. 200-212, 2018.

[234] R. Heravi and G. A. Hodtani, â€œ A New Correntropy-Based Conjugate Gradient Backpropagation Algorithm for Improving Training in Neural Networksâ€, IEEE Transactions on Neural Networks and Learning Systems, 2018.

[235] Hubara, E. Hoffer and D. Soudry, â€œQuantized Back-Propagation: Training Binarized Neural Networks with Quantized Gradientsâ€, (2018).

[236] F. Bordes, T. Berthier, L. Di Jorio, P. Vincent and Y. Bengio, â€œIteratively unveiling new regions of interest in Deep Learning modelsâ€, 2018.

[237] Aljarah, H. Faris and S. Mirjalili, â€œ Optimizing connection weights in neural networks using the whale optimization algorithmâ€, Soft Computing, 22(1), 1-15, 2018.