Knowledge Discovery and Intelligent Data Mining

Sasi Kumar M

Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Alathukombai, Sathyamangalam, Erode, Tamilnadu, India

Corresponding Author:sasikumarmurugan02@gmail.com

Sasi Kumar V

Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Alathukombai, Sathyamangalam, Erode, Tamilnadu, India

Corresponding Author:sasikumarskvs@gmail.com,

Samyukthaa LK

Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Alathukombai, Sathyamangalam, Erode, Tamilnadu, India

Corresponding Author:samyusamyukthaa@gmail.com,

Gokul Karthik S

PDepartment of Computer Science and Engineering, Bannari Amman Institute of Technology, Alathukombai, Sathyamangalam, Erode, Tamilnadu, India

Corresponding Author:gokulkarthik48@gmail.com

Abirami A

Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Alathukombai, Sathyamangalam, Erode, Tamilnadu, India

Corresponding Author:abirarmia@bitsathy.ac.in

Lakshmanaprakash S

Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Alathukombai, Sathyamangalam, Erode, Tamilnadu, India

Corresponding Author:lakshmanaprakashs@bitsathy.ac.in

Abstract :

Knowledge Discovery in Databases (KDD) is a programmed, exploratory examination and demonstrating of enormous information storehouses. KDD is the coordinated course of recognizing legitimate, novel, valuable, and justifiable examples from enormous and complex informational collections. Data Mining (DM) is the core of the KDD interaction. The model is utilized for figuring out peculiarities from the information, investigation and expectation. The bringing together objective of the KDD cycle is to extricate information from information with regards to huge data sets. KDD is a fantastic tool for keeping organizations and sectors up to date on consumer demands, behaviours, and actions. There are several obvious benefits to employing the KDD technique, as well as some drawbacks. The Intelligent Data Mining and Analysis is a trend setting innovation in data handling to remove rules and information from huge data sets deliberately and break down the nonlinear connection among info and result factors in complex issues or peculiarities. Data mining is a full-grown application in different regions like marketing. The proposed model comprises of extricating Twitter client information utilizing programmable bookkeeping sheet apparatuses like Google Docs. This content purposes the Twitter API alongside passing a bunch of boundaries, similar to client handle or hashtags to bring client metadata. In any case, straightforwardly utilizing Twitter API is additionally conceivable yet it builds the intricacy of the system.

Keywords:
  • KDD Process,,
  • Data Mining,
  • Extract Knowledge,
  • Non-linear Connection,
  • Growing technique,
  • Vehicles-to-Everything (V2X) communication system,
  • twitter client information
Reference

[1] Model, Four Element, Graham J. Williams, and Zhexue Huang. "Modelling the KDD Process." (1996).

[2] Keeney, R. L. (2006). "Value-Focused Thinking: Identifying Decision Opportunities and Creating Alternatives." European Journal of Operations Research 92: 537- 549.

[3] Berry, M. and G. Linoff (2007). Data Mining Techniques for Marketing, Sales and Customer Support, John Wiley and Sons.

[4] Osei-Bryson, K.-M. (2004). "Evaluation of Decision Trees." Computers and Operations Research 31: 1933-1945

[5] Lavrac, N., Bohanec, M., Pur, A., Cestnik, B., Debeljak, M. &Kobler, A. (2007).Data mining and visualization for decision support and modeling of public health-care resources.Journal of Biomedical Informatics, 40, 438-447. doi:10.1016/j.jbi.2006.10.003 Schenkerman, S. (2007). "Inducement of nonexistent order by the analytic hierarchy process." Decision Sciences 28(2): 475-482. 6.

[6] Inmon, W. H. (2012). Building the Data Warehouse.New York, Wiley

[7] Mirza S, Mittal S, Zaman M. A review of data mining literature. International Journal of Computer Science and Information Security (IJCSIS). 2016 Nov;14(11):437-42.

[8] Reddy DL. A Review on Data mining from Past to the Future. International Journal of Computer Applications. 2011;975(2011):8887.

[9] Silwattananusarn T, Tuamsuk K. Data mining and its applications for knowledge management: a literature review from 2007 to 2012. arXiv preprint arXiv:1210.2872. 2012 Oct 10.

[10] Parvathi I, Rautaray S. Survey on data mining techniques for the diagnosis of diseases in medical domain. International Journal of Computer Science and Information Technologies. 2014;5(1):838-46.

[11] Solieman OK. Data mining in sports: A research overview. Dept. of Management Information Systems. 2006 Aug.

[12] Giachanou A, Crestani F. Like it or not: A survey of twitter sentiment analysis methods. ACM Computing Surveys (CSUR). 2016 Jun 30;49(2):1-41.

[13] Agarwal A, Xie B, Vovsha I, Rambow O, Passonneau RJ. Sentiment analysis of twitter data. InProceedings of the workshop on language in social media (LSM 2011) 2011 Jun (pp. 30-38).

[14] Fayyad U. Knowledge discovery in databases: An overview. Relational data mining. 2001:28-47.

[15] Zhong N, Liu C, Kakemoto Y, Ohsuga S. KDD Process Planning. InKDD 1997 Aug 14 (pp. 291-294).

[16] Verma IS. Knowledge Data Discovery and Its Issues. Expansion, Impact and Challenges of IT & CS. 2015 Sep 21;88.

[17] Shafique U, Qaiser H. A comparative study of data mining process models (KDD, CRISP-DM and SEMMA). International Journal of Innovation and Scientific Research. 2014 Nov 20;12(1):217-22.

[18] Düntsch I, Gediga G, Nguyen HS. Rough set data analysis in the KDD process. InProc. of IPMU 2000 (Vol. 1, pp. 220-226).

[19] Wagner C, Saalmann P, Hellingrath B. Machine condition monitoring and fault diagnostics with imbalanced data sets based on the KDD process. IFAC-PapersOnLine. 2016 Jan 1;49(30):296-301.

[20] Chen Z. Intelligent Data Warehousing: From data preparation to data mining. CRC press; 2001 Dec 13.

[21] Ruan D, Chen G, Kerre EE, Wets G, editors. Intelligent data mining: techniques and applications. Springer Science & Business Media; 2005 Aug 24.

[22] Liu X. Intelligent data analysis. InIntelligent Information Technologies: Concepts, Methodologies, Tools, and Applications 2008 (pp. 308-314). IGI Global.

[23] Ma S, Chowdhury SK. Application of LC–high-resolution MS with ‘intelligent’data mining tools for screening reactive drug metabolites. Bioanalysis. 2012 Mar;4(5):501-10.

[24] McCormick K, Salcedo J. IBM SPSS Modeler essentials: Effective techniques for building powerful data mining and predictive analytics solutions. Packt Publishing Ltd; 2017 Dec 26.

[25] Dietz C, Berthold MR. KNIME for open-source bioimage analysis: a tutorial. Focus on Bio-Image Informatics. 2016:179-97.

[26] Mohammad MN, Sulaiman N, Muhsin OA. A novel intrusion detection system by using intelligent data mining in weka environment. Procedia Computer Science. 2011 Jan 1;3:1237-42.

[27] Sarlan A, Nadam C, Basri S. Twitter sentiment analysis. InProceedings of the 6th International conference on Information Technology and Multimedia 2014 Nov 18 (pp. 212-216). IEEE.

[28] Wang L, Niu J, Yu S. SentiDiff: combining textual information and sentiment diffusion patterns for Twitter sentiment analysis. IEEE Transactions on Knowledge and Data Engineering. 2019 Apr 26;32(10):2026-39.

[29] Pawar KK, Shrishrimal PP, Deshmukh RR. Twitter sentiment analysis: A review. International Journal of Scientific & Engineering Research. 2015 Apr 4;6(4):957-64.

[30] Philander K, Zhong Y. Twitter sentiment analysis: Capturing sentiment from integrated resort tweets. International Journal of Hospitality Management. 2016 May 1;55(2016):16-24.

[31] Rauchfleisch A, Kaiser J. The false positive problem of automatic bot detection in social science research. PloS one. 2020 Oct 22;15(10):e0241045.

[32] Geeng C, Yee S, Roesner F. Fake news on Facebook and Twitter: Investigating how people (don't) investigate. InProceedings of the 2020 CHI conference on human factors in computing systems 2020 Apr 21 (pp. 1-14).

[33] Maryn AG, Dover TL. Who gets canceled? Twitter responses to gender-based violence allegations. Psychology of Violence. 2022 Jun 6.

[34] Jeffares S. Interpreting hashtag politics: Policy ideas in an era of social media. Springer; 2014 May 15.

[35] Jain AP, Katkar VD. Sentiments analysis of Twitter data using data mining. In2015 International Conference on Information Processing (ICIP) 2015 Dec 16 (pp. 807-810). IEEE.

[36] Foysal A, Islam S, Rahaman T. Classification of AI powered social bots on Twitter by sentiment analysis and data mining through SVM. International Journal of Computer Applications. 2019;117:13-9

© The Author(s), under exclusive license to Technoarete Publishers 2022
  • ISBN -
  • Instant PDF download
  • Readable on all devices
doi.org/10.36647/MLAIDA/2022.12.B1.Ch012