Title: Data mining methods with trees
Authors: Žambochová, Marta
Citation: E+M. Ekonomie a Management = Economics and Management. 2008, č. 1, s. 126-131.
Issue Date: 2008
Publisher: Technická univerzita v Liberci
Document type: článek
article
URI: http://www.ekonomie-management.cz/download/1331826666_7ed4/14_zambochova.pdf
http://hdl.handle.net/11025/17222
ISSN: 1212-3609 (Print)
2336-5604 (Online)
Keywords: získávání dat;rozhodovací stromy;algoritmus ID3
Keywords in different language: data mining;decision trees;ID3 algorithm
Abstract in different language: Present world is characterized by ever growing volume of data collected and saved into data- bases. Data often can‘t be analysed by using standard statistical methods because they contain many missing figures or are in qualitative units, and because some databases are in very wide usa- ge. Each organization must be able to extract important information from an extensive database. These were the main reasons why data mining was initiated. Tree structures are used in many diverse areas. Tree structures are frequently used in statistical data analysis, particularly in data mining. This paper describes decision trees, their data structure and their implementation in statistical data analysis. Decision trees offer a non-algebraic method for partitioning data. Using decision trees is attractive because they offer visualization, simplicity of interpretation and high accuracy. We can utilize them to solve various classificatory and predictive exercises. They are a perfect instrument to help managers in the decision-making processes. The decision trees are also used to form different groups of clients in order to prepare special offers and campaigns. Their potential lies in the ability to predict potential debtors on which may be decided whether to give or reject a loan or insurance to a particular costumer. The decision trees are also used to predict the potency for a new product designed for targeted customer, detect an insurance fraud, or foretell the number of people, who want to attend the competition and so on. There are quite a few algorithms, which have been described and are being used to form deci- sion trees. The following two are among the basic ones: algorithm ID3 and its improved version C4.5. The author is J. R. Quinlan. The first one is very illustrative and it is really important in order to acquire the basic understanding in decision trees problematic. The article contains an example of this ID3 algorithm application.
Rights: © Technická univerzita v Liberci
CC BY-NC 4.0
Appears in Collections:Číslo 1 (2008)
Číslo 1 (2008)

Files in This Item:
File Description SizeFormat 
14_zambochova.pdfPlný text290,06 kBAdobe PDFView/Open


Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/17222

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.