Vortrag aus Archiv

Knowledge Discovery from Vague Data using Dominance-based Rough Set Approach

29.04.2013

Getting knowledge from massive data is nowadays a primary challenge for information processing. The goal of knowledge discovery from data describing decision situations is to help making better decisions. One of the difficulties in knowledge discovery is a vague character of data due to inconsistency. The Dominance-based Rough set Approach (DRSA) is a methodology for reasoning about vague data, which handles monotonic relationships between values of condition and decision attributes, typical for data describing decision situations. The origin of the vagueness is inconsistency due to violation of the dominance principle which requires that (assuming a positive monotonic relationship) if object x has an evaluation at least as good as object y on all condition attributes, then it should not get evaluation worse than y on all decision attributes. We show that DRSA is a natural continuation of the Pawlak’s concept of rough set, which builds on the ideas coming from Leibniz, Frege, Boole, Łukasiewicz and Zadeh. We also show that the assumption admitted by DRSA about the ordinal character of evaluations on condition and decision attributes is not a limiting factor in knowledge discovery from data. In particular, it is an obvious assumption in decision problems, like multicriteria classification or ranking, multiobjective optimization, and decision under risk and uncertainty. Moreover, even when the ordering of data seems irrelevant, the presence or the absence of a property can be represented in ordinal terms, because if two properties are related, the presence, rather than the absence, of one property should make more (or less) probable the presence of the other property. This is even more apparent when the presence or the absence of a property is graded or fuzzy, because in this case, the more credible the presence of a property, the more (or less) probable the presence of the other property. This observation leads to a straightforward hybridization of DRSA with fuzzy sets. Since the presence of properties, possibly fuzzy, is the base of information granulation, DRSA can also be seen as a general framework for granular computing. We also comment on stochastic version of DRSA, and on algebraic representations of DRSA, as well as on topology for DRSA.

References

[1] S. Greco, B. Matarazzo, R. Słowiński: Rough sets theory for multicriteria decision analysis. European Journal of Operational Research, 129 (2001) 1-47.

[2] S. Greco, B. Matarazzo, R. Słowiński: Dominance-based rough set approach to decision under uncertainty and time preference. Annals of Operations Research, 176 (2010) 41-75.

[3] R. Słowiński, S. Greco, B. Matarazzo: Rough Sets in Decision Making. [In]: R.A. Meyers (ed.): Encyclopedia of Complexity and Systems Science, Springer, New York, 2009, pp. 7753-7786.

[4] W. Kotłowski, K. Dembczyński, S. Greco, R. Słowiński: Stochastic dominance-based rough set model for ordinal classification. Information Sciences, 178 (2008) 4019-4037.[5] J. Błaszczyński, S. Greco, R. Słowiński: Inductive discovery of laws using monotonic