ENBIS: European Network for Business and Industrial Statistics
Forgotten your password?
Not yet a member? Please register
ENBIS-8 in Athens
21 – 25 September 2008 Abstract submission: 14 March – 11 August 2008The following abstracts have been accepted for this event:
-
On Applications of the Relative Linkage Disequilibrium
Authors: Ron S. Kenett and Silvia Salini
Affiliation: KPA Ltd., Raanana, Israel and University of Torino, Torino, Italy, email: ron@kpa.co.il, Department of Economics, Business and Statistics. University
Primary area of focus / application:
Relative Linkage Disequilibrium (RLD) was originally proposed as an approach to analyse both quantitatively and graphically general two way contingency tables (Kenett 1983). It was later expanded to the Data Mining context to evaluate Association Rules (Kenett and Salini, 2008). RLD can be interpreted graphically using a simplex representation leading to powerful graphical display of association relationships. Moreover the statistical properties of RLD are known so that confirmatory statistical tests of significance or basic confidence intervals can be applied. In this work we present several applications of RLD such as Risk Management, Kansei Engineering, Text mining, web clickstream analysis.
(Some) References
Hahsler, M., Gr¨un, B., and Hornik, K. (2005). arules – A computational environment for mining association rules and frequent item sets. Journal of Statistical Software, 14(15):1–25. ISSN 1548-7660. URL http://www.jstatsoft.org/v14/i15/.
Kenett, R. (1983). On an Exploratory Analysis of Contingency Tables. The Statistician, 32, pp. 395-403.
Kenett, R and Salini, S., "Relative Linkage Disequilibrium: A New measure for association rules" (March 2008). UNIMI - Research Papers in Economics, Business, and Statistics. Statistics and Mathematics. Working Paper 32. http://services.bepress.com/unimi/statistics/art32
Multi-Industry Semantic-Based Business Intelligence Solutions (2008), http://www.musing.eu/download-area/musing-public-documentation/it-operational-risk-bi-system-executive-summary-mar-2007-v1-0-kpa/at_download/file
Nagamachi, M. (1995), Kansei Engineering: a new ergonomic consumer-oriented technology for product development. International Journal of Industrial Ergonomics, 15: 3-11
Omiecinski, E. (2003). Alternative interest measures for mining associations in databases. IEEE Transactions on Knowledge and Data Engineering, 15(1):57–69.
Shimada, K., Hirasawa K, and Hu J. (2006) Association Rule Mining with Chi-Squared Test Using Alternate Genetic Network Programming, ICDM2006.
Van Lottum, C., Pearce, K., Coleman, S. (2006), Features of Kansei Engineering Characterizing its Use in Two Studies: Men’s Everyday Footwear and Historic Footwear. Quality and Reliability Engineering International, 22: 629-650. -
Advances in Operational Risk Management: Statistical aspects of the MUSING project on combining semantic and quantitative data in risk assessment
Authors: Ron Kenett and Yossi Raanan
Affiliation: KPA Ltd., Univ of Torino, Italy and College of Management, Buisness School, Rishon LeZion, Israel
Primary area of focus / application:
MUSING (MUlti-industry, Semantic-based next generation business INtelliGence) is an R&D initiative co-funded by the European Commission in the context of the VI Framework Programme (http://www.musing.eu). The main objective of the 4 year project, started in 2006, is to design, develop, and test Business Intelligence services combining unstructured data such as text, with structured data, such as balance sheets. The innovation in MUSING is in the use of advanced statistical and data mining methods, such as Bayesian Networks, with new sophisticated technologies based on the semantic analysis, such as ontologies and XBRL. MUSING tools have been applied to data and information gathered from internal databases, as well as external data sources (Internet, financial newspapers, etc.), thus supporting and helping several time and money consuming activities, such as
- Automatic extraction of information from several digital information sources
- Identification and mapping of Operational Risks (OpR)
- Development of practical solutions for SME for OpR mitigation
- Web based services in support of SME internationalisation
The talk will cover the main innovation in MUSING, with an emphasis on the role of statistical methods in integrating qualitative, semantic based, data and quantitative information, to produce a comprehensive Operational Risk Management System. Specifically, we will cover a case study based on an application of MUSING methodology to a telecom operator. -
Kernel based confidence intervals for survival function estimation
Authors: Bagkavos Dimitris, Ioannides Dimitris Kalamatianou Aglaia.
Affiliation: Accenture Marketing in Athens, University of Macedonia and Pantion University .
Primary area of focus / application:
Submitted at 22-Apr-2008 07:49 by Dimitris Ioannides
Accepted
-
The r-out-of-m S Control Chart
Authors: Antzoulakos D.L. and Rakitzis C.A.
Affiliation: Department of Statistics & Insurance Science, University of Piraeus, Greece
Primary area of focus / application:
-
Quality improvement of land cover data bases: a sequential approach via agreement measures
Authors: E. Carfagna J. Marzialetti
Affiliation: University of Bologna
Primary area of focus / application:
In this paper, we use quality control for continuously improving the data base production process and to this purpose we propose an adaptive sequential sample design which allows reaching high precision of estimates with the smallest sample size and in the shortest time. However, adaptive sequential procedures do not allow unbiased estimates of the quality parameters because their efficiency is due to sample selection dependent on previously selected units and stopping rules based on the quality parameter.
In Carfagna and Marzialetti (2007) we have proposed an adaptive sequential procedure with permanent random numbers which allows unbiased and efficient estimates of the percentage of area correctly photo-interpreted. In this procedure the sample size per stratum is dependent on the previously selected units but the sample selection is not, and the stopping rule is not based on the estimates of the quality parameter.
In this paper we apply the same adaptive sequential procedure to some measures of agreement, such as the Cohen’s Kappa and its weighted version, where the weights are an inverse function of the distance among the classes of the photo-interpretation legend. Both agreement measures can be applied to confusion matrices based on the area of polygons as well as on the number of polygons.
We have performed several simulations corresponding to different selections of polygons from a land cover data base of photo-interpreted satellite data. The behaviour of the standard deviations of the Cohen’s Kappa and the Weighted Kappa has showed that the sequential procedure allows us to reach a specified precision of the estimates with a smaller sample size with respect to the classical two-step procedure where the sample size necessary to obtain that specific precision is computed on the basis of an initial sample. However, for some simulations, the trend of the standard deviations is not strictly monotone. Some simulations show a small increase of the standard deviation, some others show jumps of the standard deviation to higher values.
With reference to confusion matrices based on the number of polygons, we have used also a log-linear model to describe the “structure” of the agreement (Tanner and Young 1985), where we consider two terms expressing respectively the agreement obtained by chance alone and the effective agreement existing between the photo-interpreter and the controller. If we hypothesize that the level of the agreement does not change among the classes of the legend we can use the homogeneous model, otherwise a non-homogeneous model is more appropriate. We propose a sequential approach for the log-linear model too.
References
Carfagna E. and Marzialetti J. (2007) Sequential Design in Quality Control and Validation of Land Cover Data Bases, Proc. of the ENBIS-DEINDE Conference “Computer Experiments versus Physical Experiments”, Torino, G. Vicario and E. D. Isaia (eds.).
Tanner M. A. and Young M. A. (1985) Modeling agreement among raters, Journal of the American Statistical Association, 80, 175-180. -
INFORMS: Quality Statistics and Reliability Section Panel Discussion
Authors: Russell Barton, Irad Ben-Gal, Yu Ding, and Jan Shi
Affiliation: Institute for Operations Research and the Management Sciences (INFORMS)
Primary area of focus / application:
Keywords: Quality, Reliability, Teaching
Submitted at 23-Apr-2008 16:07 by Irad Ben-Gal
Accepted