CIDM 2009 Program

Session CIDM-1: CI/Probabilistic/Statistical and other methods I

Monday, March 30, 8:30AM-10:30AM, Room: Hermitage C, Chair: Barry Chen, USA

8:30AM   Building Ultra-Low False Alarm Rate Support Vector Classifier Ensembles Using Random Subspaces [#9004]
Barry Chen, Tracy Lemmond and William Hanley
Lawrence Livermore National Laboratory, United States
8:54AM   Collaborative Filtering with Fine-grained Trust Metric [#9017]
Su Chen, Tiejian Luo, Wei Liu and Yanxiang Xu
Graduate University of Chinese Academy of Sciences, China
9:18AM   HybridSOM: A Generic Rule Extraction Framework for Self-Organizing Feature Maps [#9021]
Willem S. van Heerden and Andries P. Engelbrecht
University of Pretoria, South Africa
9:42AM   Assessing the Influence Probability between Objects: A Random Walker Approach [#9022]
Pei Li, Zhixu Li, Hongyan Liu, Jun He and Xiaoyong Du
Renmin University of China, China; Tsinghua University, China

Session CIDM-2: CI/Probabilistic/Statistical and other methods II

Monday, March 30, 11:00AM-1:00PM, Room: Hermitage C, Chair: Oliver Schulte, Simon Fraser University, USA

11:00AM   Intelligent Feature Extraction and Knowledge Mining by Multivariate Analyses [#9023]
Yisong Chen and Hong Cui
Key Laboratory of Machine Perception (Ministry of Education), Peking
University, China; Unis Nast Developing Co. Ltd, Tshinghua Unisplendour Group, China
11:24AM   Efficient Model Selection for Support Vector Machine with Gaussian Kernel Function [#9027]
Tang Yaohua, Guo Weimin and Gao Jinghuai
Henan Electric Power Research Institute, China; School of Electronic and Information Engineering, Xi'an Jiaotong University, China
11:48AM   A Fault Tolerant Peer-to-Peer Distributed EM Algorithm [#9030]
Behrooz Safarinejadian, Mohammad Bagher Menhaj and Mehdi Karrari
Amirkabir University of Technology, Iran
12:12PM   A New Hybrid Method for Bayesian Network Learning [#9036]
Oliver Schulte, Gustavo Frigo, Russell Greiner and Hassan Khosravi
School of Computing Science, Simon Fraser University, Canada; Department of Computing Science, University of Alberta, Canada
12:36PM   A Pillar Algorithm for K-Means Optimization by Distance Maximization for Initial Centroid Designation [#9038]
Ali Ridho Barakbah and Yasushi Kiyoki
Graduate School of Media and Governance, Keio University, Japan, Japan; Faculty of Environmental Information, Keio University, Japan, Japan

Session CIDM-3: CI/Probabilistic/Statistical and other methods III

Monday, March 30, 2:00PM-4:00PM, Room: Hermitage C, Chair: Giuseppe Di Fatta, University of Reading, UK

2:00PM   An Adaptive Ensemble Classifier for Concept Drifting Stream [#9043]
Dengyuan Wu, Ying Liu, Ge Gao, Zhendong Mao, Weishan Ma and Tao He
Institute of Computing Technology, Chinese Academy of Sciences, China; Fictitious Economy and Data Science Research Center, Graduate University of
Chinese Academy of Sciences, China; University of Virginia, China
2:24PM   Missing Traffic Flow Data Prediction using Least Squares Support Vector Machines in Urban Arterial Streets [#9045]
Yang Zhang and Yuncai Liu
Shanghai Jiao Tong University, China
2:48PM   Dynamic classifier selection using clustering for spam detection [#9053]
Mehrnoush Famil saeedian and Hamid Beigy
Sharif University of Technology, Iran
3:12PM   Skill Rating by Bayesian Inference [#9060]
Giuseppe Di Fatta, Guy Haworth and Kenneth Regan
The University of Reading, United Kingdom; University at Buffalo, The State University of New York, United States
3:36PM   Clustering-based Activity Classification with a Wrist-worn Accelerometer Using Basic Features [#9066]
Pekka Siirtola, Perttu Laurinen, Eija Haapalainen, Juha Roning and Hannu Kinnunen
University of Oulu, Finland; Polar Electro Oy, Finland

Session CIDM-4: Data Understanding, rule extraction, logical models I

Monday, March 30, 4:30PM-6:30PM, Room: Hermitage C, Chair: Andrei Olaru, University Politehnica of Bucharest, Romania

4:30PM   Decision Trees as Information Source for Attribute Selection [#9013]
Kyoko Fukuda and Brent Martin
University of Canterbury, New Zealand
4:54PM   An Enhanced Data Mining Life Cycle [#9018]
Markus Hofmann and Brendan Tierney
Institute of Technology Blanchardstown, Ireland; Dublin Institute of Technology, Ireland
5:18PM   Local Mining of Association Rules with Rule Schemas [#9047]
Andrei Olaru, Claudia Marinica and Fabrice Guillet
Department of Computer Science, University Politehnica of Bucharest, Romania; LINA - Ecole polytechnique de l'Universite de Nantes, France
5:42PM   Extracting Hot Spots of Basic and Complex Topics From Time Stamped Documents [#9089]
Wei Chen and Parvathi Chundi
University of Nebraska at Omaha, United States

Session CIDM-5: Applications to biomedicine, e-commerce, engineering, etc I

Tuesday, March 31, 8:30AM-10:30AM, Room: Hermitage C, Chair: Anna L Buczak, JHU APL, USA

8:30AM   Hierarchically Classifying Documents with Multiple Labels [#9007]
Andrew Mayne and Russell Perry
Hewlett Packard Labs, United Kingdom
8:54AM   Determining the Strength of the Propensities of a Blog Network [#9024]
Seok-Ho Yoon, Sang-Wook Kim and Sunju Park
Hanyang University, Korea (South); Yonsei University, Korea (South)
9:18AM   Mining Electronic Medical Records for Patient Care Patterns [#9076]
Anna Buczak, Linda Moniz, Brian Feighner and Joseph Lombardo
JHU APL, United States
9:42AM   Gender Identification from E-mails [#9081]
Na Cheng, Xiaoling Chen, Rajarathnam Chandramouli and K.p Subbalakshmi
Stevens Institute of Technology, United States

Session CIDM-6: Applications to biomedicine, e-commerce, engineering, etc II

Tuesday, March 31, 11:00AM-1:00PM, Room: Hermitage C, Chair: Sangoh Jeong, Samsung Information Systems America, USA

11:00AM   Generalized Random Rotation Perturbation for Vertically Partitioned Data Sets [#9015]
Zhenmin Lin, Jie Wang, Lian Liu and Jun Zhang
University of Kentucky, United States; Univesity of Kentucky, United States
11:24AM   Mining for Insider Threats in Business Transactions and Processes [#9046]
William Eberle and Lawrence Holder
Tennessee Technological University, United States; Washington State University, United States
11:48AM   Density-Based Clustering of Polygons [#9086]
Deepti Joshi, Ashok Samal and Leen-Kiat Soh
University of Nebraska-Lincoln, United States
12:12PM   Non-collaborative Interest mining for Personal Devices [#9091]
Sangoh Jeong, Doreen Cheng, Henry Song and Swaroop Kalasapur
Samsung Information Systems America, United States

Session CIDM-7: Applications to biomedicine, e-commerce, engineering, etc III

Tuesday, March 31, 2:00PM-4:00PM, Room: Hermitage C, Chair: Alessando Sperduti, University of Padova, Italy

2:00PM   An Empirical Study of Bagging and Boosting Ensembles for Identifying Faulty Classes in Object-Oriented Software [#9059]
Hamoud Aljamaan and Mahmoud Elish
King Fahd University of Petroleum and Minerals, Saudi Arabia
2:24PM   Automatic Analysis of Eye Tracking Data for Medical Diagnosis [#9061]
Filippo Galgani, Yiwen Sun, Pier Luca Lanzi and Jason Leigh
Politecnico di Milano, Italy; University of Illinois at Chicago, United States
2:48PM   Application of the Preference Learning Model to a Human Resources Selection Task [#9062]
Fabio Aiolli, Michele De Filippo and Alessandro Sperduti
University of Padova, Italy
3:12PM   Practical Fuzzy Decision Trees [#9069]
Na'el Abu-halaweh and Robert Harrison
Georgia State University, United States

Session CIDM-8: Data Understanding, rule extraction, logical models II

Tuesday, March 31, 4:30PM-6:30PM, Room: Hermitage C, Chair: Janusz Kacprzyk, Polish Academy of Sciences, Poland

4:30PM   Data Mining via Protoform Based Linguistic Summaries: Some Possible Relations to Natural Language Generation [#9064]
Janusz Kacprzyk and Slawomir Zadrozny
Systems Research Institute, Polish Academy of Sciences, ul. Newelska 6, 01-447 Warsaw, Poland
4:54PM   Handling Continuous Attributes in Ant Colony Classification Algorithms [#9072]
Fernando Otero, Alex Freitas and Colin Johnson
University of Kent, United Kingdom
5:18PM   A Novel Data Clustering Algorithm Based on Electrostatic Field Concepts [#9077]
Masoumeh Kalantari Khandani, Parvaneh Saeedi, Yaser P. Fallah and Mehdi K. Khandani
Simon Fraser University, Canada; University of California Berkeley, United States; University of Maryland, College Park, United States
5:42PM   Evolving Decision Trees Using Oracle Guides [#9079]
Ulf Johansson and Lars Niklasson
University of Boras, Sweden; University of Skovde, Sweden
6:06PM   Ensemble Member Selection Using Multi-Objective Optimization [#9082]
Tuve Lofstrom, Ulf Johansson and Henrik Bostrom
University of Boras, Sweden; University of Skovde, Sweden

Session CIDM-9: Text, graph and web mining

Wednesday, April 1, 8:30AM-10:30AM, Room: Hermitage C, Chair: Vladimir Bartik, Brno University of Technology, Czech Republic

8:30AM   Association Based Classification for Relational Data and Its Use in Web Mining [#9063]
Vladimir Bartik
Brno University of Technology, Czech Republic
8:54AM   Empirical Comparison of Graph Classification Algorithms [#9071]
Nikhil Ketkar, Holder Lawrence and Cook Diane
Washington State University, United States
9:18AM   Faster Computation of the Direct Product Kernel for Graph Classification [#9084]
Nikhil Ketkar, Holder Lawrence and Cook Diane
Washington State University, United States
9:42AM   Dependable Performance Analysis for Fuzzy Clustering of Web Usage Data [#9090]
Amir Ketata, Mudur Sudhir and Shiri Nematollaah
Deptartment of Computer Science and Software Engineering, Concordia University, Montreal, Quebec, Canada
10:06AM   Discovering Relational Knowledge from Two Disjoint Sets of Literatures Using Inductive Logic Programming [#9095]
Supphachai Thaicharoen, Tom Altman, Katheleen Gardiner and Krzysztof Cios
University of Colorado Denver, United States; Virginia Commonweath University, United States

Session CIDM-10: Mining spatial and spatial-temporal data

Wednesday, April 1, 11:00AM-1:00PM, Room: Hermitage C, Chair: Rachsuda Jiamthapthaksin, University of Houston

11:00AM   LD-BSCA: A Local-Density Based Spatial Clustering Algorithm [#9031]
Wei Guiyi and Liu Haiping
College of Computer Science and information engineering, Zhejiang Gongshang University, China
11:24AM   Trajectory Clustering in Road Network Environment [#9050]
Jung-Im Won, Sang-Wook Kim, Ji-Haeng Baek and Junghoon Lee
Hanyang University, Korea (South); Jeju National University, Korea (South)
11:48AM   An Architecture and Algorithms for Multi-Run Clustering [#9051]
Rachsuda Jiamthapthaksin, Christoph Eick and Vadeerat Rinsurongkawong
Computer Science Department, University of Houston, United States
12:12PM   MWASP: Multiple-Width Approximate Sequential Patterns [#9068]
Kelly Kingchi Yip and David Nembhard
Department of Industrial and Manufacturing Engineering, Pennsylvania State University, United States
12:36PM   Fuzzy P-Mode Prototypes: A Generalization of Frequency-Based Cluster Prototypes for Clustering Categorical Objects [#9094]
Mahnhoon Lee
Thompson Rivers University, Canada

Session CIDM-11: Mining of very large datasets, scalability

Wednesday, April 1, 2:00PM-4:00PM, Room: Hermitage C, Chair: P.
Krishna Reddy, International Instittute of Information Technology -
Hyderabad, India

2:00PM   Diversity Analysis on Imbalanced Data Sets by Using Ensemble Models [#9014]
Shuo Wang and Xin Yao
School of Computer Science, University of Birmingham, United Kingdom
2:24PM   Large-scale Attribute Selection using Wrappers [#9025]
Martin Guetlein, Eibe Frank, Mark Hall and Andreas Karwath
Albert-Ludwigs-Universitaet Freiburg, Germany; University of Waikato, New Zealand; Pentaho Corporation, United States
2:48PM   An Improved Multiple Minimum Support Based Approach to Mine Rare Association Rules [#9074]
Uday kiran Rage and Krishna reddy Polepally
International Instittute of Information Technology - Hyderabad, India
3:12PM   Data Mining with Ensembles of Fuzzy Decision Trees [#9075]
Christophe Marsala
University Pierre et Marie Curie - Paris 6, France
3:36PM   The Locality of RBF-SVM for Incremental Learning [#9087]
Wael Emara and Mehmed Kantardzic
University of Louisville, United States

Session CIDM-12: Feature extraction, selection, aggregation, construction

Wednesday, April 1, 4:30PM-6:30PM, Room: Hermitage C, Chair: Christophe Marsala, Universite Pierre et Marie Curie, France

4:30PM   Parametric Subspace Analysis for dimensionality reduction and classification [#9033]
Nhat Vo, Duc Vo, Bill Moran, Subhash Challa and Subhash Challa
Melbourne University, Australia; University Technology of Sydney, Australia; Melbourne University, Australia; Melbourne University, Australia; Melbourne University, Australia
4:54PM   Incremental Wrapper-based Subset Selection with Replacement: an advantageous alternative to sequential forward selection [#9052]
Pablo Bermejo, Jose Antonio Gamez and Jose Miguel Puerta
Universidad de Castilla-La Mancha, Spain
5:18PM   Analysis and Visualization of the Geographical Distribution of Atlantic Forest Bromeliads Species [#9073]
Stainam Brandao, Wagner Silva, Luis Silva, Vladimir Fagundes and Carlos Mello
COPPE/UFRJ - Computer Science Department, Graduate School of Engineering, Federal University of Rio de Janeiro, Brazil, Brazil
5:42PM   Maintaining Only Frequent Itemsets to Mine Approximate Frequent Itemsets over Online Data Streams [#9085]
Yongyan Wang, Kun Li and Hongan Wang
Institute of Software, Chinese Academy of Sciences, China

Session CIDM-13: Mining of signals and data stream

Thursday, April 2, 8:30AM-10:30AM, Room: Hermitage C, Chair: Xin Feng, Marquette University, USA

8:30AM   Extended Extreme Learning Machine [#9020]
Wanyu Deng
Xi'an Jiaotong University, China
8:54AM   Relevance Weighting of Multi-Term Queries for Vector Space Model [#9048]
Louis Wang
International School of Minnesota, United States
9:18AM   An approximation algorithm for finding skeletal points for density based clustering approaches [#9093]
Soheil Hassas Yeganeh, Jafar Habibi, Hassan Abolhassani, Mahdi Abbaspour Tehrani and Jamshid Esmaelnezhad
Mr., Iran; Dr., Iran
9:42AM   Detecting Multiple Temporal Patterns And Predictability Analysis of Complex Time-Evolving Systems [#9099]
Xin Feng and Odilon Senyana
Marquette University, United States

Tutorial CIDM-T: Temporal pattern mining in symbolic time point and time interval data

Thursday, April 2, 11:00AM-1:00PM, Room: Hermitage C, Instructor: Fabian Moerchen, Siemens Corporate Research, USA

IEEE SSCI 2009     March 30 – April 2, 2009     Sheraton Music City Hotel, Nashville, TN, USA