Call for participation: AusDM 2015, Sydney, 8-9 August

*************************************************************
The 13th Australasian Data Mining Conference (AusDM 2015)
Sydney, Australia, 8–9 August 2015
URL: http://ausdm15.ausdm.org/
*************************************************************

The Australasian Data Mining Conference is devoted to the art and science of intelligent data mining: the meaningful analysis of (usually large) data sets to discover relationships and present the data in novel ways that are compact, comprehensible and useful for researchers and practitioners.

This conference will bring together the Data Mining and Business Analytics community researchers and practitioners to share and learn of research and progress in the local context and new breakthroughs in data mining algorithms and their applications.

Keynotes
========

Discovering Negative Links on Social Networking Sites
Prof Huan Liu, Arizona State University

Large Scale Metric Learning using Locality Sensitive Hashing
Prof Ramamohanarao Kotagiri, University of Melbourne

Big Data for Everyone
Prof Jian Pei, Simon Fraser University

Big Data Mining and Data Science
Prof Yong Shi, Chinese Academy of Sciences

Deep Broad Learning – Big Models for Big Data
Prof Geoff Webb, Monash University

Algorithm acceleration for high throughout biology
Prof Wei Wang, University of California, Los Angeles

Big Data Analytics in Business Environments
Prof Hui Xiong, State University of New Jersey

On Mining Heterogeneous Information Networks
Prof Phillip Yu, University of Illinois at Chicago

Resource Management in Cloud Computing Systems
Prof Albert Zomaya, University of Sydney

Big Data Algorithms and Clinical Applications
A/Prof Yixin Chen, Washington University

Defining Data Science
Prof Yangyong Zhu, Fudan University

Learning with Big Data by Incremental Optimization of Performance Measures
Prof Zhi-Hua Zhou, Nanjinf University

Accepted Papers
=================

Research Track:

FSMEC: A Feature Selection Method based on the Minimum Spanning Tree and Evolutionary Computation
Amer Abu Zaher, Regina Berretta, Ahmed Shamsul Arefin and Pablo Moscato

Mining Productive Emerging Patterns and Their Application in Trend Prediction
Vincent Mwintieru Nofong

Detection of Structural Changes in Data Streams
Ross Callister, Mihai Lazarescu and Duc-Son Pham

Multiple Imputation on Partitioned Datasets
Michael Furner and Md Zahidul Islam

Particle Swarm Optimisation for Feature Selection: A Size-Controlled Approach
Bing Xue and Mengjie Zhang

Complement Random Forest
Md Nasim Adnan and Zahid Islam

Aspect-Based Opinion Mining from Product Reviews Using Conditional Random Fields
Amani Samha, Yuefeng Li and Jinglan Zhang

On Ranking Nodes using kNN Graphs, Shortest-paths and GPUs
Ahmed Shamsul Arefin, Regina Berretta and Pablo Moscato

Link Prediction and Topological Feature Importance in Social Networks
Stephan Curiskis, Thomas Osborn and Paul Kennedy

AWST: A Novel Attribute Weight Selection Technique for Data Clustering
Md Anisur Rahman and Md Zahidul Islam

Genetic Programming Using Two Blocks To Extract Edge Features
Wenlong Fu, Mengjie Zhang and Mark Johnston

Designing a knowledge-based schema matching system for schema mapping
Sarawat Anam and Byeong Ho Kang

A Differentially Private Decision Forest
Sam Fletcher and Md Zahidul Islam

Industry Track:

Improving Bridge Deterioration Modelling Using Rainfall Data from the Bureau of Meteorology
Qing Huang, Kok-Leong Ong and Damminda Alahakoon

An Industrial Application of Rotation Forest: Transformer Health Diagnosis
Tamilalagan Natarajan, Duc-Son Pham and Mihai Lazarescu

Non-Invasive Attributes Significance in the Risk Evaluation of Heart Disease Using Decision Tree Analysis
Mai Shouman and Tim Turner

An Improved SMO Algorithm for Credit Risk Evaluation
Jue Wang, Aiguo Lu and Xuemei Jiang

Join us on LinkedIn:
http://www.linkedin.com/groups/AusDM-4907891

Posted in Big Data, Data Mining | Tagged , | Leave a comment

The 2015 Big Data Summit, 9-10 August 2015, collocated with ACM KDD 2015, Sydney

The 2015 Big Data Summit
9-10 August 2015
collocated with ACM KDD 2015, Sydney
URL: http://2015.bigdatasummit.co/

We take this privilege opportunity to invite you to
participate in the 2015 Big Data Summit:
• Co-located with ACM KDD2015
• Plenary sessions and keynote speeches by world
industrial and academic leaders
• Big data best practices and highlights in Australia
and New Zealand
• “Big Data in China” Forum
• “Data Science in India” Forum
• “Big Data in Asia” Panel

The theme of this year’s Big Data Summit is “Data to
Economy”.

Since the Summit’s inception in 2012, we have seen
increasing interest and investment within both
industry and government in data-led innovation and
industralisation to deeply explore big data universe,
invent data science, train data engineers and scientists,
and develop the data economy.

This year’s event aims to provide analytics professionals
and academia with a global and regional perspective to
outline the big data research, education and development
in the Asia Pacific region, showcase best practices,
explore thought-provoking insights, and demonstrate
solutions and lessons learned across industry,
government and academia.

Who Should Attend?
• Data modellers and business analysts
• Analytics professionals
• Business decision makers
• Policy executives
• Senior Government Representatives
• Academics (including research students)

What Are the Trends and Topics?
• The Data2Economy Agenda: Challenges, Trends and
Prospects
• Latest Scientific Development in Data and Analytics
Science
• The progress and future of big data in Australia and
New Zealand
• The progress and future of big data in China
• The progress and future of data science in India
• Future of Data Science and Analytics Science
• Data Economy and Industrial Transformation
• Competency, Policies and Processes
• Data Analytics Case Studies and Showcases

Why You Should Attend?
Started in 2012, the 2013 and 2014 Big Data Summit
(Sydney and Canberra) attracted over 250-300
participants from industry, government and academia.
This annual Australian Summit provides a premier and
unique forum for bridging the gaps between academia,
industry and government, and independent insights on the
advancement, best practices, trends and controversies
about data science, big data and data economy.

With very prestigious speakers across academic, industry
and government from China, India, Australia, and USA
and Europe, the 2015 Summit will cover a broad spectrum
of big data and analytics aspects and domains. The three
regional Forums organized by India, China and ANZ will
present first-hand view about progress and opportunities
in the Asia Pacific region. The “Big Data in Asia” Panel
will feature world leaders from both the Asia Pacific and
global communities, to draw a big picture of big data
innovation, services, education and economy.

Co-located with ACM KDD2015 in Hilton Sydney, BDS2015
attracts global interest, will mark a unique and high
quality opportunity for you and your organization to grasp
the cutting-edge and thought-leading progress, network
with peers and thought leaders, and most importantly,
dig out more insights and value from your big data and
lift your competency in the increasingly competitive and
challenging market and environment.

For More Information
For more details about the Summit, please visit the Website
(www.bigdatasummit.co).

Registration to BDS2015 will be free of charge, please Check
and Register via http://www.bigdatasummit.co.

For any other inquiries about the Summit, please feel free to
Contact us(BDS2015@bigdatasummit.co).

Posted in Big Data | Tagged | 1 Comment

R and Data Mining workshop at Deakin University

I will run a workshop on R and Data Mining for students in the Master of Business Analytics course at Deakin University in Melbourne on Thursday 28 May.

The workshop will cover:
– Introduction to Data Mining with R and Data Import/Export in R
– Data Exploration and Visualization with R
– Regression and Classification with R
– Data Clustering with R
– Association Rule Mining with R
– Text Mining with R — Twitter Data Analysis (updated for tm v0.6)

Workshop slides can be downloaded at
http://www.rdatamining.com/training

Posted in Data Mining, R | Tagged , | 2 Comments

AusDM 2015 submission deadline extended to Thursday 30 April

The 13th Australasian Data Mining Conference (AusDM 2015)
Sydney, Australia, 8–9 August 2015 (co-located with SIGKDD’15)
URL: http://ausdm15.ausdm.org/

The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. It is devoted to the art and science of intelligent analysis of (usually big) data sets for meaningful (and previously unknown) insights. This conference will enable the sharing and learning of research and progress in the local context and new breakthroughs in data mining algorithms and their applications across all industries.

Since AusDM’02 the conference has showcased research in data mining, providing a forum for presenting and discussing the latest research and developments. Since 2006, all proceedings have been printed as volumes in the CRPIT series. Built on this tradition, AusDM’15 will facilitate the cross-disciplinary exchange of ideas, experience and potential research directions. Specifically, the conference seeks to showcase: Research Prototypes; Industry Case Studies; Practical Analytics Technology; and Research Student Projects. AusDM’15 will be a meeting place for pushing forward the frontiers of data mining in academia and industry. AusDM’15 is proud to be co-located with SIGKDD 2015 in Sydney, Australia.

Publication and topics
We are calling for papers, both research and applications, and from both academia and industry, for presentation at the conference.All papers will go through double–blind, peer–review by a panel of international experts. Accepted papers will be published in an up–coming volume (Data Mining and Analytics 2015) of the Conferences in Research and Practice in Information Technology (CRPIT) series by the Australian Computer Society which is also held in full–text on the ACM Digital Library and will also be distributed at the conference. For more details on CRPIT please see http://www.crpit.com. Please note that we require that at least one author for each accepted paper will register for the conference and present their work.

AusDM invites contributions addressing current research in data mining and knowledge discovery as well as experiences, novel applications and future challenges. Topics of interest include, but are not restricted to:

• Applications and Case Studies — Lessons and Experiences
• Big Data Analytics
• Biomedical and Health Data Mining
• Business Analytics
• Computational Aspects of Data Mining
• Data Integration, Matching and Linkage
• Data Mining Education
• Data Mining in Security and Surveillance
• Data Preparation, Cleaning and Preprocessing
• Data Stream Mining
• Evaluation of Results and their Communication
• Implementations of Data Mining in Industry
• Integrating Domain Knowledge
• Link, Tree, Graph, Network and Process Mining
• Multimedia Data Mining
• New Data Mining Algorithms
• Professional Challenges in Data Mining
• Privacy-preserving Data Mining
• Spatial and Temporal Data Mining
• Text Mining
• Visual Analytics
• Web and Social Network Mining

Keynote speakers
As is tradition for AusDM we have lined up an excellent keynote speaker program. Each speaker is a well known research and/or practitioner in data mining and related disciplines. The keynote program provides an opportunity to hear from some of the world’s leaders on what the technology offers and where it is heading.

Submission of papers
We invite two types of submissions for AusDM 2015:
• Academic submissions: Normal academic submissions reporting on research progress, with a paper length of between 8 and 12 pages in CRPIT style, as detailed below. Academic submissions we will use a double-blinded review process, i.e. paper submissions must NOT include authors names or affiliations (and also not acknowledgements referring to funding bodies). Self-citing references should also be removed from the submitted papers (they can be added on after the review) for the double blind reviewing purpose.
• Industry submissions: Submissions from governments and industry can report on specific data mining implementations and experiences. Submissions in this category can be between 4 and 8 pages in CRPIT style, as detailed below. These submissions do not need to be double-blinded. A special committee made of industry representatives will assess industry submissions.

Paper submissions are required to follow the general format specified for papers in the CRPIT series by the Australian Computer Society. Submission details are available from http://crpit.com/AuthorsSubmitting.html. LaTeX styles and Word templates may be found on this site. LaTeX is the recommended typesetting package. The electronic submissions must be in PDF only, and made through the AusDM’15 Submission Page. Note that, to be co-located with SIGKDD, this year’s submission deadline is earlier than usual.

Important Dates
Submission of full papers: Thursday 30 April 2015 (midnight PST)
Notification of authors: Sunday June 7 2015
Final version and author registration: Sunday June 28 2015
Conference: 8–9 August 2015

Join us on LinkedIn: http://www.linkedin.com/groups/AusDM-4907891

Posted in Data Mining | Tagged | 1 Comment

Seminar of IAPA Canberra Chapter: Big Data in the Social Sciences, Wednesday 29 April

Date: Wednesday 29 April
Time: 5.45pm for a 6pm start
Cost: Nil
Where: SAS Offices, 12 Moore Street, Canberra, ACT 2600
RSVP link: http://www.iapa.org.au/Event/ACTChapterEvent29AprilBigDataInSocialSciences

Join us for our next ACT Chapter Event on the 29th of April. We have guest speaker Robert Ackland presenting on Big Data in the Social Sciences. Robert will share on the challenges, give insight into new opportunities and discuss some of the tools that are available for social science big data research.

More about the event: Social scientists are increasingly using large-scale datasets from the Web (e.g. Twitter, WWW hyperlinks, Facebook etc.) to seek answers to long-standing questions about social, economic and political behaviour. An example was how social media data was used to study social inequality, diurnal and seasonal mood changes and the spread of protest during the Arab Spring. This presentation aims to highlight the methodological challenges and opportunities of big data (in particular, social media data) in empirical social science research.

About Robert Ackland

Robert Ackland is an Associate Professor in the Australian Demographic and Social Research Institute at the Australian National University and leads the Virtual Observatory for the Study of Online Networks (http://voson.anu.edu.au). His PhD was in economics, focusing on index number theory in the context of cross-country comparisons of income and inequality. Robert has been studying online social and organisational networks since 2002 and his research has been funded by five Australian Research Council grants. His research has appeared in journals such as the Review of Economics and Statistics, Social Networks, Computational Economics, Social Science Computer Review, and the Journal of Social Structure. VOSON was established in 2005, and aims to advance the social science of the Internet by conducting research, developing research tools, and providing research training. The VOSON software for hyperlink network construction and analysis has been publicly available since 2006 and has been used by around 2000 researchers worldwide. Robert established the Social Science of the Internet specialisation in the ANU’s Master of Social Research in 2008, and his book Web Social Science: Concepts, Data and Tools for Social Scientists in the Digital Age (SAGE) was published in July 2013. He is CEO of Uberlink Corp (http://www.uberlink.com) which he established to commercialise the VOSON software.

Posted in Big Data, Data Mining | Tagged , | 1 Comment

Hadoop and Neo4j

Hadoop is being widely used for processing big data and Neo4j is a popular open-source graph database. When doing social network analysis on big data, a “natural” thought is to use them together. Unfortunately, Neo4j cannot work directly on HDFS or HBase. Is it good to use them together for social network analysis of big data? If yes, any pros/cons and how to do it efficiently? Or shall we try other options, such as Hadoop + Giraph, or Spark + GraphX? Please share your ideas, and all suggestions or experiences will be appreciated. Thanks.

Anyway, to know more about how Neo4j and Hadoop can work together, I came across two presentations below, which might be interested to those who are doing social network analysis of big data.

Serious network analysis using Hadoop and Neo4j
http://neo4j.com/news/serious-network-analysis-using-hadoop-and-neo4j/

I Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop
http://2013.berlinbuzzwords.de/sessions/i-mapreduced-neo-store-creating-large-neo4j-databases-hadoop

Posted in Big Data, Data Mining | Tagged , | 1 Comment

CFP: 13th Australasian Data Mining Conference (AusDM 2015)

The 13th Australasian Data Mining Conference (AusDM 2015)
Sydney, Australia, 8-9 August 2015
co-located with SIGKDD’15
URL: http://ausdm15.ausdm.org/
Join us on LinkedIn: http://www.linkedin.com/groups/AusDM-4907891

The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. It is devoted to the art and science of intelligent analysis of (usually big) data sets for meaningful (and previously unknown) insights. This conference will enable the sharing and learning of research and progress in the local context and new breakthroughs in data mining algorithms and their applications across all industries.

Publication and topics
We are calling for papers, both research and applications, and from both academia and industry, for presentation at the conference. All papers will go through double-blind, peer-review by a panel of international experts. Accepted papers will be published in an up-coming volume (Data Mining and Analytics 2015) of the Conferences in Research and Practice in Information Technology (CRPIT) series by the Australian Computer Society which is also held in full-text on the ACM Digital Library and will also be distributed at the conference. For more details on CRPIT please see http://www.crpit.com. Please note that we require that at least one author for each accepted paper will register for the conference and present their work. AusDM invites contributions addressing current research in data mining and knowledge discovery as well as experiences, novel applications and future challenges.

Topics of interest include, but are not restricted to:
– Applications and Case Studies – Lessons and Experiences
– Big Data Analytics
– Biomedical and Health Data Mining
– Business Analytics
– Computational Aspects of Data Mining
– Data Integration, Matching and Linkage
– Data Mining Education
– Data Mining in Security and Surveillance
– Data Preparation, Cleaning and Preprocessing
– Data Stream Mining
– Evaluation of Results and their Communication
– Implementations of Data Mining in Industry
– Integrating Domain Knowledge
– Link, Tree, Graph, Network and Process Mining
– Multimedia Data Mining
– New Data Mining Algorithms
– Professional Challenges in Data Mining
– Privacy-preserving Data Mining
– Spatial and Temporal Data Mining
– Text Mining
– Visual Analytics
– Web and Social Network Mining

Submission of papers
We invite two types of submissions for AusDM 2015:

Academic submissions: Normal academic submissions reporting on research progress, with a paper length of between 8 and 12 pages in CRPIT style, as detailed below. Academic submissions we will use a double-blinded review process, i.e. paper submissions must NOT include authors names or affiliations (and also not acknowledgements referring to funding bodies). Self-citing references should also be removed from the submitted papers (they can be added on after the review) for the double blind reviewing purpose.

Industry submissions: Submissions from governments and industry can report on specific data mining implementations and experiences. Submissions in this category can be between 4 and 8 pages in CRPIT style, as detailed below. These submissions do not need to be double-blinded. A special committee made of industry representatives will assess industry submissions.

Paper submissions are required to follow the general format specified for papers in the CRPIT series by the Australian Computer Society. Submission details are available from http://crpit.com/AuthorsSubmitting.html.

Important Dates
Submission of full papers: Monday 20 April 2015 (midnight PST)
Notification of authors: Sunday June 7 2015
Final version and author registration: Sunday June 28 2015
Conference: 8-9 August 2015

Posted in Data Mining | Tagged | 1 Comment