Data quality issues trace back their origin to the early days of computing. A wide range of domain-specific techniques to assess and improve data quality exist in the literature. However, these solutions primarily target data which resides in relational databases and data warehouses.
The emergence of big data and renaissance in machine learning challenges the suitability of database-centric approaches to data quality assessment. Minor data errors can cause major damage in Big Data applications. Damages manifest in various forms including loss of revenue, operational inefficiency, and regulatory compliance failure. Moreover, these errors cascade through downstream applications and exacerbate damages.
Machine Learning (ML)/ Artificial Intelligence (AI) brings to fore unique data quality issues. In addition to conventional data quality dimensions -- missing data, incomplete data, inconsistent data, inaccurate data, duplicate and dated data -- AI ushers in issues such as outliers, data imbalance, correlations between features, and curse of dimensionality. Gender bias in AI data is another big concern.
The goal of this workshop is to bring together data quality researchers and industry practitioners to share their ideas and best practices, identify and define important problems to further the field.
The workshop Call for Papers webpage is at:
The 2017 IEEE International Conference on Big Data webpage is at:
Dr. Sergiy Vilkomir has been invited to be a member of the program committee of the following conferences: The 28th International Conference on Software Engineering and Knowledge Engineering (SEKE 2016), July 1 - July 3, 2016, Redwood City, CA, USA. http://ksiresearchorg.ipage.com/seke/seke16.html The 11th International Conference on Software Engineering and Applications (ICSOFT-EA 2016), 24 - 26 July, 2016, Lisbon, Portugal. http://www.icsoft-ea.org/
Dr. Mark Hills presented a paper titled "Variable Feature Usage Patterns in PHP" at the 30th IEEE/ACM International Conference on Automated Software Engineering (ASE 2015), 9 - 13 November 2015 Lincoln, Nebraska, USA.
Seth Butler and John Dixon, both Computer Science seniors, were awarded an all expenses paid scholarship to attend the ACM SIGPLAN conference on Systems, Programming, Languages and Applications: Software for Humanity (SPLASH) conference (http://2015.splashcon.org/) and Programming Languages Mentoring Workshop, in Pittsburgh, PA, 27 - 30 October, 2015.
Dr. Mark Hills will present an extended abstract titled "Supporting PHP Dynamic Analysis in PHP AiR" at the Workshop on Dynamic Analysis at the Systems, Programming, Languages and Applications: Software for Humanity (SPLASH) conference, 30 October 2015, Pittsburgh, Pennsylvania, USA.
Ms. Galen Pennell, a Computer Science senior, presented her research at Sigma Xi Student Research Conference held in Kansas City, Missouri, October 23-24, 2015 (https://www.sigmaxi.org/meetings-events/student-research-conference). Her presentation is titled "Empirical Investigation of Pair-wise Testing for Mixed Logical Expressions." This research was performed under the guidance of Dr. S. Vilkomir. Ms. Pennell started this research in Summer 2015 under ECU Computer Science REU program funded by the National Science Foundation, with Dr. Junhua Ding as the Principal Investigator. She continued her research in Fall 2015 semester as a member of Software Testing Research Group (STRG - http://core.ecu.edu/STRG/).
Dr. Gudivada coauthored a research paper titled "Understanding Search-Engine Optimization." IEEE Computer, Issue 10, Vol. 48, pp. 67 - 76, October 2015.
Dr. Gudivada coauthored a chapter titled "Big Data Driven Natural Language Processing Research and Applications" for a research monograph. The monograph is titled "Big Data Analytics," edited by V. Govindaraju and V. Raghavan and C. R. Rao, and published by Elsevier. pp. 203 - 238, July 2015.
Shawn Braddy, a graduate student in Software Engineering did an internship at Boeing this past summer. He was selected for the Boeing IT Intern Profiles on Boeing's website. Here is the link: http://www.boeing.com/careers/college/it-internships-profiles.page#/profile-03
The paper "Effectiveness of Multi-Device Testing Mobile Applications" by S. Vilkomir, K. Marszalkowski, C. Perry, and S. Mahendrakar was published in the Proceedings of the 2nd ACM International Conference on Mobile Software Engineering and Systems (MobileSoft 2015), May 16-17, 2015, Florence, Italy, pp. 44-47, in conjunction with the 37th International Conference on Software Engineering (ICSE’15). The paper has been presented by Dr. Vilkomir.
The paper "Relationship between pair-wise and MC/DC testing: Initial experimental results" by S. Vilkomir and D. Anderson was published in the Proceedings of the IEEE 8th International Conference on Software Testing, Verification and Validation Workshops (ICSTW 2015), 13-17 April 2015, Graz, Austria.