Services & Resources

Big Data Analytics and Data Mining Services

Small scale analysis that takes a maximum of four (4) hours total to complete

Basic Data Analysis
Analysis of basic data sets using commonly utilized bioinformatics software tools.

Advanced Data Analysis
Analysis of more complex data sets or more advanced analysis of basic data sets. Includes the use of highly specialized bioinformatics software tools

Expert Data Analysis
Expert analysis of basic or complex data sets using highly specialized bioinformatics software tools

High Dimensional and Integrative Data Analysis
Analysis of high dimensional data sets that typically involve more than one data source. Includes the use of highly specialized bioinformatics software tools

Bioinformatics Scripting
Small to medium scale scripting to help establish a pipeline or analytical method in a lab

Bioinformatics Tool Development
Complete specialized bioinformatics tool development (such as iBIS and AQWA)

Organizational Analysis Tool Development
Development of a new analysis tool that involved integration and interfaces feeding data into available software


The group offers expertise in general data mining, pattern discovery, machine learning, and other algorithmic development for data analysis. In particular, the following core technology areas are covered by the group:

  • Association rule mining. This refers to the techniques for discovering frequent occurring combinations of attibutes and then producing inferences based on the combinations thus discovered.
  • Data Classification. This refers to techniques for building classification models for labeled data. Typical techiques in this area are:
    • support vector machines,
    • K-nearest neighbors, and
    • Gaussian mixture models.
  • Data Clustering This refers to the techniques for organizing data into groups sharing similar patterns. The standard approaches for clustering include Self-organizing Map and KMeans. More advanced techniques include:
    • concurrent clustering of data points and data attributes, and
    • clustering with constraints, and
    • subspace clustering.

Contact Us

For assistance with any Services and/or Resources available through the Data Mining Program, please email CCS-Data Mining.