My main goal in teaching is to bridge the gap between the curricula of the two related fields of (big) data systems and data analytics and retrieval. In order to address the convergence in the application domains of multimedia and data and information management disciplines, I see an impending need for undergraduate and graduate curricula that will not only educate students separately in each individual domain, but will also provide a common perspective. Most importantly, I believe the curricula must introduce students to the concept of heterogeneity in data, data sources, and systems, as the convergence of different application domains generally means an increase in the heterogeneity of their underlying, relatively less-complex, domains.


  • --

    Design and Development Team, PhD in Data Science, Analysis, and Engineering (DSAE)

    This PhD program is designed to train scientists and engineers in development of new systems and algorithms for collecting, cleaning, storing, valuing, aggregating, fusing, summarizing, managing and drawing inferences from high dimension, high volume, heterogeneous data streams for knowledge discovery. The program’s educational objective is to develop each student’s ability to perform original research in the development and execution of data-driven methods for solving major societal problems. This includes the ability to identify research needs, adapt existing methods and create new methods as needed for data analytics and engineering.

  • Present2015

    Coordinator, Big Data Systems Concentration (MS and MCS)

    This  Computer Science MS/MCS program is designed for graduate students who want to pursue a thorough education and research in the area of big data systems. The goal of this concentration is to provide students the knowledge, skills and the advanced research expertise in designing scalable (parallel, distributed, and real-time) systems for acquiring, storing, processing, and accessing large-scale heterogeneous multi-source data and in using analytical tools to mine information from the data. The graduates will be able to pick and deploy the appropriate data management, processing, and analysis systems  with the suitable structured or unstructured data model for the particular task and domain application needs. The concentration will meet the growing need for data scientists and engineers that can architect, implement, and manage large data systems and, thus, students will have a competitive advantage to secure employment.

  • --

    CSE510 Database Management System Implementation

    The purpose of this course is to study established techniques for implementing database management systems through a semester-long project and reading materials covering the classic as well as cutting-edge literature in the area of database systems. The course not only educates students in the internals of database management systems, but it also highlights why and how these systems may be modified in the near future to accommodate the needs of new applications.

  • --

    CSE515 Multimedia and Web Databases

    The goal if this course is to teach the state of the art in nontraditional database applications, including multimedia and Web, and prepare students for research in database technologies that address the needs of these applications. This course will cover techniques and challenges associate with storage and retrieval of multimedia and Web data and query processing and optimization techniques for inexact data retrieval.

  • Present2018

    Introduction to Data Exploration and Visualization - Coursera

    This Coursera course answers the questions, What is data visualization and What is the power of visualization? It also introduces core concepts such as dataset elements, data warehouses and exploratory querying, and combinations of visual variables for graphic usefulness, as well as the types of statistical graphs, —tools that are essential to exploratory data analysis.

  • Present2018

    Temporal and Hierarchical Data Analysis - Coursera

    This Coursera course covers the representation schemes of hierarchies and algorithms that enable analysis of hierarchical data, as well as provides opportunities to apply several methods of analysis.