精东传媒app

UniSQ Logo
The current and official versions of the course specifications are available on the web at .
Please consult the web for updates that may occur during the year.

CSC8002 Big Data Management

Semester 2, 2022 Toowoomba On-campus
Units : 1
Faculty or Section : Faculty of Health, Engineering and Sciences
School or Department : School of Mathematics, Physics & Computing
Grading basis : Graded
Course fee schedule : /current-students/administration/fees/fee-schedules

Staffing

Examiner:

Requisites

Pre-requisite or Co-requisite: (CSC1401 or CSC5020) and (STA2300 or STA1003 or STA8170) or equivalent program and statistical knowledge and skills or students are enrolled in MCYS.

Overview

Businesses and scientists are collecting more data than ever before. This course is designed to equip the students with computers skills and concepts required to ensure that the data collected is secure, stored and retrieved efficiently, and presented in a useful form. It is also designed to ensure that graduates have the qualitative and information technology skills needed to handle the challenge of data volume, velocity, and variety.

This course is intended for students experienced in statistical analysis, experimental design, and basic systems design, and focuses on the coordination, management and utilization of data using modern computer data base management systems. This course, in emphasizing the reliable, scalable, distributed and efficient handling of data of any size, develops the pragmatics of managing data, alongside with retrieval and analysis of information.

Course learning outcomes

On successful completion of this course students should be able to:

  1. demonstrate advanced and integrated understanding of data modelling, storage, and retrieval methods and apply knowledge and skills to retrieve information from data storage
  2. apply knowledge and skills to design and complete a project to coordinate and manage large data sets
  3. analyse critically and interpret the knowledge from large data sets
  4. interpret and transmit information and knowledge in the application discipline to specialist and non-specialist audiences
  5. analyse critically and reflect on the issues of privacy and ethics of Big Data.

Topics

Description Weighting(%)
1. Introduction to Big Data Management 10.00
2. Programming for Big Data 20.00
3. Modern methods of distributed processing of large data sets (such as Hadoop and MapReduce) 25.00
4. Modern distributed database for large tables 25.00
5. Manage, store and retrieve processed data in a variety of common formats 10.00
6. Privacy, ethics and professionalism 10.00

Text and materials required to be purchased or accessed

Watson, R T 2013-2015, Data management, databases and organizations, 6th edn, eGreens Press, Athens, Georgia.
(Please follow the links on the textbook author's web site . This book is also available in print.)
Watson, RT 2016, Data Management, Databases and Organizations, 6th edn, Prospect Press.

Student workload expectations

To do well in this subject, students are expected to commit approximately 10 hours per week including class contact hours, independent study, and all assessment tasks. If you are undertaking additional activities, which may include placements and residential schools, the weekly workload hours may vary.

Assessment details

Approach Type Description Group
Assessment
Weighting (%) Course learning outcomes
Assignments Practical Tech and/or scntific artefact 1 No 25 1
Assignments Practical Tech and/or scntific artefact 2 No 25 1,2,3,5
Assignments Practical Tech and/or scntific artefact 3 No 50 1,2,3,4,5
Date printed 10 February 2023