GEO8026 : Data analysis for Geoscience
GEO8026 : Data analysis for Geoscience
- Offered for Year: 2024/25
- Module Leader(s): Dr Matthew Perks
- Co-Module Leader: Prof. Steve Juggins
- Owning School: Geography, Politics & Sociology
- Teaching Location: Newcastle City Campus
Semesters
Your programme is made up of credits, the total differs on programme to programme.
Semester 1 Credit Value: | 20 |
ECTS Credits: | 10.0 |
European Credit Transfer System | |
Pre-requisite
Modules you must have done previously to study this module
Pre Requisite Comment
N/A
Co-Requisite
Modules you need to take at the same time
Co Requisite Comment
N/A
Aims
This module will provide students with working knowledge of software widely-used for numerical analysis in the Geosciences and within a range of industries (e.g. data science, environmental consultancy/engineering). By the end of the module, students will be equipped with knowledge and skills to be able to organise, query, analyse and display environmental datasets. This skillset will be developed through the completion of practical exercises using research datasets acquired from across the sub-disciplines of the Geosciences. Following acquisition of the core skills, students will apply their knowledge to solve real-world problems.
This module is split into two parts:
Part 1
Students will be provided with skills and experience of working with the MATLAB programming language for data import, manipulation, processing, and analysis using a range of environmental datasets. The focus of this section is the development of code capable of processing datasets that are large, geo-spatial and/or time series.
Part 2
This section teaches key statistical methods and graphical techniques for the analysis of geoscience data using the R language and statistical computing environment. The first lecture and practical will provide an introduction (or refresher) to basic statistical methods and R/RStudio software. Subsequent sections will explore advanced regression methods and techniques for the analysis and display of time series and multivariate data.
Outline Of Syllabus
Part 1:
• Fundamentals of MATLAB: Key commands, variable types, structure.
• Matrix and vector operations.
• Data handling: Data import, quality control.
• Time series analysis: Managing time, interpolation, periodicity, filtering and smoothing, automation, data export.
• Statistics: Hypothesis testing, uni-variate and bi-variate tests, quantifying uncertainty.
• Image analysis: Image pre-processing, enhancement, registration, movement detection.
• Visualisation: Production of publication quality plots from 2D and 3D datasets.
Part 2:
• Introduction to R and RStudio: using tidyverse packages for data import, cleaning, restructuring and
summary.
• Visualising earth science data: traditional and modern graphics, publication quality plots and
interactive web graphics.
• Refresher on basic statistical methods using R (correlation, least squares regression, t-test,
analysis of variance)
• Advanced regression: multiple regression and model building, generalised linear models (GMLs),
generalised additive models (GAMs), classification and regression trees (CARTs).
• Analysis of multivariate data: ordination and cluster analysis; analysis of time-series data.
Learning Outcomes
Intended Knowledge Outcomes
By the end of the module students will have:
Knowledge of the main statistical methods used for the analysis of numerical geoscience data.
Knowledge of techniques for the automated analysis of numerical geoscience data.
The ability to critically evaluate the appropriateness of a statistical method for a particular data set or analysis.
The ability to select and apply an appropriate statistical method for analysis of their own data.
The knowledge and confidence to use literature and internet sources to seek additional help in the
analysis of their own data.
Intended Skill Outcomes
By the end of the module students will be able to:
Import, process, and visualize a range of environmental datasets.
Apply programming skills and knowledge of a range of statistical methods to messy, real-world data
using MATLAB and R computing software.
Adapt, develop, and apply code for the analysis of independently acquired datasets within their own
research projects.
Teaching Methods
Teaching Activities
Category | Activity | Number | Length | Student Hours | Comment |
---|---|---|---|---|---|
Scheduled Learning And Teaching Activities | Lecture | 10 | 1:00 | 10:00 | PiP |
Scheduled Learning And Teaching Activities | Practical | 10 | 2:00 | 20:00 | A - follow lecture in same room - PiP |
Scheduled Learning And Teaching Activities | Practical | 10 | 3:00 | 30:00 | B - morning on day after lecture + A - PiP |
Guided Independent Study | Project work | 2 | 65:00 | 130:00 | N/A |
Scheduled Learning And Teaching Activities | Drop-in/surgery | 10 | 1:00 | 10:00 | Follow B on same day |
Total | 200:00 |
Teaching Rationale And Relationship
Part One: The module will be taught over a two-week period. This will take the form of five, three-hour blocks of teaching comprising of a lecture to provide background and underpinning theory, followed by a two hour practical session where students will apply the techniques to a single practical example. Following this, students will be provided access to additional resources where they will be expected to apply their knowledge and skills to an extended problem. Students will be allocated time to work on this problem independently the following day (Practical B) and a drop-in surgery will be available in the afternoon to troubleshoot any problems and offer further guidance.
Part Two: Staff teaching will take place in the form of short (30 minute) lectures followed by a 30 minute “taster” practical where the students can apply methods learned in lectures to real-world data. Students will then work independently during the afternoon on more substantial practical exercises. There will be a one-hour surgery session to troubleshoot problems and provide feedback on student work.
Reading Lists
Assessment Methods
The format of resits will be determined by the Board of Examiners
Other Assessment
Description | Semester | When Set | Percentage | Comment |
---|---|---|---|---|
Portfolio | 1 | M | 50 | 1500 word equivalent practical portfolio report |
Portfolio | 1 | M | 50 | 1500 word equivalent practical portfolio report based on statistical analysis of bespoke datasets |
Assessment Rationale And Relationship
The practical portfolio will evaluate students’ ability to analyse a series of bespoke geoscience datasets using a range of statistical techniques. The portfolio will test their knowledge in terms of selecting an appropriate statistical method, assessing the assumptions of the method, interpreting the output, and relating results to the wider environmental context of the problem. The portfolio will also test students’ programming, graphical and other practical skills to perform the analysis and produce tabular and/or graphical summaries of results. A practical report is essential given the practical nature of the module.
Timetable
- Timetable Website: www.ncl.ac.uk/timetable/
- GEO8026's Timetable
Past Exam Papers
- Exam Papers Online : www.ncl.ac.uk/exam.papers/
- GEO8026's past Exam Papers
General Notes
N/A
Welcome to Newcastle University Module Catalogue
This is where you will be able to find all key information about modules on your programme of study. It will help you make an informed decision on the options available to you within your programme.
You may have some queries about the modules available to you. Your school office will be able to signpost you to someone who will support you with any queries.
Disclaimer
The information contained within the Module Catalogue relates to the 2024 academic year.
In accordance with University Terms and Conditions, the University makes all reasonable efforts to deliver the modules as described.
Modules may be amended on an annual basis to take account of changing staff expertise, developments in the discipline, the requirements of external bodies and partners, and student feedback. Module information for the 2025/26 entry will be published here in early-April 2025. Queries about information in the Module Catalogue should in the first instance be addressed to your School Office.